Discussion:
Index server not indexing all files
(too old to reply)
Tracey
2006-03-01 18:05:28 UTC
Permalink
Hi all,

I am running index server on Windows 2003.

When I queried the catalog with a simple query (a particular word) I
found it wasn't returning some wpd files (from folder A). If I run an
advanced query such '@size >= 1' it returns all the files (as
expected). So it looks as though the index server can see these files
but are not indexing the contents of them properly.

I initially thought it was an issue with wpd files (I have installed
wpd filter), but found that if I moved a wpd file from folder A into
folder B then it was being returned in the results.
However I have checked the security permissions on the folders and
ensured that they have it same rights and it still doesn't make a
difference. Also a .doc in folder A is being returned in the simple
query results so it looks as though the index server has permissions
to index this folder.

It has been a fairly confusing process so I apologise if the problem
description is confusing, but it would be great if anyone can suggest
anything to resolve this issue.

Cheers,

Tracey
Rainer Mohr
2006-03-02 07:20:45 UTC
Permalink
Hi Tracey,

I´m experiencing similar problems, mine are even more confusing a
times... The identical query (in an ASP-page) on 2 identical machines
are returning different results... And I cant find the problem.

Have you checked, if the words that are not returned are included in the
noise files? (windows\system32\noise.[your language]). Try emptying all
the noise files you see, re-index the files and try again.

Cheers,
Rainer
Tracey
2006-03-02 09:51:12 UTC
Permalink
Hi Rainer,

Thanks for your reply. I had a look in the noise file, but this
doesn't seem to be the issue. On simple queries (not words included
in the noise file) some files are being returned, just not all files
that I know contain that word.

The summary of my problem is that wpd files in a particular folder
(folder A) don't seem to have their content indexed and don't appear in
the results. Moved to any other folders these wpd files are returned
in results. However files such as .doc files in folder A do seem to
have their content indexed as they are returned in query results.

Are you getting completely different results or just additional results
on one of the machines?

Thanks

Tracey
Rainer Mohr
2006-03-02 14:25:33 UTC
Permalink
Hi Tracey,
Post by Tracey
The summary of my problem is that wpd files in a particular folder
(folder A) don't seem to have their content indexed and don't appear in
the results. Moved to any other folders these wpd files are returned
in results. However files such as .doc files in folder A do seem to
have their content indexed as they are returned in query results.
That IS strange, no idea why that happens...
Post by Tracey
Are you getting completely different results or just additional results
on one of the machines?
Both machines indexed the same files (319 documents).
Queries like 'first_word AND NOT second_word' worked on both machines
with hundreds of words, except for 2 words we identified, that were
simply ignored on one of the machines. There were 9 documents containing
the second_word and 56 containing the first_word. When we excluded the
second_word using the 2 identified words, we got the same results as we
got for first_word only.
On the other machine, it all worked fine, we got 47 results.

We found out, that the ignored words were in the index (so not ignored
by any noise list), because when we used another tool on the "strange"
machine, it worked fine as well.

I changed from the OleDbConnection("Provider=MSIDXS") to the ixxso today
and now it works on all machines. Seems to be an issue with the one ODBC
Driver on the one machine. No idea, it cost me a few days and a lot of
nerves... :-)

Good luck with your wpd files,
Regards,
Rainer

Hilary Cotter
2006-03-02 11:50:48 UTC
Permalink
for certain properties Indexing Services does a grep of the file system.
Searching on size is one of these properties. It does not indicate whether
the file is indexed or not.
--
Hilary Cotter
Director of Text Mining and Database Strategy
RelevantNOISE.Com - Dedicated to mining blogs for business intelligence.

This posting is my own and doesn't necessarily represent RelevantNoise's
positions, strategies or opinions.

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
Post by Tracey
Hi all,
I am running index server on Windows 2003.
When I queried the catalog with a simple query (a particular word) I
found it wasn't returning some wpd files (from folder A). If I run an
expected). So it looks as though the index server can see these files
but are not indexing the contents of them properly.
I initially thought it was an issue with wpd files (I have installed
wpd filter), but found that if I moved a wpd file from folder A into
folder B then it was being returned in the results.
However I have checked the security permissions on the folders and
ensured that they have it same rights and it still doesn't make a
difference. Also a .doc in folder A is being returned in the simple
query results so it looks as though the index server has permissions
to index this folder.
It has been a fairly confusing process so I apologise if the problem
description is confusing, but it would be great if anyone can suggest
anything to resolve this issue.
Cheers,
Tracey
Loading...