Discussion:
indexing service forget a lot of files.
(too old to reply)
Roberto Gerlando
2007-11-07 15:18:48 UTC
Permalink
Raw Message
Hello to all
I've my directory called Archive with 400k files (about 200gb of text
data), they are doc, pdf, html, xls, txt and rtf.
Pdf Ifilter is istalled.
When I ask via mmc to build the catalog, the indexing service works
for about 48 hours and when It finish I have a catalog of about 7
Gbyte.
All the query seems to work fine but.... It seems to all is fine but
itsn't
For example If I search for the sentence ' my name is Frank" I get
1000 results.

Ok now If I rebuild my catalog and My directory Archive is empty and I
Move smothly my 400k files in my directory, if for examples I move 10
files each 30 seconds inside my directory Archive, when after some
weeks I finish this incredible work of moving smoothly the files, if I
search for the sentence ' my name is Frank" I get 1800 results!!!!

Indexing seems to be more accurate and good if you move files in
groups of 4 - 10 in your target directory.
If I ask indexing service to index my directory with all my 400k
files, it seems to be Stifle and so it seems that it ignores a lot of
files!!

Has somebody experienced that? Any suggestions?
Sorry for my poor english

Regards
Roberto Gerlando
Craig Humphrey
2007-11-23 00:36:00 UTC
Permalink
Raw Message
Hi Roberto,

I've got a similar problem, only it's with 50+k files (~14gig), all of them
PDFs (from electronic documents, so fully content searchable) on a Win2003
server.

When I run the indexer over them I get a 6meg catalog!!! Searches for
content often return incomplete results, while searches for filenames are
fine.

On the old Win2000 server I'm migrating this from, I have a ~4gig catalog.

Both servers are using the the Adobe iFilter v6.0.

If you get an answer to this, I'd love to hear it.

BTW I've raised this with Adobe, but it's slow going...

Soon'ish
Craig
--
--
Craig dot Humphrey at ChapmanTripp dot com
Post by Roberto Gerlando
Hello to all
I've my directory called Archive with 400k files (about 200gb of text
data), they are doc, pdf, html, xls, txt and rtf.
Pdf Ifilter is istalled.
When I ask via mmc to build the catalog, the indexing service works
for about 48 hours and when It finish I have a catalog of about 7
Gbyte.
All the query seems to work fine but.... It seems to all is fine but
itsn't
For example If I search for the sentence ' my name is Frank" I get
1000 results.
Ok now If I rebuild my catalog and My directory Archive is empty and I
Move smothly my 400k files in my directory, if for examples I move 10
files each 30 seconds inside my directory Archive, when after some
weeks I finish this incredible work of moving smoothly the files, if I
search for the sentence ' my name is Frank" I get 1800 results!!!!
Indexing seems to be more accurate and good if you move files in
groups of 4 - 10 in your target directory.
If I ask indexing service to index my directory with all my 400k
files, it seems to be Stifle and so it seems that it ignores a lot of
files!!
Has somebody experienced that? Any suggestions?
Sorry for my poor english
Regards
Roberto Gerlando
Loading...