Discussion:
Noise Words file
(too old to reply)
chris
2007-07-18 23:08:04 UTC
Permalink
Raw Message
Has anyone taken out all the words from the noise words file, but still left
in all the letters in the alphabet and all the numbers 0-9.
I have a catalog that is about 400mb with 500,000 documents being indexed.
If I took out all the words, would that double or triple the size of my
catalog. I know no one has an exact answer to this question, but I was hoping
that someone could share any experiences they had when deleting words from
the /winnt/system32/noies.enu file.
thanks for any replies.
Hilary Cotter
2007-07-19 01:43:40 UTC
Permalink
Raw Message
The catalogs are lightly compressed. I would expect to see a 20-30%
increase, but this depends on the density of the words you in your
documents.
--
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
Post by chris
Has anyone taken out all the words from the noise words file, but still left
in all the letters in the alphabet and all the numbers 0-9.
I have a catalog that is about 400mb with 500,000 documents being indexed.
If I took out all the words, would that double or triple the size of my
catalog. I know no one has an exact answer to this question, but I was hoping
that someone could share any experiences they had when deleting words from
the /winnt/system32/noies.enu file.
thanks for any replies.
Loading...