Discussion:
Searching for decimal numbers
(too old to reply)
AC
2006-11-19 01:36:39 UTC
Permalink
Hello,

I have removed the numbers from my noise.enu file (using American English
locale). I can get results for whole numbers and alpha-numeric words. A
decimal number does not show up when I search for it. Some of the decimal
numbers are currency so I tried with the dollar sign as well. I would
appreciate suggestions from someone that has solved this problem. I find
very little when searching for this.

I am using the Indexing Service Query Form with Standard and Advanced query
modes. I am only putting in the numbers. No special instructions.

Will *not* find:
$30.20
847.8
(732.5)

Will find:
76750035
(H72.5)
5534.22C
--
AC
Gang_Warily
2006-11-20 13:45:01 UTC
Permalink
Hi

I presume you've emptied & rebuilt the catalog ?

Is it possible that the word-breaker is breaking the numbers at the decimal
point ?
Breaking dates at '/' or '-' could be helpful to find 2006 in 20-11-2006.

There is a utility called LRtest that might help
http://support.microsoft.com/default.aspx/kb/890613

I don't know what all that means, but it seems that using an 'NN' prefix
might help find numbers ?

ie 'NN10' seems to find instances of '10'
so 'nn123 NEAR nn456' might find 123.456

Of course, it may also find 456.123 ?


Also 'NN1' to 'NN9' seem to be hard-coded as noise words, even though they
aren't in the noise-word file.

I'm doing a lot of guesswork here - let us know what you find !
Any response from the experts or those that have time to experiment would
also be most welcome.
Post by AC
Hello,
I have removed the numbers from my noise.enu file (using American English
locale). I can get results for whole numbers and alpha-numeric words. A
decimal number does not show up when I search for it. Some of the decimal
numbers are currency so I tried with the dollar sign as well. I would
appreciate suggestions from someone that has solved this problem. I find
very little when searching for this.
I am using the Indexing Service Query Form with Standard and Advanced query
modes. I am only putting in the numbers. No special instructions.
$30.20
847.8
(732.5)
76750035
(H72.5)
5534.22C
--
AC
AC
2006-11-20 15:13:30 UTC
Permalink
I created a brand new catalog with a subset of data.

I will try NN tonight (I don't have the data with me today).

Thanks
Post by Gang_Warily
Hi
I presume you've emptied & rebuilt the catalog ?
Is it possible that the word-breaker is breaking the numbers at the decimal
point ?
Breaking dates at '/' or '-' could be helpful to find 2006 in 20-11-2006.
There is a utility called LRtest that might help
http://support.microsoft.com/default.aspx/kb/890613
I don't know what all that means, but it seems that using an 'NN' prefix
might help find numbers ?
ie 'NN10' seems to find instances of '10'
so 'nn123 NEAR nn456' might find 123.456
Of course, it may also find 456.123 ?
Also 'NN1' to 'NN9' seem to be hard-coded as noise words, even though they
aren't in the noise-word file.
I'm doing a lot of guesswork here - let us know what you find !
Any response from the experts or those that have time to experiment would
also be most welcome.
Post by AC
Hello,
I have removed the numbers from my noise.enu file (using American English
locale). I can get results for whole numbers and alpha-numeric words. A
decimal number does not show up when I search for it. Some of the decimal
numbers are currency so I tried with the dollar sign as well. I would
appreciate suggestions from someone that has solved this problem. I find
very little when searching for this.
I am using the Indexing Service Query Form with Standard and Advanced query
modes. I am only putting in the numbers. No special instructions.
$30.20
847.8
(732.5)
76750035
(H72.5)
5534.22C
--
AC
AC
2006-11-28 18:49:10 UTC
Permalink
NN didn't work :(

I tried NN30 AND NN80 as well as NN30.80

Thanks for the suggestions.
--AC
Post by Gang_Warily
Hi
I presume you've emptied & rebuilt the catalog ?
Is it possible that the word-breaker is breaking the numbers at the decimal
point ?
Breaking dates at '/' or '-' could be helpful to find 2006 in 20-11-2006.
There is a utility called LRtest that might help
http://support.microsoft.com/default.aspx/kb/890613
I don't know what all that means, but it seems that using an 'NN' prefix
might help find numbers ?
ie 'NN10' seems to find instances of '10'
so 'nn123 NEAR nn456' might find 123.456
Of course, it may also find 456.123 ?
Also 'NN1' to 'NN9' seem to be hard-coded as noise words, even though they
aren't in the noise-word file.
I'm doing a lot of guesswork here - let us know what you find !
Any response from the experts or those that have time to experiment would
also be most welcome.
Post by AC
Hello,
I have removed the numbers from my noise.enu file (using American English
locale). I can get results for whole numbers and alpha-numeric words. A
decimal number does not show up when I search for it. Some of the decimal
numbers are currency so I tried with the dollar sign as well. I would
appreciate suggestions from someone that has solved this problem. I find
very little when searching for this.
I am using the Indexing Service Query Form with Standard and Advanced query
modes. I am only putting in the numbers. No special instructions.
$30.20
847.8
(732.5)
76750035
(H72.5)
5534.22C
--
AC
AC
2007-02-17 03:46:26 UTC
Permalink
Thanks!! I think this might help. Need to test it and see.

Regards
http://msdn2.microsoft.com/en-gb/library/ms693168.aspx
When you create a word breaker, it is recommended that the word breaker
normalize numbers to a canonical representation by using the pattern
"NNddDcc," where "NN" is the literal sequence "NN," dd is the integer
portion
of the number, "D" is the literal "D," and cc is the fractional portion of
the number. Word breakers do not restrict the number of digits for either
the
integer or the fraction portion of the number. It is recommended that word
breakers recognize numerical patterns that are delimited by both periods
(.)
and commas (,). For example, Indexing Service represents both "1,000.2"
and
"1.000,2" as "NN1000D2."
Hi
I'm not sure how this can be used, but I'm sure it's relevant somehow !
Are you using the SQL query language, or one of the other two dialects of
SQL Query Language ?
http://msdn2.microsoft.com/en-gb/library/ms690580.aspx
Post by AC
NN didn't work :(
I tried NN30 AND NN80 as well as NN30.80
Thanks for the suggestions.
--AC
Post by Gang_Warily
Hi
I presume you've emptied & rebuilt the catalog ?
Is it possible that the word-breaker is breaking the numbers at the decimal
point ?
Breaking dates at '/' or '-' could be helpful to find 2006 in 20-11-2006.
There is a utility called LRtest that might help
http://support.microsoft.com/default.aspx/kb/890613
I don't know what all that means, but it seems that using an 'NN' prefix
might help find numbers ?
ie 'NN10' seems to find instances of '10'
so 'nn123 NEAR nn456' might find 123.456
Of course, it may also find 456.123 ?
Also 'NN1' to 'NN9' seem to be hard-coded as noise words, even though they
aren't in the noise-word file.
I'm doing a lot of guesswork here - let us know what you find !
Any response from the experts or those that have time to experiment would
also be most welcome.
Post by AC
Hello,
I have removed the numbers from my noise.enu file (using American English
locale). I can get results for whole numbers and alpha-numeric words.
A
decimal number does not show up when I search for it. Some of the decimal
numbers are currency so I tried with the dollar sign as well. I would
appreciate suggestions from someone that has solved this problem. I find
very little when searching for this.
I am using the Indexing Service Query Form with Standard and Advanced query
modes. I am only putting in the numbers. No special instructions.
$30.20
847.8
(732.5)
76750035
(H72.5)
5534.22C
--
AC
Loading...