Charlotte
2007-03-10 16:46:50 UTC
Hi,
We have a file server with mainly office docs and PDF files that sometime
contains mixed languages e.g. text in English and a few names in both
transcribed and orginal formats (Russian, Arabic, Chinese etc). When I
search in english the hits are fine, but using the other char sets to set up
a question returns nothing.
I suspect this has to do with some limited support for these languages. Is
that true and would it in such case be possible to design your own word
breaker/ noise word files to use?
Many thanks for your support!
We have a file server with mainly office docs and PDF files that sometime
contains mixed languages e.g. text in English and a few names in both
transcribed and orginal formats (Russian, Arabic, Chinese etc). When I
search in english the hits are fine, but using the other char sets to set up
a question returns nothing.
I suspect this has to do with some limited support for these languages. Is
that true and would it in such case be possible to design your own word
breaker/ noise word files to use?
Many thanks for your support!