Hilary Cotter
2006-12-05 11:30:58 UTC
you need to use the ms.locale metatag with the content value being the lcid,
ie EN-US, FR, etc.
Now the problem is that when you query in different languages you will get
false conjugates or false friends. IE querying for poison in English will
give hits to the German word for gift, poison.
So you can restrict your search by subwebs as you have suggested or by
checking the value of the ms.locale tag.
--
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html
Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
ie EN-US, FR, etc.
Now the problem is that when you query in different languages you will get
false conjugates or false friends. IE querying for poison in English will
give hits to the German word for gift, poison.
So you can restrict your search by subwebs as you have suggested or by
checking the value of the ms.locale tag.
--
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html
Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
Hello,
on out web server we are running subwebs in various languages, such as
English, Spanish, German, Russian, Turkish, Japanese and others. All web
pages are UTF-8 encoded and do not contain any further language/character
set
information. Is there any document describing what I have to do in order
to
manage Index Server to support a correct search in different languages.
First of all, I have to insert a meta-tag with the character set usesd, I
guess. But then: do I have to sett up a different index for each subweb?
Where do I get the required files, e.g. for word stemming, which Index
Server
needs for non (western-)european languages? How do I search the proper
index,
if, for example the searched expression is in Turkish, but my browsers
language preference is set to Swedish?
Index Server documentation on MSDN is very cryptic here, so any help is
welcome.
Regards
Kallo
on out web server we are running subwebs in various languages, such as
English, Spanish, German, Russian, Turkish, Japanese and others. All web
pages are UTF-8 encoded and do not contain any further language/character
set
information. Is there any document describing what I have to do in order
to
manage Index Server to support a correct search in different languages.
First of all, I have to insert a meta-tag with the character set usesd, I
guess. But then: do I have to sett up a different index for each subweb?
Where do I get the required files, e.g. for word stemming, which Index
Server
needs for non (western-)european languages? How do I search the proper
index,
if, for example the searched expression is in Turkish, but my browsers
language preference is set to Swedish?
Index Server documentation on MSDN is very cryptic here, so any help is
welcome.
Regards
Kallo