Discussion:
display hit results in proper format
(too old to reply)
Steve
2006-10-25 13:48:23 UTC
Permalink
hi
is there a way to set the response content type property of the .htw -
or what ever extension you use-
file in charge of displaying and special-formatting the hits of a
searched string in a file
you know the file used by webhits.dll to display summary or full
highliting of the search.
Because if you get a match inside pdf file or an rtf, it always
displays result in a monoformat when i need a display format related to
the mime type of the file.
i've written some vbscript that reads the file and displays it along
with the hits highlighted and all
but it's a pain in the ass, i'm sure there's a betta way to achieve
that !

thank you
Hilary Cotter
2006-10-26 12:47:51 UTC
Permalink
You can't do this. You are stuck with the formatting webhits.dll returns.
--
Hilary Cotter
Director of Text Mining and Database Strategy
RelevantNOISE.Com - Dedicated to mining blogs for business intelligence.

This posting is my own and doesn't necessarily represent RelevantNoise's
positions, strategies or opinions.

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
Post by Steve
hi
is there a way to set the response content type property of the .htw -
or what ever extension you use-
file in charge of displaying and special-formatting the hits of a
searched string in a file
you know the file used by webhits.dll to display summary or full
highliting of the search.
Because if you get a match inside pdf file or an rtf, it always
displays result in a monoformat when i need a display format related to
the mime type of the file.
i've written some vbscript that reads the file and displays it along
with the hits highlighted and all
but it's a pain in the ass, i'm sure there's a betta way to achieve
that !
thank you
Steve
2006-10-26 13:02:41 UTC
Permalink
Hey Hilary
thanks for the reply,
hey you're amazing,i've been searching through the posts and wow i'm
really head over heals in here ,you're everywhere !!
save the boys some,will ya !!
ok now back to business
i got some inquiries about this issue

1- format of the result returned by webhits.dll is just simple html
right but how is it converted from all kind of files into html or is
that done at indexing time ?

2- i wrote a hit highliter script that works only on html files,and i
intend to write other types of highlighters as well ie. for pdf or
other .. so i might need some guidance in here in terms of where to
look and what to look for and if you know some ready product i'll be
very grateful.

thanks
keep up the good work
Hilary Cotter
2006-10-26 15:40:32 UTC
Permalink
1) its done at query time. The text is extracted from the file and marked up
and displayed on the web. IIRC They launch the ifilter for the document type
in process and use that to extract the text.

2) The problem is that you really need to use the Microsoft word
breaker/stemmers to correctly mark up freetext searches on mice to match
with mouse. A grep or regex won't be able to do this, but the Microsoft
stemmers/word breakers do - especially for non English languages where the
endings get even more irregular.
--
Hilary Cotter
Director of Text Mining and Database Strategy
RelevantNOISE.Com - Dedicated to mining blogs for business intelligence.

This posting is my own and doesn't necessarily represent RelevantNoise's
positions, strategies or opinions.

Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
Post by Steve
Hey Hilary
thanks for the reply,
hey you're amazing,i've been searching through the posts and wow i'm
really head over heals in here ,you're everywhere !!
save the boys some,will ya !!
ok now back to business
i got some inquiries about this issue
1- format of the result returned by webhits.dll is just simple html
right but how is it converted from all kind of files into html or is
that done at indexing time ?
2- i wrote a hit highliter script that works only on html files,and i
intend to write other types of highlighters as well ie. for pdf or
other .. so i might need some guidance in here in terms of where to
look and what to look for and if you know some ready product i'll be
very grateful.
thanks
keep up the good work
Loading...