Discussion:
Catastrophic failure on LOAD of Document
(too old to reply)
terry17
2006-08-09 16:46:01 UTC
Permalink
Hi,
I am trying to use the IFilters to extract the text out of the document
for our Search Application.

For the most Part it seems to work fine but there are a few documents which
give me an error "Catastrophic failure"

I tried using "filtdump" for those .DOC documents but i get an error:
Error 0x8000ffff loading IFilter

But if i open the document and resave it as RTF and them apply the RTF
IFilter it does seem to work. Though if i just rename it from .doc to .rtf it
does not work.

If there some way i can solve this issue. Any help is appreciated.

Note: I am applying the MS Office Filter (GUID:
f07f3920-7b8c-11cf-9be8-00aa004b9986)
############################################
MY CODE (fails on .LOAD):
System.Runtime.InteropServices.UCOMIPersistFile ipf =
(System.Runtime.InteropServices.UCOMIPersistFile)(iFilter);
ipf.Load(sFileName, 0);
############################################

thanks in advance,
--
Terry
Woodrow Windischman
2006-08-30 13:56:36 UTC
Permalink
Word documents are notorious for accumulating subtle corruptions over time.
Usually they aren't noticable, in that Word is happy to open the doc, and it
"looks" fine, but Index Server (probably the Word iFilter) is more
sensitive, and will balk at certain things. It particularly doesn't like
"poorly set" document metadata properties, but there are probably other
things as well.

I'm not surprised that RTF works. Saving as an RTF document will get rid of
many types of corruption, and you can usually then re-save the RTF as a DOC
and all will be well.
Post by terry17
Hi,
I am trying to use the IFilters to extract the text out of the document
for our Search Application.
For the most Part it seems to work fine but there are a few documents which
give me an error "Catastrophic failure"
Error 0x8000ffff loading IFilter
But if i open the document and resave it as RTF and them apply the RTF
IFilter it does seem to work. Though if i just rename it from .doc to .rtf it
does not work.
If there some way i can solve this issue. Any help is appreciated.
f07f3920-7b8c-11cf-9be8-00aa004b9986)
############################################
System.Runtime.InteropServices.UCOMIPersistFile ipf =
(System.Runtime.InteropServices.UCOMIPersistFile)(iFilter);
ipf.Load(sFileName, 0);
############################################
thanks in advance,
--
Terry
Loading...