Discussion:
Abstracts not being returned for any document type
(too old to reply)
ChrisMK
2006-09-06 09:15:01 UTC
Permalink
I am building an index server test system on server 200 sp1

I have adobe iFilter installed and generate abstracts checked on the catalog
properties.

I am using oledb provider for queries, including the characterization
column, and do not get abstracts returned for PDF or text files.

Is there anything that I have missed that is needed to produce abstracts?
ChrisMK
2006-09-06 09:18:01 UTC
Permalink
should have read
server 2003 sp1
Post by ChrisMK
I am building an index server test system on server 200 sp1
I have adobe iFilter installed and generate abstracts checked on the catalog
properties.
I am using oledb provider for queries, including the characterization
column, and do not get abstracts returned for PDF or text files.
Is there anything that I have missed that is needed to produce abstracts?
WenJun Zhang[msft]
2006-09-07 07:48:06 UTC
Permalink
Hi Chris,

If there are a large number of PDF files need to be indexed, when the
catalog was initially created, the PDF file will appear in the search list,
but the abstract was not displayed. This is a expected behvaior. Generally
the abstract will show up a couple of hours later.

The first step in indexing is to scan the files, which produces a list of
files to filter. Once the files are scanned, the name, filetype, size are
known. The Cidaemon process is called for the files in this list, the
Ifilter is loaded, and the content is filtered. If this fails for some
reason or is deferred for retrying later because the file is in use, the
file will show up as unfiltered. Unfiltered docs can still be located by a
#filename search or a @size search.

After the generate abstracts option is set, the files must be rescanned.
If CiForceUsCi is set to true, then Index Server will use the index even if
it is out of date, so old data could be returned while the rescan is in
progress.

If the indexing performance adjustment is not set to Instant, it will took
a while to filter the PDF documents and create the abstracts.

Indexing Service Catalog Updates Take Several Hours or Days to Show Changes
http://support.microsoft.com/kb/270056/EN-US/

Generating an abstract will use more CPU time during the indexing process.
More disk space will also be used for the corpus. By default, abstract
creation is disabled on Windows 2003 Index Server. In order for the
abstract information to be generated and returned, Abstract generation must
be enabled and the characterization property must be cached.

Abstract generation can be set at the Index Service level and inherited by
catalogs, and can be overridden at the catalog level.

1) Run the Index Server MMC.
2) Right-click on Indexing Service, and then select Properties. Click on
the Generation tab. Check the "Generate abstracts" option to enable this
for all catalogs.
3) Right-click on a catalog and select Properties. Click on the Generation
tab. If the "Inherit above settings from Service" option is checked, then
the Indexing Service settings will be used. Otherwise, check the "Generate
abstracts" option.

Use the following procedure to enable caching of the characterization
property.
1) Run the Index Server MMC.
2) Expand the Index Service. Expand a catalog. Click on the properties
folder.
3) Locate the property that has a Friendly Name of characterization.
Right-click on characterization and select Properties.
4) Check the box labeled Cached. Set the datatype to VT_LPWSTR.

After making the above changes, restart the Indexing Service, and perform a
Full Rescan of the directories associated with the catalog.

Also please note Microsoft does not provide support for 3rd party software.
You will need to contact Adobe regarding the PDF Ifilter. As I know,
abstract should be supported by the lastest PDF Ifilter. Please verify that
you have the latest version of the Ifilter installed.

Filtering Adobe Acrobat Files for Use with Index Server
http://support.microsoft.com/default.aspx?scid=kb;en-us;174780

Microsoft does not provide a filter for PDF files.

Adobe Downloads
http://www.adobe.com/support/downloads/product.jsp?product=1&platform=Window
s

PDF Ifilter 6.0
http://www.adobe.com/support/downloads/detail.jsp?ftpID=2611

Hope the info above helps. Please update here if you have further question.

Thanks.

Sincerely,

WenJun Zhang
Microsoft Online Partner Support

When responding to posts, please "Reply to Group" via your newsreader so
that others may learn and benefit from your issue.

=====================================================

Business-Critical Phone Support (BCPS) provides you with technical phone
support at no charge during critical LAN outages or "business down"
situations. This benefit is available 24 hours a day, 7 days a week to all
Microsoft technology partners in the United States and Canada.

This and other support options are available here:

BCPS:
https://partner.microsoft.com/US/technicalsupport/supportoverview/40010469
Others: https://partner.microsoft.com/US/technicalsupport/supportoverview/

If you are outside the United States, please visit our International
Support page: http://support.microsoft.com/common/international.aspx

======================================================

This posting is provided "AS IS" with no warranties, and confers no rights.
WenJun Zhang[msft]
2006-09-11 14:19:43 UTC
Permalink
Hi Chris,

Just want to check how the issue is going or if you have any further
questions on it?

Thanks.

Sincerely,

WenJun Zhang

Microsoft Online Community Support

==================================================

Get notification to my posts through email? Please refer to:
http://msdn.microsoft.com/subscriptions/managednewsgroups/default.aspx#notif
ications.

Note: The MSDN Managed Newsgroup support offering is for non-urgent issues
where an initial response from the community or a Microsoft Support
Engineer within 1 business day is acceptable. Please note that each follow
up response may take approximately 2 business days as the support
professional working with you may need further investigation to reach the
most efficient resolution. The offering is not appropriate for situations
that require urgent, real-time or phone-based interactions or complex
project analysis and dump analysis issues. Issues of this nature are best
handled working with a dedicated Microsoft Support Engineer by contacting
Microsoft Customer Support Services (CSS) at:

http://msdn.microsoft.com/subscriptions/support/default.aspx.

==================================================

This posting is provided "AS IS" with no warranties, and confers no rights.
Hilary Cotter
2006-09-06 23:30:36 UTC
Permalink
As long as the characterization property is cached you should be generating
characterizations for most file types. IIRC the Adobe does not support
characterizations.
--
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html

Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
Post by ChrisMK
I am building an index server test system on server 200 sp1
I have adobe iFilter installed and generate abstracts checked on the catalog
properties.
I am using oledb provider for queries, including the characterization
column, and do not get abstracts returned for PDF or text files.
Is there anything that I have missed that is needed to produce abstracts?
Loading...