Oracle FAQ Your Portal to the Oracle Knowledge Grid
HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US
 

Home -> Community -> Usenet -> c.d.o.server -> Re: Oracle Text, PDF files

Re: Oracle Text, PDF files

From: Khalid Eidoo <khalid.eidoo_at_utoronto>
Date: Sat, 29 Mar 2003 02:32:24 GMT
Message-ID: <YA7ha.34323$KlE.1199@news04.bloor.is.net.cable.rogers.com>


You didn't mention which version of Oracle you were using. We recently upgraded to 9.2.0.2 on Linux, which specifically had some updates to the INSO filter. We find that indexing PDFs using the new filter provides marginally better results.

We primarily perform gists in our application, and the fact that there is less garbage in them indicates to us that the INSO filter is doing a better job. We haven't tried highlighting yet though.

Khalid.

"Randy Nichols" <randynichols_at_yahoo.com> wrote in message news:NSGga.22238$jA2.1996308_at_newsread2.prod.itd.earthlink.net...
> Oracle Text apparently has problems indexing and highlighting certain PDF
> files.
>
> Is anyone aware of any good solutions for making Oracle Text more robust
> with respect to PDF files?
>
> Is there a service pack for Oracle Text that fixes some problems
concerning
> PDF files?
>
> Anyone have experience with third-party filters that do a good job
indexing
> and highlighting (markup) of PDF files with Oracle Text?
>
> Thanks,
>
> R. Nichols
>
>
Received on Fri Mar 28 2003 - 20:32:24 CST

Original text of this message

HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US