Oracle FAQ Your Portal to the Oracle Knowledge Grid
HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US
 

Home -> Community -> Usenet -> c.d.o.server -> Re: Oracle Text, PDF files

Re: Oracle Text, PDF files

From: jelena27 <member29652_at_dbforums.com>
Date: Wed, 14 May 2003 17:08:16 +0000
Message-ID: <2878529.1052932096@dbforums.com>

Hi,
I use oracle 9.2.0.3 on solaris, and i'm trying to index 100GB of data, and i tried first normal way, so use sync_index. It was terrible, on some bigger files, specially excel and pdf, ctxhx was just hanging taking 100%cpu. Now, i choose to index it externally, somehow synchronise cron job and database, and run 2-step indexing First one is unix script with all ctxhx what to_what... and second part is indexing htmls in database (that's fast) Problem was first with pdf's, then i found other parser for pdf's, xpdf and now it works fast. So, now the problem is that ctxhx comes from time to time, to index huge excel file, and hangs. Now, in the help of ctxhx it's listed that you can give timeout, but i can't make it run!!!!
Did anyone manage to make ctxhx exit if it doesn't index the file???

Or did you find any other solution for that problem???

Any help is very wellcome!!!
Thanks,
Jelena

--
Posted via http://dbforums.com
Received on Wed May 14 2003 - 12:08:16 CDT

Original text of this message

HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US