Oracle FAQ | Your Portal to the Oracle Knowledge Grid |
Home -> Community -> Usenet -> c.d.o.server -> Re: Oracle Text, PDF files
Hi,
I use oracle 9.2.0.3 on solaris, and i'm trying to index 100GB of data,
and i tried first normal way, so use sync_index. It was terrible, on
some bigger files, specially excel and pdf, ctxhx was just hanging
taking 100%cpu. Now, i choose to index it externally, somehow
synchronise cron job and database, and run 2-step indexing
First one is unix script with all ctxhx what to_what...
and second part is indexing htmls in database (that's fast)
Problem was first with pdf's, then i found other parser for pdf's, xpdf
and now it works fast. So, now the problem is that ctxhx comes from time
to time, to index huge excel file, and hangs.
Now, in the help of ctxhx it's listed that you can give timeout, but i
can't make it run!!!!
Did anyone manage to make ctxhx exit if it doesn't index the file???
Or did you find any other solution for that problem???
Any help is very wellcome!!!
Thanks,
Jelena
-- Posted via http://dbforums.comReceived on Wed May 14 2003 - 12:08:16 CDT