Re: Oracle Text / Office 2007 question / 10gR2
Date: Mon, 17 Mar 2008 16:48:12 -0700 (PDT)
On Mar 17, 4:24 pm, Frank van Bortel <frank.van.bor..._at_gmail.com> wrote:
> BicycleRepairman wrote:
> > Ouch! You're right there, being skewered on the horns of a dilemma.
> > Oracle licenses the filters: in 10g and previous using Stellent
> > (INSO), and using Autonomy (Verity) in 11g.
> There was a switch between 10G Rel1 and Rel2.>[snip!]
> > So, if I was in your shoes, I'd either use automation to convert
> > the .docx to PDF, then insert the PDF, or use automation to crack open
> > the .docx file (actually, a zipped set of XML files) and insert/index
> > the document.xml file. The first path is easy and not complicated, but
> > you'll need Word 2007 to perform its magic. The second path is a lot
> > more risky, but it would be interesting to see how difficult it is to
> > handle. In terms of being able to index/search for keywords, it might
> > not be that bad at all.
> You do realize, PDF's are indexed without problem?
> I fail to see the benefit of "cracking open" the xml, once you
> have the PDF
> Frank van Bortel
> Top-posting in UseNet newsgroups is one way to shut me up
As I said, EITHER pdf OR cracking open the xml. I think I even laid out some reasonable rationale why you might choose one over the other. As opposed to, say, telling someone to start over with a new database. Received on Mon Mar 17 2008 - 18:48:12 CDT