Indexing XML & Binary Files with interMedia Text

From: Andy Bowes <abowes_at_silverstream.com>
Date: 30 Aug 2001 09:14:39 -0700
Message-ID: <d3e361c9.0108300814.5b2f30ec_at_posting.google.com>


Hi,

We have a single table with the following structure:

CREATE TABLE cmdoccontents
 (

  docid                      CHAR(32) NOT NULL,
  uuid                       CHAR(32) NOT NULL,
  mimetype                   VARCHAR2(80),
  contentdata                LONG RAW,
  contentsize                NUMBER(9),
  versionid                  NUMBER(9)

 )

This table will contain both XML documents and Word/PDF documents.

I have tried adding an interMedia Text index on the contentdata column as follows:

CREATE INDEX CTSYST.FTS_DOCCONTENTS ON
SILVERPORTAL.CMDOCCONTENTS(CONTENTDATA) INDEXTYPE IS CTXSYS.CONTEXT PARAMETERS('FILTER CTXSYS.INSO_FILTER') This allows us to find the PDF and Word documents that contain the key words but the XML documents are not returned.

Is it possible to add a single index on this table that will allow us to find keywords inside specific XML tags (if the record contains XML) and within binary documents if the record is PDF, Word etc ?

Thanks is advance

Andy Received on Thu Aug 30 2001 - 18:14:39 CEST

Original text of this message