Intermedia Text Issue

From: Ronald Mohr <mail_at_ronald-mohr.de>
Date: Mon, 23 Jul 2001 22:44:35 +0200
Message-ID: <9ji2ep$opc$1_at_news.online.de>


Hy all
I´m having a hard time figuring out problems with Intermedia Text. I'd like to have all my Oracle-Dokuments ( *.doc / *.pdf ) stored in file-system be indexed. Language is mixed : german, english Table holds title, format ( binary or text ), language, url-location. Problem is : everything works fine with my german docs and pdfs, but english doku from otn gives no query result after indexing, although index really grew a lot after loading those pdfs. ctx_user_index_errors recorded some few errors saying :

    ORA_DOKU_IDX                   22.07.01     AAAGJoAAFAAAAKjAA6
    DRG-11207: user filter command exited with status 1 ( guess it means inso_filter )

nls is left on default american.america. Oracle 8.1.7 on Suse 7.0. As discribed in Intermedia Text Reference I tried using Multi_Lexer, Inso_filter and url_datastore. Maybe someone could point me in the right direction. Thanks in advance.
Ronald

Here's the whole thing :

-----------sorry abut the size, maybe its good for

info -----------------------------------------------------------
begin
  Ctx_Ddl.Create_Preference ( 'my_datastore', 'URL_DATASTORE' );
  Ctx_Ddl.Set_Attribute     ( 'my_datastore', 'URLSIZE', '512');
  Ctx_Ddl.Set_Attribute     ( 'my_datastore', 'MAXDOCSIZE', '8388608');
  Ctx_Ddl.Set_Attribute     ( 'my_datastore', 'TIMEOUT', '15');
end;
/
begin
  Ctx_Ddl.Create_Preference ( 'my_filter', 'INSO_FILTER' ); end;
/
begin
  Ctx_Ddl.Create_Preference ( 'english_lexer', 'basic_lexer' );
  Ctx_Ddl.Set_Attribute     ( 'english_lexer', 'index_text',   'YES'  );
  Ctx_Ddl.Set_Attribute     ( 'english_lexer', 'index_themes', 'YES' );
  Ctx_Ddl.Set_Attribute     ( 'english_lexer', 'theme_language',
'english' );
  Ctx_Ddl.Set_Attribute     ( 'english_lexer', 'PRINTJOINS', '_-' );

  Ctx_Ddl.Create_Preference ( 'german_lexer', 'basic_lexer' );
  Ctx_Ddl.Set_Attribute     ( 'german_lexer', 'index_text',   'YES'  );
  Ctx_Ddl.Set_Attribute     ( 'german_lexer', 'composite', 'german' );
  Ctx_Ddl.Set_Attribute     ( 'german_lexer', 'mixed_case', 'yes' );
  Ctx_Ddl.Set_Attribute     ( 'german_lexer', 'alternate_spelling',
'german' );
  Ctx_Ddl.Set_Attribute     ( 'german_lexer', 'PRINTJOINS', '_-' );

  Ctx_Ddl.Create_Preference ( 'global_lexer', 'multi_lexer' );
  Ctx_Ddl.add_sub_lexer ( 'global_lexer', 'default', 'german_lexer' );   Ctx_Ddl.add_sub_lexer ( 'global_lexer', 'english', 'english_lexer', 'en' );
end;
/
begin
  Ctx_Ddl.Create_Preference ( 'my_wordlist', 'basic_wordlist'         );
  Ctx_Ddl.Set_Attribute     ( 'my_wordlist', 'substring_index', 'true');
end;
/
begin

   ctx_ddl.create_preference('mystore', 'BASIC_STORAGE');    ctx_ddl.set_attribute('mystore', 'I_TABLE_CLAUSE',
'tablespace indx storage (initial 512K next 512K)');
   ctx_ddl.set_attribute('mystore', 'K_TABLE_CLAUSE',
'tablespace indx storage (initial 512K next 512K)');
   ctx_ddl.set_attribute('mystore', 'R_TABLE_CLAUSE',
'tablespace indx storage (initial 512K next 512K)');
   ctx_ddl.set_attribute('mystore', 'N_TABLE_CLAUSE',
'tablespace indx storage (initial 512K next 512K)');
   ctx_ddl.set_attribute('mystore', 'I_INDEX_CLAUSE',
'tablespace indx storage (initial 512K next 512K)');
   ctx_ddl.set_attribute('mystore', 'P_TABLE_CLAUSE',
'tablespace indx storage (initial 512K next 512K)');
end;
/
create index ora_doku_idx on oracle8i_doku ( file_data )

    indextype is ctxsys.context
    parameters ('datastore my_datastore
   filter my_filter format column frmt
   lexer global_lexer language column lang    wordlist my_wordlist
   storage mystore
   stoplist ctxsys.empty_stoplist' ); Received on Mon Jul 23 2001 - 22:44:35 CEST

Original text of this message