Intermedia Text Issue
Date: Mon, 23 Jul 2001 22:44:35 +0200
Message-ID: <9ji2ep$opc$1_at_news.online.de>
Hy all
I´m having a hard time figuring out problems with Intermedia Text. I'd like
to have all my Oracle-Dokuments ( *.doc / *.pdf ) stored in file-system be
indexed. Language is mixed : german, english
Table holds title, format ( binary or text ), language, url-location.
Problem is : everything works fine with my german docs and pdfs, but english
doku from otn gives no query result after indexing, although index really
grew a lot after loading those pdfs. ctx_user_index_errors recorded some few
errors saying :
ORA_DOKU_IDX 22.07.01 AAAGJoAAFAAAAKjAA6DRG-11207: user filter command exited with status 1 ( guess it means inso_filter )
nls is left on default american.america. Oracle 8.1.7 on Suse 7.0. As
discribed in Intermedia Text Reference I tried using Multi_Lexer,
Inso_filter and url_datastore. Maybe someone could point me in the right
direction. Thanks in advance.
Ronald
Here's the whole thing :
-----------sorry abut the size, maybe its good for
info -----------------------------------------------------------begin
Ctx_Ddl.Create_Preference ( 'my_datastore', 'URL_DATASTORE' ); Ctx_Ddl.Set_Attribute ( 'my_datastore', 'URLSIZE', '512'); Ctx_Ddl.Set_Attribute ( 'my_datastore', 'MAXDOCSIZE', '8388608'); Ctx_Ddl.Set_Attribute ( 'my_datastore', 'TIMEOUT', '15');end;
/
begin
Ctx_Ddl.Create_Preference ( 'my_filter', 'INSO_FILTER' ); end;
/
begin
Ctx_Ddl.Create_Preference ( 'english_lexer', 'basic_lexer' ); Ctx_Ddl.Set_Attribute ( 'english_lexer', 'index_text', 'YES' ); Ctx_Ddl.Set_Attribute ( 'english_lexer', 'index_themes', 'YES' ); Ctx_Ddl.Set_Attribute ( 'english_lexer', 'theme_language', 'english' ); Ctx_Ddl.Set_Attribute ( 'english_lexer', 'PRINTJOINS', '_-' ); Ctx_Ddl.Create_Preference ( 'german_lexer', 'basic_lexer' ); Ctx_Ddl.Set_Attribute ( 'german_lexer', 'index_text', 'YES' ); Ctx_Ddl.Set_Attribute ( 'german_lexer', 'composite', 'german' ); Ctx_Ddl.Set_Attribute ( 'german_lexer', 'mixed_case', 'yes' ); Ctx_Ddl.Set_Attribute ( 'german_lexer', 'alternate_spelling', 'german' ); Ctx_Ddl.Set_Attribute ( 'german_lexer', 'PRINTJOINS', '_-' ); Ctx_Ddl.Create_Preference ( 'global_lexer', 'multi_lexer' );Ctx_Ddl.add_sub_lexer ( 'global_lexer', 'default', 'german_lexer' ); Ctx_Ddl.add_sub_lexer ( 'global_lexer', 'english', 'english_lexer', 'en' );
end;
/
begin
Ctx_Ddl.Create_Preference ( 'my_wordlist', 'basic_wordlist' ); Ctx_Ddl.Set_Attribute ( 'my_wordlist', 'substring_index', 'true');end;
/
begin
ctx_ddl.create_preference('mystore', 'BASIC_STORAGE');
ctx_ddl.set_attribute('mystore', 'I_TABLE_CLAUSE',
'tablespace indx storage (initial 512K next 512K)');
ctx_ddl.set_attribute('mystore', 'K_TABLE_CLAUSE',
'tablespace indx storage (initial 512K next 512K)');
ctx_ddl.set_attribute('mystore', 'R_TABLE_CLAUSE',
'tablespace indx storage (initial 512K next 512K)');
ctx_ddl.set_attribute('mystore', 'N_TABLE_CLAUSE',
'tablespace indx storage (initial 512K next 512K)');
ctx_ddl.set_attribute('mystore', 'I_INDEX_CLAUSE',
'tablespace indx storage (initial 512K next 512K)');
ctx_ddl.set_attribute('mystore', 'P_TABLE_CLAUSE',
'tablespace indx storage (initial 512K next 512K)');
end;
/
create index ora_doku_idx on oracle8i_doku ( file_data )
indextype is ctxsys.context
parameters ('datastore my_datastore
filter my_filter format column frmt
lexer global_lexer language column lang
wordlist my_wordlist
storage mystore
stoplist ctxsys.empty_stoplist' );
Received on Mon Jul 23 2001 - 22:44:35 CEST