Oracle FAQ Your Portal to the Oracle Knowledge Grid
HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US
 

Home -> Community -> Mailing Lists -> Oracle-L -> Re: Removing NUMGROUP from lexer in ConTerMedText index

Re: Removing NUMGROUP from lexer in ConTerMedText index

From: Hemant K Chitale <hkchital_at_singnet.com.sg>
Date: Thu, 30 Oct 2003 07:24:25 -0800
Message-ID: <F001.005D5149.20031030072425@fatcity.com>

I can't provide a solution.
But I do agree with your nomenclature
: "Context/Intermedia/Text/whateverTheHell" and "ConTerMedText".

I have [or rather HAD] Intermedia/Text setup in an 9iFS 1.1.9 and 9iFS 9.0.1 environment. The IFS_TEXT index was successful for not very many files. We've given up doing searches on IFS_TEXT ! We still do have Intermedia/Text in Oracle Portal 3.0.9 but I doubt if users user
that either.

Hemant

At 02:44 PM 29-10-03 -0800, you wrote:
>Hey all,
>
>I've setup a Context/Intermedia/Text/whateverTheHell index on 8.1.7.4 on
>HP/UX to index about 250000 description fields in order for our users to
>search on them. This was two years ago, and now someone has discovered at
>least one issue.
>
>One description contains something like:
>
> BLEAH,120,1/4W
>
>Using the default lexer, this stupidly parses into tokens of "BLEAH",
>"120,1" and "4W" instead of "BLEAH", "120", and "1/4W" (or even "1" and
>"4W"). I think this is because of the default NUMGROUP for US languages,
>which is a comma (","). So when a user looks for "120 AND 1/4W", this
>description is missed because "120" isn't a valid token with the default
>lexer.
>
>There can be numerous other issues with NUMGROUP when lexing a
>free-formatted description, so I really don't want a NUMGROUP. I tried
>setting it to null using:
>
> ctx_ddl.set_attribute('MYLEXER','NUMGROUP','');
>
>..but this bombs with:
>
> ORA-20000: interMedia Text error:
> DRG-10705: invalid value NULL for attribute NUMGROUP
>
>Other than trying to find some char that will work with 250K rows, is there
>a way to turn this off? The thing that gets me is that "120,1" isn't even a
>proper number, but ConTerMedText thinks it is and tokenizes it.
>
>TIA,
>Rich
>
>Rich Jesse System/Database Administrator
>rjesse_at_qtiworld.com Quad/Tech Inc, Sussex, WI USA
>--
>Please see the official ORACLE-L FAQ: http://www.orafaq.net
>--
>Author: Jesse, Rich
> INET: Rich.Jesse_at_qtiworld.com
>
>Fat City Network Services -- 858-538-5051 http://www.fatcity.com
>San Diego, California -- Mailing list and web hosting services
>---------------------------------------------------------------------
>To REMOVE yourself from this mailing list, send an E-Mail message
>to: ListGuru_at_fatcity.com (note EXACT spelling of 'ListGuru') and in
>the message BODY, include a line containing: UNSUB ORACLE-L
>(or the name of mailing list you want to be removed from). You may
>also send the HELP command for other information (like subscribing).

Hemant K Chitale
Oracle 9i Database Administrator Certified Professional My personal web site is : http://hkchital.tripod.com

-- 
Please see the official ORACLE-L FAQ: http://www.orafaq.net
-- 
Author: Hemant K Chitale
  INET: hkchital_at_singnet.com.sg

Fat City Network Services    -- 858-538-5051 http://www.fatcity.com
San Diego, California        -- Mailing list and web hosting services
---------------------------------------------------------------------
To REMOVE yourself from this mailing list, send an E-Mail message
to: ListGuru_at_fatcity.com (note EXACT spelling of 'ListGuru') and in
the message BODY, include a line containing: UNSUB ORACLE-L
(or the name of mailing list you want to be removed from).  You may
also send the HELP command for other information (like subscribing).
Received on Thu Oct 30 2003 - 09:24:25 CST

Original text of this message

HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US