Oracle FAQ Your Portal to the Oracle Knowledge Grid
HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US
 

Home -> Community -> Usenet -> c.d.o.server -> Re: slightly OT - cleaning up "dirty" keys?

Re: slightly OT - cleaning up "dirty" keys?

From: bugbear <bugbear_at_trim_papermule.co.uk_trim>
Date: Thu, 02 Mar 2006 10:12:50 +0000
Message-ID: <4406c522$0$6978$ed2619ec@ptn-nntp-reader02.plus.net>


sybrandb_at_yahoo.com wrote:
> Try the SOUNDEX function available in Oracle on your data.
> It is heavily English-oriented, but that doesn't seem to be a problem
> in your case.
> SOUNDEX will provide the 'phonetic' representation of a name.

That would be a useful "distance" measure.

That would give me matching (of user input) under dirty data, but doesn't help me cluster (and re-normalise) the existing cruddy data.

   BugBear Received on Thu Mar 02 2006 - 04:12:50 CST

Original text of this message

HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US