Oracle FAQ Your Portal to the Oracle Knowledge Grid
HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US
 

Home -> Community -> Usenet -> c.d.o.server -> Re: Inconsistent data within a column

Re: Inconsistent data within a column

From: Denise Williams <dwilliams_at_nospam.ntwrks.com>
Date: Wed, 11 Nov 1998 08:34:13 -0500
Message-ID: <36499255.D70A0C0D@nospam.ntwrks.com>


Richard,

If you can't normalize and have a separate table with keys for company name, there is a tool available from DataFlux (called the dfPower Series) which allows you to quickly standardize data within a column by building standardization schemes for any type of data, not just company names. And it is pretty easy to use because it is entirely point and click driven. The other thing I like about it is that it connects directly to the database via ODBC. It does other things like identify duplicate data via 'fuzzy logic' etc.

They provide a free 'data quality' diagnostic tool that allows you to determine the extent of data quality problems that you might have in the database based on your own data:

http://www.dataflux.com/dfroi.htm

And if that works for you then you can move up to the dfPower Series.

Hope this is helpful!

Denise

Richard Crawford wrote:

> Is there any way I can make data consistent within a column, or
> standardize data easily? I've got about 15 different ways a company name
> appears in a column, such as 'IBM', 'International Business
> Machines','I.B.M.', etc.. making queries and reporting very difficult,
> and since this data is collected via a form on the Internet there is no
> way to normalize it - people can enter data the way they want. Any easy
> solution?
>
> Thanks,
>
> Richard
Received on Wed Nov 11 1998 - 07:34:13 CST

Original text of this message

HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US