Return-Path: <oracle-l-bounce@freelists.org>
Delivered-To: 2-oracle-l@orafaq.com
Received: (qmail 21822 invoked from network); 10 Dec 2007 15:22:47 -0600
Received: from freelists-180.iquest.net (HELO turing.freelists.org) (206.53.239.180)
  by 69.64.49.119 with SMTP; 10 Dec 2007 15:22:41 -0600
Received: from localhost (localhost [127.0.0.1])
 by turing.freelists.org (Avenir Technologies Mail Multiplex) with ESMTP id D69207D6A21;
 Mon, 10 Dec 2007 16:22:35 -0500 (EST)
Received: from turing.freelists.org ([127.0.0.1])
 by localhost (turing.freelists.org [127.0.0.1]) (amavisd-new, port 10024)
 with ESMTP id 19725-04; Mon, 10 Dec 2007 16:22:35 -0500 (EST)
Received: from turing (localhost [127.0.0.1])
 by turing.freelists.org (Avenir Technologies Mail Multiplex) with ESMTP id 50C987D6941;
 Mon, 10 Dec 2007 16:22:35 -0500 (EST)
Received: with ECARTIS (v1.0.0; list oracle-l); Mon, 10 Dec 2007 15:35:38 -0500 (EST)
Received: from localhost (localhost [127.0.0.1])
 by turing.freelists.org (Avenir Technologies Mail Multiplex) with ESMTP id 4AF7A7D6B62
 for <oracle-l@freelists.org>; Mon, 10 Dec 2007 15:35:38 -0500 (EST)
Received: from turing.freelists.org ([127.0.0.1])
 by localhost (turing.freelists.org [127.0.0.1]) (amavisd-new, port 10024)
 with ESMTP id 00424-01 for <oracle-l@freelists.org>;
 Mon, 10 Dec 2007 15:35:38 -0500 (EST)
Received: from mx02.il.proquest.com (mx02.il.proquest.com [192.195.245.52])
 by turing.freelists.org (Avenir Technologies Mail Multiplex) with ESMTP id EE1117D6B60
 for <oracle-l@freelists.org>; Mon, 10 Dec 2007 15:35:37 -0500 (EST)
X-ASG-Debug-ID: 1197318935-62fb02ea0000-JbYmij
X-Barracuda-URL: http://spam.il.proquest.com:8000/cgi-bin/mark.cgi
Received: from AABO-EXCHANGE04.bos.il.pqe (localhost [127.0.0.1])
 by mx02.il.proquest.com (Spam Firewall) with ESMTP id 423273E8101
 for <oracle-l@freelists.org>; Mon, 10 Dec 2007 15:35:35 -0500 (EST)
Received: from AABO-EXCHANGE04.bos.il.pqe (aabo-exchange04.bos.il.pqe [172.24.3.208]) by mx02.il.proquest.com with ESMTP id w5fE29tFBWqrQU8Z for <oracle-l@freelists.org>; Mon, 10 Dec 2007 15:35:35 -0500 (EST)
X-ASG-Whitelist: Client
Received: from AABO-EXCHANGE02.bos.il.pqe ([172.24.3.202]) by AABO-EXCHANGE04.bos.il.pqe with Microsoft SMTPSVC(6.0.3790.3959);
	 Mon, 10 Dec 2007 15:35:35 -0500
X-MimeOLE: Produced By Microsoft Exchange V6.5
Content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C83B6C.373664E0"
X-ASG-Orig-Subj: Various soundex algorithms implementations in 10g?
Subject: Various soundex algorithms implementations in 10g?
Date: Mon, 10 Dec 2007 15:35:34 -0500
Message-ID: <AA29A27627F842409E1D18FB19CDCF270E427B66@AABO-EXCHANGE02.bos.il.pqe>
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
Thread-Topic: Various soundex algorithms implementations in 10g?
From: "Bobak, Mark" <Mark.Bobak@il.proquest.com>
To: "oracle-l" <oracle-l@freelists.org>
X-OriginalArrivalTime: 10 Dec 2007 20:35:35.0101 (UTC) FILETIME=[3756EED0:01C83B6C]
X-Barracuda-Connect: aabo-exchange04.bos.il.pqe[172.24.3.208]
X-Barracuda-Start-Time: 1197318935
X-Barracuda-Virus-Scanned: by Barracuda Spam Firewall at il.proquest.com
X-archive-position: 3820
X-ecartis-version: Ecartis v1.0.0
Sender: oracle-l-bounce@freelists.org
Errors-to: oracle-l-bounce@freelists.org
X-original-sender: Mark.Bobak@il.proquest.com
Precedence: normal
Reply-to: Mark.Bobak@il.proquest.com
List-help: <mailto:ecartis@freelists.org?Subject=help>
List-unsubscribe: <oracle-l-request@freelists.org?Subject=unsubscribe>
List-software: Ecartis version 1.0.0
List-Id: oracle-l <oracle-l.freelists.org>
X-List-ID: oracle-l <oracle-l.freelists.org>
List-subscribe: <oracle-l-request@freelists.org?Subject=subscribe>
List-owner: <mailto:steve.adams@ixora.com.au>
List-post: <mailto:oracle-l@freelists.org>
List-archive: <http://www.freelists.org/archives/oracle-l>
X-list: oracle-l
X-Virus-Scanned: Debian amavisd-new at localhost.localdomain
------_=_NextPart_001_01C83B6C.373664E0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

Hi,

=20

I've just had a query from a developer about soundex algorithms in
Oracle 10gR2.  I know about the soundex() function, which is an
implementation of the Russell soundex (described by Knuth in TAOCP).
The developer seems to think that there are some gaps in the Oracle
implementation, though. (working on getting details/specifics of what he
means by gaps in the implementation).  Also, he's asked about some other
algorithms, namely, Daitch-Mokotov and Double Metaphone.  Anyone have
any experience with either of these on Oracle?  Anyone have PL/SQL
implementations they'd be willing to share? J  For now, I think I'm
looking to code this in PL/SQL, and we won't be using any Oracle Text
features.

=20

Finally, these are US Census names that we'll be searching, and from
what I read, there's an algorithm called Jaro-Winkler which was
specifically written for US Census name data, *and* Oracle has an
implementation of that one in UTL_MATCH on 10gR2.  So, I've asked the
developer about this algorithm and whether he's considered it.

=20

I guess I'm just looking for any folks that have any experience
implementing any of the above algorithms, any thoughts or things to look
out for, pointers to public implementations of any of these
implementations, etc.....

=20

-Mark

=20

--
Mark J. Bobak
Senior Database Administrator, System & Product Technologies
ProQuest
789 E. Eisenhower, Parkway, P.O. Box 1346
Ann Arbor MI 48106-1346
+1.734.997.4059  or +1.800.521.0600 x 4059
mark.bobak@il.proquest.com <mailto:mark.bobak@il.proquest.com>=20
www.proquest.com <http://www.proquest.com>=20
www.csa.com <http://www.csa.com>=20

ProQuest...Start here.=20

=20


------_=_NextPart_001_01C83B6C.373664E0
Content-Type: text/html; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<html xmlns:v=3D"urn:schemas-microsoft-com:vml" =
xmlns:o=3D"urn:schemas-microsoft-com:office:office" =
xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" =
xmlns=3D"http://www.w3.org/TR/REC-html40">

<head>
<meta http-equiv=3DContent-Type content=3D"text/html; =
charset=3Dus-ascii">
<meta name=3DGenerator content=3D"Microsoft Word 12 (filtered medium)">
<style>
<!--
 /* Font Definitions */
 @font-face
	{font-family:Wingdings;
	panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
	{font-family:Wingdings;
	panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
 /* Style Definitions */
 p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0in;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:blue;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:purple;
	text-decoration:underline;}
span.EmailStyle17
	{mso-style-type:personal-compose;
	font-family:"Calibri","sans-serif";
	color:windowtext;}
.MsoChpDefault
	{mso-style-type:export-only;}
@page Section1
	{size:8.5in 11.0in;
	margin:1.0in 1.0in 1.0in 1.0in;}
div.Section1
	{page:Section1;}
-->
</style>
<!--[if gte mso 9]><xml>
 <o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
 <o:shapelayout v:ext=3D"edit">
  <o:idmap v:ext=3D"edit" data=3D"1" />
 </o:shapelayout></xml><![endif]-->
</head>

<body lang=3DEN-US link=3Dblue vlink=3Dpurple>

<div class=3DSection1>

<p class=3DMsoNormal>Hi,<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>I&#8217;ve just had a query from a developer about =
soundex
algorithms in Oracle 10gR2.&nbsp; I know about the soundex() function, =
which is
an implementation of the Russell soundex (described by Knuth in =
TAOCP).&nbsp; The
developer seems to think that there are some gaps in the Oracle =
implementation,
though. (working on getting details/specifics of what he means by gaps =
in the
implementation). &nbsp;Also, he&#8217;s asked about some other =
algorithms,
namely, Daitch-Mokotov and Double Metaphone.&nbsp; Anyone have any =
experience
with either of these on Oracle?&nbsp; Anyone have PL/SQL implementations =
they&#8217;d
be willing to share? <span =
style=3D'font-family:Wingdings'>J</span>&nbsp; For
now, I think I&#8217;m looking to code this in PL/SQL, and we =
won&#8217;t be
using any Oracle Text features.<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>Finally, these are US Census names that we&#8217;ll =
be
searching, and from what I read, there&#8217;s an algorithm called =
Jaro-Winkler
which was specifically written for US Census name data, *<b>and</b>* =
Oracle has
an implementation of that one in UTL_MATCH on 10gR2.&nbsp; So, =
I&#8217;ve asked
the developer about this algorithm and whether he&#8217;s considered =
it.<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>I guess I&#8217;m just looking for any folks that =
have any
experience implementing any of the above algorithms, any thoughts or =
things to
look out for, pointers to public implementations of any of these =
implementations,
etc&#8230;..<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>-Mark<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal><b><span =
style=3D'font-size:7.5pt;font-family:"Arial","sans-serif"'>--<br>
Mark J. Bobak</span></b><br>
<b><span =
style=3D'font-size:7.5pt;font-family:"Arial","sans-serif"'>Senior
Database Administrator, System &amp; Product Technologies</span></b><br>
<span =
style=3D'font-size:7.5pt;font-family:"Arial","sans-serif"'>ProQuest</span=
><br>
<span style=3D'font-size:7.5pt;font-family:"Arial","sans-serif"'>789 E.
Eisenhower, Parkway, P.O. Box 1346</span><br>
<span style=3D'font-size:7.5pt;font-family:"Arial","sans-serif"'>Ann =
Arbor MI
48106-1346</span><br>
<span =
style=3D'font-size:7.5pt;font-family:"Arial","sans-serif"'>+1.734.997.405=
9&nbsp;
or +1.800.521.0600 x 4059</span><br>
<a href=3D"mailto:mark.bobak@il.proquest.com"><span =
style=3D'font-size:7.5pt;
font-family:"Arial","sans-serif";color:blue'>mark.bobak@il.proquest.com</=
span></a><br>
<a href=3D"http://www.proquest.com"><span =
style=3D'font-size:7.5pt;font-family:
"Arial","sans-serif";color:blue'>www.proquest.com</span></a><br>
<a href=3D"http://www.csa.com"><span =
style=3D'font-size:7.5pt;font-family:"Arial","sans-serif";
color:blue'>www.csa.com</span></a><br>
<br>
<b><span =
style=3D'font-size:10.0pt;font-family:"Arial","sans-serif"'>ProQuest...</=
span></b><span
style=3D'font-size:10.0pt;font-family:"Arial","sans-serif"'>Start =
here.</span> <o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

</div>

</body>

</html>

------_=_NextPart_001_01C83B6C.373664E0--
--
http://www.freelists.org/webpage/oracle-l


