public marks

PUBLIC MARKS with tags c & "string matching"

December 2006

JoshDrew.com

by parmentierf
Last month, I needed to use the metaphone and edit (Levenshtein) distance algorithms for a fuzzy search of a MySQL table. Of course, neither is available as a built-in MySQL function. So, I had to install them as UDFs. The MySQL source distribution includes a metaphone UDF function in udf_example.cc. However, I couldn't find a Levenshtein UDF anywhere, so I wrote one, by converting a C implementation by Lorenzo Seidenari. I suspect that other people could benefit from this code, and you can download it from joshdrew.com. I compared the function's output to that of the PHP levensthein() function for a couple million word pairs; the results agreed completely - that's good enough for me. (this code comes with no warranty whatsoever, but I really hope you find it useful)

September 2006

SourceForge.net: The bitap / agrep library

by parmentierf & 1 other
The bitap library is a clean implementation of regular expression (regex/grep) string matching using the bitap algorithm. Approximate (a.k.a. fuzzy) matching is allowed.

PUBLIC TAGS related to tag c

cinema +   circus +   cirque +   dafoe +   de +   etrange +   film +   Freak +   Hayek +   john +   l +   le +   of +   paul +   reilly +   Salma +   vampire +   weitz +   Willem +  

Active users

parmentierf
last mark : 21/12/2006 08:10