public marks

PUBLIC MARKS from parmentierf with tags "moteur de recherche" & algorithme

22 September 2006 08:00

Tame the Beast by Matching Similar Strings

I described the algorithms in two classes: equivalence methods and similarity ranking methods. Equivalence methods return a Boolean result, whereas the similarity ranking methods return a numeric similarity measure or distance metric. In information retrieval systems, it is possible to mix methods to produce a faster hybrid approach. A typical approach is to employ a two-pass mechanism in which an equivalence method is used by the database as a first pass filter, and a ranked similarity method is applied to the filtered entries for the second pass. Ranked similarity methods tend to be algorithmically more complex than equivalence methods, so are usually implemented as custom code outside of the database.

20 September 2006 12:00

Indexed Approximate String Searching

by 1 other
Une bonne présentation sur la recherche approximative de chaînes (références, algos, ...)