08 November 2006
Algorithmes vectoriels et bioinformatique
by 1 otherThèse de doctorat de Sylvie Hamel - recherche approximative de chaînes de caractères.
22 September 2006
Tame the Beast by Matching Similar Strings
I described the algorithms in two classes: equivalence methods and similarity ranking methods. Equivalence methods return a Boolean result, whereas the similarity ranking methods return a numeric similarity measure or distance metric. In information retrieval systems, it is possible to mix methods to produce a faster hybrid approach. A typical approach is to employ a two-pass mechanism in which an equivalence method is used by the database as a first pass filter, and a ranked similarity method is applied to the filtered entries for the second pass. Ranked similarity methods tend to be algorithmically more complex than equivalence methods, so are usually implemented as custom code outside of the database.
A Guided Tour to Approximate String Matching - Navarro (ResearchIndex)
We survey the current techniques to cope with the problem of string matching allowing errors. This is becoming a more and more relevant issue for many fast growing areas such as information retrieval and computational biology. We focus on online searching and mostly on edit distance, explaining the problem and its relevance, its statistical behavior, its history and current developments, and the central ideas of the algorithms and their complexities.
20 September 2006
Indexed Approximate String Searching
by 1 otherUne bonne présentation sur la recherche approximative de chaînes (références, algos, ...)
1
(5 marks)