Stemming & Hello!
Daniël de Kok
me at danieldk.eu
Fri Dec 28 07:31:14 GMT 2012
On Dec 28, 2012, at 4:59 AM, Mark Wotton <mwotton at gmail.com> wrote:
> oh, also: I've been playing around with a suggester using levenshtein
> distance to many possible target strings. This isn't the fastest thing
> in the world: is there a better algorithm in the literature?
If you are trying to find words at a small edit-distance, you might want to look at Burkhard-Keller trees:
http://blog.notdot.net/2007/4/Damn-Cool-Algorithms-Part-1-BK-Trees
http://www.kafsemo.org/2010/08/03_bk-tree-performance-notes.html
http://hackage.haskell.org/package/bk-tree
-- Daniël
More information about the NLP
mailing list