Ann: Chatter - a simple library for language processing
    Rogan Creswick 
    creswick at gmail.com
       
    Mon Nov 18 21:53:42 GMT 2013
    
    
  
I've been working on a simple NLP library over the past month or two, and I
think it may finally be useful to others.  I would love to hear comments,
criticisms, contributions, etc... ;)
My main objective was to make it extremely easy to do basic NLP tasks in
Haskell, such as POS tagging and document similarity. (and later, Chunking,
NER, co-ref resolution, etc...).
The best example of this is Part-of-speech tagging with Chatter:
{{{
cabal install chatter
ghci
> :m +NLP.POS
> t <- defaultTagger
> tagStr t "This is a test."
"This/dt is/bez a/at test/nn ./."
}}}
Chatter provides POS tagging (with backoff taggers, and a ~83% accurate
trained default tagger), TF-IDF measures, and cosine document similarity.
It also currently contains an adapted version of the Tokenize library,
because I wanted to tokenize Text.  That's a short-term solution; I haven't
had time to make a patch to the tokenize lib.
Links:
 - Hackage: http://hackage.haskell.org/package/chatter-0.0.0.2
 - Github: http://github.com/creswick/chatter
--Rogan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://projects.haskell.org/pipermail/nlp/attachments/20131118/382f21a5/attachment.htm>
    
    
More information about the NLP
mailing list