Ann: Chatter - a simple library for language processing
Rogan Creswick
creswick at gmail.com
Mon Nov 18 21:53:42 GMT 2013
I've been working on a simple NLP library over the past month or two, and I
think it may finally be useful to others. I would love to hear comments,
criticisms, contributions, etc... ;)
My main objective was to make it extremely easy to do basic NLP tasks in
Haskell, such as POS tagging and document similarity. (and later, Chunking,
NER, co-ref resolution, etc...).
The best example of this is Part-of-speech tagging with Chatter:
{{{
cabal install chatter
ghci
> :m +NLP.POS
> t <- defaultTagger
> tagStr t "This is a test."
"This/dt is/bez a/at test/nn ./."
}}}
Chatter provides POS tagging (with backoff taggers, and a ~83% accurate
trained default tagger), TF-IDF measures, and cosine document similarity.
It also currently contains an adapted version of the Tokenize library,
because I wanted to tokenize Text. That's a short-term solution; I haven't
had time to make a patch to the tokenize lib.
Links:
- Hackage: http://hackage.haskell.org/package/chatter-0.0.0.2
- Github: http://github.com/creswick/chatter
--Rogan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://projects.haskell.org/pipermail/nlp/attachments/20131118/382f21a5/attachment.htm>
More information about the NLP
mailing list