Ann: Chatter - a simple library for language processing
Greg Matheson
drbean at freeshell.org
Fri Nov 22 03:12:05 GMT 2013
On Mon, 18 Nov 2013, Rogan Creswick wrote:
> I've been working on a simple NLP library over the past month or two, and I
> think it may finally be useful to others. I would love to hear comments,
> criticisms, contributions, etc... ;)
I am on Fedora 19, running haskell-platform-2012. I needed to
specify containers, text and bytestring versions to get it to
build. GHC.Generics was also in the hidden ghc-prim.
I submitted a pull request.
> My main objective was to make it extremely easy to do basic NLP tasks in
> Haskell, such as POS tagging and document similarity. (and later, Chunking,
> NER, co-ref resolution, etc...).
> The best example of this is Part-of-speech tagging with Chatter:
> {{{
> cabal install chatter
> ghci
> > :m +NLP.POS
> > t <- defaultTagger
> > tagStr t "This is a test."
> "This/dt is/bez a/at test/nn ./."
> }}}
> Chatter provides POS tagging (with backoff taggers, and a ~83% accurate
> trained default tagger), TF-IDF measures, and cosine document similarity.
> It also currently contains an adapted version of the Tokenize library,
> because I wanted to tokenize Text. That's a short-term solution; I haven't
> had time to make a patch to the tokenize lib.
> Links:
> - Hackage: http://hackage.haskell.org/package/chatter-0.0.0.2
> - Github: http://github.com/creswick/chatter
> --Rogan
> _______________________________________________
> NLP mailing list
> NLP at projects.haskell.org
> http://projects.haskell.org/cgi-bin/mailman/listinfo/nlp
More information about the NLP
mailing list