Java with NLP?

August 14, 2010

Jeff’s Search Engine Caffe: Java Open Source NLP and Text Mining Tools is a mother lode of Java open-source natural language processing and test mining tools. Jeff is a PhD student at UMass Amherst’s prestigious Center for Intelligent Information Retrieval and maintains a blog, which is so well-researched, it can serve as a reference point. Jeff’s site features a link to an interesting Apache Lucene Mahout project, which is designed to create highly scalable machine learning libraries. Currently, Mahout specializes in recommendation mining, clustering, classification, and item set mining. The Mahout site welcomes contributors and looks to facilitate discussions on the project and realize potential use cases. One of the most popular text classification frameworks is Weka, a collection of machine learning algorithms.

This site contains many useful links to incubator and implemented projects, and is worth a bookmark here in Harrod’s Creek.

Bret Quinn, August 14, 2010

Comments

One Response to “Java with NLP?”

  1. Theodore Monk on December 14th, 2010 6:52 am

    Thanks for this update to parts of Jeff Dalton’s list of tools.

    I want to learn JAVA so as to modify open-source NLP code. Can you recommend helpful books, sites or tutorials for beginners. (I have programmed in several languages.)

  • Archives

  • Recent Posts

  • Meta