My favorites | Sign in
Project Home Downloads Wiki Issues Source
Project Information
Members

Pattern is a web mining module for the Python programming language.

It bundles tools for data retrieval (Google + Twitter + Wikipedia API, web spider, HTML DOM parser), text analysis (rule-based shallow parser, Wordnet interface, syntactical + semantical n-gram search algorithm, tf-idf + cosine similarity + LSA metrics) and data visualization (graph networks). The module is bundled with 30+ example scripts.

http://www.clips.ua.ac.be/pages/pattern

The project has moved to github: http://github.com/clips/pattern

Powered by Google Project Hosting