My favorites | Sign in
Logo
                
People details
Project owners:
  medelyan
Project committers:
jose.aguera

Summary

Maui automatically identifies main topics in text documents. Depending on the task, topics are tags, keywords, keyphrases, vocabulary terms, descriptors, index terms or titles of Wikipedia articles.

Maui performs the following tasks:

It can also be used for terminology extraction and semi-automatic topic indexing.

Read more on Download, Installation and Usage pages.

Domain and language independence

Maui has been successfully tested on computer science, agricultural, medicine, physics, biology, bioinformatics documents, as well as on blog posts and news articles.

It supplies stemmers and stopwords for English, French and Spanish, but can be extended to work in many other languages, including languages that require special encoding.

Examples are provided in Maui's Wiki pages

Background

Maui has been developed by Olena Medelyan as a part of her PhD project, under supervision of Ian H. Witten and Eibe Frank in the Department of Computer Science at the University of Waikato, New Zealand. The PhD was sponsored by a research grant from Google.

Maui builds on the keyphrase extraction algorithm Kea, but provides additional functionalities: it allows the assignment of topics to documents based on terms from Wikipedia using Wikipedia Miner. Maui also has many new features that help identify topics more accurately.

Read more about how Maui works in the Wiki pages InsideMaui and in publications about Maui.









Hosted by Google Code