maui_1.0.tar.gz maui_1.0_with_libs.tar.gz maui_1.1.tar.gz maui_1.1_with_libs.tar.gz
Summary
Maui automatically identifies main topics in text documents. Depending on the task, topics are tags, keywords, keyphrases, vocabulary terms, descriptors, index terms or titles of Wikipedia articles.
Maui performs the following tasks:
- keyphrase extraction,
- automatic tagging,
- term assignment with a controlled vocabulary,
- subject indexing,
- topic indexing with terms from Wikipedia.
It can also be used for terminology extraction and semi-automatic topic indexing.
Read more on Download, Installation and Usage pages.
Domain and language independence
Maui has been successfully tested on computer science, agricultural, medicine, physics, biology, bioinformatics documents, as well as on blog posts and news articles.
It supplies stemmers and stopwords for English, French and Spanish, but can be extended to work in many other languages, including languages that require special encoding.
Examples are provided in Maui's Wiki pages
Background
Maui has been developed by Olena Medelyan as a part of her PhD project, under supervision of Ian H. Witten and Eibe Frank in the Department of Computer Science at the University of Waikato, New Zealand. The PhD was sponsored by a research grant from Google.
Maui builds on the keyphrase extraction algorithm Kea, but provides additional functionalities: it allows the assignment of topics to documents based on terms from Wikipedia using Wikipedia Miner. Maui also has many new features that help identify topics more accurately.
Read more about how Maui works in the Wiki pages InsideMaui and in publications about Maui.