
lingua-en-semtags-engine
Lingua-EN-Semtags-Engine uses Lingua-EN-Tagger and Wordnet-Querydata to extract semantic tags (semtags) from English text. Semtags are words which reflect the semantic essence of the text (similar to topic keywords).
Lingua-EN-Semtags-Engine was designed and developed to solve a particular problem I was facing.
Problem: a user is processing blog post titles and needs to programmatically determine the posts' semantic context.
Solution: the user feeds a blog post title to Lingua-EN-Semtags and gets back a set of semtags which can be used for further processing (e.g., web searches).
Example: a blog post title like "BBtv: Graffiti Research Lab, the movie" (boingboing.net, Posted by Xeni Jardin, April 24, 2008 8:00 AM) would produce the following semtags: DECORATION WORKPLACE SHOW.
Project Information
- License: GNU GPL v3
- svn-based source control