|
Project Information
Members
Featured
Downloads
Wiki pages
Links
|
OverviewOluolu is a open source query log mining tool which works on Hadoop. This tool provides resources to add new features to search engines. Concretely Oluolu supports automatic dictionary creation such as spelling correction, context queries or frequent query n-grams from query log data. The dictionaries are applied to search engines to add features such as 'did you mean' or 'related keyword suggestion' service in search engines. News2011-11-16 oluolu 0.2.1 released 2011-05-11 oluolu 0.2.0 released
2010-10-12 oluolu 0.1.4rc2 released
2010-06-09 oluolu 0.1.2 released
2010-04-26 oluolu 0.1.1 released
2010-02-08 oluolu 0.1 released FeaturesSpelling correction dictionarySpelling correction dictionary consists of two row pair, one of them is query contains mistakes and the other is the query corrected the mistake. For example, Oluolu can extract a pair such as 'yaho' -> 'yahoo'. We can make use of the spelling correction dictionary dictionary consists of such pair building the 'did you mean' feature on search engines such as Solr or Fast ESP. Context dictionaryContext dictionary dictionary also consist of query pairs as the spelling correction dictionary. One item of the pair is the query and the other is the query contains first query. For example, a related query dictionary can have the pair such as, 'yahoo' -> 'yahoo news'. This dictionary can be apply to the 'related keyword suggestion' service. Such services implemented in Bing or Google. UsageBegin with the Oluolu quick start page (QuickStart) which shows you to the installation and the tutorial with with small input files. For detailed usage, please visit the page, Usage. To do
|