This is big. One of the secret weapons underlying the Search Inside the Music project and Project Aura is a high quality search engine called Minion. Minion handles everything that has to do with Text for these projects. In addition to traditional search, we use Minion for document similarity (the core technique used for Tagomendations), item clustering, sense disambiguation, classification and autotagging. Minion is a research-oriented search engine - meaning that it is designed to allow for all sorts of variations. It is ultra-configurable and has a simple API. The big news is that the process to open source the Minion engine is underway. Steve Green (aka the search guy) has created a Minion project on Java.net - and soon, the Minion search engine will be available for all.

Steve promises to start posting regularly about the engine, so check out the Search Guy blog: The Search Guy

Comments:

It's so exciting! great men!

Posted by bigz on April 21, 2008 at 01:46 PM EDT #

What will be the license of Minion ? a free software license ? How Minion will be compared to Nutch ? Thanks for the info.

Posted by adulau on April 21, 2008 at 04:01 PM EDT #

@Adulau: The license is GPL V2. Keep an eye on Steve's blog - he'll be discussing the differences between Minion and Lucene over the next few weeks (Nutch, is just the crawler, Lucene is the search engine). Steve's blog: http://blogs.sun.com/searchguy/

Posted by Paul on April 23, 2008 at 10:05 AM EDT #

Post a Comment:
Comments are closed for this entry.

This blog copyright 2010 by plamere