September 29, 2005
I remember a while back researching some open source search engines and coming up with plenty of options such as lucene/plucene/clucene/nutch/other variants of lucene and other engines based on completely different code such as glimpse. They all had various strengths/weaknesses and were all slightly different even though they may have been based on similar source…not to mention DB searches such as MySQL with both geospatial searching and fulltextsearch capabilities.
It will be interesting to see how/if this code forks, etc. Performance might be great, for all I know, but I’d like to see a C variant (it’s based on Java). I’d also like to see geospatial functions (I’m biased…I like the whole “local” aspect of search in case you haven’t noticed. :-)…another thing I’d like to see in some of these engines is the realization that data is’t always unstructured/crawled data…there may be a need for structured search with filters/sort/etc on different fields. I guess the only way I’ll find out about Sentensa’s features and capabilities will be to download and try it as well. It will be interesting to see how it works: does it support a distributed environment? how/where does it store/retrieve data? how does it use filesystem/memory available to it? etc.