[Yanel-dev] API for accessing large data sets

Michael Wechner michael.wechner at wyona.com
Tue Mar 29 10:50:06 CEST 2011


Hi

I am currently thinking about introducing a new VersionableV3 interface 
to access large sets of revisions
(e.g. 50K) and make it scale better. Also it would be nice to search 
revisions for particular tags.
Hence I was looking at the search API of lucene, because it has similar 
scalability issues:

http://lucene.apache.org/java/3_0_3/api/core/org/apache/lucene/search/Searcher.html#search%28org.apache.lucene.search.Query,%20org.apache.lucene.search.Filter,%20int%29

publicTopDocs  <http://lucene.apache.org/java/3_0_3/api/core/org/apache/lucene/search/TopDocs.html>  *search*(Query  <http://lucene.apache.org/java/3_0_3/api/core/org/apache/lucene/search/Query.html>  query,
                       Filter  <http://lucene.apache.org/java/3_0_3/api/core/org/apache/lucene/search/Filter.html>  filter,
                       int n)
                throwsIOException  <http://java.sun.com/j2se/1.5/docs/api/java/io/IOException.html>


http://lucene.apache.org/java/3_0_3/api/core/org/apache/lucene/search/TopDocs.html


|ScoreDoc 
<http://lucene.apache.org/java/3_0_3/api/core/org/apache/lucene/search/ScoreDoc.html>[]| 
|*scoreDocs 
<http://lucene.apache.org/java/3_0_3/api/core/org/apache/lucene/search/TopDocs.html#scoreDocs>*| 

           The top hits for the query. | int| |*totalHits 
<http://lucene.apache.org/java/3_0_3/api/core/org/apache/lucene/search/TopDocs.html#totalHits>*| 

           The total number of hits for the query.

but also see for example

http://docs.codehaus.org/display/GEOTOOLS/Random+Data+Access

I am currently playing with the various APIs, but any suggestions are 
very welcome.

Cheers

Michael
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.wyona.org/pipermail/yanel-development/attachments/20110329/811a7478/attachment.html>


More information about the Yanel-development mailing list