  | | | Lucene | Hi all
Thanks for the answers. I was looking for a best practice guide to do
the same. If anyone already had had some practical experience with
such kind of queries it will be great to know hisis there a lucene server that persists in memory ?
i see this announcement from september in the list
http //mail-archives.apache.org/eyebrowse/ReadMsg?listName lucene-user@(protected)&msgIaneesha@(protected) wrote
> That is awesome and very inspirational!
Thank you.
>
> Carrot2 looks very interesting. Wondering if anybody has a list of all the
Technically I don 't thin
For my own amusement I 've indexed the Wikipedia and put up pages that
- display search results
- cluster the results using Carrot2 (my first use of this)
- display similar pages using the entireHi everybody
can someone help me ?
i have a problem with my index ?
on my localhost everything is ok
i can put my index directory in different places it is accessed by my jsp.
But onHi luceners
Using Reader whats the best (fastest) way to find if a documents exists with a given term. The term is unique ID meaning with that term atmost one document can exist.
I have seen is it efficient and feasible to use lucene to do full text
comparisions. eg take an entire text thats reasonably large ( eg
more than 10 words) and find the result set within the lucene search
iDoes anyone know how much stop words are supposed to affect the index size?
I did an experiment of building an index once with and once without
stop words.
The corpus is the English WikipediI didn 't get any response to this post so I wanted to follow up (you can
read the full description of my problem in the archives
http //nagoya.apache.org/eyebrowse/ReadMsg?listName lucene-user@(protected)I was indexing at the time and I was under the impression that was safe
but it looks like the indexer may have removed a file that the search
was trying to access. Is there something I should be
Great thanks for your help I understand things quickly but I need lots of
explanation .. -)
For who is interested .. I was using
int id hits.doc(i)
instead of
int id hits.idHello
how does lucene calculate the score of a given document? In the class DefaultSimilarity are some parts of this formula (e.g. tf itf) but how does these parts working together?
Thanks
M
Hi
Thanks for your quick answer I understood wot u meant by using the
indexSearcher to get the termFreqVector. But you use an int as an id to
find the termFrequency so I suppose that it I what to know your opinion about this
I 've a new portal and Lucene is the serach engine. This portal is an
integration of a lot of opensource software.
phpBB(MySql) is our election for the fo
Hi all
Im quite new in this mailing list. I ve many difficulties to find the
number of a word (occurence) in a document I need to use indexSearcher
because of the query but the score returnI am pretty new to Lucene.
In my situation there will be one most likely fairly large index and over
time a trickle of smaller indexes being created that could eventually number
into the hu*sigh* Yet again I apologize. I 'm generating altogether too much traffic
here lately!
I 'm stuck. I have a custom Directory and I *need* a callback point so I
can clean up. There 's a method for Hi
I 'm new to Lucene and also this forum.
I have a txt file which contains the path to jpg
files. These jpg files are organized into folders.
My search is limited to searching only this txtErik Hatcher wrote
>
> On Jan 10 2005 at 6 54 PM David Spencer wrote
>
> > Hi...I wrote the WordNet sandbox code - but I 'm not sure if I
> > undertand this thread. Are we saying that it dHi all.
I 'm starting to use lucene and I wonder if it is possible to make a
query syntax to ask for one string which can be in two different fields
and filter duplicated results like with distinHey There
Wondering where I can find a list of the set of characters that the
StandardAnalyzer will tokenize on when indexing text in Lucene.
Best
Shawn.
I 'm getting
Lock obtain timed out.
I was developing and forgot to close the writer. How do I recover? I
killed the program put the close in but it won 't let me open again.
Thanks
Jim.
Hi All
My company is looking to hire someone UK-based for a few day 's Lucene
consultancy. Experience with coupling Lucene to large scale web
spidering is a must experience with term vectors wo
The SearchBlox Distributed Edition is a J2EE Search Component for the Akamai
EdgeComputing Platform a J2EE Application Platform consisting of more than
14 000 servers in more than 1100 networks i
Hi Guys
Apologies...........
This Question may be asked million times on this form need some
clarifications.
1) FieldType keyword name vendor
2)FieldType text Hi all
This is jac here and I am currently in urgent need to install Lucene on a unix machine. However I am not sure where to set the paths coz I am unfamiliar with Unix & am a newbie to Java aGosh I 'm really sorry for the number of posts I 'm making.
Again to recap I 'm writing a Directory a SQL-based one.
Hi ALL
I am using a java class to query an index and return sorted results.
The "author " and "title " fields are Indexed Tokenized and Stored.
The are added in the following way
doc.add(FHi
I 'm very sorry but i just wanted to receive an answer
to my question where is the runable file to make the
software lucene work.
You told me that it is the ezlm program but i don 't
see iHow is anyone managing reindexing of pages that change? Just
periodically reindex everything or do you try to determine frequency of
each changes to each page and/or site?
Thanks
Jim.
-- |
|
 |