  | | | Lucene |
Hi
I have put this question as "urgent " because I can notice I don 't have
often answers
If I 'm asking the wrong way please tell me...
Before I delete a document I search it in the index to be sWhen performing a query and getting a result set back if one wants to
know which terms from the query actually matched is Highlighter still
the best way to go with the latest Lucene or should I staHi all
How to disable lucene norm factor?
Thanks
Maureen
Hi
I want to implement fulltext search on a collection of documents. I try
to figure out which system is the better choice - eXist or Lucene or
some combination of the two. I have some knowledgeHi Mike
How to disable/turn off the norm?is it while indexing?
Thanks
Maureen
"Michael McCandless " <lucene@(protected) > wrote
> The only simple workaround I can think of is to set maxMergeDocs to
> keep all segments "small ". But then you may have too many segments
> Hi mike
"The only simple workaround I can think of is to set maxMergeDocs to
keep all segments "small ". But then you may have too many segments
with time. Either that or find a way to reduce tHi
I need to merge several indexes (I call them incremental index) with my
main index.
Each incremental index can contain the same url 's of the main index
that 's why I have a list of url 's to upHi Mike..
"One thing that stands out in your listing is your norms file
(_1ke1.nrm) is enormous compared to all other files. Are you indexing
many tiny docs where each docs has highly variaHi all
Is it possible to search whether a term is equal to the entire contents
of a field rather than that the field contains a term?
So for example if I have a field with this text "world cup " an
Dear All
How much actually the disk space needed to optimize the index?The explanation given in documentation seems to be very different with the practical situation
I have an index file There are (at least) two ways to generate a BitSet which can be used for filtering.
Filter.bits()
BitSet bits new BitSet(reader.maxDoc())
TermDocs td reader.termDocs(new Term( "field " "tehi all i have successfully indexed .txt and .pdf files using lucene . Now i
want to index word documents Excel sheets and power point slides .and for
this i have downloaded POI api
from the followiHi All
I have a question regarding the close() CSIndexInput class why there is
no close operation defined...
Like ...
base.close()
If I analyze the memory dump of our server I can see lot of objHi Everyone
We have been using Lucene integrated with our application for over a year
now. The indexing and searching has been pretty fast until recently. But now
we are having some scalability issuI want to search for phrase ?innere Organe " bezeichnet
am using query q1 "?innere Organe\ " bezeichnet "
is there any issue with q1
am getting Exception in retrieveQuery().IndexDirec Lexicalhi all i am able to convert a pdf in to a text file using pdfbox. and this
is the code that i used
import org.pdfbox.pdfparser.PDFParser
import org.pdfbox.pdmodel.PDDocument
import org.pdfbox.util.Hi all
I 'm new to this group
I 'm using lucene for indexing. I have a problem. Any help gratly appreciate.
Please see the following code
// three fields
MultiFieldQueryParser parser new MultiFieHi there
I 'm indexing a lot of documents with lucene and I 'm really happy with
it )
Observing my users I understand they are querying only small range of
time. Often they only read results bacAm using lucene search. I have a field as "summary " in my Document which is
having large amount of content/data. In each search i want to get not more
than 200 words from the field "summary ". Now i wHi All
I have a lucene index with many fields one of which is a Keyword field IS.
The values stored in this field are the document ids like _839930494
_839930492.
But I am unable to delete the dohi all my name is ashwin i am trying to connect my servlet front end to my
backend lucene search program
these r the two programs
< <frontend servlet program > >
import javax.servlet.*
import javax.hye
I want to extract documents which contain a specific term.
I tried to do it in two different ways
1 Using the 'iterator ' termdocs reader.termDocs(term)
2 Using search and examing Hits
turnsHi all
I 'm new to this group
I 'm using lucene for indexing. I have a problem. Any help gratly appreciate.
Please see the following code
// three fields
MultiFieldQueryParser paHi All
I understand lucene has a requirement of double the size of index available
free on the disk on which the index is being optimised. But if in case the
disk gets filled up during optimisation
Hello
I run Nutch and get a whole slew of articles and when I display search
results there may be 5-6 articles that have different titles and most of
the body text is the same but I want to group(Lucene 1.9.1)
I have a "filename " field in Lucene that holds a value like this
pagefile.sys
If I run searches through QueryParser and I do a search for
pagefile.sys
pagefile
pagefilI 'm having problems with queries not returning a hit when a document
does in fact have those terms. (I 'm not worried about the ranking just
whether or not it 's a hit.)
Is anything wrong with the quhi all i have tried indexing .txt using lucene and its working fine.
now i want to index .doc .pdf .xls . ppt with lucene
can some one help in doing that
thanks
regards
ashwin
Hi
Here we use lucene to index our emails currently 500.000 Documents.
When Searching the body by a WildcardQuery the problems arises.
I did some profiling with JProfiler. I see the more BooleanCl |
|
 |