Talk:Apache Lucene

Updated
Have updated the latest Lucene implementations. --Rkthorne 03:10, 6 September 2005 (UTC)

Retired
Removed link to Lupy project since it has been retired  Open2universe 04:59, 14 January 2006 (UTC)

PHP port
Andi Gutman (responsible for PHP 3, and some other PHP stuff afterward) noted that there is now a Lucene port to PHP at the recent Scale4x Linux conference.

Ported vs bindings
I don't know the product, but I think it was not ported to so many different programming languages. I expect it to have bindings for those languages?


 * I have the same feeling. Notably, I did not find a native C++ implementation. —Preceding unsigned comment added by 83.180.228.22 (talk) 10:49, 2 August 2008 (UTC)


 * At least the PHP port is definitely a stand alone implementation. It uses the same index format, but runs without using any Java code. —Preceding unsigned comment added by 84.253.47.226 (talk) 09:57, 24 February 2009 (UTC)

How Indexing Done In Lucene
How Indexing Takes Place In Lucene

How to index images of text pages
Which one is better for indexing scanned text jLibrary, solr or nutch we currently using Lucene  — Preceding unsigned comment added by 122.173.72.124 (talk) 06:14, 22 July 2011 (UTC)

Lucene -> Lucene Java
Nowaday, it seems like Lucene become boarder than Lucene Java? 158.108.135.144 05:15, 16 March 2007 (UTC)

"Lucene" is the name of the Apache Top Level Project (TLP) which serves as an umbrella for dealing with all search related apache subprojects -- including "Lucene-Java", a java search library used as the foundation for some of the other sub projects (Nutch and Solr) and the reference implementation for some of the "port" subprojects (Lucene.Net and the now defunct Lucene4c). Some subprojects however have no direct relationship to the "Lucene Java" library, but are still included in the Lucene TLP because they are search related and share common goals and communities (Hadoop and Lucy)

In a broad discussion/context of user communities or Apache projects the term "Lucene" refers to the TLP and "Lucene-Java" (or "Java Lucene") refers to the specific sub project. Within the Lucene community (or any of the sub communities), "Lucene" is frequently used to refer to the Lucene-Java project/product as long as the context is clear.

--hossman 05:10, 28 June 2007 (UTC)

Performance
Java versus C#: Since the code is so similar, comparing the performance on the different virtual machines would make for a very interesting comparison between Java and C#.

Has anyone done such a study? —The preceding unsigned comment was added by 71.7.147.153 (talk) 20:06, 9 May 2007 (UTC).

Wikipedia standards
I don't think this article answers the "What is it?" question in the first (or second) sentence. It does not happen before in the second section, "Features and common use":

"... full text indexing and searching ..."

What do you think?

--Mortense (talk) 22:16, 25 January 2010 (UTC)

KinoSeach and Ferret
The C-based with Perl and respectively Ruby bindings "loose ports" should probably be merged here. They are briefly covered in the Lucene book, and don't have much additional coverage. Apache Lucy should be mentioned here as well. FuFoFuEd (talk) 16:07, 31 May 2011 (UTC)

Nutch Lucene based or not?
Is it really correct to say that Nutch is based on Lucene? It used to bundle Lucene as indexing component, but now you typically plug in Solr or Elastic. Janhoy (talk) 10:38, 14 October 2015 (UTC)

Lucene, the name
I was wondering where does this name Lucene from? Does anyone know the origin of it? Is it from the Switzerland city Lucerne? Biogerm (talk) 12:09, 4 December 2015 (UTC)

Unbelievable Introduction
I find it unbelievable that anywhere in the introduction is said what this software ACUTALLY DOES.

It starts with ".. is a free and open source software" ==> what actual information does that bring us ? we don't care. The first thing said should be the most relevant and essential information. And for a software it is WHAT IS DOES, not what type of legal license is associated with it.

Take this page for example : https://en.wikipedia.org/wiki/FFmpeg

it starts with "is a free and open source software blah blah blah", but at least it says what the program does in the same sentence

Lucene and Indri
On 2021-09-15 - the article was edited and the part referencing the halt/end of the indri project was removed with the hint "not notable". Abother search-engine project decidedbthat lucenen was a better codevase for their search-engine then their own. Basically switching from providing the "same functionallity" to "kind-of-extension". How can this be not notable? 2.247.248.244 (talk) 05:16, 16 September 2021 (UTC)
 * Well, your edit looked okay, really the problem is the independence of the sources you used. A third-party reliable source (such as a tech magazine, book, or academic article) discussing Indri's merger with Lucene's codebase would establish that it's notable enough to include in the this article. But, I wonder even then if that wouldn't be a detail more appropriate to include in an article on Indri. Why is this relevant to Lucene's development history? Suriname0 (talk) 14:39, 16 September 2021 (UTC)