Apache Solr - 5.0 and beyond

Report 4 Downloads 66 Views
Apache Solr - 5.0 and beyond Anshum Gupta Apache Lucene/Solr PMC Member and Committer

Who am I? •

Anshum Gupta, Apache Lucene/Solr PMC member and committer, Lucidworks Employee.



Interested in search and related stuff.



Apache Lucene since 2006 and Solr since 2010.



Organizations I am or have been a part of:

What is Lucene?



Apache Lucene is a free open source information retrieval software library



Originally written in Java by Doug Cutting.



It is supported by the Apache Software Foundation and is released under the Apache Software License.

What is Solr? •

Solr (pronounced "solar") is an open source enterprise search platform



Written in Java,



For a while now, a part of the Apache Lucene project.



Search on Lucene - Replicated (SoLR)



SolrCloud - Distributed feature set

Apache Solr is the most widely-used search solution on the planet. You use

everyday.

Solr has tens of thousands of applications in production.

Solr is both established and growing.

8,000,000+ Total downloads

250,000+ Monthly downloads

2,500+

Open Solr jobs and the largest community of developers.

Apache Solr is also one of the most active open source projects out there Activity statistics 30 Day Summary Mar 14 2015 — Apr 13 2015

160 Commits 23 Contributors Annual commits up

12 Month Summary Apr 13 2014 — Apr 13 2015

1440 Commits 31 Contributors +126 (9%)

via https://www.openhub.net/p/solr

Solr Feature Release Frequency

Solr Essentials



Search - Full text, Geo-spatial



Faceting - Values, Ranges, Pivots, etc.



Suggestor, highlighting, auto-complete



Pluggability



and of course, Speed and Scalability

What’sTitle new Text in Solr 5x?

Ease of Use •

Get started in < 5 minutes



APIs, and more APIs •

Schema



Config



Collections



Auto* - Failover, leader election, addition of replica!



One of the best official documentation, released almost with the code.

Scalability and Performance



Thousands of collections - Apple



Billions of Documents - Box



High throughput and near real time Bloomberg



Impressive indexing performance: 150 k docs/ sec per node

Solr Scalability is unmatched

Reliability



Tons of tests and quality code



Critical systems running in production



Jepsen tests - Proven again!



Independent benchmarking and testing

Features and more!



Analytics - Do more with your data!



Distributed IDF



It’s an app not a war!

Solr News

What’s coming? •

Scalability



Faster search - SOLR-6810



Improved indexing - SOLR-6816



Analytics - HyperLogLog - SOLR-6968



Security - Authentication and Authorization framework - SOLR-7230



And tons more!

The largest Lucene/Solr conference in the world

OCT 13 - 16, 2015

AUSTIN, TX

CFP is open until May 8, 2015 For more details visit: http://lucenerevolution.org

Connect @

http://www.twitter.com/anshumgupta http://www.linkedin.com/in/anshumgupta/ [email protected]