際際滷

際際滷Share a Scribd company logo
Meet Solr for the first time again 
Varun Thacker
Apache Solr has a huge install base and tremendous momentum 
Solr is both established & growing 
250,000+ 
most widely used search 
solution on the planet. 8M+ total downloads 
monthly downloads 
You use Solr everyday. 
Solr has tens of thousands 
of applications in production. 
2500+ open Solr jobs. 
Activity Summary 
30 Day summary 
Aug 18 - Sep 17 2014 
 128 Commits 
 18 Contributors 
12 Month Summary 
Sep 17, 2013 - Sep 17, 2014 
 1351 Commits 
 29 Contributors 
via https://www.openhub.net/p/solr
Search - Until recently 
 Large organizations (Enterprise) 
 Expensive 
 Complex 
 $$$$$
New Age Search 
 Everyone startups, websites 
 Special use cases 
 E-commerce 
 Mails and personal data 
 Personal data - Across devices 
 Social and Local! 
 Analytics
Decision making! 
 Short time frame 
 Confidence measure: 
 Getting started quick 
 Configure and see the tip of the iceberg 
 Issues only uncover later in the story
Until recently 
 Getting started: 
 Download 
 java -jar start.jar 
 SolrCloud, getting started. 
 Download 
 Copy example directory x times over. 
 java -Dbootstrap_confdir=./solr/collection1/conf - 
Dcollection.configName=myconf -DzkRun -DnumShards=2 -jar start.jar 
 java -Djetty.port=7574 -DzkHost=localhost:9983 -jar start.jar 
 It runs!
Times they are a changin 
 Download 
 cd solr 
 Standalone: bin/solr start 
 SolrCloud, example, interactive: 
 bin/solr start -e cloud (< 2 minutes!)
Lets index some data 
 Flexible JSON Indexing - Solr supports any JSON 
document and the document can be indexed in 
the required format in Solr 
 More reading: https://lucidworks.com/blog/ 
indexing-custom-json-data/
Managed Schema 
 Solr is the schema owner 
 REST APIs - Hide the implementation details 
 Schema-less mode 
 Update and Addition of Fields and FieldTypes 
 More reading: https://lucidworks.com/blog/ 
schemaless-solr-part-1/
Configuration APIs 
 Configure Solr using APIs 
 solrconfig.xml What did you say?
Solr Scale Toolkit 
 Easily deploy SolrCloud clusters 
 Live patching and rolling restarts 
 Dependency on AWS soon to go away 
 Chef or Puppet still are valid approaches 
 More reading: http://lucidworks.com/blog/ 
introducing-the-solr-scale-toolkit/
Talking about the Admin UI 
 Already improved from 3.x 
 Uploading documents 
 Collections API is coming soon 
Collection Actions
Recently Added Features 
 Document expiration and Time To Live (TTL) 
 Cursors: Efficient Deep Paging 
 Export Sorted Result Sets 
 SSL support in SolrCloud 
 Distributed Pivot Faceting 
 Suggester v2 
 CollapsingQParserPlugin 
 ReRankingQParserPlugin 
 Collections API improvements
Theres so much more coming up 
 Schema Bulk API 
 Distributed IDF 
 Query DSL 
 Cross Data-center replication 
 Cluster Backup and Restore 
 SOLR - Make an application, not war.
Its easy.. and stable! 
 Benchmarking 
 Tons of users testing it 
 Evolving test framework
Solr scalability is unmatched. 
 10TB+ Index Size 
 10 Billion+ Documents 
 100 Million+ Daily Requests
Where is it headed? 
 Download 
 See that server directory? 
 Use start scripts 
 Send a document, or a few 
 Things dont really look the way they should? 
 Use the schema APIs 
 Add fields not enough? 
 Add field types and then add fields 
 Configure Solr using REST APIs 
For Production: 
 Use Solr Scale Toolkit to deploy, 
patch and manage! 
 Configure Solr using REST APIs
Lucidworks Fusion 
Intelligent Search Services/API 
Recommendation Module Signal Processing Analytics Service 
Enrichment Analytics Store 
 Services 
Discovery Engine 
Analyst 
Workbench 
eCommerce 
Solution 
Admin/ 
Management 
SiLK Log 
Analysis 
Search/ 
Discovery 
Partner 
Solutions 
Connector 
Framework
Connect @ 
https://twitter.com/varunthacker 
http://in.linkedin.com/in/varunthacker 
varun.thacker@lucidworks.com
Ad

Recommended

Friends of Solr - Nutch & HDFS
Friends of Solr - Nutch & HDFS
Saumitra Srivastav
Web scraping with nutch solr part 2
Web scraping with nutch solr part 2
Mike Frampton
Large Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and Friends
lucenerevolution
Implementing Hadoop on a single cluster
Implementing Hadoop on a single cluster
Salil Navgire
SolrCloud on Hadoop
SolrCloud on Hadoop
Alex Moundalexis
HBaseConEast2016: HBase on Docker with Clusterdock
HBaseConEast2016: HBase on Docker with Clusterdock
Michael Stack
8a. How To Setup HBase with Docker
8a. How To Setup HBase with Docker
Fabio Fumarola
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...
Yahoo Developer Network
Nutch + Hadoop scaled, for crawling protected web sites (hint: Selenium)
Nutch + Hadoop scaled, for crawling protected web sites (hint: Selenium)
Mark Kerzner
Cross Datacenter Replication in Apache Solr 6
Cross Datacenter Replication in Apache Solr 6
Shalin Shekhar Mangar
Configuration management
Configuration management
Luca De Vitis
Introduction to apache nutch
Introduction to apache nutch
Sigmoid
Large Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and Friends
Julien Nioche
Get started with Developing Frameworks in Go on Apache Mesos
Get started with Developing Frameworks in Go on Apache Mesos
Joe Stein
Making Apache Kafka Elastic with Apache Mesos
Making Apache Kafka Elastic with Apache Mesos
Joe Stein
Apache HDFS - Lab Assignment
Apache HDFS - Lab Assignment
Farzad Nozarian
SphinxSE with MySQL
SphinxSE with MySQL
Ritesh Puthran
Apache Hadoop & Hive installation with movie rating exercise
Apache Hadoop & Hive installation with movie rating exercise
Shiva Rama Krishna Dasharathi
Containerized Data Persistence on Mesos
Containerized Data Persistence on Mesos
Joe Stein
Developing Frameworks for Apache Mesos
Developing Frameworks for Apache Mesos
Joe Stein
You know, for search. Querying 24 Billion Documents in 900ms
You know, for search. Querying 24 Billion Documents in 900ms
Jodok Batlogg
Scaling search in Oak with Solr
Scaling search in Oak with Solr
Tommaso Teofili
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Joe Stein
Solr cluster with SolrCloud at lucenerevolution (tutorial)
Solr cluster with SolrCloud at lucenerevolution (tutorial)
searchbox-com
Nutch as a Web data mining platform
Nutch as a Web data mining platform
abial
Ease of use in Apache Solr
Ease of use in Apache Solr
Anshum Gupta
Apache Solr - Enterprise search platform
Apache Solr - Enterprise search platform
Tommaso Teofili
20130310 solr tuorial
20130310 solr tuorial
Chris Huang
Apache Solr Web Development: Unlocking the Power of Search
Apache Solr Web Development: Unlocking the Power of Search
company
Solr on Cloud
Solr on Cloud
Gulcin Yildirim Jelinek

More Related Content

What's hot (17)

Nutch + Hadoop scaled, for crawling protected web sites (hint: Selenium)
Nutch + Hadoop scaled, for crawling protected web sites (hint: Selenium)
Mark Kerzner
Cross Datacenter Replication in Apache Solr 6
Cross Datacenter Replication in Apache Solr 6
Shalin Shekhar Mangar
Configuration management
Configuration management
Luca De Vitis
Introduction to apache nutch
Introduction to apache nutch
Sigmoid
Large Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and Friends
Julien Nioche
Get started with Developing Frameworks in Go on Apache Mesos
Get started with Developing Frameworks in Go on Apache Mesos
Joe Stein
Making Apache Kafka Elastic with Apache Mesos
Making Apache Kafka Elastic with Apache Mesos
Joe Stein
Apache HDFS - Lab Assignment
Apache HDFS - Lab Assignment
Farzad Nozarian
SphinxSE with MySQL
SphinxSE with MySQL
Ritesh Puthran
Apache Hadoop & Hive installation with movie rating exercise
Apache Hadoop & Hive installation with movie rating exercise
Shiva Rama Krishna Dasharathi
Containerized Data Persistence on Mesos
Containerized Data Persistence on Mesos
Joe Stein
Developing Frameworks for Apache Mesos
Developing Frameworks for Apache Mesos
Joe Stein
You know, for search. Querying 24 Billion Documents in 900ms
You know, for search. Querying 24 Billion Documents in 900ms
Jodok Batlogg
Scaling search in Oak with Solr
Scaling search in Oak with Solr
Tommaso Teofili
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Joe Stein
Solr cluster with SolrCloud at lucenerevolution (tutorial)
Solr cluster with SolrCloud at lucenerevolution (tutorial)
searchbox-com
Nutch as a Web data mining platform
Nutch as a Web data mining platform
abial
Nutch + Hadoop scaled, for crawling protected web sites (hint: Selenium)
Nutch + Hadoop scaled, for crawling protected web sites (hint: Selenium)
Mark Kerzner
Cross Datacenter Replication in Apache Solr 6
Cross Datacenter Replication in Apache Solr 6
Shalin Shekhar Mangar
Configuration management
Configuration management
Luca De Vitis
Introduction to apache nutch
Introduction to apache nutch
Sigmoid
Large Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and Friends
Julien Nioche
Get started with Developing Frameworks in Go on Apache Mesos
Get started with Developing Frameworks in Go on Apache Mesos
Joe Stein
Making Apache Kafka Elastic with Apache Mesos
Making Apache Kafka Elastic with Apache Mesos
Joe Stein
Apache HDFS - Lab Assignment
Apache HDFS - Lab Assignment
Farzad Nozarian
SphinxSE with MySQL
SphinxSE with MySQL
Ritesh Puthran
Apache Hadoop & Hive installation with movie rating exercise
Apache Hadoop & Hive installation with movie rating exercise
Shiva Rama Krishna Dasharathi
Containerized Data Persistence on Mesos
Containerized Data Persistence on Mesos
Joe Stein
Developing Frameworks for Apache Mesos
Developing Frameworks for Apache Mesos
Joe Stein
You know, for search. Querying 24 Billion Documents in 900ms
You know, for search. Querying 24 Billion Documents in 900ms
Jodok Batlogg
Scaling search in Oak with Solr
Scaling search in Oak with Solr
Tommaso Teofili
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Joe Stein
Solr cluster with SolrCloud at lucenerevolution (tutorial)
Solr cluster with SolrCloud at lucenerevolution (tutorial)
searchbox-com
Nutch as a Web data mining platform
Nutch as a Web data mining platform
abial

Similar to Meet Solr For The Tirst Again (20)

Ease of use in Apache Solr
Ease of use in Apache Solr
Anshum Gupta
Apache Solr - Enterprise search platform
Apache Solr - Enterprise search platform
Tommaso Teofili
20130310 solr tuorial
20130310 solr tuorial
Chris Huang
Apache Solr Web Development: Unlocking the Power of Search
Apache Solr Web Development: Unlocking the Power of Search
company
Solr on Cloud
Solr on Cloud
Gulcin Yildirim Jelinek
Apache Solr crash course
Apache Solr crash course
Tommaso Teofili
Introduction to Solr
Introduction to Solr
Erik Hatcher
Building Intelligent Search Applications with Apache Solr and PHP5
Building Intelligent Search Applications with Apache Solr and PHP5
israelekpo
Intro to Apache Solr
Intro to Apache Solr
Shalin Shekhar Mangar
Webinar: Inside Apache Solr 5
Webinar: Inside Apache Solr 5
Lucidworks
Inside Solr 5 - Bangalore Solr/Lucene Meetup
Inside Solr 5 - Bangalore Solr/Lucene Meetup
Shalin Shekhar Mangar
Solr 101
Solr 101
Findwise
Introduction to Solr
Introduction to Solr
Jayesh Bhoyar
Introduction to Apache Solr
Introduction to Apache Solr
Christos Manios
What's new in Solr 5.0
What's new in Solr 5.0
Anshum Gupta
Introduction to Apache solr
Introduction to Apache solr
Knoldus Inc.
Solr Masterclass Bangkok, June 2014
Solr Masterclass Bangkok, June 2014
Alexandre Rafalovitch
SOLR
SOLR
Matthew McCullough
Solr 8 interview
Solr 8 interview
Alihossein shahabi
Suche mit Apache Lucene & Co.
Suche mit Apache Lucene & Co.
inovex GmbH
Ease of use in Apache Solr
Ease of use in Apache Solr
Anshum Gupta
Apache Solr - Enterprise search platform
Apache Solr - Enterprise search platform
Tommaso Teofili
20130310 solr tuorial
20130310 solr tuorial
Chris Huang
Apache Solr Web Development: Unlocking the Power of Search
Apache Solr Web Development: Unlocking the Power of Search
company
Apache Solr crash course
Apache Solr crash course
Tommaso Teofili
Introduction to Solr
Introduction to Solr
Erik Hatcher
Building Intelligent Search Applications with Apache Solr and PHP5
Building Intelligent Search Applications with Apache Solr and PHP5
israelekpo
Webinar: Inside Apache Solr 5
Webinar: Inside Apache Solr 5
Lucidworks
Inside Solr 5 - Bangalore Solr/Lucene Meetup
Inside Solr 5 - Bangalore Solr/Lucene Meetup
Shalin Shekhar Mangar
Solr 101
Solr 101
Findwise
Introduction to Solr
Introduction to Solr
Jayesh Bhoyar
Introduction to Apache Solr
Introduction to Apache Solr
Christos Manios
What's new in Solr 5.0
What's new in Solr 5.0
Anshum Gupta
Introduction to Apache solr
Introduction to Apache solr
Knoldus Inc.
Solr Masterclass Bangkok, June 2014
Solr Masterclass Bangkok, June 2014
Alexandre Rafalovitch
Suche mit Apache Lucene & Co.
Suche mit Apache Lucene & Co.
inovex GmbH
Ad

Recently uploaded (20)

Fatality due to Falls at Working at Height
Fatality due to Falls at Working at Height
ssuserb8994f
Call For Papers - 17th International Conference on Wireless & Mobile Networks...
Call For Papers - 17th International Conference on Wireless & Mobile Networks...
hosseinihamid192023
Rapid Prototyping for XR: Lecture 6 - AI for Prototyping and Research Directi...
Rapid Prototyping for XR: Lecture 6 - AI for Prototyping and Research Directi...
Mark Billinghurst
Mechanical Vibration_MIC 202_iit roorkee.pdf
Mechanical Vibration_MIC 202_iit roorkee.pdf
isahiliitr
retina_biometrics ruet rajshahi bangdesh.pptx
retina_biometrics ruet rajshahi bangdesh.pptx
MdRakibulIslam697135
20CE404-Soil Mechanics - 際際滷 Share PPT
20CE404-Soil Mechanics - 際際滷 Share PPT
saravananr808639
Cadastral Maps
Cadastral Maps
Google
Rapid Prototyping for XR: Lecture 4 - High Level Prototyping.
Rapid Prototyping for XR: Lecture 4 - High Level Prototyping.
Mark Billinghurst
machine learning is a advance technology
machine learning is a advance technology
ynancy893
May 2025: Top 10 Read Articles in Data Mining & Knowledge Management Process
May 2025: Top 10 Read Articles in Data Mining & Knowledge Management Process
IJDKP
How to Un-Obsolete Your Legacy Keypad Design
How to Un-Obsolete Your Legacy Keypad Design
Epec Engineered Technologies
Rapid Prototyping for XR: Lecture 2 - Low Fidelity Prototyping.
Rapid Prototyping for XR: Lecture 2 - Low Fidelity Prototyping.
Mark Billinghurst
Introduction to Natural Language Processing - Stages in NLP Pipeline, Challen...
Introduction to Natural Language Processing - Stages in NLP Pipeline, Challen...
resming1
Structured Programming with C++ :: Kjell Backman
Structured Programming with C++ :: Kjell Backman
Shabista Imam
International Journal of Advanced Information Technology (IJAIT)
International Journal of Advanced Information Technology (IJAIT)
ijait
(Continuous Integration and Continuous Deployment/Delivery) is a fundamental ...
(Continuous Integration and Continuous Deployment/Delivery) is a fundamental ...
ketan09101
AI_Presentation (1). Artificial intelligence
AI_Presentation (1). Artificial intelligence
RoselynKaur8thD34
Introduction to sensing and Week-1.pptx
Introduction to sensing and Week-1.pptx
KNaveenKumarECE
Structural Wonderers_new and ancient.pptx
Structural Wonderers_new and ancient.pptx
nikopapa113
Microwatt: Open Tiny Core, Big Possibilities
Microwatt: Open Tiny Core, Big Possibilities
IBM
Fatality due to Falls at Working at Height
Fatality due to Falls at Working at Height
ssuserb8994f
Call For Papers - 17th International Conference on Wireless & Mobile Networks...
Call For Papers - 17th International Conference on Wireless & Mobile Networks...
hosseinihamid192023
Rapid Prototyping for XR: Lecture 6 - AI for Prototyping and Research Directi...
Rapid Prototyping for XR: Lecture 6 - AI for Prototyping and Research Directi...
Mark Billinghurst
Mechanical Vibration_MIC 202_iit roorkee.pdf
Mechanical Vibration_MIC 202_iit roorkee.pdf
isahiliitr
retina_biometrics ruet rajshahi bangdesh.pptx
retina_biometrics ruet rajshahi bangdesh.pptx
MdRakibulIslam697135
20CE404-Soil Mechanics - 際際滷 Share PPT
20CE404-Soil Mechanics - 際際滷 Share PPT
saravananr808639
Cadastral Maps
Cadastral Maps
Google
Rapid Prototyping for XR: Lecture 4 - High Level Prototyping.
Rapid Prototyping for XR: Lecture 4 - High Level Prototyping.
Mark Billinghurst
machine learning is a advance technology
machine learning is a advance technology
ynancy893
May 2025: Top 10 Read Articles in Data Mining & Knowledge Management Process
May 2025: Top 10 Read Articles in Data Mining & Knowledge Management Process
IJDKP
Rapid Prototyping for XR: Lecture 2 - Low Fidelity Prototyping.
Rapid Prototyping for XR: Lecture 2 - Low Fidelity Prototyping.
Mark Billinghurst
Introduction to Natural Language Processing - Stages in NLP Pipeline, Challen...
Introduction to Natural Language Processing - Stages in NLP Pipeline, Challen...
resming1
Structured Programming with C++ :: Kjell Backman
Structured Programming with C++ :: Kjell Backman
Shabista Imam
International Journal of Advanced Information Technology (IJAIT)
International Journal of Advanced Information Technology (IJAIT)
ijait
(Continuous Integration and Continuous Deployment/Delivery) is a fundamental ...
(Continuous Integration and Continuous Deployment/Delivery) is a fundamental ...
ketan09101
AI_Presentation (1). Artificial intelligence
AI_Presentation (1). Artificial intelligence
RoselynKaur8thD34
Introduction to sensing and Week-1.pptx
Introduction to sensing and Week-1.pptx
KNaveenKumarECE
Structural Wonderers_new and ancient.pptx
Structural Wonderers_new and ancient.pptx
nikopapa113
Microwatt: Open Tiny Core, Big Possibilities
Microwatt: Open Tiny Core, Big Possibilities
IBM
Ad

Meet Solr For The Tirst Again

  • 1. Meet Solr for the first time again Varun Thacker
  • 2. Apache Solr has a huge install base and tremendous momentum Solr is both established & growing 250,000+ most widely used search solution on the planet. 8M+ total downloads monthly downloads You use Solr everyday. Solr has tens of thousands of applications in production. 2500+ open Solr jobs. Activity Summary 30 Day summary Aug 18 - Sep 17 2014 128 Commits 18 Contributors 12 Month Summary Sep 17, 2013 - Sep 17, 2014 1351 Commits 29 Contributors via https://www.openhub.net/p/solr
  • 3. Search - Until recently Large organizations (Enterprise) Expensive Complex $$$$$
  • 4. New Age Search Everyone startups, websites Special use cases E-commerce Mails and personal data Personal data - Across devices Social and Local! Analytics
  • 5. Decision making! Short time frame Confidence measure: Getting started quick Configure and see the tip of the iceberg Issues only uncover later in the story
  • 6. Until recently Getting started: Download java -jar start.jar SolrCloud, getting started. Download Copy example directory x times over. java -Dbootstrap_confdir=./solr/collection1/conf - Dcollection.configName=myconf -DzkRun -DnumShards=2 -jar start.jar java -Djetty.port=7574 -DzkHost=localhost:9983 -jar start.jar It runs!
  • 7. Times they are a changin Download cd solr Standalone: bin/solr start SolrCloud, example, interactive: bin/solr start -e cloud (< 2 minutes!)
  • 8. Lets index some data Flexible JSON Indexing - Solr supports any JSON document and the document can be indexed in the required format in Solr More reading: https://lucidworks.com/blog/ indexing-custom-json-data/
  • 9. Managed Schema Solr is the schema owner REST APIs - Hide the implementation details Schema-less mode Update and Addition of Fields and FieldTypes More reading: https://lucidworks.com/blog/ schemaless-solr-part-1/
  • 10. Configuration APIs Configure Solr using APIs solrconfig.xml What did you say?
  • 11. Solr Scale Toolkit Easily deploy SolrCloud clusters Live patching and rolling restarts Dependency on AWS soon to go away Chef or Puppet still are valid approaches More reading: http://lucidworks.com/blog/ introducing-the-solr-scale-toolkit/
  • 12. Talking about the Admin UI Already improved from 3.x Uploading documents Collections API is coming soon Collection Actions
  • 13. Recently Added Features Document expiration and Time To Live (TTL) Cursors: Efficient Deep Paging Export Sorted Result Sets SSL support in SolrCloud Distributed Pivot Faceting Suggester v2 CollapsingQParserPlugin ReRankingQParserPlugin Collections API improvements
  • 14. Theres so much more coming up Schema Bulk API Distributed IDF Query DSL Cross Data-center replication Cluster Backup and Restore SOLR - Make an application, not war.
  • 15. Its easy.. and stable! Benchmarking Tons of users testing it Evolving test framework
  • 16. Solr scalability is unmatched. 10TB+ Index Size 10 Billion+ Documents 100 Million+ Daily Requests
  • 17. Where is it headed? Download See that server directory? Use start scripts Send a document, or a few Things dont really look the way they should? Use the schema APIs Add fields not enough? Add field types and then add fields Configure Solr using REST APIs For Production: Use Solr Scale Toolkit to deploy, patch and manage! Configure Solr using REST APIs
  • 18. Lucidworks Fusion Intelligent Search Services/API Recommendation Module Signal Processing Analytics Service Enrichment Analytics Store Services Discovery Engine Analyst Workbench eCommerce Solution Admin/ Management SiLK Log Analysis Search/ Discovery Partner Solutions Connector Framework
  • 19. Connect @ https://twitter.com/varunthacker http://in.linkedin.com/in/varunthacker varun.thacker@lucidworks.com