www.anant.us | [email protected] | 202.905.2818 1010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007 Research & Development – Comparing Lucene / SolR / Elastic & Cloud Search Providers Building Search Engines
www.anant.us | [email protected] | 202.905.28181010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007
Research & Development – Comparing Lucene / SolR / Elastic &
Cloud Search Providers
Building Search Engines
What do we do?
Streamline, Organize & Unify
Business Information
Agenda
• Challenge - Why does this matter?• Info Retrieval - Retrieval / Routing• Lucene - More than meets the eye ...• Search Engine - 30k Foot View• On Premise - Lucene / SolR / Elastic • Cloud Providers - Amazon / Azure
Challenge – Why does this matter?
Knowledge
Project Information
Client Service Information
CorporateGuides
Collaborative Documents
Assets& Files
Corporate Resources
Appleseed Framework (Portal, Base, Search)
G Drive Delta
DropBox
G Drive Delta
NutshellDropbox
Freshbooks
G DriveG Sites (KB)
G DriveWorkflowy
Evernote
G DriveDropBox
OwnCloud
PocketLeaves
AIC (WP)Anant (WP)
Document Retrieval• Google Search
• Amazon Search
• LinkedIn Search
• CMS Search *
• Portal Search *
• CRM Search *
• Search *
Document Routing• Google Alerts
• Amazon Recommendations
• Netflix Recommendations
• LinkedIn Recommendations
Information Retrieval
Lucene – Inverted Index
Lucene – More than meets the eye
WhoNext?
Think of it like a “NoSQL” Database that has great indexing.. everywhere.
Search Engine – 30 Thousand Foot View
The search index is only as good as your processed data. If you put everything you find in your index, you are going to spend a lot of time telling people how to search.
On Premise – Lucene / ES / SolR
Lucene• Library
• File System
• Format
• Fast
• Embeddable*
• Indexing Anywhere
• Need to really know Lucene
• No Interface
• No server
• Lots of house keeping
SolR• Server
• Admin / REST Interface
• Configurable
• Scalable
• Great at Text*
• Truly Open
• 10+ Years
• Good ecosystem
• Too customizable
• Schemas*
• Zookeeper Needed
ElasticSearch• Server
• Configurable
• Scalable
• Good ecosystem
• Built in Clustering
• Grouping / Filtering
• Great for Logs
• Started as a Cloud Tool
• No great OTS Interface
• Only REST Interface
Cloud Search – Amazon / Azure
Amazon• SolRCloud*
• AWS* Ecosystem
• 5 QParsers
• Dynamic Fields
• 100% Completely Managed
• Been Around for a While
• Data / Read Writes
• No nested Objects
Azure• ElasticSearch*
• Azure* Ecosystem
• 2 QParsers
• 100% Completely Managed
• Good SDK
• Few Years Old
• Data / Read Writes
• No nested Objects
• Not so Dynamic Fields
Questions & Contact
www.anant.us | [email protected] | 202.905.28181010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007
@anantcorp
facebook.com/anantCorp
linkedin.com/company/anant
[email protected]/in/xingh
Rahul SinghCEO & Founder
Questions & Contact
• Modern Enterprise• Mastering Services in the Service of Others• Hybrid Agile Project Management• Building Search Engines• CICD / DevOps• Connecting Internet Software
www.anant.us | [email protected] | 202.905.28181010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007
Streamlined DataIntegration / Data Pipelines
Organized KnowledgeSearch / Data Warehouses
Unified InterfacesPortals / Dashboards / Mobile