Top Banner
21
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Scalable Search Analytics
Page 2: Scalable Search Analytics

Scalable Search and AnalyticsRavi Krishnamurthy, VP Technical Services, [email protected]

Yann Yu, Systems Engineer, [email protected]

Page 3: Scalable Search Analytics

• Motivation: Why Search AND Analytics?

• Apache Solr and Lucidworks SILK

• Solution Architectures

• Demo(s)

• Q & A

• Resources

Agenda

Page 4: Scalable Search Analytics

Why Search AND Analytics?

AnalysisData Insight Action Value

Page 5: Scalable Search Analytics

Search is more than just a box.

Page 6: Scalable Search Analytics

personal. contextual. actionable.

Search makes data

Page 7: Scalable Search Analytics

Search is everywhere.

ecommerce

log analysis

site search

compliance

enterprise apps

Page 8: Scalable Search Analytics

Secure access to all your data through one interface, empowering everyone in your organization to access the data they need.

Search is the key to unlocking big data.

vSearch anything.

Page 9: Scalable Search Analytics

query

Traditional enterprise search was all about the query.

Page 10: Scalable Search Analytics

Search can be smarter.

location search history query permissions context

Personal, contextual, relevant results: consumer-like simplicity and power in the enterprise.

Page 11: Scalable Search Analytics

Solr in a nutshell

8M+ total downloads

Solr is both established & growing

250,000+monthly downloads

Largest community of developers.

2500+open Solr jobs.

Solr most widely used search solution on the planet.

LucidworksUnmatched Solr expertise.

1/3of the active committers

70%of the open source code is committed

Lucene/Solr Revolutionworld’s largest open source user

conference dedicated to Lucene/Solr.

Solr has tens of thousands of applications in production.

You use Solr everyday.

Page 12: Scalable Search Analytics

• Search-first NoSQL store

• Distributed, Horizontally Scalable

• Stable and Robust

• Deep Paging

• Accurate Facets and Stats

• Stats on Pivots (5.0)

• Easier to start-up; run as a service on Linux (5.0)

• Your Content, Your Way (5.0)

Solr and Analytics

Page 13: Scalable Search Analytics

• Solr - Logstash - Kibana

• http://lucidworks.com/product/integrations/silk/

• Open source at:

• https://github.com/LucidWorks/banana

• https://github.com/LucidWorks/solrlogmanager

SiLK

Page 14: Scalable Search Analytics

data enrichment

your business

your app

your datamachine learning

recommendations landing pages relevancy tuningsecurity

connector framework signal processing

api reporting admin

Lucidworks FusionEverything your team needs to rapidly design and deploy next-generation search apps to your entire organization.

Page 15: Scalable Search Analytics

Enterprise Search

Lucidworks connectors processes documents and

sends to SolrCloud

Standard document storage and search

Page 16: Scalable Search Analytics

Log record search

Machine generated log records are sent to Flume.

Flume forwards raw log record to Hadoop for archiving.

Flume simultaneously parses out data in record into a Solr document,

forwarding resulting document to Solr

Lucidworks SiLK exposes real-time statistics and analytics to end-users,

as well as full-text search

High volume indexing of many small records

Page 17: Scalable Search Analytics

Co-existence with other NoSQL solutions

Page 18: Scalable Search Analytics

eCommerce: Search is Recommendation

Catalog

Signals

Pipeline

Your App

Fusion

Page 19: Scalable Search Analytics

http://github.com/lucidworks/solr-for-datascience

Page 20: Scalable Search Analytics

• Solr: http://lucene.apache.org/solr

• Company: http://www.lucidworks.com

• Our blog: http://www.lucidworks.com/blog

• Blog on stats and facets: http://lucidworks.com/blog/you-got-stats-in-my-facets/

• Fusion: http://www.lucidworks.com/products/fusion

• Solr for Data Science code: http://github.com/lucidworks/solr-for-datascience

• Email: [email protected]; [email protected]

Resources

Page 21: Scalable Search Analytics