Top Banner
10010 10010 10010 10010 Confidential and Proprietary © Copyright 2014 Confidential and Proprietary © Copyright 2013 eCommerce Search with Apache Solr Grant Ingersoll CTO, LucidWorks Twitter: @gsingers
14

Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

Jan 27, 2015

Download

Technology

"eCommerce Search with Apache Solr", Grant Ingersoll
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013

eCommerce Search with Apache Solr

Grant IngersollCTO, LucidWorks

Twitter: @gsingers

Page 2: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013

Tales from the trenches

• The case of the missing data

• The power of suggestion

Page 3: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013

Topics

• Solr powered commerce– Companies– Features

• Relevance, relevance, relevance

• Demo

Page 4: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013

Solr Powers Leading eCommerce and Consumer Sites

Page 5: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013

Basic Features for eCommerce

• High quality OOTB relevance• Facets

– Range, Term/Category, Hierarchical, Pivot• Highlighting• Did you mean?• Boosting/Blocking/Landing Pages• Easy scale

Page 6: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013

Advanced Features

• Spatial– Local– Route finding– Open Hours, etc.

• Function Queries– Inventory, Margin

• Stats Component– Missing data– Bounds, etc.

Page 7: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013

Tips and Tricks

Page 8: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014

Look Before You Leap

• Before undertaking any relevance tuning, you need to define what “better search” means to you

• Once determined, many ways to test/measure

• Once tested, many ways to fix

http://www.betternetworker.com/files/useruploads/16675/leap.jpg

Page 9: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014

Understand your…

• Domain– Types of documents– Languages present– Document structures,

metadata and other features– Lexical resources: jargon,

synonyms, abbreviations...– Relationships between

documents

• Users– Sophistication/Expertise– Search and Discovery needs– Known Item vs. Keyword

• Tolerance for Pain– Managers– Business Interests– Release cycles– Obsession in finding the one

true relevance model (hint, it doesn’t exist)

– “explain() blindness”

Page 10: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014

Known Item vs. Keyword

eCommerce search often has a split between known item and keyword search

You probably have more “wiggle” room for relevancy on keyword search

E.g. What should be the top result for a search on “women’s shoes”?

Known Item should have best matches at the top More in a moment

Page 11: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014

Debugging

• Check the analysis (more in the next slide)• Check for data quality issues• Check your query constructs (slop, boosts, etc.)

• Try alternate query representations• (exact match)^100 OR (sloppy phrase match)^50 OR (OR query)

• Use Lucene’s explain() or Solr’s &debugQuery

Page 12: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013

Signal Processing for Search and Discovery

• Signals power modern relevance– Clicks, conversions, sharing, history, signatures

• LucidWorks 5 makes it easy to capture and leverage signals– Recommendations, analytics, discovery

• Simplifies your data workflow• Simplify your operational footprint

Page 13: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013

Solr Powered Signal Processing

• Use Case: eCommerce

• Data: – Product catalog (~1.2m items)– Click data (~3.9M clicks)

Page 14: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

10010

10010

10010 10010

Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013

Metadata

• http://www.lucidworks.com

[email protected]• @gsingers

• Lucene/Solr Revolution – Washington DC, Nov 11-14– http://www.lucenerevolution.org