Top Banner
1 ©MapR Technologies - Confidential Recommendation as Search Reflections on Symmetry
22

Recommendation as Search: Reflections on Symmetry

May 08, 2015

Download

Technology

When recommendation is described in mathematical terms as a matrix equation, a striking symmetry in the form of the equation becomes apparent.

Exploiting this symmetry allows us to build search engines that don't need meta-data and self-organizing web-sites.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Recommendation as Search: Reflections on Symmetry

1©MapR Technologies - Confidential

Recommendation as Search

Reflections on Symmetry

Page 2: Recommendation as Search: Reflections on Symmetry

2©MapR Technologies - Confidential

Company Background

MapR provides the industry’s best Hadoop Distribution– Combines the best of the Hadoop community

contributions with significant internally financed infrastructure development

Background of Team– Deep management bench with extensive analytic,

storage, virtualization, and open source experience– Google, EMC, Cisco, VMWare, Network Appliance, IBM,

Microsoft, Apache Foundation, Aster Data, Brio, ParAccel Proven – MapR used across industries (Financial Services, Media,

Telcom, Health Care, Internet Services, Government) – Strategic OEM relationship with EMC and Cisco– Over 1,000 installs

Page 3: Recommendation as Search: Reflections on Symmetry

3©MapR Technologies - Confidential

What is Hadoop?

A new style of computation

A new style of combining computation and storage

Allows very large computations

Used by all large internet companies, many other industries

Fundamentally changes the economics of large-scale computation

Page 4: Recommendation as Search: Reflections on Symmetry

4©MapR Technologies - Confidential

Why Big Data?

Because we can

Because we can learn new things

Because new economics of computation favors large scale

Because big data can be simpler than small data

Page 5: Recommendation as Search: Reflections on Symmetry

5©MapR Technologies - Confidential

Recommendations

Often known as collaborative filtering“People who bought x also bought y”

Actors (people) interact (bought) with items (x and y)– observe successful interaction

We want to suggest additional successful interactions

Observations are inherently very sparse

Page 6: Recommendation as Search: Reflections on Symmetry

6©MapR Technologies - Confidential

Examples

Customers buying books (Linden et al)

Web visitors rating music (Shardanand and Maes) or movies (Riedl, et al), (Netflix)

Internet radio listeners not skipping songs (Musicmatch)

Internet video watchers watching >30 s (Veoh)

iTunes song purchases or plays (Apple)

Page 7: Recommendation as Search: Reflections on Symmetry

7©MapR Technologies - Confidential

Fundamental Algorithm

History matrix A has the shape of actors x items

Cooccurrence matrix K has the shape of items x itemsan actor interacted with both x and ysum over all actors

A is also a linear operator

K tells us “users who interacted with x also interacted with y”

Page 8: Recommendation as Search: Reflections on Symmetry

8©MapR Technologies - Confidential

… Warning …

Page 9: Recommendation as Search: Reflections on Symmetry

9©MapR Technologies - Confidential

… Warning …

Mathematics ahead

Page 10: Recommendation as Search: Reflections on Symmetry

10©MapR Technologies - Confidential

Fundamental Algorithmic Structure

Cooccurrence

For very large data-sets

Page 11: Recommendation as Search: Reflections on Symmetry

11©MapR Technologies - Confidential

But Wait ...

Does it have to be that way?

Page 12: Recommendation as Search: Reflections on Symmetry

12©MapR Technologies - Confidential

But why not ...

Page 13: Recommendation as Search: Reflections on Symmetry

13©MapR Technologies - Confidential

But why not ...

Why just dyadic learning?

Page 14: Recommendation as Search: Reflections on Symmetry

14©MapR Technologies - Confidential

But why not ...

Why just dyadic learning?

Why not triadic learning?

Page 15: Recommendation as Search: Reflections on Symmetry

15©MapR Technologies - Confidential

But why not ...

Why just dyadic learning?

Why not p-adic learning?

Page 16: Recommendation as Search: Reflections on Symmetry

16©MapR Technologies - Confidential

For example

Users enter queries (A)– (actor = user, item=query)

Users view videos (B)– (actor = user, item=video)

A’A gives query recommendation– “did you mean to ask for”

B’B gives video recommendation– “you might like these videos”

Page 17: Recommendation as Search: Reflections on Symmetry

17©MapR Technologies - Confidential

The punch-line

B’A recommends videos in response to a query– (isn’t that a search engine?)– (not quite, it doesn’t look at content or meta-data)

Page 18: Recommendation as Search: Reflections on Symmetry

18©MapR Technologies - Confidential

Real-life example

Query: “Paco de Lucia” Conventional meta-data search results:– “hombres del paco” times 400– not much else

Recommendation based search:– Flamenco guitar and dancers– Spanish and classical guitar– Van Halen doing a classical/flamenco riff

Page 19: Recommendation as Search: Reflections on Symmetry

19©MapR Technologies - Confidential

Real-life example

Page 20: Recommendation as Search: Reflections on Symmetry

20©MapR Technologies - Confidential

Real-life example

Page 21: Recommendation as Search: Reflections on Symmetry

21©MapR Technologies - Confidential

Hypothetical Example

Want a navigational ontology? Just put labels on a web page with traffic– This gives A = users x label clicks

Remember viewing history– This gives B = users x items

Cross recommend– B’A = label to item mapping

After several users click, results are whatever users think they should be

Page 22: Recommendation as Search: Reflections on Symmetry

22©MapR Technologies - Confidential

Resources

[email protected]@ted_dunning

Slides and such:– http://info.mapr.com/ted-paris-05-2012

The original paper– Accurate Methods for the Statistics of Surprise and Coincidence– (check on citeseer)

Source code– Mahout project– contact me