Top Banner
EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381
15

EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

Jan 12, 2016

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 20121

Netflix and Your Next Movie Night

Nonlinear Programming

Ron Andrews

EMIS 8381

Page 2: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 20122

Elevator Pitch:

Situation:

Problem: You have finished your NLP homework, have some downtime but don’t know what movie to watch…

Solution: Netflix’s collaborative filter (cf) based movie recommendation engine that utilizes NLP methods.

Relevancy:

Customer: – The algorithm makes more relevant movie recommendations than nonlinear methods– Better movie choices

Company:– Decreased resource utilization (program complexity) – Increased customer retention.

Page 3: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 2012

Agenda:

• Netflix - Business of movie recommendation• Collaborative filtering• Linear Aspects• Nonlinear Aspects• Performance Improvement

3

Page 4: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 2012

Netflix - Business of movie recommendation

4

- Movie recommendation system: System that seeks to predict or anticipate a user’s preference for a film that has not been viewed by utilizing an algorithm that takes into consideration the collaborative nature of the website (user ratings via collaborative based filtering) -Utilizes linear, nonlinear and statistical methods

Page 5: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 2012

Collaborative filtering

5

•Collaborative filtering is a process to make automated recommendations based upon crowd sourced information such as preference, taste and patterns.

•Two advantages:Wisdom of crowdsLarge Numbers

•Netflix collects four pieces of information from its users:

User Movie Date of grade Grade

Group unity

Page 6: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 2012

Linear Approach:

6

• Computes a prediction for an item (i) using the weighted sum of items similar to i.• Corresponding similarity si,j

• Captures how users rate similar items

Page 7: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 2012

Linear Approach Cont.:

7

Traditionally:

NxM modeled by NxC N= users M= movies Rows= user feature columns are movie feature vectors

•Low rank approximations can be found•Data Sets are sparse•Result:

• difficult non-convex problem •approximation of gradient is difficult to approximate

•Need: nonlinear aspects.

Page 8: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 2012

Nonlinear approaches to movie recommendation:

• Nonlinear Principal Component Analysis (h-NLPCA)• PCA – Principal Component Analysis

– Well established data analysis technique

– Transformation of recorded observations to produce independent score variables

– Captures linear relationships well

– Not sufficient to capture nonlinear patterns

• Introducing ANN: Artificial Neural Network (models defining a function f: X-> Y) Function approximated.

8

NonlinearComponent,Associative network

Page 9: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 2012

Nonlinear Approach: NLPCA

• ANN allows for mappings onto a reduced dimensional space.• Relies on the SVD: Singular value decomposition (factorization of matrix

– first step for CF) Example:

9

Variable ofprincipal components

Uniquely determinedsingle value variances

Conjugate transpose

• Hidden layer enables the function to perform nonlinear mapping functions from extraction : X -> Z to generate: Z -> ^X

• Associative network performs identify mapping, reducing the squared reconstruction error: ½||x^-x||^2

Page 10: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 2012

Nonlinear Approach: NLPCA Cont.

• Nonlinear principal component analysis provides:– Optimal nonlinear subspace spanned by components (different groups formed) – Constrains nonlinear hierarchical order of linear components – A minimum error between groups using conjugate gradient descent algorithm– N components explain the maximal variance– Tries to search for a k-dimensional subspace of minimal mean square error

10

Application to class:• Is used to find a local minimum (not global)• Works when function is quadratic (twice differentiable)

= step size

Update iterationPi = nonlinear

Page 11: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 2012

Nonlinear Approach: NLPCA Algorithm

• Step 1: Data representation– Figure out missing values in original user item matrix (we know how to do

this)

• Step 2: Low rank representation– Use conjugate gradient descent algorithm– Hierarchical error minimized

• Step 3: Neighborhood Formation– Calculate similarity between each user and his closest neighbors.– A = reconstructed matrix, rij = rating of user ui on item ij. – Summations of l are calculated when both users (ua and ui) have rated a

movie

11

Page 12: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 2012

Nonlinear Approach: NLPCA Algorithm Cont.

• Step 4: Prediction Generation – Matching of neighborhood to user

12

Neighborhood formation

User ratings

Original itemaverage

Page 13: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 2012

Nonlinear effectiveness

13

Dimensions

Acc

urac

y Nonlinear can account for more variance.

True accuracy:.7843

Page 14: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 2012

Nonlinear effectiveness: Conclusions

• Faster convergence• Less resources • For small data sets, i.e. not many film ratings, nonlinear provides better

suggestions faster:

• More difficult to implement from a programming stand point

14

Page 15: EMIS 8381 – Spring 2012 1 Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.

EMIS 8381 – Spring 2012

•Questions?

15