Top Banner
Smart Personal Shopping Assistant (SPA) Arvind Rapaka Sairam Bantupalli Ravindra Nath SpotDy Inc www.spotdy.com
22

Personal Shopping Assistant - A Big Data Problem

Apr 15, 2017

Download

Technology

Arvind Rapaka
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Personal Shopping Assistant - A Big Data Problem

Smart Personal Shopping Assistant (SPA)

Arvind Rapaka

Sairam BantupalliRavindra Nath

SpotDy Incwww.spotdy.com

Page 2: Personal Shopping Assistant - A Big Data Problem

Why we need?

A Personal Shopping Assistant is an occupation where people help customers by giving advice and making suggestions. They are employed by departmental Stores.

But you have a Mobile/Web Ecommerce business. How can you enable your mobile/web application turn into a smart personal shopper for your customers? Enter - Smart Personal Shopping Assistant.

Customer

StoreAssistant

Online Store

Page 3: Personal Shopping Assistant - A Big Data Problem

Why we need ?

I need skinny pants that girls like. My size is 32 inch waist and 34 length.

Here you go. Let me know If I should filter by price, size or brand

I like it. My price range is 40-50 dollars.

I need skinny pants that girls like. 32 inch waist, 34 length.

I like this pant. Let’s buy it.

Ok, I placed the order. You should receive your order by tomorrow. Best of luck.

Page 4: Personal Shopping Assistant - A Big Data Problem

Why we need ?

Can you place an order of red skinny pants that I ordered last year

Do you want the same size?

Yes

I have placed the order. You should receive your pants by tomorrow.

Page 5: Personal Shopping Assistant - A Big Data Problem

Process Overview

ASR Image Q&A

Knowledge Graph/ Image DB

● Speech Recognition ● Image Matching● Q & A Dialogue

ASR- Acoustic Speech RecognitionQ&A - Question and Answers Dialogue

Page 6: Personal Shopping Assistant - A Big Data Problem

SPA - System Call Flow

Q&A Dialogue and IR

ASR

Image Analysis Engine

Pre-computed KD

SpotDy BigAITM Platform

Image

Text

Voice

IR - Information RetrievalKD - Knowledge Graph DB

Dialogue/Action

Dialogue/Action

Page 7: Personal Shopping Assistant - A Big Data Problem

Components

Page 8: Personal Shopping Assistant - A Big Data Problem

Knowledge Database (KD)

● Build Product Knowledge Database○ Classification (LDA, Existing Taxonomy)

○ NLP Analysis (CRF, Bayesian etc ..)

○ Image Analysis/ Text Attribution (SURF)

○ Ontologies

Page 9: Personal Shopping Assistant - A Big Data Problem

Image Matching

● Image Analysis ○ Extract Feature (SURF Feature Extraction)

■ Find keypoints

○ Grouping Descriptors (SURF Feature Descriptor)■ Keypoints are grouped in descriptors

○ Match images in the precomputed descriptor database.

○ Post Processing

Page 10: Personal Shopping Assistant - A Big Data Problem

ASR

● Speech Recognition ○ Extract Feature vectors

○ Speech Decoder■ Scoring (DNN)

■ Most Likely Text from Acoustic Model (HMM/Viterbi Algorithm)

○ Pass to Q & A system

Page 11: Personal Shopping Assistant - A Big Data Problem

Question and Answering

● Query Analysis ○ Query Processing (Stemming, lemma, Gazetteer ..)

○ Understand User Intent (HMM )■ Navigational/Specificity

■ Initiate dialogue if necessary

○ Query POS Labelling / Entity Extraction (CRF)

○ Query Rewrite/ Retrieval/Post Processing

Page 12: Personal Shopping Assistant - A Big Data Problem

Q&A Personalization

● Q & A results should be personalized and

aggregated based on:

○ Past user history

○ User Geo/Demo

○ Occasions such as Christmas, Thanksgiving etc ..

SpotDy BigAITM

Query Results

Page 13: Personal Shopping Assistant - A Big Data Problem

Algorithms

Page 14: Personal Shopping Assistant - A Big Data Problem

SURF (Speeded up Robust Features)

SURF is a feature detection process to examine an image to extract features, that are unique to the objects in the image. Based on SIFT but faster.

In our case, it help in retrieving similar products based on images.

Process Involves : ○ Build Scale Space○ LoG Approximation○ Key Point Extraction○ Generate Features

Page 15: Personal Shopping Assistant - A Big Data Problem

LoG Approximation

● The Laplacian is a 2-D isotropic measure of the 2nd spatial derivative of an image.

● The Laplacian of an image highlights regions of rapid intensity change to detect edges.

● Uses Gaussian smoothing filter in order to reduce its sensitivity to noise due to second derivation

Page 16: Personal Shopping Assistant - A Big Data Problem

GMM/DNN-HMM

HMM is a generative probabilistic model that provides a framework for modelling time-varying spectral vector sequences. In our case, we use for speech recognition.

● GMM/DNN produce posterior probabilities for HMM States● predicts likelihood of observation sequence being generated by

state sequence using Viterbi Algo● Sub word HMMs concatenate to create larger word-based HMM

Observations (Feature vectors)

GMM/DNN

HMM States(Senones)

Posterior Probabilities

Page 17: Personal Shopping Assistant - A Big Data Problem

NLP

Knowledge Database (KD) is the key for the query processing and information retrieval

● NLP is extensively used to process unstructured data in building KD.

Algorithms:● Conditional Random Fields/Maxent for POS Tagging, Entity

Extraction, concept tagging etc.● LDA for topic Analysis and Classification

Q&A Dialogue and IR

IndexedKD

Product Catalog

Product metainfo

NLP Engine

Page 18: Personal Shopping Assistant - A Big Data Problem

Query Processing

Query

Indexed KD

Annotators/Filters

Results

● User Query pass goes through various annotators. Some of the few annotators include :

○ Gazetteer, Lemmatization, Stemming, POS Tagging, Entity Extraction

● Query Rewrite ● Search - Similarity (IR). Basic

Algorithms include ○ Vector Space Modelling○ BM25F

● Result Generation

Page 19: Personal Shopping Assistant - A Big Data Problem

Scalability

Page 20: Personal Shopping Assistant - A Big Data Problem

SPA - HA Architecture

Significant computing resources are required while

scaling to millions of requests in real time.

Page 21: Personal Shopping Assistant - A Big Data Problem

BigAITM

BigAITM is purpose built for the scalability of applications such as SPA.

● Building KD (Knowledge Database)

● Image Repository Store

● Query Processing

● Scalable Machine Learning Models

Page 22: Personal Shopping Assistant - A Big Data Problem

Q&A