Top Banner
Sales Prediction with Time Series Modeling Gautam Shine, Sanjib Basak Nonlinearity induced by hidden layer = + =1 Parameters and learned from data Autoregression can be included through lagged inputs Optimization is non-convex, averaging needed ARIMA β€’ Order (5,2,0) chosen by minimizing Akaike information, which is a regularized maximum likelihood estimate β€’ Fourier terms used to introduce multiple-seasonality β€’ Predictions are too smooth to capture sales spikes Neural Net β€’ 10 autoregression lags and 14 hidden layers, chosen by minimizing generalization error β€’ Averaged prediction of 100 nets initialized with random seeds β€’ Qualitatively captures sales spikes with lower MSE than ARIMA, but low daily accuracy ARIMA + Regression β€’ Regression with indicator vectors for special days like Black Friday and Christmas greatly improves ARIMA β€’ Captures sales spikes while retaining daily accuracy β€’ Lower MSE than the neural net Autoregression (AR) = + + =1 βˆ’ βˆ’ Moving Average (MA) = + + =1 βˆ’ βˆ’ Autoregressive Integrated Moving Average (ARIMA) = + + =1 βˆ’ βˆ’ + =1 βˆ’ βˆ’ Conventional Time Series Models Feed-Forward Neural Networks Sales forecasting is critical for retailers β€’ optimal stocking of products β€’ website stability under peak traffic β€’ planning, customer support, and marketing Anomalies like Black Friday are especially difficult to capture with models Example data from online sales of tech products: Forecast Results Predicting Sales Hidden Node Count and Autoregression Order β€’ Erratic behavior can be observed when the number of hidden nodes is low β€’ Autoregression order made little difference beyond 5 or so β€’ Both chosen to minimize test MSE, but nearby values could also have been reasonable Learning Curve β€’ Curve is non-monotonic since data is not i.i.d. and set size affects prediction interval β€’ Neural nets require more data to achieve the same forecast accuracy, but can exceed ARIMA with large data sets Model Fitting
1

Sales Prediction with Time Series Modelingcs229.stanford.edu/proj2015/219_poster.pdfSales forecasting is critical for retailers β€’ optimal stocking of products β€’ website stability

May 22, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Sales Prediction with Time Series Modelingcs229.stanford.edu/proj2015/219_poster.pdfSales forecasting is critical for retailers β€’ optimal stocking of products β€’ website stability

Sales Prediction with Time Series ModelingGautam Shine, Sanjib Basak

Nonlinearity induced by hidden layer

𝑦𝑗 = 𝑏𝑗 +

𝑖=1

𝑛

𝑀𝑖𝑗 π‘₯𝑖

Parameters 𝑏𝑗 and 𝑀𝑖𝑗 learned from data

Autoregression can be included through lagged inputs

Optimization is non-convex, averaging needed

ARIMA

β€’ Order (5,2,0) chosen by minimizing Akaike information, which is a regularized maximum likelihood estimate

β€’ Fourier terms used to introduce multiple-seasonality

β€’ Predictions are too smooth to capture sales spikes

Neural Net

β€’ 10 autoregression lags and 14 hidden layers, chosen by minimizing generalization error

β€’ Averaged prediction of 100 nets initialized with random seeds

β€’ Qualitatively captures sales spikes with lower MSE than ARIMA, but low daily accuracy

ARIMA + Regression

β€’ Regression with indicator vectors for special days like Black Friday and Christmas greatly improves ARIMA

β€’ Captures sales spikes while retaining daily accuracy

β€’ Lower MSE than the neural net

Autoregression (AR)

π‘Ÿπ‘‘ = 𝑐 + πœ–π‘‘ +

𝑖=1

𝑝

πœ™π‘‘βˆ’π‘– π‘Ÿπ‘‘βˆ’π‘–

Moving Average (MA)

π‘Ÿπ‘‘ = πœ‡ + πœ–π‘‘ +

𝑖=1

π‘ž

πœƒπ‘‘βˆ’π‘– πœ–π‘‘βˆ’π‘–

Autoregressive Integrated Moving Average (ARIMA)

𝑑𝑑 = 𝑐 + πœ–π‘‘ +

𝑖=1

𝑝

πœ™π‘‘βˆ’π‘– π‘‘π‘‘βˆ’π‘– +

𝑖=1

π‘ž

πœƒπ‘‘βˆ’π‘– πœ–π‘‘βˆ’π‘–

Conventional Time Series Models

Feed-Forward Neural Networks

Sales forecasting is critical for retailers

β€’ optimal stocking of products

β€’ website stability under peak traffic

β€’ planning, customer support, and marketing

Anomalies like Black Friday are especially difficult to capture with models

Example data from online sales of tech products:

Forecast ResultsPredicting Sales

Hidden Node Count and Autoregression Order

β€’ Erratic behavior can be observed when the number of hidden nodes is low

β€’ Autoregression order made little difference beyond 5 or so

β€’ Both chosen to minimize test MSE, but nearby values could also have been reasonable

Learning Curve

β€’ Curve is non-monotonic since data is not i.i.d. and set size affects prediction interval

β€’ Neural nets require more data to achieve the same forecast accuracy, but can exceed ARIMA with large data sets

Model Fitting