Top Banner
Predictive Measures in College Football By David Affentranger MSIS 5633 OSU Fall 2012
15
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Msis5633 cfb presentation

Predictive Measures in College Football

By David AffentrangerMSIS 5633 OSU Fall 2012

Page 2: Msis5633 cfb presentation

Project Outline

Project Research

Data Sources and Tools

Data Transformation

Decision Tree

Decision Tree Results

Classification Predictive Modeling

Prediction Results

Future Efforts

Page 3: Msis5633 cfb presentation

Project Research

Projecting Point Spreads◦ http://cs229.stanford.edu/proj2010/LiuLai-BeatingTheNCAAFootballPointSpread.pdf

Predicting individual games◦ http://cfbpredictions.com/

Predicting BCS Rankings◦ http://harvardsportsanalysis.wordpress.com/2011/

11/24/making-sense-of-the-chaos-a-bcs-prediction-model/

Page 4: Msis5633 cfb presentation

Data Sources and Tools

Data Sources◦ http://www.cfbstats.com

Tools◦ Microsoft Excel

◦ Rapid Miner

Page 5: Msis5633 cfb presentation

Data Transformation

Data Challenges◦ Data home/away teams listed in separate spreadsheet

◦ Data rows lacked visiting team statistics

◦ Data rows lacked who won the game

◦ Many numerical fields difficult for classification

Data Solutions◦ Used Excel vlookups to pull in visiting team statistics into

the tuples

◦ Used the “Points” field to compare home vs away to create a “Winner” classification field.

◦ Built Formula fields to transform numerical values into a text classification (TOP, Special Teams, Penalty Yards)

Page 6: Msis5633 cfb presentation

Data Transformation

Page 7: Msis5633 cfb presentation

Decision Tree

Using Rapid Miner and a Decision Tree (Gini Index) built a model to determine key factors for winning

Page 8: Msis5633 cfb presentation

Decision Tree Results

Visit Rush Yards Most important Factor in the model >= 165 Yards Visit Winner 66% of the time

<= 165 Yards Home Winner 79% of the time

Page 9: Msis5633 cfb presentation

Decision Tree Results

Page 10: Msis5633 cfb presentation

Classification Predictive Modeling

Predictive Model Plan◦ Predict winner of an individual game based upon key statistics

◦ Use same data set as the decision tree

◦ Split the data into Testing and Training

◦ Evaluate different data elements and there affects on prediction

Page 11: Msis5633 cfb presentation

Classification Predictive Modeling

Using Rapid Miner◦ Naïve-Bayes Classification Model

◦ Split Example set into 2 sets using X-Validation

◦ Analyzed using Performance Tool

Page 12: Msis5633 cfb presentation

Prediction Results

Page 13: Msis5633 cfb presentation

Prediction Results

85.58% Home Winner precision

78.93% Visit Winner precision

Page 14: Msis5633 cfb presentation

Future Efforts

Model currently relies on a user manually entering key statistical information for a game

Model could be enhanced for a full predictive solution ◦ Using a neural network statistical data could be predicted for an individual team for a particular game

◦ Data from the neural network could feed the classification model to predict a game or series of games