Title: Football for KMS: NFL 01
1Football for KMS NFL 01
APRIL 30TH 2008
- Abhijit Kumar
- Kaijia Bao
- Vishal Rupani
Course Instructor Prof. Hsinchun Chen
2Agenda
ABHI
VISHAL
KAI
Data Cleaning Statistical Analysis Final Paper
Data Collection Client Relations Final
Presentation
Data Import Data Transformation Data Mining
Data Mining Techniques Key Findings KMS
Demonstration
Objectives Literature Overview Conclusion
Knowledge DiscoveryStatistical Analysis
3Research Objectives
- Pattern identification
- Descriptive Statistics
- Data Mining Techniques
- Prediction
- Developing a strategy
- Fantasy League
4Literature Overview
- Moneyball The Art of Winning an Unfair Game
- Michael Lewis
- Las Vegas Odds
- www.VegasInsider.com
- NFL Fantasy League
- www.Nfl.com/fantasy
5Knowledge Discovery Process
6Knowledge Discovery Process
7Dependency Network
8Dependency Network
9Intended Player Statistics
- Top 3 Intended Players for Passes for the 4
teams that played in the semi-finals
H.Ward (142), P.Burress (121), B.Shaw (44)
T.Brown (143), D.Patten (93), M.Edwards (39)
T.Holt (133), M.Faulk (104), I.Bruce (103)
J.Thrash (107), D.Staley (89), T.Pinkston (83)
10Play Direction Statistics
- Direction of Rushes for all plays in 2001 season
Right Tackle
Right Guard
Left Tackle
Left Guard
Right End
Left End
Middle
Middle
11Play Direction Statistics
- Direction of Rushes for all plays in 2001 season
Number of Rushes
Direction
12Yardage Statistics
- Yardage during each down for Pass and Rush
Passes
Rushes
Average Yards Covered
Yards To Go
13Play Decision Statistics
- Play Decisions for the 4 teams that played in the
semi-finals
Play Decision Type
Number of Decisions
14Play Decision Analysis Overview
- Discovery of what environmental and/or game
factors affect play decision - Discovery of football expert knowledge through
data mining - Prediction of play decisions based on game factors
15Play Decision ID3 Analysis
16Play Decision ID3 Analysis
17Play Decision Accuracy
18Rush Accuracy Lift Chart
19Field Goal Accuracy Lift Chart
20Play Decision Classification Matrix
21Play Decision Key Findings
- Football strategy can be discovered through data,
instead of knowledge experts - Top 3 factors affecting decision
- Down, Off Ydl, Time
- Accuracy of the models are different depending on
the decision we are trying to predict - Team specific strategies may be discovered with
more data.
22Play Direction Analysis Overview
- Discover teams strengths and weakness in their
defense and/or offense - Prediction of play directions based on game
factors
23Play Direction Accuracy
24Play Direction Key Findings (ID3)
25Intended Player Analysis Overview
- Discover each teams favored recipient of a pass
- Prediction of intended player based on game
factors
26Intended Player Lift Chart
27Intended Player Key Findings
- There are 400 intended players
- Not enough data to accurately predict intended
players - Not enough data to gain knowledge over
statistical models
28Conclusions
29Future Direction
- Increase sample set
- More instances of different scenarios
- Incorporate additional information
- Pro-football-Reference.com
- VegasInsider.com (Odds for favorites)
- Extend Analysis
- Nested case (Historical performance)
30References
- Prof. Lisa Ordóñez
- Professor in Statistics
- Steve Aldrich
- Author of Moneyball in Football
- About Football
- Glossary of terms
31Knowledge Discovery Process
32Research Objectives Literature
Overview Knowledge Discovery Statistics Intended
Player Statistics Play Direction Statistics
Yardage Statistics Play Decision
Accuracy Lift Chart Charts Analysis Play
Decision Analysis Play Direction Analysis
Intended Player Conclusions Future
Directions System Design
33Backup Slide Section
34Data Collection
55,000 rows 90 columns
47,033 rows 30 columns
Dependent 4 Independent 10 Calculated - 9
35System Design
NFL KMS
FOOTBALL DATA
FIELD STRATEGY
36Yards Analysis
- Yards gained on the play is used as a metric to
measure effort - Discover how environmental and/or game factors
affect players efforts - Key Findings Top 4 environmental factors
- Off Ydl
- Time
- Down
- Gap