Presentation is loading. Please wait.

Presentation is loading. Please wait.

By Kelly Galarneau. Don’t Attack Me Until the End.  My probability model is not perfect, and never will be.  We will discuss the assumptions and limitations.

Similar presentations


Presentation on theme: "By Kelly Galarneau. Don’t Attack Me Until the End.  My probability model is not perfect, and never will be.  We will discuss the assumptions and limitations."— Presentation transcript:

1 By Kelly Galarneau

2 Don’t Attack Me Until the End.  My probability model is not perfect, and never will be.  We will discuss the assumptions and limitations of this model.  History and Concept  NCAA tournament  Rating Systems

3 Jeff Sagarin’s Ratings  Published in the USA Today since 1985.  NCAA men’s basketball tournament selections  Bowl Championship Series (BCS) selections  Secret Formula  Each team’s rating  Home field advantage (constant for all teams)

4 Any Given Sunday  We know that a team’s performance is not always constant. On some days they play better, others worse.  Many things naturally vary according to the normal distribution, so perhaps we can assume NFL ratings do the same.  In an e-mail, Sagarin suggested that I use a standard deviation of 15 or 16 (perhaps 15.5). I chose 16.

5 Normal Distribution  Here is a graph for the theoretical performance of the Tennessee Titans, with a mean of 32.7 and standard deviation of 16.

6 Normal Curves  Now compare two teams, Titans with average of 32.7 and the Lions with an average of 7.82. We’ll pretend the Lions are playing at home and add in Sagarin’s prescribed 3.02.

7 Monte Carlo Simulation  Used for simulating probability situations.  Mostly used for business and finance applications.  Allows us to vary a parameter according to whatever distribution we choose (uniform, normal, Poisson, exponential, etc.)  As the parameter is changing, we can observe the effect on other variables.  I am using free software from www.yasai.rutgers.edu. (Rutgers University) www.yasai.rutgers.edu

8 How each trial works  We let the computer pick a random “performance rating” from the normal distribution for each team.  We see which one is greater and tally that as a win for that team.  Then repeat 1000 times. This graph shows the results of one trial, with a win going to the Titans.

9 The graph on the left represents a game in which the Lions perform better than average, the Titans perform worse than average, but the Titans still get the win. The graph on the right represents a game in which the Lions pull off the upset. Then we repeat… …1000 times.

10 Here is a graph of 1000 trials

11 Here is my actual Excel programming: ABCDEFG 1 TeamRatingS.D. NormalIf/thenOutput 2 Titans35.7216 =gennormal(C2,D2)=if(E2>E3,1,0)=simoutput(F2) 3 Lions7.8216 =gennormal(C3,D3)=if(E3>E2,1,0)=simoutput(F3) ABCDEFG 1 TeamRatingS.D. NormalIf/thenOutput 2 Titans35.721625.9881911 3 Lions7.8216-3.6104700 Here is one trial: This matchup gives us: Titans 88.8% (1000 trials) Lions 11.2%

12 Do we need a simulation?  I tried to approach the probability calculation from a theoretical perspective.  My thought was to assign each team a random variable X and Y and vary them according to a normal distribution. So: X ~ N(μ₁, σ) and Y ~ N(μ₂, σ).  A statistics textbook led me to the idea of multiplying them to get a 3-D probability distribution [X, Y, and P(X,Y)].

13 A 3-D Probability Distribution Here, X ~ N(35.72,16) and Y ~ N(10.84,16) Where is X > Y? YX Titans 88.8% Lions 11.2%

14 Integrals  To find the area under this surface, we need to evaluate the double integral.  Mathematica will not evaluate the exact integral, but it will give us a decimal approximation.

15 Comparing the results  Mathematica Input : NIntegrate[NIntegrate[(1/((16^2)*2*Pi))*(E^((- 1/2)*((x-35.72)/16)^2))*(E^((-1/2)*((y- 7.82)/16)^2)),{y,x,300}],{x,-300,300}]  Mathematica gives us: Titans 89.1% Lions 10.9%  The Monte Carlo with 1000 trials gave us: Titans 88.8% Lions 11.2%

16 The Bracket  6 teams from each conference (AFC and NFC) make the playoffs. The division winners are seeded 1-4 by record, the wild card teams are seeded 5 and 6 by record.  In each game the higher seed gets home field advantage.  In the “Wild Card” round of the playoffs:  The 3 seed plays the 6 seed  The 4 seed plays the 5 seed.  The 1 and 2 seeds get a first round bye.  In the “Divisional” round the 1 seed gets to play the lower seed of the two advancing teams.

17 So that means… 4 5 1 2 3 6 4 6 5 1 2 3 4 or 5 6 4 5 1 2 3 3 6 3 seed wins 6 seed wins

18 Possibilities…  In a playoff bracket with 12 teams, there are 11 games to be played, with a total of 2^11=2048 possibilities.  In my Excel spreadsheet I have made use of the complement rule and referencing so that I only need to calculate 66 matchups.

19 Assumptions and Limitations  We are using offensive and defensive statistics from the regular season, we are assuming that teams continue similar play into the post-season.  We are assuming that overall performance is normally distributed.  We are assuming that home field advantage is the same constant for all teams.  We are not taking anything into account for weather, injuries, matchups that might be significant, or any other variables.

20 2007 trial run (AFC) SeedTeamRating Div. Game Champ. Superbowl Superbowl Win 1pats36.42 100.0% 73.9% 48.3% 32.4% 2colts30.21 100.0% 58.0% 25.1% 13.7% 3chargers29.98 65.7% 28.6% 11.9% 6.5% 4steelers23.55 45.5% 12.8% 4.3% 1.9% 5jags27.9 54.5% 19.2% 8.0% 4.0% 6titans21.79 34.3% 7.4% 2.5% 1.0% 4 2 1 59.6%

21 2007 trial run (NFC) SeedTeam Rating Div. Game Champ. Superbowl Superbowl Win 1cowboys27.95 100.0% 62.6% 34.9% 14.5% 2packers29.41 100.0% 65.9% 35.6% 15.6% 3seahawks21.99 57.0% 18.4% 7.3% 2.3% 4bucs19.83 39.9% 12.0% 4.2% 1.2% 5giants28.31 60.1% 26.4% 12.7% 5.1% 6redskins22.21 43.0% 14.7% 5.2% 1.7% 4 2 1 40.4%

22 2008 Playoff Prediction  My OFFICIAL prediction won’t be available until after week 17 (end of regular season is Sunday, Dec. 28)  Over Christmas break you can access it at: kgalarneau.wikispaces.com/NFL+Prediction kgalarneau.wikispaces.com/NFL+Prediction  Feel free to e-mail me with your comments or suggestions: kgalarneau@notredameacademy.com

23 2008 Prediction (As of Dec. 15) SeedTeamRating Div. Game Champ. Superbowl Superbowl Win NFC1Giants28.14 100.0% 63.6% 37.7% 19.0% 2Panthers27.05 100.0% 57.5% 28.1% 13.5% 3Vikings25.66 55.5% 25.4% 11.8% 5.3% 4Cardinals20.66 48.3% 16.3% 5.8% 2.2% 5Cowboys23.66 51.7% 20.0% 8.4% 3.6% 6Bucs25.31 44.5% 17.2% 8.2% 3.7% sum 4 2 1 47.4% AFC1Titans29.95 100.0% 64.6% 39.6% 22.0% 2Steelers29.1 100.0% 65.1% 31.8% 16.9% 3Jets20.4 43.6% 14.7% 5.1% 2.0% 4Broncos17.61 39.1% 9.7% 2.8% 1.0% 5Colts26.39 60.9% 24.3% 10.8% 5.4% 6Ravens27.21 56.4% 21.5% 9.9% 5.3% sum 4 2 1 52.6% standard dev. = 16 home field advantage = 2.58

24 QQuestions? Comments?

25 But Wait… …There’s More!

26 History of Seeds  The NFL has been using the 12 team playoff system since 1990. SeedDivisionChamp.SuperbowlSuperbowl Win 1 28188 2 27105 325721 424542 512411 611111 sum72 3618

27 By Percentage: SeedDivisionalChampSuperbowlSuperbowl Win 138.9%50.0%44.4% 237.5%27.8% 369.4%9.7%5.6% 466.7%6.9%11.1% 533.3%5.6%2.8%5.6% 630.6%1.4%2.8%5.6% sum2111

28 Superbowl Winners by Seed since 1990 First 10 years – 7 #1 seedsLast 8 years – Only 1 #1 seed 9/10 are 1 or 2 seeds4/8 are 1 or 2 seeds

29 The End. Special thanks to: Harry Geiser Steve Havlichek


Download ppt "By Kelly Galarneau. Don’t Attack Me Until the End.  My probability model is not perfect, and never will be.  We will discuss the assumptions and limitations."

Similar presentations


Ads by Google