Comic Book Movies By Anthony Borkowski. Why Base Movies Off of Comic Books? Comic books are an American art form that many children in each generation.

Slides:



Advertisements
Similar presentations
Chapter 10: Re-Expressing Data: Get it Straight
Advertisements

Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Inferences for Regression.
Copyright © 2010 Pearson Education, Inc. Chapter 27 Inferences for Regression.
Chapter 27 Inferences for Regression This is just for one sample We want to talk about the relation between waist size and %body fat for the complete population.
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Copyright © 2010 Pearson Education, Inc. Slide
Inference for Regression
CHAPTER 24: Inference for Regression
Inference for Regression 1Section 13.3, Page 284.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. *Chapter 29 Multiple Regression.
1 BA 275 Quantitative Business Methods Residual Analysis Multiple Linear Regression Adjusted R-squared Prediction Dummy Variables Agenda.
Irwin/McGraw-Hill © The McGraw-Hill Companies, Inc., 2000 LIND MASON MARCHAL 1-1 Chapter Twelve Multiple Regression and Correlation Analysis GOALS When.
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide Are the Means of Several Groups Equal? Ho:Ha: Consider the following.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 5): Outliers Fall, 2008.
Simple Linear Regression Analysis
Business Statistics - QBM117 Interval estimation for the slope and y-intercept Hypothesis tests for regression.
Welcome to class today! Chapter 12 summary sheet Jimmy Fallon video
Business Statistics - QBM117 Statistical inference for regression.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Chapter 12 Section 1 Inference for Linear Regression.
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
The Gas Guzzling Luxurious Cars Tony Dapontes and Danielle Sarlo.
Chapter 24: Comparing Means.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Inference for Regression
Inferences for Regression
Inference for Linear Regression Conditions for Regression Inference: Suppose we have n observations on an explanatory variable x and a response variable.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Chapter 10: Re-Expressing Data: Get it Straight AP Statistics.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
Correlation & Regression
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 Linear Regression.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
DO NOW Read Pages 222 – 224 Read Pages 222 – 224 Stop before “Goals of Re-expression” Stop before “Goals of Re-expression” Answer the following questions:
Model Selection and Validation. Model-Building Process 1. Data collection and preparation 2. Reduction of explanatory or predictor variables (for exploratory.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
 Describe the association between two quantitative variables using a scatterplot’s direction, form, and strength  If the scatterplot’s form is linear,
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8- 1.
Quadratic Regression ©2005 Dr. B. C. Paul. Fitting Second Order Effects Can also use least square error formulation to fit an equation of the form Math.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Regression. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other words, there is a distribution.
Inference with Regression. Suppose we have n observations on an explanatory variable x and a response variable y. Our goal is to study or predict the.
The Box-Jenkins (ARIMA) Methodology
Linear Regression Chapter 8. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King.
Inference for Regression
Regression Analysis: A statistical procedure used to find relations among a set of variables B. Klinkenberg G
Chapter 26 Inferences for Regression. An Example: Body Fat and Waist Size Our chapter example revolves around the relationship between % body fat and.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
We will use the 2012 AP Grade Conversion Chart for Saturday’s Mock Exam.
Statistics 10 Re-Expressing Data Get it Straight.
Chapter 13 Lesson 13.2a Simple Linear Regression and Correlation: Inferential Methods 13.2: Inferences About the Slope of the Population Regression Line.
Chapter 13 Lesson 13.2a Simple Linear Regression and Correlation: Inferential Methods 13.2: Inferences About the Slope of the Population Regression Line.
Inference for Linear Regression
CHAPTER 12 More About Regression
The simple linear regression model and parameter estimation
CHAPTER 12 More About Regression
Inferences for Regression
Chapter 13 Created by Bethany Stubbe and Stephan Kogitz.
The scatterplot shows the advertised prices (in thousands of dollars) plotted against ages (in years) for a random sample of Plymouth Voyagers on several.
CHAPTER 12 More About Regression
Undergraduated Econometrics
CHAPTER 12 More About Regression
CHAPTER 12 More About Regression
Inferences for Regression
Presentation transcript:

Comic Book Movies By Anthony Borkowski

Why Base Movies Off of Comic Books? Comic books are an American art form that many children in each generation have grown up with. Characters such as Spiderman, Superman, Wonder Woman, or She-Hulk represent the values of our society, so it is only natural for a movie to be made about them and further glorify the characters.

Why SHOULD We Base Movies Off of Comic Books? Simply put, movie producers create movies based off of comic books because they know the movies will make money. But the real question is, how much money will the movie make?

How Much Money Will the Movie Make? I honestly do not know how much a movie based off of a comic book will make. I cannot predict the future, I am a statistics student. BUT, I do know a way where we can estimate the amount of money the comic book movie will make: Multiple Regression!!!!!

Multiple Regression? To create multiple regression model, we will first need to select a random sample of existing movies that are based on comic books. However, not all movies are eligible. Since many comic book movies spawn sequels and trilogies, the success on the sequels could be influenced by the original. So, for this, we will only use the FIRST movie in a series. The movie also needs to be live-action (not animated) and the comic English in origin. To define “series”, we mean storyline/plot continuation. The Batman movies, for example, from the 90’s tie into one another, so they count as a separate series that DOES NOT include Batman Begins (the 2005 film). So, both Batman films were eligible for selection

Selecting Movies Using the internet sites Internet Movie Database and Wikipedia, I compiled a list of movies based on comic books. I alphabetized the list and assigned each movie a number from starting from the top of the list down. After writing numbers on small slips of paper and mixing them up, I randomly drew 10 slips and chose the comics corresponding to the numbers on the slips.

Initial Predictors Initially, I looked at the following factors to try and predict the amount of money comic book movie would bring in.

Initial Predictors (continued) Years in publish Number of shows the series had BEFORE the film. Number of games for the series BEFORE the film. The number of times I (Anthony “WoodStock” Borkowski) saw the film. The number of films the movie director directed BEFORE this film. The month number the film fist came into theaters. The day number the movie came into theaters.

End Result Predictors After entering all of my data into Fathom, I formed a linear regression model. I arranged the predictors based on P-value and deleted the greatest values until the remaining values were below.05 (5%). The following is a list of the predictors I ended with.

End Result Predictors (continued) Number of games for the series BEFORE the film. The number of times I (Anthony “WoodStock” Borkowski) saw the film. Years in publish The day number the movie came into theaters.

Conditions To validate our multiple regression model, we had to check the following conditions: Straight Enough Independence Does the Plot Thicken? Nearly Normal

Straight Enough Condition Unfortunately, Fathom does not have the ability to create a scatter plot of residuals for multiple regression. For this condition, we will have to assume that the condition is met.

Independence Condition Since the income of one comic book movie will not affect the income of another, there is no reason to think they would affect one another. Also, I chose the movies randomly (as described earlier). This also helps ensure independence.

Does the Plot Thicken? Condition This condition is met due to the fact that all of the individual scatter plots comparing two different variables are all randomly scattered.

Does the Plot Thicken? Condition (continued)

Nearly Normal Condition Next, we had to check to see if the data is nearly normal. See the next few slides, I’ll comment on them later.

Nearly Normal Condition (continued) The data for “Number of Games” is not symmetrical in the histogram nor is it linear in a normal probability plot. If we choose to use it, we should do so with caution. However, after removing “Number of Games” I found the R-square value dropped over 60 points. For my analysis, I kept “Number of Games” and did so with caution.

Nearly Normal Condition (continued) Everything else is either a fairly linear normal probability plot or is unimodal/symmetric histogram so they meet the conditions. Note: Also, be careful of the residual plot for the variable “day”. Some could interpret it as NOT being linear, so we should also proceed with caution when using it as well.

So.. What’s that mean? Under these conditions, and proceeding with caution in regards to “Number of Games”, the multiple regression model is appropriate. The following is the output from Fathom.

Incase you can’t see it, The estimated multiple regression formula is … ^ (Predicted) income = 830,476, ,984, (games) – 80,223, (Saw) – 9,745, (publish) – 13,067, (day)

Interpretation The R-squared value is about.881, so my variables predict about 88% of the predicted movie income of a comic book movie. Looking at the bar chart in the multiple regression model, the “Number of Years in Publish for the comic book” variable seems to be responsible with about 65% of the income, followed by about 20% from the day the movie was released on.

Interpretation (continued) The Number of Games produced BEFORE the movie accounts for about 17% of the income, while the number of times I saw the film accounts for about 2%.

Interpreting the Formula If the variables “Saw It”, “years in publish”, and “day” remain at a constant, the predicted income for a comic book movie should be about $38,984, greater. If all other variables remain at a constant, the predicted income will be about $ 80,223, less for every time that I see the movie.

Interpreting the Formula If all other variables remain at a constant, the predicted movie income should be about $9,745, less for every year the comic book was in publish before the movie came out. If all other variables remain at a constant, the predicted income for a movie should be about $ 13,067, less based on the day number it is released.

Interpreting the Formula If all the variables have 0 values, the predicted income for a comic book movie should be about $ 830,476,