Alex Larson ECE 539 Fall 2013. Reasons to Predict & Goal Movies are a large part of today’s culture Movies are expensive to make Goal: To predict a movie.

Slides:



Advertisements
Similar presentations
SAMPLING METHODS OR TECHNIQUES
Advertisements

Learning Relational Probability Trees Jennifer Neville David Jensen Lisa Friedland Michael Hay Presented by Andrew Tjang.
Significance Testing.  A statistical method that uses sample data to evaluate a hypothesis about a population  1. State a hypothesis  2. Use the hypothesis.
Imbalanced data David Kauchak CS 451 – Fall 2013.
Data Analysis of Tennis Matches Fatih Çalışır. 1.ATP World Tour 250  ATP 250 Brisbane  ATP 250 Sydney... 2.ATP World Tour 500  ATP 500 Memphis  ATP.
Sophomore Slumpware Predicting Album Sales with Artificial Neural Networks Matthew Wirtala ECE 539.
Machine Learning with Discriminative Methods Lecture 18 – Structured Prediction CS Spring 2015 Alex Berg.
Scott Wiese ECE 539 Professor Hu
1 Machine Learning: Lecture 10 Unsupervised Learning (Based on Chapter 9 of Nilsson, N., Introduction to Machine Learning, 1996)
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 4 Slide 1 Slide 1 Use Case Diagram Corrected to match newer conventions.
Zvika Marx, Extending Ontology ‘s.
Comic Book Movies By Anthony Borkowski. Why Base Movies Off of Comic Books? Comic books are an American art form that many children in each generation.
Data preprocessing before classification In Kennedy et al.: “Solving data mining problems”
Do Now and Example for Notebook
Autocorrelation and Linkage Cause Bias in Evaluation of Relational Learners David Jensen and Jennifer Neville.
Chapter 17 Overview of Multivariate Analysis Methods
Psychology 202b Advanced Psychological Statistics, II February 15, 2011.
Information and Telecommunication Technology Center (ITTC) University of Kansas SmartXAutofill Intelligent Data Entry Assistant for XML Documents Danico.
Predicting Matchability - CVPR 2014 Paper -
Feature Selection Which features work best? One way to rank features: –Make a contingency table for each F –Compute abs ( log ( ad / bc ) ) –Rank the log.
MEASURING AND PREDICTING UW BADGERS’S PERFORMANCE BY QUARTERBACK AND RUNNING BACK STATS By: Tyler Chu ECE 539 Fall 2013.
Chapter 5 Data mining : A Closer Look.
Modern Navigation Thomas Herring
METU Informatics Institute Min 720 Pattern Classification with Bio-Medical Applications PART 2: Statistical Pattern Classification: Optimal Classification.
Lesson 3: Adding Polynomials
SVMLight SVMLight is an implementation of Support Vector Machine (SVM) in C. Download source from :
Motion Picture Revenue Prediction An Artificial Neural Network Method for Predicting Opening Weekend Box-Office Revenue ECE 539 – Fall 2001 Final Project.
A Random Walk on the Red Carpet: Rating Movies with User Reviews and PageRank Derry Tanti Wijaya Stéphane Bressan.
Recommender systems Drew Culbert IST /12/02.
Action Movies FEASIBILITY REPORT Prepared for Monica Ng Prepared by Edgar Carranza, Juan Ochoa, Chabrea Owens, Janak Ranchod, Rebecca Roberts.
Ramesh Sharda and Dursun Delen Institute for Research in Information Systems Department of Management Science and Information Systems William S. Spears.
ECE 539 Final Project ANN approach to help manufacturing of a better car Prabhdeep Singh Virk Fall 2010.
"Hunger games genre". Genre   
Do NOW! In your induction packs is a sheet titled Film Consumption.
WOW World of Walkover-weight “My God, it’s full of cows!” (David Bowman, 2001)
Predicting the Future With Social Media. Introduction Goal – How buzz and attention is created for different movies and how that changes over time.
Jacob Zurasky ECE5526 – Spring 2011
How do I collect sample data from population? 7.6 Sample Data and Populations Example 1 Collect data by randomly sampling A gym has 467 female members.
TEKS (6.10) Probability and statistics. The student uses statistical representations to analyze data. The student is expected to: (B) identify mean (using.
Warmup Factor Factoring Trinomials Objective: To factor quadratic trinomials. Standard 11.0.
Chapter 2 STA 200 Summer I Objective We want to be able to extrapolate results from a sample to the population at large. In order to do this (and.
Design and Implementation of a Dynamic Data MLP to Predict Motion Picture Revenue David A. Gerasimow.
Check it out! 2.1.1: Properties of Equality 1. Sydney subscribes to an online company that allows her to download electronic books. Her subscription costs.
Evolutionary Algorithms for Finding Optimal Gene Sets in Micro array Prediction. J. M. Deutsch Presented by: Shruti Sharma.
A Content-Based Approach to Collaborative Filtering Brandon Douthit-Wood CS 470 – Final Presentation.
CS378 Final Project The Netflix Data Set Class Project Ideas and Guidelines.
Quiz 1 review. Evaluating Classifiers Reading: T. Fawcett paper, link on class website, Sections 1-4 Optional reading: Davis and Goadrich paper, link.
Musical Genre Categorization Using Support Vector Machines Shu Wang.
Multiplying Polynomials
Single Digit Addition Flash cards.  The program starts with introduction.  Today, we’re going to play single digit numbers addition and flash cards.
Do MPAA Ratings Affect Revenues at the Box Office?
Scientific Method. Ask a question Finding the reason something is happening (e.g. why do the tadpoles have deformed limbs??) Make observations—be accurate!
ECE 471/571 – Lecture 3 Discriminant Function and Normal Density 08/27/15.
(4-2) Adding and Subtracting Matrices Objectives: To Add and subtract Matrices To solve certain Matrix equations.
International Comparison of Cultural Discount: Focusing on 31 countries' consumption on Hollywood movies Sungkyungkwan University Byeng-Hee Chang*, Sung-Hyun.
Human features are those things created by man.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.
Multiplication Timed Tests.
Prediction of Box Office Gross Revenue
Deep Feedforward Networks
Predicting NFL Game Outcomes: Back-Propagating MLP
Prediction of Wine Grade
network of simple neuron-like computing elements
8-3 Using Statistics to Predict (pg )
Machine Learning and Verbatim Survey Response
Aleysha Becker Ece 539, Fall 2018
Visualizing Common Criteria of Successful and Unsuccessful Movies
Simplify 8y3 12y2.
What about −(−6) ? What quantity is in the box?.
Presentation transcript:

Alex Larson ECE 539 Fall 2013

Reasons to Predict & Goal Movies are a large part of today’s culture Movies are expensive to make Goal: To predict a movie potential box office success based on its characteristics

Data Collection Movie statistics available online on various websites Random samples of top movies from recent years Sample data from the-numbers.com

Features and Classes Release Month Distributor Genre MPAA Rating (G, PG, etc…) Is Sequel? (Y/N) 3 Classes based on Gross in release Year Gross < $49,000,000 $49,000,000 < Gross < $91,000,000 $91,000,00 < Gross

Results So far Initial results: poor low classification rate Reevaluated data Removed various outliers Limit to top Distributors/Studios Simplified genre classification

Results So Far KNN Classifier Testing Data Average C Rate(%) Confusion Matrix Maximum Likelihood Classifier Testing Data Average C Rate(%) Confusion Matrix Confusion Matrix MLP back propagation Testing Data Average C Rate(%)

Results So far Correctly Predicted for 2013: Iron Man 3 Hunger Games Oblivion Incorrectly Predicted for 2013: Gravity After Earth The Internship

Possible Improvements Refine features more, add more features like budge, director, lead actors Try with different combination of features