Predicting White Wine Quality Scores RAPHAEL MWANGI.

Slides:



Advertisements
Similar presentations
The Important Thing About By. The Important Thing About ******** The important thing about ***** is *****. It is true s/he can *****, *****, and *****.
Advertisements

SOURCES OF DATA TO DETERMINE PROGRESS AND AREAS OF NEEDED IMPROVEMENT.
Every edge is in a red ellipse (the bags). The bags are connected in a tree. The bags an original vertex is part of are connected.
Acknowledgements Contact Information Anthony Wong, MTech 1, Senthil K. Nachimuthu, MD 1, Peter J. Haug, MD 1,2 Sepsis Temporal Model Methodology Dynamic.
English only VS. L1 support Looking at English language acquisition of native Spanish speaking children.
Boosting Rong Jin.
Indian Statistical Institute Kolkata
Tools for applications improvement George Bosilca.
Consistency of the Weather Nicole Baratelle, Cara Barskey, Youjin Kwon.
Discriminant Analysis To describe multiple regression analysis and multiple discriminant analysis. Discriminant Analysis.
EXAMPLES Attributes of Information. Overview Information Systems help companies achieve their goals. How do they do it? 1. By processing raw data into.
Boosting Rong Jin. Inefficiency with Bagging D Bagging … D1D1 D2D2 DkDk Boostrap Sampling h1h1 h2h2 hkhk Inefficiency with boostrap sampling: Every example.
Developing a Hiring System Reliability of Measurement.
Predicting Protein Interactions HERPES! Team Question Mark Jeff Brown Dante Kappotis Robert Vanderley Anthony Biasella.
Digital Circuit Why Digital Circuit? – Digital signals ( 0 and 1) are very easy to handle with electronic circuits only 2 states needed: Switch ON or OFF,
Bagging LING 572 Fei Xia 1/24/06. Ensemble methods So far, we have covered several learning methods: FSA, HMM, DT, DL, TBL. Question: how to improve results?
Copyright © 2009 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 6 HUMAN CAPITAL.
Simple Linear Regression NFL Point Spreads – 2007.
Methodology Technology, IQ, and Koko the Gorilla.
2006 CAS RATEMAKING SEMINAR CONSIDERATIONS FOR SMALL BUSINESSOWNERS POLICIES (COM-3) Beth Fitzgerald, FCAS, MAAA.
Reconstructing Gene Networks Presented by Andrew Darling Based on article  “Research Towards Reconstruction of Gene Networks from Expression Data by Supervised.
Estimation Statistics with Confidence. Estimation Before we collect our sample, we know:  -3z -2z -1z 0z 1z 2z 3z Repeated sampling sample means would.
A Sampling Distribution
And Problem Solving in Chemistry Precision and Accuracy Significant Figures % error Measurement Density.
Application of Data Mining Algorithms in Atmospheric Neutrino Analyses with IceCube Tim Ruhe, TU Dortmund.
Report #1 By Team: Green Ensemble AusDM 2009 ENSEMBLE Analytical Challenge: Rules, Objectives, and Our Approach.
Defining Procedures for Decision Analysis May & Engr A April 30, 2002 Client & Faculty Advisors –Dr. Keith Adams –Dr. John Lamont –Dr. Ralph.
Air Travel Forecast Problem 1 Objectives Introduction to forecasting methods Experience with Delphi Experience with consensus-seeking techniques Strength/weaknesses.
BOOSTING David Kauchak CS451 – Fall Admin Final project.
BAGGING ALGORITHM, ONLINE BOOSTING AND VISION Se – Hoon Park.
Sampling Distribution of a Sample Mean Lecture 30 Section 8.4 Mon, Mar 19, 2007.
Forecasting Unit Demand Chester Corporation. Forecast Definition To estimate or calculate in advance To serve as an advance indication of; foreshadow:
1 Psych 5500/6500 Introduction to the F Statistic (Segue to ANOVA) Fall, 2008.
Chris and Abbie’s Vital Statistics. It’s actually quite dull!!! But......It is quite easy if you think it through. Arghhh Statistics...
Predictive Modeling for Small Commercial Risks CAS PREDICTIVE MODELING SEMINAR Beth Fitzgerald ISO October 2006.
Chapter 7/8 Examples. Example 1 A company wants to know when they raise their prices if they will lower customer satisfaction. Therefore, they complete.
Accuracy vs. Precision. Accuracy Vs. Precision Accuracy Accuracy is how close a measured value is to the actual (true) value. Precision Precision is how.
Advanced Math Topics Tests Concerning Means for Large Samples.
For her project, Maria bought 50 small bags of jelly beans. The average number of jelly beans in a small bag is 27. It is known that the standard deviation.
ICC Mission Statement Our mission at Informed Credit Consulting is to provide a quality credit repair service with a commitment to deliver fast, comprehensive.
Wisconsin Traffic Operations and Safety Laboratory Department of Civil and Environmental Engineering University of Wisconsin-Madison Improving Accuracy.
11 Oct 2008 PKAL Quirk 1 MILO SCHIELD, Augsburg College Director, W. M. Keck Statistical Literacy Project Vice President, National Numeracy Network Project.
Accuracy vs. Precision. 2 minutes: Write what you think accuracy and precision mean 1 minute: discuss at your table what you think.
Risk Solutions & Research © Copyright IBM Corporation 2005 Default Risk Modelling : Decision Tree Versus Logistic Regression Dr.Satchidananda S Sogala,Ph.D.,
1 How to Take Tests 4 How to Take a True/False Test.
Copyright © 2003, N. Ahbel Residuals. Copyright © 2003, N. Ahbel Predicted Actual Actual – Predicted = Error Source:
Science Process Skills. What is Science?
1 Scientific Method Learning Target: I can describe the steps in the scientific method.
TCSS 342 Autumn 2004 Version TCSS 342 Data Structures & Algorithms Autumn 2004 Ed Hong.
Experimentation A.P. Biology. Observation QuestionHypothesisAndPrediction Design the Experiment Collect Data AndAnalyzeConclusion Theory BackgroundResearch.
Boosting ---one of combining models Xin Li Machine Learning Course.
COLLEGE ADMISSION TEST SCORES Student IDReadingWritingMathTotal S S S S S
Getting to know your compass. Standard Compass Basics Remember: RED IS NORTH!!! Remember: RED IS NORTH!!! The arrow on a compass will usually be either.
Boosting and Additive Trees (2)
Scientific Method Jeopardy
The 3-Way Light Switch and The If-And-Only-If Statement
Machine Learning Week 1.
Navigation In Dynamic Environment
How accurate can models from social media be?
Attributes of Information
Accuracy vs. Reproducibility
Суури мэдлэг Basic Knowledge
Machine Learning Interpretability
IDSL, Intelligent Database System Lab
Validity and Reliability II: The Basics
Call of the Wild Test Analysis
Data Structures and Algorithms
Predicting Breast Cancer by Applying Deep Learning to Linked Health Records and Mammograms The algorithm predicted future breast cancer based on the index.
Analysis on Accelerated Learning Cohorts
Chapter 2, Unit E Screening Tests.
Presentation transcript:

Predicting White Wine Quality Scores RAPHAEL MWANGI

Background  Tree-models are intuitive, easy to understand, and can be used as tools for more advanced algorithms to create highly accurate models  Tree-modeling requires practically no knowledge of statistics to understand

Basic Example If Alcohol < If Alcohol > 10.85

Actual Tree Model

Bagging

Results of Single Tree Model vs. Bagged Tree Model  Each prediction we make will, on average, be: o 0.77 points off from the true wine quality score for the Single Tree Model o 0.75 points off from the true wine quality score for the Bagged Tree Model  Bagged model did slightly better