Simon Fraser University Department of Statistics and Actuarial Sciences Some Random Questions.

Slides:



Advertisements
Similar presentations
Rachel T. Johnson Douglas C. Montgomery Bradley Jones
Advertisements

Applications of one-class classification
Antony Lewis Institute of Astronomy, Cambridge
Bayesian tools for analysing and reducing uncertainty Tony OHagan University of Sheffield.
Using an emulator. Outline So we’ve built an emulator – what can we use it for? Prediction What would the simulator output y be at an untried input x.
Insert Date HereSlide 1 Using Derivative and Integral Information in the Statistical Analysis of Computer Models Gemma Stephenson March 2007.
Dialogue Policy Optimisation
Polynomial Curve Fitting BITS C464/BITS F464 Navneet Goyal Department of Computer Science, BITS-Pilani, Pilani Campus, India.
Pattern Recognition and Machine Learning
Ensemble Emulation Feb. 28 – Mar. 4, 2011 Keith Dalbey, PhD Sandia National Labs, Dept 1441 Optimization & Uncertainty Quantification Abani K. Patra, PhD.
CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 10: The Bayesian way to fit models Geoffrey Hinton.
More MR Fingerprinting
Validating uncertain predictions Tony O’Hagan, Leo Bastos, Jeremy Oakley, University of Sheffield.
Gaussian Processes I have known
Artificial Intelligence Lecture 2 Dr. Bo Yuan, Professor Department of Computer Science and Engineering Shanghai Jiaotong University
Gaussian process emulation of multiple outputs Tony O’Hagan, MUCM, Sheffield.
Carnegie Mellon School of Computer Science Understanding SMT without the “S” (Statistics) Robert Frederking.
Analysis of Simulation Input.. Simulation Machine n Simulation can be considered as an Engine with input and output as follows: Simulation Engine Input.
Model Selection. Agenda Myung, Pitt, & Kim Olsson, Wennerholm, & Lyxzen.
Chapter 5. Operations on Multiple R. V.'s 1 Chapter 5. Operations on Multiple Random Variables 0. Introduction 1. Expected Value of a Function of Random.
Numerical Analysis - Simulation -
Gaussian process modelling
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Applications of Bayesian sensitivity and uncertainty analysis to the statistical analysis of computer simulators for carbon dynamics Marc Kennedy Clive.
Hierarchical Distributed Genetic Algorithm for Image Segmentation Hanchuan Peng, Fuhui Long*, Zheru Chi, and Wanshi Siu {fhlong, phc,
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
Crystal Linkletter and Derek Bingham Department of Statistics and Actuarial Science Simon Fraser University Acknowledgements This research was initiated.
Comparison of Differential Evolution and Genetic Algorithm in the Design of a 2MW Permanent Magnet Wind Generator A.D.Lilla, M.A.Khan, P.Barendse Department.
WB1440 Engineering Optimization – Concepts and Applications Engineering Optimization Concepts and Applications Fred van Keulen Matthijs Langelaar CLA H21.1.
Prognosis of Gear Health Using Gaussian Process Model Department of Adaptive systems, Institute of Information Theory and Automation, May 2011, Prague.
17 May 2007RSS Kent Local Group1 Quantifying uncertainty in the UK carbon flux Tony O’Hagan CTCD, Sheffield.
Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.
10 December, 2008 CIMCA2008 (Vienna) 1 Statistical Inferences by Gaussian Markov Random Fields on Complex Networks Kazuyuki Tanaka, Takafumi Usui, Muneki.
Selecting Input Probability Distribution. Simulation Machine Simulation can be considered as an Engine with input and output as follows: Simulation Engine.
5-1 ANSYS, Inc. Proprietary © 2009 ANSYS, Inc. All rights reserved. May 28, 2009 Inventory # Chapter 5 Six Sigma.
INTRODUCTION TO Machine Learning 3rd Edition
Reservoir Uncertainty Assessment Using Machine Learning Techniques Authors: Jincong He Department of Energy Resources Engineering AbstractIntroduction.
Extreme Value Prediction in Sloshing Response Analysis
Neural Networks Demystified by Louise Francis Francis Analytics and Actuarial Data Mining, Inc.
Over-fitting and Regularization Chapter 4 textbook Lectures 11 and 12 on amlbook.com.
Gaussian Processes For Regression, Classification, and Prediction.
Lecture 6 Your data and models are never perfect… Making choices in research design and analysis that you can defend.
How Good is a Model? How much information does AIC give us? –Model 1: 3124 –Model 2: 2932 –Model 3: 2968 –Model 4: 3204 –Model 5: 5436.
Options and generalisations. Outline Dimensionality Many inputs and/or many outputs GP structure Mean and variance functions Prior information Multi-output,
CISC Machine Learning for Solving Systems Problems Microarchitecture Design Space Exploration Lecture 4 John Cavazos Dept of Computer & Information.
… Algo 1 Algo 2 Algo 3 Algo N Meta-Learning Algo.
Future Directions in Ensemble DA for Hurricane Prediction Applications Jeff Anderson: NCAR Ryan Torn: SUNY Albany Thanks to Chris Snyder, Pavel Sakov The.
Paper: A. Kapoor, H. Ahn, and R. Picard, “Mixture of Gaussian Processes for Combining Multiple Modalities,” MIT Media Lab Technical Report, Paper.
Introduction to emulators Tony O’Hagan University of Sheffield.
A Kriging or Gaussian Process emulator has: an unadjusted mean (frequently a least squares fit: ), a correction / adjustment to the mean based on data,
Marc Kennedy, Tony O’Hagan, Clive Anderson,
Machine Learning with Spark MLlib
Robert Anderson SAS JMP
Extreme Value Prediction in Sloshing Response Analysis
Opening Routine.
Lecture 17. Boosting¶ CS 109A/AC 209A/STAT 121A Data Science: Harvard University Fall 2016 Instructors: P. Protopapas, K. Rader, W. Pan.
How Good is a Model? How much information does AIC give us?
Boosting and Additive Trees
CS548 Fall 2017 Decision Trees / Random Forest Showcase by Yimin Lin, Youqiao Ma, Ran Lin, Shaoju Wu, Bhon Bunnag Showcasing work by Cano,
Variable Selection for Gaussian Process Models in Computer Experiments
Combining Base Learners
CSCI 5822 Probabilistic Models of Human and Machine Learning
10701 / Machine Learning Today: - Cross validation,
Biointelligence Laboratory, Seoul National University
Introduction to Sensor Interpretation
Introduction to Sensor Interpretation
Surface Fairing Segmentation Meeting 07.
Probabilistic Surrogate Models
Optimization under Uncertainty
Presentation transcript:

Simon Fraser University Department of Statistics and Actuarial Sciences Some Random Questions

Simon Fraser University Department of Statistics and Actuarial Sciences Questions I have…many not smart “Parameterization” – Came up several time –Can be choice for stochastic features in a computer model –Can be parameters in PDE’s…do these have error? How to account for? Robert and Howard – How did you generate your ensembles –Wanted to understand sensitivity to certain parameters? How measure?

Simon Fraser University Department of Statistics and Actuarial Sciences Questions I have…many not smart NCAR folks.. What was helpful or what did you learn? Statisticians… new problems or new methodology?

Simon Fraser University Department of Statistics and Actuarial Sciences Questions I have…many not smart Regarding PDE’s: y~N(pde(  ),  ) ? Build physics right in? Elaine…interested in maximums (Bo?)….failure models in Engineering

Simon Fraser University Department of Statistics and Actuarial Sciences Questions I have…many not smart Guillaume – Added stochastic forcing…are models still closed Seem to have a lot of parameters…are they identifiable? I do not think I understand the data assimilation (Josh? Jeff?)

Simon Fraser University Department of Statistics and Actuarial Sciences GP’s have proven effective for emulating computer model output & data mining Gaussian Spatial Process (GP) model frequently used in modeling response from complex computer codes Emulating computer model output – output varies smoothly with input changes – output is essentially noise free – GP’s outperform other modeling approaches in this arena (mars, cart, …) Data Mining – compares favorably with other machine learning techniques – noise is a more prominent feature

Simon Fraser University Department of Statistics and Actuarial Sciences Gaussian Process Models Emulators to be used as a surrogate for the computer model 1.How to build likely model complexity into design/analysis –GP models are very complex and hard to interpret –Even more challenging in calibration/assimilation problems 2.Sample Size Issues –Do you have enough data to fit these models well?

Simon Fraser University Department of Statistics and Actuarial Sciences Complexity Important elicitation problem How complex is the response surface y(x) ? How to build likely model complexity into design/analysis –GP models are very complex and hard to interpret –Even more challenging in calibration/assimilation problems

Simon Fraser University Department of Statistics and Actuarial Sciences Complexity

Simon Fraser University Department of Statistics and Actuarial Sciences Sample Size…Emulating a computer model

Simon Fraser University Department of Statistics and Actuarial Sciences Simulation p= 27, n=50,100,200,300,500 Random design Symmetric LHS Predictions for 100 holdout x’s