Time series applied to volcanic data: A review and application to Fuego volcano. Rüdiger Escobar Wolf MTU, September 2010.

Slides:



Advertisements
Similar presentations
Value-at-Risk: A Risk Estimating Tool for Management
Advertisements

Introduction to modelling extremes
Introduction to modelling extremes Marian Scott (with thanks to Clive Anderson, Trevor Hoey) NERC August 2009.
A.S. 3.8 INTERNAL 4 CREDITS Time Series. Time Series Overview Investigate Time Series Data A.S. 3.8 AS91580 Achieve Students need to tell the story of.
Earthquake Prediction Methods. Earthquake predictions  Because earthquakes do not happen on regular intervals it is difficult to predict when the next.
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
TK 6413 / TK 5413 : ISLAMIC RISK MANAGEMENT TOPIC 6: VALUE AT RISK (VaR) 1.
Correlation and regression Dr. Ghada Abo-Zaid
Statistics 100 Lecture Set 7. Chapters 13 and 14 in this lecture set Please read these, you are responsible for all material Will be doing chapters
1 BIS APPLICATION MANAGEMENT INFORMATION SYSTEM Advance forecasting Forecasting by identifying patterns in the past data Chapter outline: 1.Extrapolation.
1 Predator-Prey Oscillations in Space (again) Sandi Merchant D-dudes meeting November 21, 2005.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Decision Making: An Introduction 1. 2 Decision Making Decision Making is a process of choosing among two or more alternative courses of action for the.
Comparing volcanic risk at Volcán de Fuego and Cerro Quemado, Guatemala. PhD research proposal by Rüdiger Escobar Wolf.
Chapter 12 - Forecasting Forecasting is important in the business decision-making process in which a current choice or decision has future implications:
Data Sources The most sophisticated forecasting model will fail if it is applied to unreliable data Data should be reliable and accurate Data should be.
Real-time volcanic cloud detection from INETER ’ s AVHRR station Peter Webley KCL.
CHAPTER 6 Statistical Analysis of Experimental Data
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Future eruptive crisis scenarios a Volcán de Fuego: what can we infer from the historic record? Rüdiger Escobar Wolf PIRE meeting Nov. 13, 2009.
HAZARD ASSESSMENT FOR EARLY WARNING PURPOSES: WHAT INFORMATION IS MOST RELEVANT FROM A USERS PERSPECTIVE? THE CASE OF VOLCAN DE FUEGO, GUATEMALA. Rüdiger.
 Catalogue No: BS-338  Credit Hours: 3  Text Book: Advanced Engineering Mathematics by E.Kreyszig  Reference Books  Probability and Statistics by.
September 20101Master of Science in Geological & Mining & Sciences Michigan Technological University – College of Engineering Historic Activity Records.
AM Recitation 2/10/11.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Chapter 8 Introduction to Hypothesis Testing
Hypothesis Testing.
Trying to Give Business Students What They Need for Their Future Bob Andrews Virginia Commonwealth University.
N318b Winter 2002 Nursing Statistics Specific statistical tests: Correlation Lecture 10.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
CE 3354 ENGINEERING HYDROLOGY Lecture 6: Probability Estimation Modeling.
Introduction: Why statistics? Petter Mostad
LECTURE 22 VAR 1. Methods of calculating VAR (Cont.) Correlation method is conceptually simple and easy to apply; it only requires the mean returns and.
Theory of Probability Statistics for Business and Economics.
Inferential Statistics 2 Maarten Buis January 11, 2006.
Statistical Analysis Topic – Math skills requirements.
Geo597 Geostatistics Ch9 Random Function Models.
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 3: The Foundations of Research 1.
Research Process Parts of the research study Parts of the research study Aim: purpose of the study Aim: purpose of the study Target population: group whose.
Chapter 11: Inference for Distributions of Categorical Data Section 11.1 Chi-Square Goodness-of-Fit Tests.
Managerial Economics Demand Estimation & Forecasting.
Maintenance Workload Forecasting
Experimental Design and Statistics. Scientific Method
Testing hypotheses Continuous variables. H H H H H L H L L L L L H H L H L H H L High Murder Low Murder Low Income 31 High Income 24 High Murder Low Murder.
+ “Statisticians use a confidence interval to describe the amount of uncertainty associated with a sample estimate of a population parameter.”confidence.
Academic Research Academic Research Dr Kishor Bhanushali M
Chapter 4 The Classical Model Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Sea level Workshop – Paris 2006 Assessing the impact of long term trends in extreme sea levels on offshore and coastal installations Ralph Rayner Marine.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
INTEGRATING DISASTER RISK ‘FIRE AND VOLCANOS ERUPTION’ REDUCTION INTO LEARNING MATHEMATICS.
Long-term forecasting of volcanic explosivity
DEPARTMENT OF MECHANICAL ENGINEERING VII-SEMESTER PRODUCTION TECHNOLOGY-II 1 CHAPTER NO.4 FORECASTING.
Chapter 13 Understanding research results: statistical inference.
EC 827 Module 2 Forecasting a Single Variable from its own History.
Methods of Presenting and Interpreting Information Class 9.
Statistical Forecasting
Statistical analysis.
Introduction to Hypothesis Test – Part 2
Statistical analysis.
12 Inferential Analysis.
Chapter 10 Verification and Validation of Simulation Models
Stat 112 Notes 4 Today: Review of p-values for one-sided tests
Introduction to Instrumentation Engineering
Basic Statistical Terms
12 Inferential Analysis.
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Analytics – Statistical Approaches
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Presentation transcript:

Time series applied to volcanic data: A review and application to Fuego volcano. Rüdiger Escobar Wolf MTU, September 2010.

Outline: 1.What are time series and where do they come from? 2.In what way are time series useful and what can they tell us? 3.How can we analyze time series and what methods are out there? 4.The Fuego case: different domains and different time scales. 5.The larger context of Fuego

What are time series and where do they come from? A variable “varying” in time. The domain of the variable: physical continuous magnitudes

Seismic… velocity.

Seismic… RSAM

GOES… Thermal

But also discrete or even categorical data

Where do time series come from? Measuring and recording the time-varying “variable”. Discretization. Sampling resolution. Reliability & uncertainty. How well do we know the variable? I. e. the size of an eruption?

Degradation with time and size. Older and smaller events are more difficult to record. Dataset or “catalogue” completeness. How far back do I think my dataset is reliable for a given size of event? Implications for the statistical properties of the dataset.

Year VEI

Measuring and recording the time variable Sampling resolution. Reliability & uncertainty. Aggregating data.

Dating… Some geologist are so boring that they end up dating rocks… Dating prehistoric events: 40 Ar/ 39 Ar, 14 C, K-Ar, U series, etc… Point vs. interval data, and how to combine them.

14 C

40 Ar/ 39 Ar

The time of historical events Chronicles and their interpretation Facts, myths and everything in between.

The year of [fifteen hundred] eighty one, on December twenty six, the volcano started to throw fire more than usual, and it was so much what it threw, and with such a fury, the next day of December twenty seven, through a mouth that it has in the highest part, that from the abundant ash that came out, the air became black and thick, such that people couldn’t see each other [in Antigua Guatemala?]… …that ash reached many leagues from Guatemala, in the province of Xoconusco, where the trees were found to be covered by it… The next month of January, at the beginning of the year [fifteen hundred] eighty two, on the fourteenth of that month, the same volcano started to throw so much fire, that a great mishap was feared, because in the twenty four hours that the fury lasted, one couldn’t see anything from the volcano but rivers of fire and very large rocks made embers, which came out of the volcanoes mouth and came down with enormous fury and impetus… That fire caused much damage from the coast to the southeast, where it ruined a pueblo de indios named San Pedro, two leagues from [Antigua] Guatemala, although there were no deaths, because it happened during the day, and prevented by fear, all the indios escaped with time, abandoning their homes… Ciudad – Real, Antonio de, 1873, Relacion breve y verdadera de algunas cosas de las muchas que sucedieron al Padre Fray Alonso Ponce en las provincias de la Nueva España, siendo comisario general de aquellas partes. Tomo I. Imprenta de la Viuda de Calero. Madrid.

The problem of defining discrete events for continuous variables Different approaches: Values over threshold Local peaks

In what way are time series useful and what can they tell us? From very straightforward, i. e. simple trends… To very complex, i. e. cyclic behavior, etc.

Seismic… RSAM

Thermal GOES

Cyclic behavior at Fuego 2002 – 2007?

Correlation between time series If they share some phenomenological (causal?) relationship, chances are that they may vary together (co-vary) Volcanologists (and other geo-scientists) try this a lot!

Seismic and thermal

Stochastic nature and statistical structure of some time series data Some element of randomness but not completely unpredictable The question is “what can be predicted or forecasted”? Parametric vs. Non-parametric

Probability distributions and fitting of data What is the “distribution of the population from which the sampled set comes from”? How to fit? MLE.

The recent (2002 – 2007) Fuego dataset

Assumptions and elegant lies… Can we just assume certain things about the distribution of the population? E. g. Normality, independence, Poisson process? Is this assumption valid/justified? How do we know? Stationary vs. Non-stationary time series.

A note on time series and early warning… The “ideal” model of increasing risk, acceptable risk threshold and warning/response to that threshold. Some “real world details”.

Crisis time path (development) posibilities Time Perceived risk (probability?) 0 1 A lethal eruption will NOT happen soon. A lethal eruption WILL happen soon. The lethal event HAPPENS Progressively narrowing time window for evacuation. As the window narrows the options for action decrease and the evacuation becomes more difficult. I II III IV Highly concave curve. This case is unlikely to be maintained for a prolonged period of time because of accustomization and desensitizing. It usually tends to become case III over time. Peak and decrease curve. This case reflects and initial increase in the perceived risk, followed by a rapid decrease, either due to a decrease of the observed activity, or due to a process of desensitizing. Linear increase curve. This case represents a steady increase in the perceived risk, pointing towards the actual occurrence of the eruption. This is arguably the best case. Highly convex curve. This case represents the “sudden” or “surprise” scenario, in which warning and evacuation actions can be severely limited (even impossible) due to the short time available to carry them out. This is arguably the worst case. Tends to…

Crisis time path (development) posibilities Time Perceived risk (probability?) 0 1 A lethal eruption will NOT happen soon. A lethal eruption WILL happen soon. The lethal event HAPPENS Progressively narrowing time window for evacuation. As the window narrows the options for action decrease and the evacuation becomes more difficult. In real life, the changes in perceived risk don’t happen as a continuously varying function of time. They tend to happen as jumps or drops (discontinuities) associated to the occurrence of key events and findings (e. g. the initiation of the eruption or the issue of a warning).

Crisis time path (development) posibilities Time Perceived risk (probability?) 0 1 A lethal eruption will NOT happen soon. A lethal eruption WILL happen soon. The lethal event HAPPENS Progressively narrowing time window for evacuation. As the window narrows the options for action decrease and the evacuation becomes more difficult. I III IV II

How can we analyze time series and what methods are out there? Organizing dataset for analysis. Database for large, multidimensional datasets… or just a simple table (the simplest database) for small, low dimensionality datasets.

Tools and platforms… From the very basic (and limited!): Excel…...to the more complex (and powerful!) Matlab, R, etc… Level of automatization and available (built in) tools…

Depending on how big and good your dataset is you can apply different tools. Borrowing tools from signal processing community (electronic and communications engineering, seismologists, etc): Time vs. frequency domain. Auto and cross-correlation. Fourier and Laplace transformations.

Focus on cyclic behavior and correlation

Stochastic structure and statistical methods Parametric: Choosing a distribution and searching for a “best fit”. Hypothesizing on why they fit the distribution or how the fit can be interpreted, for instance in physical terms.

The recent (2002 – 2007) Fuego dataset Parameter k = 1.55 > 0 Aging process?

The Fuego case: Different domains and different time scales.

Summarizing… Prehistoric data: very sparse and uncertain… goes back to 230 ka, but most relevant for the last 3.5 ka. A “fairly good” (detailed and time precise) historical record. Some issues with interpretation for assessing the size, explosivity, and other relevant characteristics of the events.

A much more detailed record of recent (since ~1960’s) activity.

Combined data sources: Pre-historic: dating and deposit assessment. Historic – older: Accounts from witnesses and scientific reports. Historic – recent: INSIVUMEH bulletins, OVFUEGO / INSIVUMEH records of lava flow lengths and number of daily eruptions, GOES and MODIS / MODVOLC thermal data, RSAM from INSIVUMEH and J. Lyons, SE-CONRED bulletins and personal notes.

Some caveats… Non-stationary: turning the volcano “on” and “off”. Trends and non-homogeneous processes. How can we account for that in the time series analysis?

Cyclic behavior at Fuego 2002 – 2007?

Year VEI Clustering of eruptions since 1524?

Thanks!

Questions? Discussion?…