BASIC STATISTICAL CONCEPTS Statistical Moments & Probability Density Functions Ocean is not “stationary” “Stationary” - statistical properties remain constant.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Chapter 6 Confidence Intervals.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Chapter 6 Sampling and Sampling Distributions
Estimation in Sampling
Introduction to Statistics
Objectives (BPS chapter 24)
ELEC 303 – Random Signals Lecture 18 – Statistics, Confidence Intervals Dr. Farinaz Koushanfar ECE Dept., Rice University Nov 10, 2009.
QUANTITATIVE DATA ANALYSIS
Chapter 7 Sampling and Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Topic 2: Statistical Concepts and Market Returns
Evaluating Hypotheses
Class notes for ISE 201 San Jose State University
CHAPTER 6 Statistical Analysis of Experimental Data
Statistics and Probability Theory Prof. Dr. Michael Havbro Faber
Chapter 2 Simple Comparative Experiments
Chapter 7 Estimation: Single Population
1 Inference About a Population Variance Sometimes we are interested in making inference about the variability of processes. Examples: –Investors use variance.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Statistical Intervals Based on a Single Sample.
Standard error of estimate & Confidence interval.
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
Review of Basic Statistics. Definitions Population - The set of all items of interest in a statistical problem e.g. - Houses in Sacramento Parameter -
Confidence Intervals Chapter 6. § 6.1 Confidence Intervals for the Mean (Large Samples)
1 More about the Sampling Distribution of the Sample Mean and introduction to the t-distribution Presentation 3.
Chapter 6 Confidence Intervals.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
1 Math 10 Part 5 Slides Confidence Intervals © Maurice Geraghty, 2009.
Topic 5 Statistical inference: point and interval estimate
1 Introduction to Estimation Chapter Concepts of Estimation The objective of estimation is to determine the value of a population parameter on the.
1 SAMPLE MEAN and its distribution. 2 CENTRAL LIMIT THEOREM: If sufficiently large sample is taken from population with any distribution with mean  and.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Measures of Dispersion CUMULATIVE FREQUENCIES INTER-QUARTILE RANGE RANGE MEAN DEVIATION VARIANCE and STANDARD DEVIATION STATISTICS: DESCRIBING VARIABILITY.
Estimation (Point Estimation)
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Statistics PSY302 Quiz One Spring A _____ places an individual into one of several groups or categories. (p. 4) a. normal curve b. spread c.
Fundamentals of Data Analysis Lecture 3 Basics of statistics.
ELEC 303 – Random Signals Lecture 18 – Classical Statistical Inference, Dr. Farinaz Koushanfar ECE Dept., Rice University Nov 4, 2010.
Probability = Relative Frequency. Typical Distribution for a Discrete Variable.
The final exam solutions. Part I, #1, Central limit theorem Let X1,X2, …, Xn be a sequence of i.i.d. random variables each having mean μ and variance.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
PCB 3043L - General Ecology Data Analysis. PCB 3043L - General Ecology Data Analysis.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
Describing Samples Based on Chapter 3 of Gotelli & Ellison (2004) and Chapter 4 of D. Heath (1995). An Introduction to Experimental Design and Statistics.
Section 6.4 Inferences for Variances. Chi-square probability densities.
CHAPTER 2: Basic Summary Statistics
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Chapter 5: The Basic Concepts of Statistics. 5.1 Population and Sample Definition 5.1 A population consists of the totality of the observations with which.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
STATISTICS People sometimes use statistics to describe the results of an experiment or an investigation. This process is referred to as data analysis or.
Fundamentals of Data Analysis Lecture 3 Basics of statistics.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
Dr.Theingi Community Medicine
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
A QUANTITATIVE RESEARCH PROJECT -
Confidence Intervals and Sample Size
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
Chapter 4. Inference about Process Quality
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Chapter 2 Simple Comparative Experiments
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
IE 355: Quality and Applied Statistics I Confidence Intervals
Statistics PSY302 Review Quiz One Spring 2017
Sampling Distributions (§ )
Chapter 8 Estimation.
Statistical Inference for the Mean: t-test
Presentation transcript:

BASIC STATISTICAL CONCEPTS Statistical Moments & Probability Density Functions Ocean is not “stationary” “Stationary” - statistical properties remain constant in time Data collected have signal and noise Both signal and noise are assumed to have random behavior Population Sample

Most basic descriptive parameter for any set of measurements: Sample Mean over the duration of a time series – “time average” or over an ensemble of measurements – “ensemble mean” Sample mean is an unbiased estimate of the population mean ‘  ’ The population mean, μ, can be regarded as the expected outcome E(y) of an event y. If the measurement is executed many times, μ would be the most common outcome, i.e., it’d be E(y) (e.g. the weight printed on a bag of chips)

Sample Mean - locates center of mass of data distribution such that: Weighted Sample Mean relative frequency of occurrence of i th value

Variance - describes spread about the mean or sample variability Sample variance Sample standard deviation typical difference from the mean Population variance (unbiased) N needs to be > 1 to define variance and std dev Only for N < 30 s’ and  are significantly different Computationally more efficient (only one pass through the data)

Population variance has one degree of freedom (dof) < Sample variance because we estimate population variance with sample variance (one less dependent measure) d.o.f. : = # of independent pieces of data being used to make a calculation. = measure of how certain we are that our sample is representative of the entire population The larger the more certain we are that we have sampled the entire population Example: we have 2 observations, when estimating the mean we have 2 independent observations: = 2 But when estimating the variance, we have one independent observation because the two observations are at the same distance from the mean: =1

Other values of Importance range (1.27) Median – equal number of values above and below = Mode – value occurring most often N = 1601

Mode = -0.3 Two Modes Bimodal

Probability Provides procedures to infer population distribution from sample distribution and to determine how good the inference is The probability of a particular event to occur is the ratio of the number of occurrences of that event and the total number of occurrences for all possible events P (a dice showing ‘6’) = 1/6 The probability of a continuous variable is defined by a PROBABILITY DENSITY FUNCTION -- PDF 0  P (x)  1

Probability is measured by the area underneath PDF

Probability Density Function Gauss Normal Bell Gauss or Normal or Bell  11 22 33 erf(1/(2) ½ ) = 68.3% erf(2/(2) ½ ) = 95.4% erf(3/(2) ½ ) = 99.7%

 11 22 33 68.3% 95.4% 99.7% standardized normal variable Probability Density Function Gauss Normal Bell Gauss or Normal or Bell

Probability Density FunctionGamma  = 1  = 1  = 2  = 3  = 4

Probability Density FunctionGamma  = 2  = 1  = 2  = 3  = 4

Probability Density Function Chi Square  =  /2 Special case for  = 2  = 2  = 4  = 6  = 8 4 24 2 8 28 2  12 2  16 2

CONFIDENCE INTERVALS 1 -   /2 known Confidence Interval for  with  known For N > 30 (large enough sample) confidence interval the 100 (1 -  )% confidence interval is: standardized normal variable

(1 -  /2) =  z  /2 = 1.96

C.I. 100 (1 -  )% C.I. is: If  = 0.05, z  /2 = 1.96 Suppose we have a CT sensor at the outlet of a spring into the ocean. We obtain a burst sample of 50 measurements, once per second, with a sample mean of 26.5 ºC and a stdev of 1.2 ºC for the burst. What is the range of possible values, at the 95% confidence, for the population mean?

CONFIDENCE INTERVALS 1 -   /2 unknown Confidence Interval for  with  unknown For N < 30 (small samples) confidence interval the 100 (1 -  )% confidence interval is: Student’s t-distribution with = (N-1) degrees of freedom

 /2 = d.o.f.= 19

1 -   /2 C.I. 100 (1 -  )% C.I. is: If  = 0.05, t 0.025,19 = Suppose we do 20 CTD profiles at one station in St Augustine Inlet. We obtain a mean at the surface of 16.5 ºC and a stdev of 0.7 ºC. What is the range of possible values, at the 95% confidence, for the population mean?

CONFIDENCE INTERVALS 1 -   /2  2 Confidence Interval for  2 To determine reliability of spectral peaks Need to know C.I. for  2 on the basis of s 2 = (N-1) degrees of freedom

1 -   /2 Suppose that we have = 10 spectral estimates of a tidal record. C.I. 100 (1 -  )% C.I. is: The background variance near a distinct spectral peak is 0.3 m 2 95% C.I. for variance? How large would the peak have to be to stand out, statistically, from background level?  /2 = 0.025; 1 -  /2 = Look at Chi square table:

Chi Square Table The background variance lies in this range The spectral peak has to be greater than 0.92 m 2 to distinguish it from background levels