Primer on Statistics for Interventional Cardiologists Giuseppe Sangiorgi, MD Pierfrancesco Agostoni, MD Giuseppe Biondi-Zoccai, MD.

Slides:



Advertisements
Similar presentations
Chapter 6 Sampling and Sampling Distributions
Advertisements

CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Sampling: Final and Initial Sample Size Determination
Statistics review of basic probability and statistics.
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Biostatistics Unit 4 - Probability.
1 Midterm Review Econ 240A. 2 The Big Picture The Classical Statistical Trail Descriptive Statistics Inferential Statistics Probability Discrete Random.
Statistical Methods Chichang Jou Tamkang University.
Chapter 7 Sampling and Sampling Distributions
Evaluating Hypotheses
CHAPTER 6 Statistical Analysis of Experimental Data
OMS 201 Review. Range The range of a data set is the difference between the largest and smallest data values. It is the simplest measure of dispersion.
Part III: Inference Topic 6 Sampling and Sampling Distributions
Slide 1 Statistics Workshop Tutorial 4 Probability Probability Distributions.
Lecture Slides Elementary Statistics Twelfth Edition
Inferences About Process Quality
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Sampling Theory Determining the distribution of Sample statistics.
Inferential Statistics
Physics 114: Lecture 15 Probability Tests & Linear Fitting Dale E. Gary NJIT Physics Department.
5-2 Probability Distributions This section introduces the important concept of a probability distribution, which gives the probability for each value of.
Chapter 5 Sampling Distributions
Sampling Theory Determining the distribution of Sample statistics.
1 Ch6. Sampling distribution Dr. Deshi Ye
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
1 DATA DESCRIPTION. 2 Units l Unit: entity we are studying, subject if human being l Each unit/subject has certain parameters, e.g., a student (subject)
Quantitative Skills: Data Analysis
Biostatistics Lecture 7 4/7/2015. Chapter 7 Theoretical Probability Distributions.
Continuous Probability Distributions  Continuous Random Variable  A random variable whose space (set of possible values) is an entire interval of numbers.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Random Sampling, Point Estimation and Maximum Likelihood.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
ENGR 610 Applied Statistics Fall Week 3 Marshall University CITE Jack Smith.
Biostatistics Class 3 Discrete Probability Distributions 2/8/2000.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 5-2 Random Variables.
Discrete Random Variables. Numerical Outcomes Consider associating a numerical value with each sample point in a sample space. (1,1) (1,2) (1,3) (1,4)
Primer on Statistics for Interventional Cardiologists Giuseppe Sangiorgi, MD Pierfrancesco Agostoni, MD Giuseppe Biondi-Zoccai, MD.
Primer on Statistics for Interventional Cardiologists Giuseppe Sangiorgi, MD Pierfrancesco Agostoni, MD Giuseppe Biondi-Zoccai, MD.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
Determination of Sample Size: A Review of Statistical Theory
Confidence Intervals Lecture 3. Confidence Intervals for the Population Mean (or percentage) For studies with large samples, “approximately 95% of the.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
June 11, 2008Stat Lecture 10 - Review1 Midterm review Chapters 1-5 Statistics Lecture 10.
ENGR 610 Applied Statistics Fall Week 4 Marshall University CITE Jack Smith.
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
Ka-fu Wong © 2003 Chap 6- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
Primer on Statistics for Interventional Cardiologists Giuseppe Sangiorgi, MD Pierfrancesco Agostoni, MD Giuseppe Biondi-Zoccai, MD.
1 Probability and Statistical Inference (9th Edition) Chapter 5 (Part 2/2) Distributions of Functions of Random Variables November 25, 2015.
Probability Theory Modelling random phenomena. Permutations the number of ways that you can order n objects is: n! = n(n-1)(n-2)(n-3)…(3)(2)(1) Definition:
Beginning Statistics Table of Contents HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Primer on Statistics for Interventional Cardiologists Giuseppe Sangiorgi, MD Pierfrancesco Agostoni, MD Giuseppe Biondi-Zoccai, MD.
Chapter 5 Probability Distributions 5-1 Overview 5-2 Random Variables 5-3 Binomial Probability Distributions 5-4 Mean, Variance and Standard Deviation.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Chapter 6 Sampling and Sampling Distributions
THE NORMAL DISTRIBUTION
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Introduction to Probability - III John Rundle Econophysics PHYS 250
Combining Random Variables
Determining the distribution of Sample statistics
Review of Hypothesis Testing
Lecture Slides Elementary Statistics Twelfth Edition
Lecture Slides Elementary Statistics Twelfth Edition
Lecture Slides Essentials of Statistics 5th Edition
Further Topics on Random Variables: 1
Lecture Slides Essentials of Statistics 5th Edition
Introductory Statistics
Presentation transcript:

Primer on Statistics for Interventional Cardiologists Giuseppe Sangiorgi, MD Pierfrancesco Agostoni, MD Giuseppe Biondi-Zoccai, MD

What you will learn Introduction Basics Descriptive statistics Probability distributions Inferential statistics Finding differences in mean between two groups Finding differences in mean between more than 2 groups Linear regression and correlation for bivariate analysis Analysis of categorical data (contingency tables) Analysis of time-to-event data (survival analysis) Advanced statistics at a glance Conclusions and take home messages

What you will learn Probability distributions –what is it and what is it for –discrete: binomial, Poisson –continuous: normal, Chi-square, F and t –central limit theorem

What you will learn Probability distributions –what is it and what is it for –discrete: binomial, Poisson –continuous: normal, Chi-square, F and t –central limit theorem

What is a probability distribution?

Probability distribution: definition It identifies either the probability of each value of an unidentified random variable (for discrete variables), or the probability of the value falling within a particular interval (for continuous variables) The probability function describes the range of possible values that a random variable can attain and the probability that the value of the random variable is within any (measurable) subset of that range More roughly, a probability distribution is the universe of all possible cases for a given variable or function

Probability distribution: definition There are thus discrete probability distributions, when their cumulative distribution function only increases in jumps. More precisely, a probability distribution is discrete if there is a finite or countable set whose probability is 1. Otherwhise, probability distributions are called continuous if their cumulative distribution function is continuous, which means that it belongs to a random variable X.

Probability distribution: what for? Probability distributions are powerful tools which are routinely used (either explictly or implicitly) for making statistical inferences It is pivotal to identify the most appropriate distribution to be exploited for each given biostatistical problem Should you really be concerned?

Probability distribution: what for? Probability distributions are powerful tools which are routinely used (either explictly or implicitly) for making statistical inferences It is pivotal to identify the most appropriate distribution to be exploited for each given biostatistical problem Should you really be concerned? … Actually no, because when you correctly identify a given statistical test, you by default choose its corresponding probability distribution

What you will learn Probability distributions –what is it and what is it for –discrete: binomial, Poisson –continuous: normal, Chi-square, F and t –central limit theorem

Binomial distribution The binomial distribution is the discrete probability distribution of the number of successes in a sequence of n independent yes/no experiments, each of which yields success with probability p

Binomial distribution The binomial distribution and the corresponding binomial test are seldom used in clinical research, but they are the most basic example of probability distribution But, how can I recognize a biased die? Using the binomial distribution: I roll the dice 40 times, and compare my results to the results expected by the binomial model with n = 40 and p = 1/6

Poisson distribution The Poisson distribution is a discrete probability distribution that expresses the probability of a number of events occurring in a fixed period of time if these events occur with a known average rate and independently of the time since the last event. The Poisson distribution can also be used for the number of events in other specified intervals such as distance, area or volume

Poisson distribution The Poisson distribution provides a useful and efficient way to assess the percentage of time when a given range of results will be expected. You might wish to project a reasonable upper limit on some event after making a number of observations. Another potential application would be comparing rates of very rare adverse events, which occur sparsely in time and space The Poisson distribution and the corresponding tests are however seldom used in clinical research

What you will learn Probability distributions –What is it and what is it for –discrete: binomial, Poisson –continuous: normal, Chi-square, F and t –central limit theorem

Normal distribution The normal distribution, also called the Gaussian distribution, is an important family of continuous probability distributions, applicable in many fields Each member of the family may be defined by two parameters, location and scale: the mean ("average", μ) and variance (standard deviation squared, σ 2 ) respectively

Normal distribution The standard normal distribution is the normal distribution with a mean of zero and a variance of one

Normal distribution The normal distribution is probably the most powerful tool in biostatistics, with thousand uses. Why? –It can be summarized quickly and efficiently by just two numbers (μ and σ) –Many probability distributions look normal for large samples (see central limit theorem)

Chi-square distribution Describes the probability distribution of a random sum (Q) of k independent, normally distributed random variables with mean 0 and variance 1

Chi-square distribution It is commonly used for chi-square tests for goodness of fit of an observed distribution to a theoretical one, and of the independence of two criteria of classification of qualitative data It is a very powerful and robust tool in biostatistics, second only to the normal distribution, for comparing categorical variables and/or goodness of fit

F distribution The F distribution is a continuous probability distribution

F distribution Named by Snedecor as F for Ronald Aylmer Fisher, is a continuous probability distribution exploited for the comparison of continuous variables It is a complex but very potent tool in biostatistics, and forms the basis of analysis of variance (ANOVA), as well as many other complex statistical models and analyses (eg multivariable linear regression models)

t distribution Student t distribution (or simply the t distribution) is a probability distribution that arises in the problem of estimating the mean of a normally distributed population when the sample size is small Student's distribution arises when (as in nearly all practical statistical work) the population standard deviation is unknown and has to be estimated from the data.

t distribution Student t distribution (or simply the t distribution) is a probability distribution that arises in the problem of estimating the mean of a normally distributed population when the sample size is small t distribution arises when (as in nearly all practical statistical work) the population standard deviation is unknown and has to be estimated from the data Gosset

t distribution If you look behind a t distribution, you will find a…

t distribution If you look behind a t distribution, you will find a… GUINNESS!!!

t distribution The t distribution was developed in 1908 by William Sealy Gosset, while he worked at a Guinness Brewery in Dublin, as he was prohibited from publishing under his own name. So the paper was written under the pseudonym Student The t test and the associated frequentist theory became well- known through the work of R.A. Fisher, who called the distribution “Student's distribution”

t distribution The t test is a very useful and friendly test in biostatistics, probably the most commonly used one with the chi-square test

t distribution The t test is a very useful and friendly test in biostatistics, probably the most commonly used one with the chi-square test

What you will learn Probability distributions –what is it and what is it for –discrete: binomial, Poisson –continuous: normal, Chi-square, F and t –central limit theorem

Central limit theorem The central limit theorem (CLT) states that the re- averaged sum of a sufficiently large number of identically distributed independent random variables each with finite mean and variance will be approximately normally distributed In other words, any sum of many independent identically distributed random variables will tend to be distributed according to a particular "attractor distribution” Since many real populations yield distributions with finite variance (eg weight, height, IQ), this explains the prevalence of the normal probability distribution

Central limit theorem Histogram plot of average proportion of heads in a fair coin toss, over a large number of sequences of coin tosses.

Central limit theorem Histogram plot of average proportion of heads in a fair coin toss, over a large number of sequences of coin tosses. In other words, if you collect enough cases, most variables will be distributed normally around their means and according their variances, and parametric statistics and tests will be potentially applicable

Everything is connected – applications of the CLT From binomial to Poisson: –As n approaches ∞ and p approaches 0 while np remains fixed at λ > 0 or at least np approaches λ > 0, then the Binomial (n, p) distribution approaches the Poisson distribution with expected value λ From binomial to normal: –As n approaches ∞ while p remains fixed, the distribution of approaches the normal distribution with expected value 0 and variance 1 (this is just a specific case of the central limit theorem)

Value Frequency When is a distribution normal?

Rules of thumb Testing normality assumptions Rules of thumb 1.Refer to previous data or analyses (eg landmark articles, large databases) 2.Inspect tables and graphs (eg outliers, histograms) 3.Check rough equality of mean, median, mode 4.Perform ad hoc statistical tests Levene test for equality of variances Kolmogodorov-Smirnov test Moses-Shapiro test... When is a distribution normal?

Short test Sakurai et al, AJC 2007

Thank you for your attention For any correspondence: For further slides on these topics feel free to visit the metcardio.org website: