Variability and statistical tests. Where the variability comes from? Instrumental measurements Biology –Genotype –Environment –Ootype –Experimental factors.

Slides:



Advertisements
Similar presentations
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
Advertisements

Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Correlation and Regression
Statistical Issues in Research Planning and Evaluation
Design of Experiments and Analysis of Variance
Correlation and Linear Regression
QBA 260 Chapter 1. Chapter 1 Topics Samples and Populations Samples and Populations Types of Data Types of Data Variables – Independent and Dependent.
ANOVA: ANalysis Of VAriance. In the general linear model x = μ + σ 2 (Age) + σ 2 (Genotype) + σ 2 (Measurement) + σ 2 (Condition) + σ 2 (ε) Each of the.
General linear model and regression analysis. The general linear model: Y = μ + σ 2 (Age) + σ 2 (Sex) + σ 2 (Genotype) + σ 2 (Measurement) + σ 2 (Condition)
Chapter Topics Types of Regression Models
PSYC512: Research Methods PSYC512: Research Methods Lecture 9 Brian P. Dyre University of Idaho.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
Statistical Treatment of Data Significant Figures : number of digits know with certainty + the first in doubt. Rounding off: use the same number of significant.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Correlation and Regression Analysis
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Simple Linear Regression Analysis
Relationships Among Variables
Choosing Statistical Procedures
AM Recitation 2/10/11.
Chapter 13: Inference in Regression
Variation, Validity, & Variables Lesson 3. Research Methods & Statistics n Integral relationship l Must consider both during planning n Research Methods.
1 Chapter 1: Introduction to Design of Experiments 1.1 Review of Basic Statistical Concepts (Optional) 1.2 Introduction to Experimental Design 1.3 Completely.
Evidence Based Medicine
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Chapter 1 Measurement, Statistics, and Research. What is Measurement? Measurement is the process of comparing a value to a standard Measurement is the.
Examining Relationships in Quantitative Research
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
1 Chapter 1: Introduction to Design of Experiments 1.1 Review of Basic Statistical Concepts (Optional) 1.2 Introduction to Experimental Design 1.3 Completely.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
Chapter 22: Comparing Two Proportions. Yet Another Standard Deviation (YASD) Standard deviation of the sampling distribution The variance of the sum or.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Review Hints for Final. Descriptive Statistics: Describing a data set.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.
Question paper 1997.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.3 Two-Way ANOVA.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 11: Models Marshall University Genomics Core Facility.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Ch 8 Estimating with Confidence 8.1: Confidence Intervals.
Research Methodology Lecture No :26 (Hypothesis Testing – Relationship)
Chapter 13 Understanding research results: statistical inference.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Hypothesis Testing and Statistical Significance
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Advanced Higher STATISTICS Spearman’s Rank (Spearman’s rank correlation coefficient) Lesson Objectives 1. Explain why it is used. 2. List the advantages.
Chapter 22 Inferential Data Analysis: Part 2 PowerPoint presentation developed by: Jennifer L. Bellamy & Sarah E. Bledsoe.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 9 Introduction to the t Statistic
Some Terminology experiment vs. correlational study IV vs. DV descriptive vs. inferential statistics sample vs. population statistic vs. parameter H 0.
STATISTICAL TESTS FOR SCIENCE FAIR PROJECTS
Review 1. Describing variables.
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Inferential Statistics
Type I and Type II Errors
Presentation transcript:

Variability and statistical tests

Where the variability comes from? Instrumental measurements Biology –Genotype –Environment –Ootype –Experimental factors Randomly fluctuating Gradually changing in time: drift Errors -Random: not reproducible -Systematic: is reproduced in a particular setting -Major: something crucial has been overlooked in the experiment Depending on context, these can be classified into factors of: 1.Core research interest 2.Satellite ones 3.Nuisance

Variables Nominal: yellow, blue, green… Ordinal: small, big Interval: 0…10, 11…20, 21…30 etc. Ratio: p/N Continuous: … Discrete: 4, 7, 11 Binary: 0 or 1, “Yes” or “No”

Signal vs. noise On can introduce a continous varaiable,.. X Y

Class 1Class 2 …discrete classes (~bins, levels etc.) for one variable,.. X Y

Class XaClass Xb …or discrete classes for both variables X Y Class Y2 Class Y1

Quantify it! We should figure out which factors are most relevant to the phenomenon being studied. An example: 1.Age σ 2 (Age) 2.Sex σ 2 (Sex) 3.Genotype σ 2 (Genotype) 4.Measurement differencesσ 2 (Measurement) 5.Experimental conditionsσ 2 (Condition) Thus, the general linear model: Y = μ + σ 2 (Age) + σ 2 (Sex) + σ 2 (Genotype) + σ 2 (Measurement) + σ 2 (Condition) + ε Y: response of the system μ: grand mean σ 2 : variance from the factor ε: error (correctly speaking, residual or unexplained variance!)

In other words, to capture a signal y = f(x) ( an example signal : “the higher x, the better y” ), a formalization is needed.

Different methods: Work with different data (both factors and responses) Have different power in different conditions (sample size, data type, design topology) Answer different questions (defined via null hypotheses) Provide different amount of supplementary output (graphs, tables etc.)

What are variables? Variables are things that we measure, control, or manipulate in research. They differ in the role they are given in our research and in the way they are measured.

Correlational vs. experimental research In correlational research we do not influence any variables but only measure them In experimental research, we manipulate some variables and then measure the effects of this manipulation on other variables; for example, a researcher might artificially increase blood pressure and then record cholesterol level. However, “correlation-like” techniques still may be applied to experimental data. But due to a better quality of the experimental setting, they potentially provide qualitatively better information

Dependent vs. independent variables Independent varables ARE MANIPULATED in the experiment Dependent ones ARE NOT MANIPULATED Independent variables shape the experiment Dependent variables measure its result

Relations between variables Distributed in a consistent manner Systematically correspond to each other Do not forget the noise!

Features of relations Two basic features of every relation between variables “magnitude” (~strength) vs. “reliability” (~confidence, or significance): not totally independent.

Statistical significance p-level: the probability of the relation to NOT EXIST

Null hypothesis Null hypothesis H 0 which we test: –Is the reference point in the analysis –States that “The factor does not work” (or “The relation does not exist”) –Its rejection proves (at some probability) that the factor does work (“is likely to work”)! In the tests we are going to consider, the null hypothesis H0: σ 2 (The factor) = 0 almost always has an equality condition!

How to determine that a result is "really" significant? How the "level of statistical significance" is calculated? Can "no relation" be a significant result? Only after a test on the general population!

How to measure the magnitude (strength) of relations between variables? (regression) Common "general format" of most statistical tests.

Why stronger relations between variables are more significant? The stronger relation, the higher is the chance it will exceed the noise. Thus, the relation is easier to prove.

Description of the established relations: –Strong? 1.Absolutely 2.Related to other relations –Confident? By different tests –Robust? What happens if: we change the method? the distribution changes the shape?

In the general linear model x = μ + σ 2 (Age) + σ2(Sex) + σ2(Genotype) + σ2(Measurement) + σ2(Condition) + ε Each of the terms σ2 can be questioned. Moreover, their particular combinations can be studied. x = μ + … σ2(Age X Sex) + … + σ2(Sex X Genotype) + σ2(Age X Genotype X Condition) + … + ε Examples: “Does the disease prognosis deteriorate with age equally for men and women?” H0: σ2(Age X Sex) = 0 “Is not genotype AbC reaction particularly difficult to detect by measuring with tool Z?” H0: σ2(Genotype X Measurement) = 0

Pearson correlation Kind of problems that it solves Describe strength of relations between two variables SuitabilityFor normally distributed data Basic theorySums of squares Addressable questionsIf X and Y are related? Not addressable questions Format of input data How to run Interpretation of the results Presentation of the results