Estimation of authenticity of results of statistical research (part I)

Slides:



Advertisements
Similar presentations
ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
Advertisements

Associations between Obesity and Depression by Race/Ethnicity and Education among Women: Results from the National Health and Nutrition Examination Survey,
Psychology: A Modular Approach to Mind and Behavior, Tenth Edition, Dennis Coon Appendix Appendix: Behavioral Statistics.
Table of Contents Exit Appendix Behavioral Statistics.
Chapter 3 Distribution of Disease Section 1 Epidemic Strength of Disease Definition: epidemic strength of disease refers to the variability and characterization.
Calculating & Reporting Healthcare Statistics
Slides by JOHN LOUCKS St. Edward’s University.
Cohort Studies.
1 EXPLORING PSYCHOLOGY (7th Edition) David Myers PowerPoint Slides Aneeq Ahmad Henderson State University Worth Publishers, © 2008.
Basic Statistical Concepts Donald E. Mercante, Ph.D. Biostatistics School of Public Health L S U - H S C.
Today Concepts underlying inferential statistics
Measures of Central Tendency
1 Normal Distributions Heibatollah Baghi, and Mastee Badii.
Math 116 Chapter 12.
Lecture 3: Measuring the Occurrence of Disease
 Mean: true average  Median: middle number once ranked  Mode: most repetitive  Range : difference between largest and smallest.
Multiple Choice Questions for discussion
Measuring disease and death frequency
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Statistics. Intro to statistics Presentations More on who to do qualitative analysis Tututorial time.
Medical statistics.
Graphic representations in statistics
Average values. Measures of Association n Absolute risk -The relative risk and odds ratio provide a measure of risk compared with a standard. n Attributable.
Estimation of authenticity of results of statistical research.
Graphic representations in statistics. Graphic representation and graphic analysis n Graphic representations are used for evident representation of statistical.
Organization of statistical investigation. Medical Statistics Commonly the word statistics means the arranging of data into charts, tables, and graphs.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
Estimation of authenticity of results of statistical research (part II)
LOGISTIC REGRESSION A statistical procedure to relate the probability of an event to explanatory variables Used in epidemiology to describe and evaluate.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Average Arithmetic and Average Quadratic Deviation.
Chapter 2 Nature of the evidence. Chapter overview Introduction What is epidemiology? Measuring physical activity and fitness in population studies Laboratory-based.
Dynamic lines. Measures of Association n Absolute risk -The relative risk and odds ratio provide a measure of risk compared with a standard. n Attributable.
Research Ethics:. Ethics in psychological research: History of Ethics and Research – WWII, Nuremberg, UN, Human and Animal rights Today - Tri-Council.
Medical Statistics as a science
Relative Values. Statistical Terms n Mean:  the average of the data  sensitive to outlying data n Median:  the middle of the data  not sensitive to.
Evaluating Risk Adjustment Models Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
PCB 3043L - General Ecology Data Analysis. PCB 3043L - General Ecology Data Analysis.
Authenticity of results of statistical research. The Normal Distribution n Mean = median = mode n Skew is zero n 68% of values fall between 1 SD n 95%
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Research and Methodology
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Average Arithmetic and Average Quadratic Deviation.
Statistical inference Statistical inference Its application for health science research Bandit Thinkhamrop, Ph.D.(Statistics) Department of Biostatistics.
Average values and their types. Averages n Averages are widely used for comparison in time, that allows to characterize the major conformities to the.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Estimation of authenticity of results of statistical research.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Chapter 3 Descriptive Statistics for Qualitative Data.
Measuring of Correlation. Definition Correlation is a measure of mutual correspondence between two variables and is denoted by the coefficient of correlation.
 Major concepts  Focused on key issues for practice, education, and administration  Examples: chronic pain, acute pain, self-care, coping, health.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Direct method of standardization of indices. Average Values n Mean:  the average of the data  sensitive to outlying data n Median:  the middle of the.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
© 2010 Jones and Bartlett Publishers, LLC. Chapter 5 Descriptive Epidemiology According to Person, Place, and Time.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
Relative values and their types
Instructional Objectives:
Statistical analysis.
Average Arithmetic and Average Quadratic Deviation
Variety of characteristic
Measuring of Correlation
Statistical analysis.
Direct method of standardization of indices
Relative Values.
Basic Statistics Overview
Graphic representations in statistics
NURS 790: Methods for Research and Evidence Based Practice
Presentation transcript:

Estimation of authenticity of results of statistical research (part I)

n The necessity estimation of authenticity of results is determined by volume of research. In full research (general aggregate), when all units of supervision are explored it is possible to get only one value of certain index. The general aggregate is always reliable because in it included her all units of supervision are included. General aggregate official statistics can exemplify.

n The general aggregate is rarely used in medical-biologic research, mainly part of researches is selective. The law of large numbers is basis for forming of reliable selective aggregate. It sounds so: it is possible to assert with large authenticity, that at achievement of large number of supervisions average of sign, which is studied in a selective aggregate will be a little to differ from an average which is studied at all general aggregate.

n The selective aggregate always has errors, because not all units of supervision are included in research. Authenticity of selective research depends from the size of this error. That is why greater number of supervisions, teed to less error, the less sizes of casual vibrations of index. That, to decrease an error it is needed to multiply the number of supervisions.

n The necessity estimation of authenticity of results is determined by volume of research. n In full research (general aggregate), when all units of supervision are explored it is possible to get only one value of certain index. n The general aggregate is always reliable because in it included her all units of supervision are included. General aggregate official statistics can exemplify.

The distribution of birthwt is shown.

Objective To describe the distribution and frequency of a disease in population.

Four primary types of epidemiology studies

How to describe ? What is the problem of the disease? How frequent ? Who are affected?----person Where and when does it occur?----place and time

Three distributions Place Person Time

Population n Age n Behavior Sex n Race

1Age n Frequency of disease n Severity of disease Young people : infectious disease Old people: noninfectious disease accumulation of environmental factors

Examples n Children are more susceptible to some infectious diseases, measles n Prevalence of hypertension increase with age

Mortality rate ---Age n Figure 6-1

incidence rate---Age n Figure 6-2

Serum HDL-cholesterol in Tromsø 1994/ ISM, UiT HDL-cholesterol mmol/L The Tromsø Study age

Sex n Frequencies and severity of disease differ between male and female population. n It is helpful to identify the risk factor of disease e.g. endemic struma female > male

Prevalence of obesity in Han students aged 8-18 years in Urumqi, 2003

Race (ethnic) n Obesity, hypertension are more prevalent in blacks than in whites n T2D is very prevalent in Pima indians n Prevalence of hypertension is quite different among ethnicities. Why? Genetic Environment

Prevalence of obesity among ethnicities ( adjusted by age) Ethnicities Prevalence of obesity%

Prevalence of EH among ethnic adults (1991)

Death rate in the U.S. n Blacks: cause of deaths: hypertensive heart disease, stroke, tuberculosis, syphilis, and accidental death. n Whites: cardio artery disease, suicide, and leukemia.

Behaviors n Cigarette smoking, alcohol consumption, abuse of drugs; high salt intake, fat food, and so on. n Determined by biological and social factors.

Place n Countries Urban and rural Places in different altitude

Estimated number of people at over 35% risk of a major cardiovascular event in the next decade, by WHO sub-region

CHD mortality. Women and men, age adjusted rates per Source: WHO Health statistics annual 1993/94 ISM, UiT

Time When does the disease occur and transmit in the population?

Mean Plasma Cholesterol Values in China

Some terms to describe the “time” of diseases n Long-term or secular trends n Periodic fluctuations (cyclical changes) seasonal trends cyclical trends n Short term fluctuations

Secular trends Changes in the incidence of disease over a long period of time (several years of decades) CHD have shown an upward trend in developed countries over decades.

Periodic fluctuations 1. Seasonal trends n Diarrhea---summer n Respiratory diseases---winter 2. Cyclical trends disease occur in cycles spread over short periods of time (day, weeks, months or years) e.g. influenza 7-10 yrs)

n The general aggregate is rarely used in medical-biologic research, mainly part of researches is selective. The law of large numbers is basis for forming of reliable selective aggregate. It sounds so: it is possible to assert with large authenticity, that at achievement of large number of supervisions average of sign, which is studied in a selective aggregate will be a little to differ from an average which is studied at all general aggregate.

n The selective aggregate always has errors, because not all units of supervision are included in research. Authenticity of selective research depends from the size of this error. That is why greater number of supervisions, teed to less error, the less sizes of casual vibrations of index. That, to decrease an error it is needed to multiply the number of supervisions.

Basic criteria of authenticity (representation): n Error of representation (w) n Confiding scopes n The coefficient of authenticity (the student criterion) is authenticity of difference of middle or relative sizes (t)

Basic criteria of authenticity (representation): n The errors of representation of /m/ are the degree of authenticity of average or relative value shows how much the results of selective research differ from results which it is possible to get from continuous study of general aggregate.

Basic criteria of authenticity (representation): n Confiding scopes – properties of selective aggregate are carried on general one, probability oscillation of index is shown in the general aggregate, its extreme values of minimum and maximal possibility, which the size of general aggregate can be within the limits of.

Basic criteria of authenticity (representation): n The coefficient of authenticity (the Student’s criterion) is authenticity of difference of middle or relative sizes (t). The student’s Criterion shows the difference of the proper indexes in two separate selective aggregates.

The use of averages in health protection n for description of work organization of health protection establishments (middle employment of bed, term of stay in permanent establishment, amount of visits on one habitant and other);

The use of averages in health protection n for description of indices of physical development (length, mass of body, circumference of head of new-born and other);

The use of averages in health protection n for determination of medical-physiology indices of organism (frequency of pulse, breathing, level of arterial pressure and other);

The use of averages in health protection n for estimation of these medical-social and sanitary-hygienic researches (middle number of laboratory researches, middle norms of food ration, level of radiation contamination and others).

Averages n Averages are widely used for comparison in time, that allows to characterize the major conformities to the law of development of the phenomenon. So, for example, conformity to the law of growth increase of certain age children finds the expression in the generalized indices of physical development. Conformities to the law of dynamics (increase or diminishment) of pulse rate, breathing, clinical parameters at the certain diseases find the display in statistical indices which represent the physiology parameters of organism and other.

Average Values n Mean:  the average of the data  sensitive to outlying data n Median:  the middle of the data  not sensitive to outlying data n Mode:  most commonly occurring value n Range:  the difference between the largest observation and the smallest n Interquartile range:  the spread of the data  commonly used for skewed data n Standard deviation:  a single number which measures how much the observations vary around the mean n Symmetrical data:  data that follows normal distribution  (mean=median=mode)  report mean & standard deviation & n n Skewed data:  not normally distributed  (mean  median  mode)  report median & IQ Range

Average Values n Limit is it is the meaning of edge variant in a variation row lim = Vmin Vmax

Average Values n Amplitude is the difference of edge variant of variation row Am = Vmax - Vmin

Average Values n Average quadratic deviation characterizes dispersion of the variants around an ordinary value (inside structure of totalities).

Average quadratic deviation σ = simple arithmetical method

Average quadratic deviation d = V - M genuine declination of variants from the true middle arithmetic

Average quadratic deviation σ = i method of moments

Average quadratic deviation is needed for: 1. Estimations of typicalness of the middle arithmetic (М is typical for this row, if σ is less than 1/3 of average) value. 2. Getting the error of average value. 3. Determination of average norm of the phenomenon, which is studied (М±1σ), sub norm (М±2σ) and edge deviations (М±3σ). 4. For construction of sigmal net at the estimation of physical development of an individual.

Average quadratic deviation This dispersion a variant around of average characterizes an average quadratic deviation (  )

n Coefficient of variation is the relative measure of variety; it is a percent correlation of standard deviation and arithmetic average.

Terms Used To Describe The Quality Of Measurements n Reliability is variability between subjects divided by inter-subject variability plus measurement error. n Validity refers to the extent to which a test or surrogate is measuring what we think it is measuring.

Measures Of Diagnostic Test Accuracy n Sensitivity is defined as the ability of the test to identify correctly those who have the disease. n Specificity is defined as the ability of the test to identify correctly those who do not have the disease. n Predictive values are important for assessing how useful a test will be in the clinical setting at the individual patient level. The positive predictive value is the probability of disease in a patient with a positive test. Conversely, the negative predictive value is the probability that the patient does not have disease if he has a negative test result. n Likelihood ratio indicates how much a given diagnostic test result will raise or lower the odds of having a disease relative to the prior probability of disease.

Measures Of Diagnostic Test Accuracy

Expressions Used When Making Inferences About Data n Confidence Intervals -The results of any study sample are an estimate of the true value in the entire population. The true value may actually be greater or less than what is observed. n Type I error (alpha) is the probability of incorrectly concluding there is a statistically significant difference in the population when none exists. n Type II error (beta) is the probability of incorrectly concluding that there is no statistically significant difference in a population when one exists. n Power is a measure of the ability of a study to detect a true difference.

Multivariable Regression Methods n Multiple linear regression is used when the outcome data is a continuous variable such as weight. For example, one could estimate the effect of a diet on weight after adjusting for the effect of confounders such as smoking status. n Logistic regression is used when the outcome data is binary such as cure or no cure. Logistic regression can be used to estimate the effect of an exposure on a binary outcome after adjusting for confounders.

Survival Analysis n Kaplan-Meier analysis measures the ratio of surviving subjects (or those without an event) divided by the total number of subjects at risk for the event. Every time a subject has an event, the ratio is recalculated. These ratios are then used to generate a curve to graphically depict the probability of survival. n Cox proportional hazards analysis is similar to the logistic regression method described above with the added advantage that it accounts for time to a binary event in the outcome variable. Thus, one can account for variation in follow-up time among subjects.

Kaplan-Meier Survival Curves

Why Use Statistics?

Descriptive Statistics n Identifies patterns in the data n Identifies outliers n Guides choice of statistical test

Percentage of Specimens Testing Positive for RSV ( respiratory syncytial virus)

Descriptive Statistics

Distribution of Course Grades