Dr.Shaikh Shaffi Ahamed Ph.D., Associate Professor Dept. of Family & Community Medicine.

Slides:



Advertisements
Similar presentations
Revision Slides Types of Data
Advertisements

Numeracy & Quantitative Methods: Numeracy for Professional Purposes Laura Lake.
Types of Variables Objective:
Types of Data Types of data Categorical data Measurement data.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Slide 1 Spring, 2005 by Dr. Lianfen Qian Lecture 2 Describing and Visualizing Data 2-1 Overview 2-2 Frequency Distributions 2-3 Visualizing Data.
Chapter 2 Presenting Data in Tables and Charts
1 Introduction to biostatistics By Dr. S. Shaffi Ahamed Asst. Professor Dept. of Family & Community Medicine KKUH.
INTRODUCTION TO BIOSTATISTICS DR.S.Shaffi Ahamed Asst. Professor Dept. of Family and Comm. Medicine KKUH.
Research Methods in MIS
DESCRIPTIVE STATISTICS: GRAPHICAL AND NUMERICAL SUMMARIES
INTRODUCTION TO BIOSTATISTICS
Quantitative Data Analysis Definitions Examples of a data set Creating a data set Displaying and presenting data – frequency distributions Grouping and.
Sexual Activity and the Lifespan of Male Fruitflies
Introduction to Statistics
Ka-fu Wong © 2003 Chap 2-1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
Introduction to Biostatistics
Statistical Techniques in Hospital Management QUA 537
The Stats Unit.
3 - 1 Module 2: Types of Data This module describes the types of data typically encountered in public health applications. Recognizing and understanding.
Frequency Distributions and Graphs
1 Statistics This lecture covers chapter 1 and 2 sections in Howell Why study maths in psychology? “Mathematics has the advantage of teaching you.
Collecting, Presenting, and Analyzing Research Data By: Zainal A. Hasibuan Research methodology and Scientific Writing W# 9 Faculty.
With Statistics Workshop with Statistics Workshop FunFunFunFun.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
Copyright © 2012 by Nelson Education Limited.2-1 Chapter 2 Basic Descriptive Statistics: Percentages, Ratios and Rates, Tables, Charts, and Graphs.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 3 Organizing and Displaying Data.
Data Presentation.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Displaying Data Visually
Smith/Davis (c) 2005 Prentice Hall Chapter Four Basic Statistical Concepts, Frequency Tables, Graphs, Frequency Distributions, and Measures of Central.
STAT 211 – 019 Dan Piett West Virginia University Lecture 1.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
1 Introduction to biostatistics By Dr. S. Shaffi Ahamed Asst. Professor Dept. of Family & Community Medicine KKUH.
Dr.Shaikh Shaffi Ahamed Ph.D., Associate Professor Dept. of Family & Community Medicine.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
Warm-Up Set your signed welcome letter on the stool. 1. What is your favorite color? 2. What is your favorite place to shop? 3. What is your favorite.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 2-2 Frequency Distributions.
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
Chapter Eight: Using Statistics to Answer Questions.
Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.
Data, Type and Methods of representation Dr Hidayathulla Shaikh.
Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
Types of Data Dr.Lely Lubna Alaydrus Community Medicine Department Aimst University.
Dr.Shaikh Shaffi Ahamed Ph.D., Associate Professor Dept. of Family & Community Medicine.
Biostatistics Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
M. MASTAK AL AMIN The summary Table A summary table indicates the frequency, amount or percentage of items in a set of categories so that you can see differences.
2 NURS/HSCI 597 NURSING RESEARCH & DATA ANALYSIS GEORGE MASON UNIVERSITY.
Introduction to Biostatistics Lecture 1. Biostatistics Definition: – The application of statistics to biological sciences Is the science which deals with.
Data organization and Presentation. Data Organization Making it easy for comparison and analysis of data Arranging data in an orderly sequence or into.
SESSION 1 & 2 Last Update 15 th February 2011 Introduction to Statistics.
Pharmaceutical Statistics
INTRODUCTION AND DEFINITIONS
Introduction to biostatistics & Basic Concepts By Dr. S
Chapter 5 STATISTICS (PART 1).
Chapter 2 Describing Data: Graphs and Tables
Basic Statistical Terms
Sexual Activity and the Lifespan of Male Fruitflies
Types of Data.
Biostatistics College of Medicine University of Malawi 2011.
Displaying Data – Charts & Graphs
Chapter Nine: Using Statistics to Answer Questions
Experimental Design Experiments Observational Studies
Biostatistics Lecture (2).
Frequency Distribution and Graphs
Presentation transcript:

Dr.Shaikh Shaffi Ahamed Ph.D., Associate Professor Dept. of Family & Community Medicine

Statistics is the science of conducting studies to collect, organize, summarize, analyze, present, interpret and draw conclusions from data. Any values (observations or measurements) that have been collected

What Is Statistics? 1. Collecting Data e.g., Sample, Survey, Observe, Simulate 2. Characterizing Data e.g., Organize/Classify, Count, Summarize 3. Presenting Data e.g., Tables, Charts, Statements 4. Interpreting Results e.g. Infer, Conclude, Specify Confidence Why? Data Analysis Decision- Making © T/Maker Co.

(1) Statistics arising out of biological sciences, particularly from the fields of Medicine and public health. (2) The methods used in dealing with statistics in the fields of medicine, biology and public health for planning, conducting and analyzing data which arise in investigations of these branches.

 Summarize data  Generalize from a sample to a population  Compare groups  Test for relationships  Analyze multiple variables and their relationship to a given outcome

 Assess time to an event as an endpoint  Report the characteristics of a test  Combine results of several studies  Assess costs and consequences of treatment  Report decision analyses and clinical practice guidelines

BASIC CONCEPTS Data : Set of values of one or more variables recorded on one or more observational units (singular: Datum) Categories of data 1. Primary data: observation, questionnaire, interviews & survey 2. Secondary data: census, medical records, registry Sources of data 1. Routinely kept records 2. Surveys (census) 3. Experiments 4. External source

Dataset: Data for a set of variables collection in group of persons. Data Table: A dataset organized into a table, with one column for each variable and one row for each person. Datasets and Data Tables

OBSAGEBMIFFNUMTEMP( 0 F)GENDEREXERCISE LEVELQUESTION Typical Data Table

Definitions for Variables AGE: Age in years BMI: Body mass index, weight/height 2 in kg/m 2 FFNUM: The average number of times eating “fast food” in a week TEMP: High temperature for the day GENDER: 1- Female 0- Male EXERCISE LEVEL: 1- Low 2- Medium 3- High QUESTION: what is your satisfaction rating for this Biostatistics session ? 1- Very Satisfied 2- Somewhat Satisfied 3- Neutral 4- Somewhat dissatisfied 5- Dissatisfied

When collecting or gathering data we collect data from individuals cases on particular variables. A variable is a unit of data collection whose value can vary. Variables can be defined into types according to the level of mathematical scaling that can be carried out on the data. There are four types of data or levels of measurement: Types of variables and data 1. Nominal2. Ordinal 3. Interval4. Ratio

Scales of Measurement

Terminology  Categorical variables  Quantity variables  Nominal variables  Ordinal Variables  Binary data.  Discrete and continuous data.  Interval and ratio variables  Qualitative and Quantitative traits/ characteristics of data.

Categorical data  The objects being studied are grouped into categories based on some qualitative trait.  The resulting data are merely labels or categories.

Examples of categorical data  Eye color: blue, brown, black, green, etc.  Smoking status: smoker, non-smoker  Attitudes towards the death penalty: Strongly disagree, disagree, neutral, agree, strongly agree.

Categorical data Ordinal data Nominal data

Nominal data  A type of categorical data in which objects fall into unordered categories.  Studies measuring nominal data must ensure that each category is mutually exclusive and the system of measurement needs to be exhaustive.  Variables that have only two responses i.e. Yes or No, are known as dichotomies.

Examples of Nominal data  Type of Car  BMW, Mercedes, Lexus, Toyota, etc.,  Ethnicity  White British, Afro-Caribbean, Asian, Arab, Chinese, other, etc.  Smoking status  smoker, non-smoker

Binary Data  A type of categorical data in which there are only two categories. Examples:  Smoking status- smoker, non-smoker  Attendance- present, absent  Result of a exam- pass, fail.  Status of student- undergraduate, postgraduate.

Ordinal data is data that comprises of categories that can be rank ordered. Similarly with nominal data the distance between each category cannot be calculated but the categories can be ranked above or below each other. Ordinal data

Examples of Ordinal Data:  Grades in exam- A+, A, B+ B, C+, C,D, D+, and Fail.  Degree of illness- none, mild, moderate, acute, chronic.  Opinion of students about stats classes- Very unhappy, unhappy, neutral, happy, ecstatic!

Examples: Nominal data (Binary)& Ordinal data What is your gender? (please tick) Male Female Did you enjoy the teaching session ? (please tick) Yes No What is the level of satisfaction with the new curriculum at a medical school received? (please tick) Very satisfied Somewhat satisfied Neutral Somewhat dissatisfied Very dissatisfied

QUANTATIVE DATA  The objects being studied are ‘measured’ based on some quantitative trait.  The resulting data are set of numbers. Examples:  Pulse rate  Height  Age  Exam marks  Time to complete a statistics test  Number of cigarettes smoked

Quantitative data Continuous Discrete

Discrete Data Only certain values are possible (there are gaps between the possible values). Implies counting. Continuous Data Theoretically, with a fine enough measuring device. Implies counting.

26 Discrete data -- Gaps between possible values Continuous data -- Theoretically, no gaps between possible values Number of Children Hb

Examples of Discrete Data:  Number of children in a family  Number of students passing a stats exam  Number of crimes reported to the police  Number of bicycles sold in a day. Generally, discrete data are counts. We would not expect to find 2.2 children in a family or 88.5 students passing an exam or crimes being reported to the police or half a bicycle being sold in one day.

Example of Continuous Data:  Age ( in years)  Height( in cms.)  Weight (in Kgs.)  Sys.BP, Hb., etc., ‘Generally, continuous data come from measurements.

Variables Category Quantity Nominal Ordinal Discrete (counting) Continuous (measuring)

Interval variables Examples:  Fahrenheit temperature scale- Zero is arbitrary- 40 Degrees is not twice as hot as 20 degrees.  IQ tests. No such thing as Zero IQ. 120 IQ not twice as intelligent as 60.  Question- Can we assume that attitudinal data represents real, quantifiable measured categories? (ie. That ‘very happy’ is twice as happy as plain ‘happy’ or that ‘Very unhappy’ means no happiness at all). “Statisticians not in agreement on this”.

Ratio variables Examples:  Can be discrete or continuous data.  The distance between any two adjacent units of measurement (intervals) is the same and there is a meaningful zero point.  Income- someone earning SR20,000 earns twice as much as someone who earns SR10,000.  Height  Weight  Age

These levels of measurement can be placed in hierarchical order. Hierarchical data order

Nominal data is the least complex and give a simple measure of whether objects are the same or different. Ordinal data maintains the principles of nominal data but adds a measure of order to what is being observed. Interval data builds on ordinal by adding more information on the range between each observation by allowing us to measure the distance between objects. Ratio data adds to interval with including an absolute zero. Hierarchical data order

34 QUANTITATIVE DATA QUALITATIVE DATA wt. (in Kg.) : under wt, normal & over wt. Ht. (in cm.): short, medium & tall

35 A science called clinimetrics in which qualities are converted to meaningful quantities by using the scoring system. Examples: (1) Apgar score based on appearance, pulse, grimace, activity and respiration is used for neonatal prognosis. (2) Smoking Index: no. of cigarettes, duration, filter or not, whether pipe, cigar etc., (3) APACHE( Acute Physiology and Chronic Health Evaluation) score: to quantify the severity of condition of a patient

Why do we need to know what type of data we are dealing with? The data type or level of measurement influences the type of statistical analysis techniques that can be used when analysing data. Data types – important?

Frequency Distributions What is a frequency distribution? What is a frequency distribution? A frequency distribution is an organization of raw data in tabular form, using classes (or intervals) and frequencies. What is a frequency count? What is a frequency count? The frequency or the frequency count for a data value is the number of times the value occurs in the data set.

Frequency Distributions data distribution – pattern of variability. the center of a distribution the ranges the shapes simple frequency distributions grouped & ungrouped frequency distributions

Categorical or Qualitative Frequency Distributions What is a categorical frequency distribution? What is a categorical frequency distribution? A categorical frequency distribution represents data that can be placed in specific categories, such as gender, blood group, & hair color, etc.

Categorical or Qualitative Frequency Distributions -- Example Example: Example: The blood types of 25 blood donors are given below. Summarize the data using a frequency distribution. AB B A O B O B O A O O B O A O B O B B B B O B B B A O AB AB O A O AB AB O A B AB O A A B AB O A

Categorical Frequency Distribution for the Blood Types -- Example Continued Note: The classes for the distribution are the blood types. Note: The classes for the distribution are the blood types.

Quantitative Frequency Distributions -- Ungrouped What is an ungrouped frequency distribution? What is an ungrouped frequency distribution? An ungrouped frequency distribution simply lists the data values with the corresponding frequency counts with which each value occurs.

Quantitative Frequency Distributions – Ungrouped -- Example Example: 57, 57, 56, 57, 58, 56, 54, 64, 53, 54, 54, 55, 57, 55, 60, 58 Example: The at-rest pulse rate for 16 athletes at a meet were 57, 57, 56, 57, 58, 56, 54, 64, 53, 54, 54, 55, 57, 55, 60, and 58. Summarize the information with an ungrouped frequency distribution.

Quantitative Frequency Distributions – Ungrouped -- Example Continued Note: The (ungrouped) classes are the observed values themselves.

Example of a simple frequency distribution (ungrouped) f  f = 25

Relative Frequency Distribution Note: The relative frequency for a class is obtained by computing f/n.

Example of a simple frequency distribution f rel f  f = 25  rel f = 1.0

Cumulative Frequency and Cumulative Relative Frequency Note: Table with relative and cumulative relative frequencies.

Example of a simple frequency distribution (ungrouped) f cf rel f rel. cf  f = 25  rel f = 1.0

Quantitative Frequency Distributions -- Grouped What is a grouped frequency distribution? What is a grouped frequency distribution? A grouped frequency distribution is obtained by constructing classes (or intervals) for the data, and then listing the corresponding number of values (frequency counts) in each interval.

Patient No Hb (g/dl) Patient No Hb (g/dl) Patient No Hb (g/dl) Tabulate the hemoglobin values of 30 adult male patients listed below

Hb (g/dl)No. of patients 9.0 – – – – – – – Total30 Frequency distribution of 30 adult male Frequency distribution of 30 adult male patients by Hb

DIAGRAMS/GRAPHS Categorical data --- Bar diagram (one or two groups) --- Pie diagram Continuous data --- Histogram --- Frequency polygon (curve) --- Stem-and –leaf plot --- Box-and-whisker plot

Commonly Used Graphs Histogram Height of bars proportional to frequency Width proportional to class boundaries Bar Chart Height proportional to frequency Width not really significant Frequency Polygon Plot points then connect with straight lines

Graphic Presentation of Data the histogram (quantitative data) the bar graph (qualitative data) the frequency polygon (quantitative data)

Two-dimensional graphs: Basic Set-Up

Histograms

Frequency Polygons

Example data

Stem and leaf plot Stem-and-leaf of Age N = 60 Leaf Unit =

Bar Graphs The distribution of risk factor among cases with Cardio vascular Diseases Heights of the bar indicates frequency Frequency in the Y axis and categories of variable in the X axis The bars should be of equal width and no touching the other bars

HIV cases enrolment in USA by gender Bar chart

HIV cases Enrollment in USA by gender Stocked bar chart

Grouped Bar Graph

Pie diagram Pie diagram – depicts the percentage represented by each alternative as a slice of a circular pie; the larger the slice, the greater the percentage.

The prevalence of different degree of Hypertension in the population Pie Chart Circular diagram – total -100% Divided into segments each representing a category Decide adjacent category The amount for each category is proportional to slice of the pie

General rules for designing graphs A graph should have a self-explanatory legend A graph should help reader to understand data Axis labeled, units of measurement indicated Scales important. Start with zero (otherwise // break) Avoid graphs with three-dimensional impression, it may be misleading (reader visualize less easily

Thank You