SADC Course in Statistics Graphical summaries for quantitative data Module I3: Sessions 2 and 3.

Slides:



Advertisements
Similar presentations
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
Advertisements

Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
Chapter 12 Analysing quantitative data
Multistage Sampling Module 3 Session 9.
1 Questionnaire design Module 3 Session 3. 2 Overview (of Session) This session starts by introducing some aspects that need to be considered when designing.
1 Data processing and exporting Module 2 Session 6.
Collecting data for informed decision-making
1 Adding a statistics package Module 2 Session 7.
SADC Course in Statistics Basic summaries for demographic studies (Session 03)
SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)
The Poisson distribution
SADC Course in Statistics Further ideas concerning confidence intervals (Session 06)
SADC Course in Statistics Trends in time series (Session 02)
SADC Course in Statistics Introduction to Non- Parametric Methods (Session 19)
SADC Course in Statistics Tests for Variances (Session 11)
Assumptions underlying regression analysis
Correlation & the Coefficient of Determination
SADC Course in Statistics Confidence intervals using CAST (Session 07)
SADC Course in Statistics Sampling design using the Paddy game (Sessions 15&16)
SADC Course in Statistics Organising data in a spreadsheet Module B2 Sessions 9 and 10.
SADC Course in Statistics Processing single and multiple variables Module I3 Sessions 6 and 7.
SADC Course in Statistics Session 4 & 5 Producing Good Tables.
SADC Course in Statistics Exploratory Data Analysis (EDA) in the data analysis process Module B2 Session 13.
SADC Course in Statistics Common complications when analysing survey data Module I3 Sessions 14 to 16.
SADC Course in Statistics Revision using CAST (Session 04)
SADC Course in Statistics Introduction to Statistical Inference (Session 03)
SADC Course in Statistics (Session 09)
SADC Course in Statistics Hierarchies of Units and non-traditional sampling approaches (Session 18)
SADC Course in Statistics General approaches to sample size determinations (Session 12)
SADC Course in Statistics To the Woods discussion (Sessions 10)
SADC Course in Statistics Introduction to the module and the sessions Module I4, Sessions 1 and 2.
SADC Course in Statistics Reporting on the web site Module I4, Sessions 14 and 15.
SADC Course in Statistics Review of ideas of general regression models (Session 15)
SADC Course in Statistics Case Study Work (Sessions 16-19)
Using a statistics package to analyse survey data Module 2 Session 8.
SADC Course in Statistics Modelling ideas in general – an appreciation (Session 20)
SADC Course in Statistics Producing a product portfolio Module I3 Session
The MDGs and School Enrolment: An example of administrative data
SADC Course in Statistics Handling Data Module B2.
SADC Course in Statistics Objectives and analysis Module B2, Session 14.
SADC Course in Statistics Numerical summaries for quantitative data Module I3 Sessions 4 and 5.
SADC Course in Statistics Risks and return periods Module I3 Sessions 8 and 9.
SADC Course in Statistics Revision on tests for means using CAST (Session 17)
SADC Course in Statistics Analysing Data Module I3 Session 1.
SADC Course in Statistics Revision on tests for proportions using CAST (Session 18)
Probability Distributions
SADC Course in Statistics Good graphs & charts using Excel Module B2 Sessions 6 & 7.
SADC Course in Statistics Excel for statistics Module B2, Session 11.
SADC Course in Statistics Module B2, Session3
SADC Course in Statistics Exploratory Data Analysis for single variables Module B2 Session 12.
ZMQS ZMQS
September 2005Created by Polly Stuart1 Analysis of Time Series For AS90641 Part 2 Extra for Experts.
ABC Technology Project
Chapter 5 Microsoft Excel 2007 Window
The x- and y-Intercepts
Addition 1’s to 20.
Key Stage 3 National Strategy Handling data: session 4.
25 seconds left…...
Week 1.
We will resume in: 25 Minutes.
SADC Course in Statistics Taking measurements Module I1, Session 17.
Tables and graphs for frequencies and summary statistics
SADC Course in Statistics Climate variability and climate change Module B1, Session 7.
SADC Course in Statistics Producing Good Tables In Excel Module B2 Sessions 4 & 5.
1 Statistical concepts Module 1, Session 2. 2 Objectives From this session participants will be able to: Define statistics Enter simple datasets once.
Presentation transcript:

SADC Course in Statistics Graphical summaries for quantitative data Module I3: Sessions 2 and 3

To put your footer here go to View > Header and Footer 2 Learning Objectives Students should be able to: Present data in a histogram –and interpret data when shown a histogram Present data in a boxplot –and interpret data in a boxplot Recognise advantages and limitations –of each method of presentation Explain what is gained and lost from data summary Interpret graphical summaries to answer questions –concerning proportions, –extremes, –medians –and quartiles Resolve simple problems with graphical displays –when real data (not text-book examples) are used

To put your footer here go to View > Header and Footer 3 Session Overview Activity 1: Presentation to introduce the sessions Activity 2: Demonstration on histograms in Excel Activity 3: Practical covering the same ideas –CAST Chapter 2.1 –Histograms –Population pyramids Activity 4: Practical on boxplots and percentage points –CAST Chapter 2.2 –Boxplots in Excel Activity 5: Presentation continued

To put your footer here go to View > Header and Footer 4 Activity 1- Case studies Case studies used in these sessions are: –Rice Survey –Zambia Rainfall Data –The Swaziland Crop Cutting Survey The were introduced in Modules B1 and B2 –so for many this will be a reminder. The rice survey and the Swaziland Crop Cutting –were already used in Session 1 of this Module.

To put your footer here go to View > Header and Footer 5 Rice survey Used repeatedly in slightly different forms In CAST as shown here In demonstration And in Excel Qualitative and quantitative variables to analyse

To put your footer here go to View > Header and Footer 6 Zambia Rainfall data Farmers are migrating from Southern Zambia –citing climate change as the reason –they can no longer grow the crops as before A local NGO –acknowledges climate change in general, –but believes improved farming practices is more important –wherever farmers locate They question the evidence –for climate change in the pattern of rainfall –as it affects farming strategy

To put your footer here go to View > Header and Footer 7 Data analysed in these sessions Total rainfall from 1 Jan to 31 March – called SeasonTot Number of rain days in the same period - Scount

To put your footer here go to View > Header and Footer 8 Original daily values are also available Not needed for analysis here, but used for checking Also useful if further questions posed

To put your footer here go to View > Header and Footer 9 Zambia rainfall continued: Here we use the annual data –on the final worksheet –preparing data for analysis is done in Module I2 We have access to the raw data –for checking purposes –and in case other variables are needed on a later occasion

To put your footer here go to View > Header and Footer 10 Swaziland Crop Cutting Survey Annual survey –Of agricultural holdings –And areas planted –Then yields from a crop-cutting exercise Data from 2005 made available

To put your footer here go to View > Header and Footer 11 Person-level data Ages will be analysed in these sessions

To put your footer here go to View > Header and Footer 12 Yield data The dry weights will be analysed Analysis overall and just for maize The zero yields cause a slight problem

To put your footer here go to View > Header and Footer 13 Activity 2 – demonstration of histograms Before the practical (which is activity 3) Follow the demonstration On histograms and population pyramids in Excel It shows Use of Excel But keeping control – you must remain in charge And not be limited by the software So it shows how to resolve problems As well as how to use Excel

To put your footer here go to View > Header and Footer 14 Activity 3 – practical with CAST and Excel Using CAST to understand histograms Then use Excel to construct them Being observant – as was shown in the demonstration And keeping control Remember the aims Are more to understand statistics Rather than to practice with Excel Excel is just the tool

To put your footer here go to View > Header and Footer 15 An example with CAST Practice with small data sets – as shown here and also with large data sets

To put your footer here go to View > Header and Footer 16 Population pyramid in CAST How close to this display can you get with Excel? Interpret this display

To put your footer here go to View > Header and Footer 17 Activity 4: Boxplots and data summary Now do the demonstration And these two practical exercises Then return to the remaining slides For a discussion of the key points

To put your footer here go to View > Header and Footer 18 Topics for the class discussion What was interesting? What did you discover What was difficult? What needs further discussion?

To put your footer here go to View > Header and Footer 19 Boxplots and histograms – CAST page Are you clear how boxplots and histograms relate?

To put your footer here go to View > Header and Footer 20 Boxplots show outliers – CAST page And this makes them very useful for data exploration as well as summary

To put your footer here go to View > Header and Footer 21 You can calculate percentage points The formula r * (n + 1)/100 for the rth % point Should now hold no fears For example –when there are 11 observations –and you want the median –use 50 * (11+1)/100 = 6 –the median is the 6 th highest in the sorted data If necessary look again at practical 2

To put your footer here go to View > Header and Footer 22 Practical problems with real data This always happens You saw a problem –with the rainfall data –and with the crop yields The solution always involves being observant You can follow guidelines for a good analysis But not always simply obey rules Become a data detective instead!

To put your footer here go to View > Header and Footer 23 Learning Objectives Are you now able to: Present data in a histogram –and interpret data when shown a histogram Present data in a boxplot –and interpret data in a boxplot Recognise advantages and limitations –of each method of presentation Explain what is gained and lost from data summary Interpret graphical summaries to answer questions –concerning proportions, –extremes, –medians –and quartiles Resolve simple problems with graphical displays –when real data (not text-book examples) are used

To put your footer here go to View > Header and Footer 24 These sessions were largely on graphical summaries The next sessions consider numerical summaries of the data