Statistics are ubiquitous “Statistics are generated today about nearly every activity on the planet. Never before have we had so much statistical information.

Slides:



Advertisements
Similar presentations
Data Quality Assurance and Dissemination International Workshop on Energy Statistics Aguascalientes, Mexico.
Advertisements

Using American FactFinder John DeWitt Project Manager Social Science Data Analysis Network Lisa Neidert Data Services Population Studies Center.
Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Chuck Humphrey Data Library University of Alberta.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
United Nations Oslo City Group on Energy Statistics 8 th Oslo Group Meeting, Baku, Azerbaijan September 2013 ESCM Chapter 8: Data Quality and Metadata.
CHAPTER 1 INTRODUCTION Prem Mann, Introductory Statistics, 8/E Copyright © 2013 John Wiley & Sons. All rights reserved.
15 de Abril de A Meta-Analysis is a review in which bias has been reduced by the systematic identification, appraisal, synthesis and statistical.
1. DATA ATTRIBUTES ; SUMMARY 1.1Introduction to biostatistics 1.2 The Mean 1.3Measures of Variability 1.4The Normal Distribution 1.5 Distribution; Data.
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 26, 2009.
Chuck Humphrey University of Alberta Digital Reference: Statistics & Data LIS 536 March 4, 2009.
Chuck Humphrey, Leah Vanderjagt and Anna Bombak University of Alberta The Winter Institute on Statistical Literacy for Librarians Demystifying statistics.
Anna Bombak, Chuck Humphrey, Lindsay Johnston and Leah Vanderjagt University of Alberta The Winter Institute on Statistical Literacy for Librarians Demystifying.
Chuck Humphrey & Lynne Robinson University of Alberta Surviving Statistics Strategies for dealing with statistical questions on the reference desk.
Anna Bombak, Chuck Humphrey, Lindsay Johnston, Angie Mandeville and Leah Vanderjagt Winter Institute on Statistical Literacy for Librarians, February 18-20,
Chuck Humphrey, Leah Vanderjagt and Anna Bombak University of Alberta The Winter Institute on Statistical Literacy for Librarians Demystifying statistics.
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library March 6, 2009.
Statistics and Data for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 27, 2008.
EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Organizing data in tables and charts: Criteria for effective presentation Jane E. Miller, Ph.D. Rutgers University.
CHAPTER 14, QUANTITATIVE DATA ANALYSIS. Chapter Outline  Quantification of Data  Univariate Analysis  Subgroup Comparisons  Bivariate Analysis  Introduction.
Statistics - Descriptive statistics 2013/09/23. Data and statistics Statistics is the art of collecting, analyzing, presenting, and interpreting data.
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
Anna Bombak, Chuck Humphrey, Angie Mandeville, Leah Vanderjagt and Amanda Wakaruk Winter Institute on Statistical Literacy for Librarians, February 23-25,
The Winter Institute on Statistical Literacy for Librarians Demystifying statistics for the practitioner Anna Bombak, Chuck Humphrey, Larry Laliberte,
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Organizing data in tables and charts: Different criteria for different tasks Jane.
Stat 1080 “Elementary Probability and Statistics” By Dr. AFRAH BOSSLY
Copyright © 2008 by Nelson, a division of Thomson Canada Limited Chapter 17 Part 5 Analysis and Interpretation of Data BASIC DATA ANALYSIS.
BUSINESS STATISTICS BQT 173
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
Topics Covered: Data Processing Data Processing Information Examples of data and information Examples of data and information Difference between data.
Data and Social Research Chuck Humphrey Data Library Rutherford North Library.
Chuck Humphrey, University of Alberta Atlantic DLI Training, 2008 DLI Orientation: Concepts A Framework for Thinking about Data and Statistics.
1 Course review, syllabus, etc. Chapter 1 – Introduction Chapter 2 – Graphical Techniques Quantitative Business Methods A First Course
DLI Workshop -- Mar Hosted by Dalhousie University March 2000 DLI Training Workshop.
Chapters 1 and 2 Week 1, Monday. Chapter 1: Stats Starts Here What is Statistics? “Statistics is a way of reasoning, along with a collection of tools.
Copyright © 2012 by Nelson Education Limited. Chapter 10 Hypothesis Testing IV: Chi Square 10-1.
GSIM implementation in the Istat Metadata System: focus on structural metadata and on the joint use of GSIM and SDMX Mauro Scanu
The Process of Conducting Research
Chuck Humphrey, University of Alberta Digital Reference: Statistics and Data LIS 536 March 5, 2008.
American Community Survey (ACS) 1 Oregon State Data Center Meeting Portland State University April 14,
The Practice of Social Research Chapter 14 – Quantitative Data Analysis.
Exploratory Research Design: Secondary Data. 4-2 Primary vs. Secondary Data Primary data are originated by a researcher for the specific purpose of addressing.
1 An Introduction to the Minnesota Epidemiological Profile Minnesota State Epidemiological Workgroup and Minnesota Institute of Public Health Fall Forums.
Chapter 1: Exploring Data Sec. 1.1 Analyzing Categorical Data.
5 Descriptive Statistics Chapter 5.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
A Survey of Probability Concepts
Aim: How do we analyze data with a two-way table?
1 Statistics Statistics can be found in all aspects of life:
POPULATION SURVEYS Evaluation the health status of a population (community diagnosis). Evaluation the health status of a population (community diagnosis).
Sociology 343 Chuck Humphrey Data Library University of Alberta.
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Data in context Chapter 1 of Data Basics. Frameworks Today, we will be presenting two frameworks for thinking about the content of data services. A.Statistics.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Probability Concepts. 2 GOALS 1. Define probability. 2. Explain the terms experiment, event, outcome, permutations, and combinations. 3. Define the terms.
Data and Statistics: As easy as 1-2-3? Carolyn DeLorey, MLIS St. Francis Xavier University Atlantic DLI Workshop UNB Fredericton April 28, 2015.
User Services Focus, value and attitude Vocabulary stories: wash & wear, circ & dingo Statistics and data.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter Three MaxIT WiMax.
Community &family medicine
Data Analysis for Two-Way Tables
Measuring Data Quality and Compilation of Metadata
Chapter 18: The Chi-Square Statistic
Presentation transcript:

Statistics are ubiquitous “Statistics are generated today about nearly every activity on the planet. Never before have we had so much statistical information about the world in which we live. Why is this type of information so abundant? For one thing, statistics have become a form of currency in today’s information society. Through computing technology, society has become very proficient in calculating statistics from the vast quantities of data that are collected. As a result, our lives involve daily transactions revolving around some use of statistical information.” Data Basics, page 1.1

Statistics: what are we talking about? Statistics and data are related but different

A statistic can’t be real without data A ‘real’ statistic requires a data source. If the publisher of a statistic can’t tell you the data source behind a statistic, you should question that the statistic is ‘real.’ After all, people do make up statistics. Classic example: a statistic in a 1986 Newsweek article claimed that a 40- year-old woman had a better chance of being killed by a terrorist than of getting married (2.6 percent). Twenty years later, Newsweek admitted that this “comparison wasn’t in the study.”

A statistic can’t be real without data A statistic may have been derived from poor quality data and, consequently, may be of questionable value. But nevertheless, it is a ‘real’ statistic. For example, a debate erupted over a Lancet article on the number of civilians deaths in Iraq following the first 18 months after the invasion.number of civilians deaths The desire is to have quality statistics that are derived from quality data.

Quality of data, quality of statistics Data producers use one set of criteria to ensure that the data are of high quality Producers of statistics also use criteria to ensure that the statistics are of high quality. This is contingent upon having accurate, complete metadata.

Statistics Canada’s criteria Statistics Canada uses the following criteria to define quality statistics or “fit for use” quality Relevance: addresses issues of important to users Accuracy: degree it describes what it was designed to measure Timeliness: the delay between when the information was collected and when it is made available Accessibility: the ease to which the information can be obtained by users Interpretability: access to metadata that facilitates interpretation and use Coherence: the fit with other statistical information through the use of standard concepts, classifications and target populations

How statistics and data differ Statistics Numeric facts & figures Derived from data, i.e, already processed Presentation-ready format Published Data Numeric files created and organized for analysis or processing Require processing Not display-ready Disseminated, not published

How statistics and data differ

Stories are told through statistics The National Population Survey used in this example had over 80,000 respondents in sample and the Canadian Community Health Survey in 2005 has over 130,000 cases. How do we tell the stories about each of these respondents? We create summaries of these life experiences using statistics.

Statistics are about definitions

Definitions and metadata Users of statistics require complete, accurate metadata to understand the statistics. All of the definitions and information that describe the unit of observation, the universe, the sampling method, the concepts and the variables are critical to understand both the data and the statistics derived from the data.

Dimensions of statistics Six dimensions or variables in this table The cells in the table are the number of estimated smokers. Geography Region Time Periods Unit of Observation Attributes Smokers Education Age Sex

Statistics involve classifications The definitions that shape statistics specify the metric of the data they summarize (for example, Canadian dollars) or the categories used to classify things if a statistic represents counts or frequencies. In this latter case, classification systems are used to identify categories of membership in a concept’s definition. Some classification systems are based on standards while others are based on convention or practice. For an example of a standard, see the North American Industrial Classification System (NAICS).NAICS

Statistics are presentation ready Tables and charts (or graphs) are typically used to display many statistics at once. You will find statistics sprinkled in text as part of a narrative describing some phenomenon; but tables and charts are the primary methods of organizing and presenting statistics.