Next on OPRAH – Bringing Data Out of the Closet Walter Giesbrecht, Data Librarian York University Jeff Moon, Head, Documents Unit Queen’s University OLA.

Slides:



Advertisements
Similar presentations
January OLA Super Conference 2009 S.Giles, D.Jakubek Ryerson University Census? Statistics? E-Stat can give you the answers! Ontario Library Association.
Advertisements

DLI Orientation: Concepts A Framework for Thinking about Statistical Information Train the Trainers Montreal, March 9, 2004 Chuck Humphrey Data Library.
Statistics means never having to say you are certain Working with Remote Numbers E. Hamilton Atlantic DLI Training February 28, 2003.
Environmental Statistics in E-STAT Tom Power Education Centre Library, Nipissing University/Canadore College Ontario DLI Training Guelph University, Guelph,
1 The DLI Contacts and Designates Survey: Ontario regional profile Gaëtan Drolet Train the Trainers February 23-25, 2010 Université de Montréal Montréal,
Data Access and Data Use: the Missing Link? Elizabeth Hamilton University of New Brunswick Chuck Humphrey University of Alberta Data and Knowledge Transfer.
Jeff Moon Data Librarian & Academic Director, Queen’s Research Data Centre Statistics & Data& Data An OverviewAn Overview
Chuck Humphrey Data Library University of Alberta.
First Year in Focus at Canadian Colleges and Universities.
2004 OLA - E-STAT Census and CANSIM data: Comparison of providers Presentation for OLA Conference 2004 “Discovering the World of Numbers: Statistics Canada’s.
Geo-referenced data and DLI aggregate data sources Chuck Humphrey University of Alberta September 29, 2008.
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 26, 2009.
Chuck Humphrey & Lynne Robinson University of Alberta Surviving Statistics Strategies for dealing with statistical questions on the reference desk.
Searching the University of Alberta Library’s Statistics Canada-based Websites 2001 Census of Canada Canadian Centre for Justice Statistics Canadian Business.
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library March 6, 2009.
Statistics and Data for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 27, 2008.
EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008.
ISR Training February 12, 2010 Data Retrieval from Statistics Canada Surveys.
Geo-referenced data and DLI aggregate data sources Chuck Humphrey University of Alberta ACCOLEDS 2007.
PUBH 898: Health Economics Finding data and statistics.
The Census Quartet Finding Census Data E. Hamilton November 2003 ACCOLEDS Training December 2003.
NAICS? YIKES! (North American industry classification system (NAICS)? Yearly index of constant (k) dollar estimates (YIKES)!) Jeff Moon, Queens
1 OLA Conference February 2008 Session 1022 Jeff Moon Head, Maps, Data, & Government Information Centre (MADGIC) Queen’s University An Introduction to.
Census Transportation Planning Products (CTPP) Data Products June 18, 2010.
Merging census aggregate statistics with postal code-based microdata Laine Ruus University of Toronto. Data Library Service ,
The Field (California) Poll. What is the Field Poll? The Field Poll was established in 1947 by Mervin Field. An independent non-partisan survey of California.
CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.
PUBH 898: Health Economics Finding data and statistics.
Finding Data & GIS Files at the U of S Library Kiran Doranalli Lucy Li
Searching for Statistics Why can’t we find the data we need? Where should we even start?
Nesstar: A Web-based Data Extraction and Analysis System Richard Pinnell & Sandra Keys, University of Waterloo Libraries.
Doing data & statistics at the reference desk (some of) what you’ll need to know OLA Super Conference Walter W. Giesbrecht Data Librarian,
Data and Social Research Chuck Humphrey Data Library Rutherford North Library.
1 The 2001 Census PUMFS Odyssey Sponsored by HAL and PALS Presented by Chuck Humphrey.
DLI Workshop -- Mar Hosted by Dalhousie University March 2000 DLI Training Workshop.
NAICS? YIKES! Or North American industry classification system (NAICS)? Yearly index of constant (k) dollar estimates (YIKES)!
The Census of Canada and Immigration & Ethno-cultural Data Chuck Humphrey University of Alberta February 10, 2006.
DLI Boot Camp 2011 Finding Statistics: Tools and Techniques Jean Blackburn Vancouver Island University Library SDA.
POLS 328.3: Public Policy Analysis Finding data and statistics.
Using the Statistics Canada website for your MDM4U Culminating Project
Framework of Statistical Information. This is a typology of the categories or classes of statistical information. Remember the relationship between statistics.
October 2008 Getting to Know Data Sources SOC 3140 Prof. Sylvie Lafrenière Susan Mowers, GSG / Library.
ISR Training February 12,  Types of information you’ll find  Searching the website  Finding statistics using... ◦ Browse By Subject (Summary.
Soc : Principles of Research Design LONGITUDINAL DATA Sunny Kaniyathu, Data Services Librarian.
January 20089SOC4112 Getting to Know Data Sources Geographic, Statistical and Government Information Centre GSG Team Susan Mowers.
Beyond 20/20 for Beginners. Plan Who needs Beyond 20/20 anyway? ◦ What is Beyond 20/20, and what can we do with it? Pros and cons of using 20/20 How to.
Project? Microdata? Say what? TRY Conference May 5, 2008 Suzette Giles, Ryerson University Laine Ruus, University of Toronto.
Jeff Moon Data Librarian & Academic Director, Queen’s Research Data Centre Statistics & Data& Data An OverviewAn Overview
DATA and STATISTICS … at your service! S.Mowers & the GSG team ©2009, University of Ottawa.
RRM : Resource Data and Environmental Modeling DATA SOURCES Sunny Kaniyathu, Data Services Librarian.
Ontario Data Documentation, Extraction Service and Infrastructure.
Sociology 343 Chuck Humphrey Data Library University of Alberta.
Finding Data: Vital Statistics Geography 342.3; Community planning in Canada Kiran Doranalli Lucy Li Data & GIS Library Services, U of S Library
DLI and EQUINOX Question 1 How do I find out what survey datasets are available from Statistics Canada ?
How to be a happy data back up, impress your users and keep learning. Erin Alcock Memorial University of Newfoundland.
DLI Training - Ontario 16 April, 2015 Elizabeth Hill, Western University Survey of Household Spending.
2006 Census Products and Services Line: Proposed Directions April 10, 2006 DLI, Guelph, Ontario Charles Watson // Stuart Fyffe.
Accessing and Using NCHS Data: An Overview of Microdata Access Tools with SETS Demonstration Ann Aikin, Avay Dolberry, and Brady Hamilton 2004 Data Users.
Anticipating Great Things: A 2006 Census Preview June, 2006 DLI, Ottawa, ON Paul Schwets // Stuart Fyffe.
Hosted by the University of Regina Library December 1999 DLI Training Workshop Chuck Humphrey.
Soc 332.6: Principles of research design Finding statistics.
Getting the Whole Picture Using Numbers to Enhance Your Stories
Health Statistics 2016 DLI Atlantic Training
Rural Development Finding data and statistics.  Statistics Canada: Federal statistical agency  Data released under the Data Liberation Initiative (DLI)
ESTAT & CANSIM DLI Equinox <odesi> ICPSR
Short Product Review: Canadian Business Counts
2001 Census of Population Products and Services Presentation to ACCOLEDS December 6, 2001.
Getting the Whole Picture
An Example of Working with Data Documentation
University of Regina Library
Presentation transcript:

Next on OPRAH – Bringing Data Out of the Closet Walter Giesbrecht, Data Librarian York University Jeff Moon, Head, Documents Unit Queen’s University OLA SuperConference Friday, 1 February, 2002

Not this Data …

… but these kinds!

Before we get all shaken up about data and statistics, with warnings that such and such a percent of people get such and such a disease after following such and such a personal habit... … it is useful to note that: 80% of those who go insane drink coffee, tea, or beer 98% of those who commit suicide sleep indoors and darned near 100% of those injured in traffic accidents are people who move from one place to another!

Let’s take a look at Data and Statistical Analysis… have you ever seen the movie “Twins”?

Think of “Arnie” as the “Data” continuum… Tables, Charts, Graphs (from books, journals, the web, etc...) A ‘number’ Raw Survey Data # French Mother Tongue (1996) in Ontario Employment levels by occupation class Annual inflation rate from 1914 to present Aggregate Data Microdata Coded responses of surveyed individuals

Canada - Employment Telecommunication Equipment Industry 479,285 Aggregate Data: A Number Tables, Charts, GraphsTime Series

Sources of Aggregate Data… Statistics Canada is generally the first stop for Canadian Data: The Canada Year Book (print) The Daily (web) Canadian Social Trends (web/print) CANSIM / E-Stat (web) – time series… “Canadian Statistics” (web) Beyond 20/20 Files – multidimensional tables…

Survey Data (microdata): Statistical analysis software is used to generate meaningful results… e.g. SPSS, SAS. “variables” “respondents”

Sources of Survey Data… Once again, Statistics Canada is generally the first stop for Canadian Data: The “Data Liberation Initiative” (DLI) provides access to hundreds of publicly released survey data files. Polling Companies (Environics, CROP, etc.) produce microdata files as well. For US & International data, the “Inter-university Consortium for Political & Social Research” (ICPSR)

Survey Data Aggregate Data Postcard Camera “Fixed” “Flexible”

Think of “Danny” as the “Statistical Analysis” continuum… Percentages Counts Standard Deviations Tests of Significance Descriptive Statistics Averages Inferential Statistics

Significance testing PercentagesCountsStandard Deviations Averages Tables, Charts, Graphs A ‘number’ Raw Survey Data Data continuum… Statistical Analysis continuum… Aggregate / DescriptiveMicrodata / Inferential

To review… Data Aggregate & Survey Data (Microdata) Statistical Analysis Counts, Percentages, Averages, Standard Deviations, Cross- tabulations, t-tests, Regression, etc.

Reference Question Example: How many of you have had a patron arrive at the Reference Desk with a newspaper article reporting Statistics Canada data?

Globe & Mail, Dec 17, 2001, p A15 “…71% of 15- to 17-year-olds use online chat rooms, double the proportion of the only slightly older year-olds.”

First, note that the article says: “Statistics Canada, in a study released last week…” So… where do you go from here?

First… Let’s try:

Which leads you to the following:

Canadian Social Trends, Winter 2001 Which leads, in turn to: Here is the statistic quoted in the Globe… and here is the source…

So… how do we check out this source? General Social Survey, 2000 DLI Web Site (or Local Data Centre)

Documentation and Data…

So… going to your campus “Data Centre”

AGEGR5 less than or equal to 3

Results…

79.9 % 65.9 % 71 % 48 % vs Canadian Social Trends ? Our cross-tab

“An errata will be issued for the table appearing in CST because the table does not show percentages for those who used the Net in the last month but for those who used the Net in the last year.” “The difference in the numbers is because I used the variable H19 while your client is using the variable H20. H19 asked respondents who had used the Internet in the last year, if they had ever used the Internet to connect to an ONLINE CHAT SERVICE. H20 asked respondents how often they used the Internet to connect to an online chat service in the last month.” Reply from Statistics Canada… So… let’s try again with H19

So we need…

The numbers match! AND… you’ll note the table now says “last 12 months”

Original Table… Revised… Dec 2001 Jan 2002

So… We can use survey files to verify published results. But… We can also use survey files to expand on published results and explore new avenues of research. For example… 1.What is the influence of gender, education, or income on Internet use? 2.Are there differences between provinces? Between URBAN and RURAL dwellers? 3.Or any number of other “dimensions”… any question asked in the survey.

Survey Data Aggregate Data Postcard Camera “Fixed” “Flexible”

Sources of Aggregate Data… print –e.g., Canada Year Book, STC print publications CD-ROM –e.g., 1996 Census Profiles, LFHR, other DSP products Web-based –The Daily –“Canadian Statistics” –PDF versions of print publications –Beyond 20/20 Files – multidimensional tables… –CANSIM / E-Stat – time series

Beyond 20/20: what is it? Used to display multidimensional data, i.e., more than 3 dimensions or characteristics at once –e.g., age, sex (usually 3!), geography, date, etc.... allows user to customize the display of the data very useful for aggregate data, less so for microdata

Beyond 20/20: what is it used for/in? used in an increasing number of STC products, –many CD-ROM DSP products, e.g., LFHR, ITC, Profiles, Nation Series, Dimensions, etc. –one of available formats on E-Stat

CANSIM acronym for CANadian Socio-Economic Information Management System time-series data available –direct from STC ($) –via E-Stat (free to registered institutions) –via DLI (from UofT)

CANSIM II via E-Stat

Dealing with data really isn’t that hard...

Don’t be afraid to ask for help!