Navigating Your Way Through the EFT, Nesstar and Beyond 20/20 (WDS)

Slides:



Advertisements
Similar presentations
DLI Orientation: Concepts
Advertisements

DLI Training Nesstar Workshop
DLI Orientation: Concepts A Framework for Thinking about Statistical Information Train the Trainers Montreal, March 9, 2004 Chuck Humphrey Data Library.
Anne Etheridge Economic and Social Data Service IASSIST May 2006 METADATA MANAGEMENT THE FORGOTTEN WORLD OF THE BACK OFFICE.
The Economic and Social Data Service (ESDS) Karen Dennison UK Data Archive Improving access to government datasets 18 January 2007.
Data Access and Data Use: the Missing Link? Elizabeth Hamilton University of New Brunswick Chuck Humphrey University of Alberta Data and Knowledge Transfer.
Chuck Humphrey Data Library University of Alberta.
Geo-referenced data and DLI aggregate data sources Chuck Humphrey University of Alberta September 29, 2008.
Searching the University of Alberta Library’s Statistics Canada-based Websites 2001 Census of Canada Canadian Centre for Justice Statistics Canadian Business.
Citing Statistics and Data : Toward an accepted standard Gaëtan Drolet Data Liberation Initiative Communications and Library Services Division Statistics.
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library March 6, 2009.
Statistics and Data for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 27, 2008.
EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008.
STATISTICS CANADA SURVEY LIFECYCLE WOLFVILLE, APRIL 2008 SURVEY LIFECYCLE Michel B. Séguin Atlantic DLI Training.
ISR Training February 12, 2010 Data Retrieval from Statistics Canada Surveys.
Geo-referenced data and DLI aggregate data sources Chuck Humphrey University of Alberta ACCOLEDS 2007.
PUBH 898: Health Economics Finding data and statistics.
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
South Africa Data Warehouse for PEPFAR Presented by: Michael Ogawa Khulisa Management Services
Searching for Statistics Why can’t we find the data we need? Where should we even start?
DLI Training April 2004 Kingston Ontario. DDI What, Why, How?
Data and Social Research Chuck Humphrey Data Library Rutherford North Library.
Health Data Sources Sunny Kaniyathu 03 February 2011.
Framework of Statistical Information. This is a typology of the categories or classes of statistical information. Remember the relationship between statistics.
Soc : Principles of Research Design LONGITUDINAL DATA Sunny Kaniyathu, Data Services Librarian.
Creating Something from Nothing: Synthetic and Dummy files Bo Wandschneider University of Guelph Chuck Humphrey University of Alberta DLI Training: Ottawa,
DATA and STATISTICS … at your service! S.Mowers & the GSG team ©2009, University of Ottawa.
RRM : Resource Data and Environmental Modeling DATA SOURCES Sunny Kaniyathu, Data Services Librarian.
Ontario Data Documentation, Extraction Service and Infrastructure.
The Data Documentation Initiative: more discussion Chuck Humphrey University of Alberta Atlantic DLI Workshop 2005, Acadia University.
National Boot camp Vancouver Heather Dryburgh and Michel B. Séguin May 31 st, 2011 Survey Life cycle.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Handling Reference Questions DLI Orientation Session Kingston, Ontario April 5, 2004.
DLI and EQUINOX Question 1 How do I find out what survey datasets are available from Statistics Canada ?
DLI Training - Ontario 16 April, 2015 Elizabeth Hill, Western University Survey of Household Spending.
Role of the IMDB in the CBA and IM Strategy Presented to Information Management Committee Standards Division June
Soc 332.6: Principles of research design Finding statistics.
OVERVIEW OF THE DATA LIBERATION: Licence, Products, & Services Mike Sivyer Ontario DLI Training, April 5, 2004.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Health Statistics 2016 DLI Atlantic Training
Rural Development Finding data and statistics.  Statistics Canada: Federal statistical agency  Data released under the Data Liberation Initiative (DLI)
Essex Insight Introduction to Essex Insight Training Guide Source: Research and Analysis Unit v4.
Data Access North of the (US) Border
Small Area Data and Geography For the 2017 DLI Training Workshop
“Data from national surveys: access, analysis, and sharing”
Geo-referenced data and DLI aggregate data sources
Accessing data – a user’s perspective
Data Liberation Initiative – Statistics Canada
DLI Website.
Creating Something from Nothing: Working with Synthetic Files
General Social Survey Enquête sociale générale
Contents Introducing the GSBPM Links to other standards
Beyond 20/20 for Beginners.
General Social Survey Enquête sociale générale
Susan Mowers, Data Librarian, GSG Centre - UOttawa
Presentation 2b 2018 Census Products & Services Engagement.
ESDS resources for managing and analysing data
DDI for the Uninitiated

Getting the Whole Picture
Enhancing ICPSR metadata with DDI-Lifecycle
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
Agenda Context of the BR Redesign Redesign Objectives Redesign changes
Collecting Data Online
Capitalising on Metadata
Data Liberation Initiative (DLI)
Technical Coordination Group, Zagreb, Croatia, 26 January 2018
Exploring the DLI Product line
Palestinian Central Bureau of Statistics
Creating Something from Nothing: Working with Synthetic Files
Presentation transcript:

Navigating Your Way Through the EFT, Nesstar and Beyond 20/20 (WDS) Jen Scharf April 29, 2016

DLI’s data products The DLI’s data are kept in the following products: The EFT (Electronic File Transfer) service Nesstar Beyond 20/20, WDS (Web Data Server) Statistics Canada • Statistique Canada 14/04/2018

So what’s what? THE EFT The Electronic File Transfer (EFT) service by Statistics Canada is a corporate solution for exchanging files Requires that each user have their own unique user ID and password. Houses the data files (PUMFs), and aggregate data, for the majority of StatCan surveys that are available to the DLI. The DLI EFT site is the primary method used to disseminate the DLI Collection. Only the DLI contact has access to the EFT. Statistics Canada • Statistique Canada 2016-04-29

So what’s what? NESSTAR Nesstar is a web-based exploration, extraction and analysis tool for social science data. Provides access to the Data Liberation Initiative’s (DLI) collection of public use microdata files (PUMFs). Allows authorized users to access both PUMFs and metadata for master files. Uses the Data Documentation Initiative (DDI) standard which has controlled vocabulary, consistent terminology, is highly accessible, reusable, and is detailed metadata. The data for the Masterfile are available in RDS (research data centre). Statistics Canada • Statistique Canada 14/04/2018

So what’s what? BEYOND 20/20 (WDS) Web-based multi-dimensional table viewer. Dissemination of multiple data products in a variety of formats (PDF, Excel, IVT, Word). User friendly: No knowledge of SPSS, SAS or other complicated Statistical programs required. Contains aggregate data only! Statistics Canada • Statistique Canada 14/04/2018

But what do those terms means? Master file - "Pure" data set created by the author division. All variables and cases are available for analysis in the master file. Metadata - Defines and describes the structure and meaning of information resources, and the context and systems in which they exist. It is used to support efficient and effective management of these information resources over time. PUMF - A master file that has undergone modification to minimize the possibility of disclosing a respondent's identity. Aggregate data – Information derived directly from microdata files or from statistical aggregate files. As opposed to microdata files, aggregate statistics do not record information at the level of individual units of observation. In other words, they are the result of grouping data at an aggregate or macro level (eg., persons in a specific region). Statistics Canada • Statistique Canada 14/04/2018

Where do I find the data I need? Step 1: Determine the type of data you are interested in: Master files  RDC PUMFs  Nesstar/EFT Generic tables/aggregate data  WDS/EFT Step 2: Check the website! Many surveys on the Restricted page are defined as being PUMF (through Nesstar) or Tables (through WDS) Check the Daily, summary tables and CANSIM. Masterfiles are only for advance users. Statistics Canada • Statistique Canada 14/04/2018

The DLI “Restricted Access Web Site” Links directly to the WDS Links directly to Nesstar Provide the link or how to get to this page. Statistics Canada • Statistique Canada 14/04/2018

Let’s break it down! The EFT 166 surveys comprised of data files, PUMFs, tables, etc. Essentially the “warehouse” for the DLI files. All files received from subject matter are placed in the EFT, PUMFs are staged and sent to Nesstar. Excel tables/IVT tables are sent directly (as is) to the WDS. Statistics Canada • Statistique Canada 14/04/2018

Where are the files? “Most” files received from subject matter are placed in the “other” folder Michael is working on a proposal to re-organise the file structures. Statistics Canada • Statistique Canada 14/04/2018

Let’s break it down! Nesstar Contains PUMFs Master files that have been decompressed for confidentiality and coded using the DDI standard. Essentially, takes the original (staged) PUMF provided by subject matter and makes it more user friendly. Contains a listing of master files (metadata) that are available through the RDC’s. Statistics Canada • Statistique Canada 14/04/2018

Where are the files? PUMFs are coded using the DDI standard to produce easy-to-understand results Statistics Canada • Statistique Canada 14/04/2018

Where are the files? Nesstar also contains a listing of all Master Files available through the RDCs Statistics Canada • Statistique Canada 14/04/2018

Let’s break it down! Beyond 20/20 (WDS) Aggregate data only. No PUMFs or metadata. Geography – maps  the WDS contains thousands! Excel tables, IVT tables, Word documents, PDF files. Statistics Canada • Statistique Canada 14/04/2018

Where are the files? Files are organized by category Need a map? The WDS has thousands! Statistics Canada • Statistique Canada 14/04/2018

Where does DLI’s data go? Files received from Subject Matter areas PUMFs Staged through the EFT DDI coded and placed in Nesstar (data and metadata) Excel tables/IVT tables/PDFs/Word documents Uploaded “as-is” to the EFT Renamed and added to the WDS Statistics Canada • Statistique Canada 14/04/2018

Quick reference guide!  Survey Name EFT (all files) Nesstar (PUMFs) WDS (aggregate files) Aboriginal Peoples Survey  Canadian Business Patters Canadian Community Health Survey Census of Population General Social Survey Geography maps Homicide Survey Inter-corporate Ownership Labour Force Survey National Household Survey Survey of Household Spending Statistics Canada • Statistique Canada 14/04/2018

Need more info? The DLI’s online Survival Guide is a great tool! Check out Section 6: “Accessing and Citing DLI Data” for a more information! http://www.statcan.gc.ca/eng/dli/guide/toc/3000279 Statistics Canada • Statistique Canada 14/04/2018

In conclusion Although there is no “rule” as to which surveys can be found where, the easiest thing to remember is to start by determining WHAT KIND of data you are looking for FIRST. You will only find PUMFs and metadata in Nesstar, and aggregate data in the WDS. The EFT houses most files that come from subject matter, typically in “data only” form (SAS files). Statistics Canada • Statistique Canada 14/04/2018

Questions? Statistics Canada • Statistique Canada 2016-04-29