IPUMS-International Steven Ruggles Minnesota Population Center.

Slides:



Advertisements
Similar presentations
Multiple Indicator Cluster Surveys Data Interpretation, Further Analysis and Dissemination Workshop Basic Concepts of Further Analysis.
Advertisements

How IPUMS Harmonizes Microdata Data Sources and Bibliography Data Sources: Original census data are contributed to the IPUMS- International project by.
Introduction to the State-Level Mitigation 20/20 TM Software for Management of State-Level Hazard Mitigation Planning and Programming A software program.
IPUMS workshop * * * Robert McCaa, Professor of Population History University of Minnesota additional information.
Census 2000 symposium, session 4 paper 261 Archiving Census Documentation and Microdata: Preserving Memory, Increasing Stakeholders * * * Wendy L. Thomas.
Using a restricted-access web-site of anonymized, integrated census microdata (for 1, 2, 3, 4,
IPUMS in the classroom experience from Brazil Bernardo L. Queiroz Cedeplar-UFMG-Brasil
1 Assortative Mating Patterns in the Developing World Albert Esteve* and Robert McCaa** Presented by: Sula Sarkar** * Centre d ’ Estudis Demogr à fics.
IPUMS-International Integration Process Matt Sobek Minnesota Population Center
St. Lucia Country Report By Edwin St Catherine Director, Central Statistical Office Presented to IPUMS Workshop August 24 th, 2007.
The MetaDater Model and the formation of a GRID for the support of social research John Kallas Greek Social Data Bank National Center for Social Research.
6. Managing access to IPUMS integrated census microdata “extracts” (13 slides)
IPUMS-International Integration Process Matt Sobek Minnesota Population Center
Hist.umn.edu/~rmccaa/ipums-europe1 Sister-project: IPUMS-Latin America: 17 countries, ~500 million pop., 5 census rounds 80+ samples, 100+ million person.
Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Steven Ruggles Minnesota Population Center.
5. Integration of Microdata and Metadata (9 slides)
Users and Uses of IPUMS International Data Presented by Dr. Miriam King.
Original dataOriginal data. (various) Reformat dataReformat data: structural issues draw sample confidentiality (general tools) Data dictionary. (txt/pdf)
Archiving our Social Science Digital History ECURE 2005 March 1, 2005.
Labor Statistics in the United States Grace York March 2004.
MONGOLIA COUNTRY REPORT National Statistical Office IPUMS-Global Workshop, Lisbon, Portugal, August 22-26, 2007.
Census Processing Procedures Matt Sobek Funded by the National Science Foundation Minnesota Population Center.
IPUMS-EurAsia, : Changing Patterns of Microdata Use * * * Robert McCaa, Professor of Population History University.
Building Historical Social Science Infrastructure: Data Integration Projects of the Minnesota Population Center Robert McCaa and Steven Ruggles Minnesota.
IPUMS-International Integration Process Matt Sobek Minnesota Population Center
Harnessing the Power of Microdata Standards, tools and best practices for microdata dissemination and management International Household Survey Network.
The International Household Survey Network IHSN IHSN Secretariat PARIS21 Steering Committee, 14 November 2007.
United Nations CensusInfo User Application Training Workshop, Cairo, Egypt, October World Population and Housing Census Programme United.
Harmonizing the World’s Census Microdata: The IPUMS Project Matt Sobek Minnesota Population Center
Country Paper on: Census Data Accessibility, Confidentiality and Copyright Policy: Ethiopia’s Experience Seminar United Nations Regional Seminar on Census.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
Hist.umn.edu/~rmccaa/ipums-europe1 IPUMS-Europe, : Restricted-access, anonymized microdata for scientific and policy research * * * Robert McCaa,
Labor Market Information in the Americas: the United States Workshop On Labor Migration and Labor Market Information Systems Inter-American Network for.
1 International Comparative Data for Research and Policy on Aging James P. Smith.
Design and Use of the IPUMS-International Data Series
Statistical Coherence: Census Hub Hypercubes and IPUMS Microdata UNECE Expert Group on Population and Housing Censuses Geneva, September 2014 Lara.
Data archive in developing countries: preservation and dissemination of microdata as an instrument for better development results Olivier Dupriez Senior.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
Integrating ACS with the World’s Census Data: ACS Microdata and the IPUMS Presented at the Pre-ALAP ACS/IPUMS Workshop November 16, 2010 Trent Alexander.
TerraPop Vision An organizational and technical framework to preserve, integrate, disseminate, and analyze global-scale spatiotemporal data describing.
Design and Use of the IPUMS-International Data Serieshttp://international.ipums.org Matt Sobek Minnesota Population Center
IPUMS-International Methods Matt Sobek Minnesota Population Center
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
* IPUMS-International * Using Integrated unit records for demographic and health research: Local, regional, national, and international * * * Robert McCaa,
IPUMS-International Free census samples (microdata) for researchers and policy makers: * * * Robert McCaa, Minnesota Population.
Documenting and disseminating census and survey data sets Ilpo Survo, United Nations ESCAP, Bangkok, for UNECE.
Trans-Border access to Census Microdata: The IPUMS-IECM partnership * * * Robert McCaa and Albert Esteve Palós “You have to.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis Amman - Jordan 16 – 19 May 2011 Determination of the scope and form of.
IPUMS Microdata Relation to head Marital status Literacy Occupation.
 Background Data harmonization Data output  Web: Variable documentation system  Web: Data extract system IPUMS Dissemination System.
Integrated Public Use Microdata Series IPUMSwww.ipums.org Matt Sobek Minnesota Population Center
Archiving microdata Standards and good practices United Nations Statistics Commission New York, February 26, 2009 Olivier Dupriez World Bank, Development.
The Integrated Public Use Microdata Series database IPUMSwww.ipums.org Lab 1 Background on the IPUMS and SPSS.
ANALYSIS OF CENSUS RESULTS FOR EVIDENCE – BASED DECISION MAKING “2009 KENYA POPULATION AND HOUSING CENSUS RESULTS”
Challenges of Census Data Harmonization: IPUMS-International Matt Sobek Minnesota Population Center
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Integrated Public Use Microdata Series IPUMS Internationalwww.ipums.org Matt Sobek Minnesota Population Center
Integrated Public Use Microdata Series IPUMSwww.ipums.org.
1. Introduction 2. Background 3. Funding framework 4. EU participation 5. Timetable 6. Progress report 7. Future plans I ntegrating the E uropean C ensus.
Data access and development: The IPUMS perspective United Nations Commission on Population and Development The data revolution in action: National and.
DATA FOR EVIDENCE-BASED POLICY MAKING Dr. Tara Vishwanath, World Bank.
Lessons Learned from the production of Gridded Population of the World Version 4 (GPW4) Columbia University, CIESIN, USA EFGS October 2014.
Matt Sobek Minnesota Population Center
IPUMS-International Integration Process
and the Future of Historical Family Demography
2. Applying for Access (10 slides)
hist.umn.edu/~rmccaa/ipums-europe
The role of metadata in census data dissemination
Presentation transcript:

IPUMS-International Steven Ruggles Minnesota Population Center

What is IPUMS-International? The IPUMS-International project is creating an integrated global database of over 150 censuses from at least 44 countries. It will be the world’s largest public-use population database, with multiple samples from each country enabling analyses across time and space. The microdata and accompanying documentation will be freely available for scholarly and educational research through a web-based data dissemination system.

The Problem A vast body of raw census microdata covering much of the world over the past four decades survives in machine-readable form. In most countries, these census microdata are either unavailable to researchers or difficult to obtain. These data are at constant risk of destruction because of technological obsolescence, physical aging of computer tapes, and loss of institutional memory and documentation

Why it matters In the few countries where census microdata are readily available to researchers, they have become an indispensable part of social science infrastructure. –In the journal Demography, the leading U.S. journal of population, census microdata are used three times as often as any other source for studies of the U.S. or Canada. No alternate source offers comparable sample sizes, chronological depth, or widespread availability across countries.

Advantages of Census Microdata Samples Many more cases than any alternative datasets Enable study of relatively small populations Allows analysis of effects of local conditions on behavior Large Long-term Data usually available for multiple decades Flexible Tabulations can be customized to research problem Multivariate analysis feasible Harmonization is possible, allowing analyses that cross borders and time periods

Cross-National Harmonization and Open Access: National Academy of Science recommendations “National and international funding agencies should establish mechanisms that facilitate the harmonization of data collected in different countries.” “Cross national studies conducted within a framework of comparable measurement can be a substantially more useful tool for policy analysis than studies of single countries.” “The scientific community, broadly construed, should have widespread and unconstrained access to the data.” Source: Preparing for an Aging World: The Case for Cross-National Research (National Academy, 2001)

The Model: IPUMS-USA Project to harmonize U.S. Census microdata for the period : NSF-funded IPUMS project harmonized samples using composite codes, documented comparability; 250,000 transformations, 3,000 pages of printed documentation : Another NSF project funded an online data access system with integrated hypertext documentation

Success of IPUMS-USA User friendly access, harmonized codes, and integrated comprehensive hypertext documentation led to flood of historical census-based research: 12,000 users, 75,000 custom data extracts Currently distributing an average of 638 MB/hr, 24/7 1,300 publications and working papers –IPUMS-based research is concentrated in the top U.S. journals: the most common venues are Demography, American Economic Review, Journal of Political Economy, American Sociological Review, Social Forces, and Quarterly Review of Economics

IPUMS-International After 1960, most censuses around the world were tabulated by computer McCaa decided that IPUMS model should be applied to other countries Began with a project for Columbia, then in 1999 NSF Infrastructure grant to add six more countries : new HSD grant to increase database to 44 countries NICHD is also assisting with funding

IPUMS-International samples: First release

IPUMS-International Users Prospective users must sign confidentiality agreement and provide an abstract explaining need for the data Through 9/1/05 we had 980 applicants to use the database, of which 582 were approved (59 percent) Users represent 40 countries and 250 institutions, including many international organizations (e.g., ILO, WHO, World Bank, Inter-American Development Fund)

Early results National Academy of Sciences panel (2005) used data from Colombia, Kenya, Mexico, and Vietnam to analyze changing outcomes such as schooling, work, fertility, and marriage as a function of age, gender, and household characteristics.

Early results Cynthia Feliciano (2005) compared the education of immigrants to the United States with those who remained behind to understand patterns of selectivity

Other topics include: Changing living arrangements of the aged Concentration of mortality within families Impact of rainfall on health and economic welfare Female labor-force participation and educational attainment Regional inequality differentials Brain drain from developing countries Effects of emigration on labor markets Relationship between divorce and family composition Relationship between disease factors and education Relationship between educational attainment and cohort size. Effect of NAFTA on educational attainment and school enrollment by region within Mexico

Number of countries requested by IPUMS-International users (percent distribution) 1 country39 2 countries24 3 countries10 4 countries6 5-8 countries20 Most users request multiple countries

IPUMS-International Tasks Inventory and preservation of data and documentation Processing Documentation (especially comparability) Dissemination—obtain licenses that allow us to disseminate data for educational and scholarly use, and set up secure web-based dissemination system

IPUMS-International Tasks Inventory and preservation of data and documentation Processing Documentation (especially comparability) Dissemination—obtain licenses that allow us to disseminate data for educational and scholarly use, and set up secure web-based dissemination system

UN Demographic Center for Latin America (CELADE, Santiago, Chile) ~3000 microdata tapes recovered and metadata (documentation) IPUMS-International Preservation Initiatives

Status of Data Acquisition dark green = disseminating medium green = data received light green = negotiating

Current IPUMS-International Partners Current funding for 44 countries by 2009 Next data release late spring 2006

Current IPUMS-International Partners Current funding for 44 countries by 2009 Next data release late spring 2006

IPUMS-International Tasks Inventory and preservation of data and documentation Processing Documentation (especially comparability) Dissemination—obtain licenses that allow us to disseminate data for educational and scholarly use, and set up secure web-based dissemination system

Processing 1.Standardize format 2.Correct format errors 3.Draw samples 4.Add confidentiality protections 5.Harmonize codes 6.Edit and allocate missing or inconsistent data 7.Add standard constructed variables

PernumRelationshipAgeSexMarstChborn 1head53femaleseparated6 2child28malesinglen/a 3child22malesinglen/a 4child21malesinglen/a 5child25femalemarried2 6child-in-law28malemarriedn/a 7grandchild3malesinglen/a 8grandchild1malesinglen/a 9non-relative32femaleseparated2 10non-relative10malesinglen/a 11non-relative5femalesinglen/a Location Spouse’sFather’sMother’s Constructed Variables: IPUMS Family Interrelationship Pointers

IPUMS-International Tasks Inventory and preservation of data and documentation Processing Documentation (especially comparability) Dissemination—obtain licenses that allow us to disseminate data for educational and scholarly use, and set up secure web-based dissemination system

Documentation 1.Translate codebooks, enumeration forms, and enumeration instructions into English 2.Standardize format and add xml tags 3.Write documentation identifying comparability problems across countries, and within countries, across time periods 4.Assemble and scan ancillary documentation (e.g. census maps, post-enumeration survey results, and additional information on post-enumeration processing).

Variable Description: Literacy (International)

IPUMS-International Tasks Inventory and preservation of data and documentation Processing Documentation (especially comparability) Dissemination—obtain licenses that allow us to disseminate data for educational and scholarly use, and set up secure web-based dissemination system

Dissemination Uniform perpetual agreements with national statistical agencies allows us to disseminate anonymized microdata to researchers who agree to a web-based confidentiality agreement MPC staff assess research proposals for feasibility Disputes with agencies, if they arise, will be settled by the International Court of Arbitration in Paris Data dissemination occurs exclusively through the IPUMS-International web-based data access system