HISTORICAL CENSUS RESCUE PROJECT Historical Census Rescue Project at UC DATA IASSIST 2003 Conference, June 28, 2003, Ottawa Canada Project Management.

Slides:



Advertisements
Similar presentations
UNITED NATIONS REGIONAL WORKSHOP ON DATA DISSEMINATION AND COMMUNICATION VENUE: Amman, Jordan DATE: 9th September, 2013 Presenter: GODWIN ODEI GYEBI Statistical.
Advertisements

What are Geographical Information Systems (GIS) & ArcView GIS software? What is a Geographical Information System (GIS)? Introduction to ESRI ArcView 3.x.
Connecticut State Data Center at the Map and Geographic Information Center - MAGIC Connecticut State Data Center Data Collaborator for Planning, Analysis,
Dissemination of U.S. Census Data and Results: The role of ICPSR First Conference of Al-Khawarezmi Committee on Statistics Doha, Qatar 6-8 December 2010.
Geographic Information Systems TIGER Data. 1 Street Unit ► Street segment ► Street segment - The range of addresses that run along a street from one intersection.
California Digital Library Applications in the Real World: The Counting California Experience with the DDI Patricia Cruse Ilona Einowski Juri Stratford.
Information Sources for Urban History Linda Zellmer Government Information & Data Services Librarian Western Illinois University
The Data Center Role in Providing Data: Packaging Information for Texas Data Users Jeff Jordan Texas State Data Center / Business and Industry Data Center.
Using American FactFinder John DeWitt Project Manager Social Science Data Analysis Network Lisa Neidert Data Services Population Studies Center.
Online Market Research Part 1. The ABCs of the Federal Statistical System Presented by Janet Harrah, Director Center for Economic Development & Business.
Census 2000: Geographic Concepts & Products. Geographic Hierarchy.
Understanding Census Geography Lisa Neidert NPC Workshop: Analyzing Poverty and Socioeconomic Trends Using the American Community Survey June 22 – June.
What is Where? Lecture 5 Introduction to GISs Geography 176A Department of Geography, UCSB Summer 06, Session B.
1 CS 502: Computing Methods for Digital Libraries Lecture 20 Multimedia digital libraries.
2010 Census and ACS in Oregon: Results and Resources Census Data Workshops November, 2011 Charles Rynerson Census State Data Center Coordinator Population.
16 months…. The Visibility Information Exchange Web System is a database system and set of online tools originally designed to support the Regional Haze.
IASSIST Conference 2006 – Ann Arbor, May Metadata as report and support A case for distinguishing expected from fielded metadata Reto Hadorn S I.
The process of [social research theory/model/framework conceptual relationships hypotheses working hypotheses and measurement research design data collection.
Competency with the Census E. Turner CSU Northridge FOR MORE INFO...
What is Where? u Getting Started With Geographic Information Systems u Chapter 5.
11 American Community Survey Summary Data Products.
Understanding Census Geography Lisa Neidert NPC Workshop: Analyzing Poverty and Socioeconomic Trends Using the American Community Survey June 23 – June.
The American Community Survey (ACS) Lisa Neidert NPC Workshop: Analyzing Poverty and Socioeconomic Trends Using the American Community Survey June 23 –
1 The American Community Survey (ACS) 2005 Data Release.
Your Community by the Numbers Accessing the most current and relevant Census data Alexandra Barker Data Dissemination Specialist U.S Census Bureau New.
Census Basics UP206A: Introduction to GIS. History When was the first census? – 1790 How many people were counted? – 3.9 million How many states did we.
U.S. Census Overview SOC 101.
The American FactFinder Florida Libraries Association Annual Conference, 2012, Orlando, Florida Jan Swanbeck, Documents Librarian, Joe Aufmuth, GIS Librarian.
Lecture 4 Geodatabases. Geodatabases Outline  Data types  Geodatabases  Data table joins  Spatial joins  Field calculator  Calculate geometry 
11 Geographic Areas and Concepts for the American Community Survey.
1. Fundamentals of Computer Systems Define a computer system Computer Systems in the modern world Professional standards for computer systems Ethical,
U.S. Decennial Census Finding and Accessing Data Summer Durrant October 20, 2014 Data & Geographical Information Librarian Research Data Services
American Factfinder Workshop Nola du Toit Spring 2007.
Kern Grant Summit - January 30, 2015
Building Blocks The process of creating decennial census tabulation blocks. GeoElections User’s Conference October 6 th, 2011 Tampa, Florida.
Power in Numbers: Putting 2010 Census Data to Use Presented by.
Census 2000: Geographic Concepts. Small-Area Geography Overview.
2010 DCA CDBG Applicants’ Workshop CDBG Application: Census Tract Data.
Management Information Systems By Effy Oz & Andy Jones
UP206A: Introduction to GIS. » When was the first census? ˃1790 » How many people were counted? ˃3.9 million » How many states did we have then? ˃13 original.
Copyright 2010, The World Bank Group. All Rights Reserved. COVERAGE, FRAMES & GIS, Part 2 Quality assurance for census 1.
Census Data Update Craig Best Supervisory Geographer Kansas City Regional Office 1.
American Community Survey Overview September 4, 2013 Tim Gilbert American Community Survey Office.
POPULATION AND HOUSING CENSUSES IN SLOVAKIA ON THE WEBSITE Miroslav Hudec Pavol Büchler INFOSTAT – Bratislava MSIS Geneva
GIS Tutorial 1 Lecture 4 Geodatabases. Outline  Data types  Geodatabases  Data table joins  Spatial joins  Field calculator  Calculate geometry.
Digital Computer Concept and Practice Copyright ©2012 by Jaejin Lee Introduction Lecture 01.
Using ACS and Census 2010 in Communities and Neighborhoods: Guidelines and Tools POPULATION REFERENCE BUREAU | PRESENTATION BY MARK MATHER.
Sherry Lake Candidate for Metadata Specialist for User Projects.
Update on the American Community Survey (ACS) and Geographic Products 2012 PA SDC Data User Conference September 20,2012 Noemi Mendez Eliasen Geographer.
Census 2000: The Redistricting Summary Data (Public Law )
American FactFinder2 Urban Areas Population by County Self Tutorial Presented by: Liang Long Cambridge Systematics, Inc.
The Integrated Public Use Microdata Series database IPUMSwww.ipums.org Lab 1 Background on the IPUMS and SPSS.
CENSUS GEOGRAPHY WORKSHOP Tim McMonagle Geography Los Angeles Regional Office 1.
CTPP in TranStats The One-Stop Shop of Transportation Data
Window to My Environment Tom Brody Region 5 US EPA (Chicago)
Creating Spaces for Historical Data Michael Ratcliffe Geographic Standards and Criteria Branch Geography Division US Census Bureau.
U.S. Census Data & TIGER/Line Files
On the Map & Statistical Abstract South Dakota State University Demography Conference May 2013.
Accessing and Using NCHS Data: An Overview of Microdata Access Tools with SETS Demonstration Ann Aikin, Avay Dolberry, and Brady Hamilton 2004 Data Users.
Anticipating Great Things: A 2006 Census Preview June, 2006 DLI, Ottawa, ON Paul Schwets // Stuart Fyffe.
© John M. Abowd 2005, all rights reserved Using the Decennial Census of Population and Housing John M. Abowd February 2005.
GHANA STATISTICAL SERVICE IPUMS – Country Report: Ghana BY N.N.N. Nsowah-Nuamah (Deputy Government Statistician)
Census 2010: Accessing Census Data THURSDAY, July 21, :30am.
ASDC Annual Meeting November 10, 2011 Kathleen Gabler Socioeconomic Research Associate Center for Business and Economic Research Culverhouse College of.
Finding and Mapping Census Data Kathleen Fear, Data Librarian Blair Tinker, GIS Research Specialist.
Introduction to Survey Documentation and Analysis (SDA)
David R. Maidment GIS in Water Resources Fall 2018
Survey Documentation and Analysis (SDA)
Two Geospatial Data Your Library that You Need to Know
Geographic Information Systems
Presentation transcript:

HISTORICAL CENSUS RESCUE PROJECT Historical Census Rescue Project at UC DATA IASSIST 2003 Conference, June 28, 2003, Ottawa Canada Project Management Fredric C. Gey, Ilona Einowski, University of California, Berkeley Students: –Natalia Perelman, Sungman Cho, Tien-hao Lan Work performed under grant from California Digital Library Counting California project ( Fredric C. Gey

HISTORICAL CENSUS RESCUE PROJECT Between 1972 and 1988 the Lawrence Berkeley Laboratory of the University of California acquired most known population counts in machine readable form from the 1970 and 1980 decennial censuses at levels of geography down to the census enumeration district and block group, as well as other auxiliary files from the Bureau and other sources such as consolidated county and city data book and mortality detail files for from NCHS. Included in this data are unique files which don't seem to be found at ICPSR such as 1960 population by county (1000 items) and 1970 census second count (single years of age down to census tract level of geography). Also included are 1970 Census tract boundary files used to produce the Urban Atlas Series of map portfolios. Before the last running computer containing this unique database failed in year 2000 a complete dump of this data was made by the Census Bureau and sent to UC DATA on DLT tape (34 gigabytes). The final uncompressed version of these datasets should exceed 100 gigabytes in size. Fredric C. Gey

Lawrence Berkeley Laboratory SEEDIS System Lawrence Berkeley Laboratory constructed an information system which stored and retrieved this data, called SEEDIS (Socio-Economic- Environmental-Demographic Information System) with the following characteristics: –150 databases organized by geography (State  County  Tract  BG/ED) –Geographic join across databases with common geography –Data extraction for selected geography and data elements to SPSS, SAS, CODATA (self-documenting data files) –Charting –Mapping Fredric C. Gey

Lawrence Berkeley Laboratory SEEDIS System: generations of Mass Storage Photodigital chip store ( ) GSS 6250 BPI tape robot ( ) 8mm Exabyte tape jukeboxes Two generations of hardware Control Data supercomputer Digital Equipment VAX-VMS Fredric C. Gey

Databases in the SEEDIS System: 1960 Census Population (county) 1970 Census Population files (state, county, place, mcd, tract) –Second Count (100% population by single years of age) –Fourth count (Sample, 1178 items by 5 race-ethnic groups) –Fifth count (Sample, includes housing, BG/Enumeration District) 1980 Census Population files (all geographies) City-County Data Book (state, county, place) NCHS Mortality Summary files ( ) NCHS Cancer Mortality EPA Air Quality monitoring statiions ( ) 1970 MED-X population centroid latitude/longiture) 1970 Census Tract Boundary Files (polygon format) Fredric C. Gey

HISTORICAL CENSUS: Challenges to Rescue Archiving format –GSS on Control Data Supercomputer –VAX Backup on Digital Equipment Vax machines Compressed Data Format (up to 99 percent compression) –Run-length encoding –Nibble (half byte) smallest unit of storage –Computer architecture independent Metadata transformation –SEEDIS data definition files (DDF, EDF) –Qwick Qwery Dictionaries –Eye readable dictionaries –NEED TO TRANSFORM TO DDI Code: 100,000 lines of FORTRAN) –Selective recoding in C –New PERL code for transformation of metadata to DDI Fredric C. Gey

HISTORICAL CENSUS: Current Status Decompression working on UNIX, Windows County Data Book almost done 1960 Census almost done 1970 Fifth Count MCD/Tract/ED/BG available for FTP 1970 Second Count in GSS format –Hung up on VAX machine assembly code –DE, DC previously de-archived Many “tapes” not scanned Feeding to Counting California (SAS-based web interface) Fredric C. Gey