Presentation is loading. Please wait.

Presentation is loading. Please wait.

Public Use Microdata Samples Using PDQ Explore Software Grace York University of Michigan Library May 2004.

Similar presentations


Presentation on theme: "Public Use Microdata Samples Using PDQ Explore Software Grace York University of Michigan Library May 2004."— Presentation transcript:

1 Public Use Microdata Samples Using PDQ Explore Software Grace York University of Michigan Library May 2004

2 2000 Census Data Tabulations Summary Files 1-4, Equal Employment Opportunity, School District Data, and Work Flow data are TABULATED data American Factfinder EXTRACTS the tabulated data

3 Public Use Microdata Samples Copies of the original questionnaires with identifying information edited out Create your own cross tabulations of census data

4 Typical PUMS Questions Single years of age by sex for teachers in Michigan (e.g. when will they retire?) Race of those with Arab ancestry (no, they are not all white) Demographic characteristics of immigrants from Senegal (age, sex, education, occupation, income, citizenship for a social survey) Age, race and sex of automotive industry employees (campaign for organ donations)

5 PUMS Software Programs FTP data from Census Bureau (and manipulate with SAS or SPSS) http://www.census.gov/Press- Release/www/2003/PUMS5.html Census Bureau CD-ROMS (Beyond 20/20 software) http://www.census.gov/mp/www/Tempcat/ PUMS.html SDA Software for Michigan (UMich Only) http://nds.umdl.umich.edu/n/nds/ PDQ Explore http://www.pdq.com

6 PDQ Explore Software Easy interface to – –Public Use Microdata Samples, 1 and 5%, 1980-2000 – –IPUMS, edited PUMS, 1850-1880, 1900- 1920, 1940-1990 – –Current Population Survey, 1991+ – –Mortality Schedules Permits users to tabulate their own variables

7 Access to PDQ Librarians may request free Ids, passwords, and software from PDQ Send e-mail to info@pdq.cominfo@pdq.com – –You are a librarian who talked to Grace York – –Requesting ID and password for using PDQ Explore – –Want to download software for the PDQ Toolbox, Expert Edition http://www.pdq.com

8 Software Download the software per instructions to your hard drive To begin searching, open the icon on your desktop

9 Before Beginning … Choose File Two PUMS files – 1% and 5% sample 1% has data for the nation, states, MSAs and super-Pumas (areas of 400,000) 5% has data for the nation, states, MSAs and Pumas (areas of 100,000)

10 Before Beginning… Define the data you want in terms of a spreadsheet. The longer part should be defined as rows rather than columns. I want single years of age by sex for all Vietnam-era veterans in the United States Universe = Vietnam-era veterans in the U.S. Column=sex (not very wide) Row=single years of age (could be long)

11 Before Beginning… Consult Chapter 7 of the PUMS codebook if you want to check the possible variables and the appendices for place/language/ancestry and occupation codes http://www.census.gov/prod/cen2000/doc/pums.pdf Chapter 7 is also available on the University of Michigan web site at: http://www.lib.umich.edu/govdocs/census2/pums2000/pums7.pdf

12 Before Beginning… Housing Record All geographic codes (state, MSA, PUMA) All housing records Some population records Population Record All population variables Ok to combine with geographic codes in housing Ask for help for other population/housing combinations at: info@pdq.cominfo@pdq.com

13 Before Beginning… Variable Codes for the Question in the Technical Documentation Data Dictionary AGE Single Years of Age SEX Male or Female VPS5 Veteran’s Period of Service 5: On active duty during the Vietnam Era (Aug. 1964 to Apr. 1975) http://www.lib.umich.edu/govdocs/census2/pums2000/pums7.pdf

14 Logging On Enter the subscriber name and password that you were given by the PDQ staff

15 Logging On Press OK to close the message of the day

16 Defining Workspace To conduct a new search, create a new workspace Press Finish or return twice

17 Defining Workspace Name your file on your hard drive and save.

18 Defining Workspace At the next screen, use the top menu to choose Workspace; then Add a Data Set

19 Defining Workspace Browse data sets; highlight ipums, pums, cps, or mortality file; Open

20 Defining Variables Once you choose a data set, its codebook will open up Click on the plus button to get a list of variables, their alphabetic symbols, and any numeric values

21 Defining Variables Determine the alphanumeric variables you want (e.g. Vietnam-era veteran: yes is VPS5=1) Use Top Menu to Choose Query/Setup New Expert Query (Access the codebook later through a tab on the desktop toolbar)

22 Expert Query Form 1. 1.Make sure you have the correct data set 2. 2.Determine if you want a tabulation (counts or numbers) 3. 3.Name your file

23 Expert Query Form Enter the code for UNIVERSE (what you’re counting) in the Universe box (e.g. vps5=1 are Vietnam-era veterans for the entire U.S.)

24 Expert Query Form Enter the code for the variables in the ROW box (age = single years of age; age/5 would be five year age groups) Enter the code for the variables in the COLUMN box (e.g. sex) Press RESULTS to run the query

25 Search Results Search results appear in spreadsheet format

26 Saving Results Click on File/Export Query Results You can save as CSV, tab delimited and several other formats. CSV (WYSIWIG) recommended for use with Excel Use SETUP button to return to query or icon at bottom to review the codebook

27 Geographic Codes Geographic codes are found in the Housing documentation Limit files to Michigan with the code state=26 Click on Query/New Expert Query to continue

28 Narrowing the Universe Narrow the universe by using & newcode (e.g. vps5=1 & state=26)

29 Logical Operators in PDQ http://www.lib.umich.edu/govdocs/census2/pdqop.pdf http://www.lib.umich.edu/govdocs/census2/pdqop.pdf & is one of numerous operators used in PDQ Operator Name Example/Comment X:a..b range age:15..44 unary + plus sex=+1 (never needed) unary - minus income4 greater than age>64 = greater than or equal age>=65 = or == equal age=23 != or <> not equal income!=0 & or && and race=2 & looking=1 ^ exclusive or bit-wise--use with caution | or || or age =65

30 Altering the Spreadsheet Tabulations Once you have a spreadsheet, click on Options to create totals or percentages for tables or columns

31 Adding More Parameters Expand the table detail by repeating the row and column data for another parameter (e.g. race) as shown in Dimension 3

32 Altering Spreadsheet Appearance The default shows separate tables for each of the values in the third dimension (e.g. separate spreadsheets for white and black) Change Axis3 tab to FOREACH everything on same spreadsheet

33 Calculating Means or Averages Calculate averages by changing the query type to summary statistics (e.g. mean or average) at the top Fill in the new Describe Expression box at the bottom with a variable code (e.g. age, income)

34 Complex Table Mean income of white male Vietnam-era veterans in Michigan by age, whether or not they have earnings You can respecify only veterans with earnings

35 Altering Mean Income Add & incws > 0 to universe to count only Vietnam-era veterans who are earning more than $0

36 Complex Table Mean income is higher when data limited to wage-earning veterans

37 Small Area Geography Data from the PUMS 5% file is available for states, metropolitan areas, and Public Use Microdata Areas (PUMAS) of 100,000 You can identify a PUMA or group of PUMAs using – –Maps in American Factfinder (http://factfinder.census.gov/)http://factfinder.census.gov/ – –PDF maps on the Census Bureau web site (http://www.census.gov/geo/www/maps/puma5pct.htm)http://www.census.gov/geo/www/maps/puma5pct.htm – –Mable/Geocorr Search Engine (http://mcdc2.missouri.edu/websas/geocorr2k.html)http://mcdc2.missouri.edu/websas/geocorr2k.html

38 Small Area Geography This map shows Detroit as PUMAs 3701-3708

39 PUMA Codes for Michigan Ann Arbor3200 Detroit3701-3708 Flint2200 Grand Rapids1300 Lansing1800 PUMA to Place http://www.lib.umich.edu/govdocs/census2/pumapl00.txt Place to PUMA http://www.lib.umich.edu/govdocs/census2/plpuma00.txt

40 Codebook and PUMAS The Explore Codebook shows PUMA5 as term for 5% PUMA boundaries

41 Small Area Geography and Ranges When creating data sets for PUMAS, be sure to include the correct state as the universe (e.g. state=26)

42 Small Area Geography and Ranges Puma5: 3701..3708 will list the data for each individual area

43 Small Area Geography and Ranges Search result for each individual PUMA

44 Small Area Geography for Ranges To get the total for the area, list it in the universe as puma5 >3700 & puma5 <3709 & state=26

45 Small Area Geography for Ranges To get a listing of single years of age between 65 and 85, list column as age: 65..85

46 Calculating Totals To calculate the most spoken languages by 65-85 year olds as a group Click on Options/Total Options/Row

47 Complex Result Spanish and Polish are two most popular languages spoken by seniors 65-85 in Detroit

48 Access to PDQ Librarians may request free Ids, passwords, and software from PDQ Send e-mail to info@pdq.cominfo@pdq.com – –You are a librarian who talked to Grace York – –Requesting ID and password for using PDQ Explore – –Want to download software for the PDQ Toolbox, Expert Edition http://www.pdq.com

49 Contacts for Research Assistance Initial Queries Grace York, Documents Center, 203 Hatcher graceyor@umich.edugraceyor@umich.edu or 936-2378 graceyor@umich.edu JoAnn Dionne, Numeric and Spatial Data Services, 825 Hatcher, jdionne@umich.edu, jdionne@umich.edu 763-9408 Complex Data Sets Lisa Neidert, Population Studies Center, 426 Thompson, lisan@umich.edu, 763-2163 lisan@umich.edu PDQ Staff, 310 Depot Street, Suite C, Ann Arbor 48104, info@pdq.com info@pdq.com


Download ppt "Public Use Microdata Samples Using PDQ Explore Software Grace York University of Michigan Library May 2004."

Similar presentations


Ads by Google