Presentation is loading. Please wait.

Presentation is loading. Please wait.

Doing data & statistics at the reference desk (some of) what you’ll need to know OLA Super Conference 2003 2003.02.01 Walter W. Giesbrecht Data Librarian,

Similar presentations


Presentation on theme: "Doing data & statistics at the reference desk (some of) what you’ll need to know OLA Super Conference 2003 2003.02.01 Walter W. Giesbrecht Data Librarian,"— Presentation transcript:

1 Doing data & statistics at the reference desk (some of) what you’ll need to know OLA Super Conference 2003 2003.02.01 Walter W. Giesbrecht Data Librarian, York University

2 not this kind of Data...

3 … but these kinds!

4 what’s on the menu how to deal with numeric panic how to deal with numeric panic definitions definitions – types of data & statistics, analysis things to learn about data and the reference interview things to learn about data and the reference interview sources of data & statistics sources of data & statistics – tools required

5 what’s (mostly) not on the menu geographic data files geographic data files – not qualified to deal with it in great detail – those interested will have attended Friday’s session (“ GIS and Digital Map Reference for Non-Map Librarians” ) details on 2001 Census of Canada details on 2001 Census of Canada – general overview only – those interested wil already have attended Thursday’s session (“ Get Familiar With Canada!”)

6 numeric panic! related conditions are numerophobia, arithmophobia, statistophobia related conditions are numerophobia, arithmophobia, statistophobia in librarians, a condition brought on by a request for a statistical fact, figure, table or data in librarians, a condition brought on by a request for a statistical fact, figure, table or data symptoms include symptoms include – a blank mind – feeling of a clenched fist in your stomach – urge to run from the reference desk

7 how to deal with numeric panic? ask the right questions ask the right questions search the right sources search the right sources spread it around! spread it around! – know who to turn to for help – train colleagues so the load doesn’t fall only on you

8 what are data? facts or figures from which conclusions can be drawn facts or figures from which conclusions can be drawn numeric files created and organized numeric files created and organized – for analysis, or to create a new table includes geographic data includes geographic data – (to make maps)

9 what data are not "The plural of anecdote is not data." -- Roger Brinner

10 what are statistics? type of information obtained through mathematical operations on numerical data type of information obtained through mathematical operations on numerical data statistics are processed data, or data that have been analyzed in some way statistics are processed data, or data that have been analyzed in some way generally used to support an argument or position in a study or report generally used to support an argument or position in a study or report

11 statistics in print form, typically found in statistical abstracts, census and other government publications (monograph or serial) in print form, typically found in statistical abstracts, census and other government publications (monograph or serial) in digital form, found on CD-ROM or in online databases in digital form, found on CD-ROM or in online databases

12 data vs. statistics difference between looking at a photograph and taking the photograph yourself difference between looking at a photograph and taking the photograph yourself statistics are like a photograph or postcard statistics are like a photograph or postcard – a captured image of the data chosen by someone else data are like the view through a camera data are like the view through a camera – you choose the view you want

13 the data continuum raw survey data tables, charts, graphs a ‘number’ # French Mother Tongue (1996) in Ontario Employment levels by occupation class Annual inflation rate from 1914 to present Aggregate Data Microdata Coded responses of surveyed individuals

14 aggregate data data that have been grouped or summarized in some way data that have been grouped or summarized in some way – e.g., by geography or age group boundary between aggregate data and statistics sometimes blurry boundary between aggregate data and statistics sometimes blurry

15 aggregate data structure time time – e.g., time series data from CANSIM, Labour Force Historical Review, multiple Census years geography geography – e.g., Census data – neighbourhood --> national social content social content – e.g., injury data from Health Indicators Database

16 Beyond 20/20 table

17 microdata unsummarized data unsummarized data – often samples of actual responses to surveys two types of microdata files two types of microdata files – master file -- raw data, usually directly available only to STC employees and authorized researchers – PUMF (public-use microdata file) -- anonymized version of master file

18 excerpt from NPHS microdata file column 8 -- sex of respondent column 13 – pets? column 42-44 -- # visits to eye specialist

19 the analysis continuum Percentages Counts Standard Deviations Tests of Significance Descriptive Statistics (aggregate data?) Averages Inferential Statistics

20 Significance testing Percentages Counts Standard Deviations Averages Tables, Charts, Graphs A ‘number’ Raw Survey Data Data continuum … Statistical analysis continuum … Aggregate / DescriptiveMicrodata / Inferential

21 aggregate data vs. microdata in the reference interview aggregate data is what you’ll be working with at the reference desk (most of the time) aggregate data is what you’ll be working with at the reference desk (most of the time) microdata usually requires referral to data librarian or Statistics Canada, except when... microdata usually requires referral to data librarian or Statistics Canada, except when...

22 examples of Web interfaces to microdata QWIFS (Queen's Web Interface For SPSS) QWIFS (Queen's Web Interface For SPSS) link TriUniversity Data Resources TriUniversity Data Resources link

23 data at the desk: the reference interview proper reference interview will help you tremendously proper reference interview will help you tremendously makes referrals more efficient makes referrals more efficient

24 reference interview -- one view

25 another view few report numbers intended use YES exists in print?NO exists as data? print source OTHER many analysis data source YES NO

26 essential factors in data reference interview geography geography – determines jurisdiction, reporting agency time time – current / historical / both (time series) level of observation level of observation intended use intended use format format

27 how to know where to look know your users know your users know your sources know your sources – don’t ignore print sources know your limitations know your limitations know who to ask for help! know who to ask for help!

28 jurisdiction & reporting agency Federal National Accounts Census Trade Provincial Health Education CanadaInternational United Nations OECD IMF World Bank Eurostats United States Federal Departments Commerce Labor Justice Agriculture

29 Canadian data Statistics Canada is generally the first stop for Canadian data Statistics Canada is generally the first stop for Canadian data search tools: search tools: – the Daily – Online Catalogue – Thesaurus – CANSIM – E-STAT

30 The Daily

31

32

33

34

35

36

37

38

39

40

41

42 Beyond 20/20 application used by STC to display many of their data tables application used by STC to display many of their data tables easily handles large tables with multiple dimensions easily handles large tables with multiple dimensions user can easily manipulate the data to get the desired presentation user can easily manipulate the data to get the desired presentation data can also be exported to other formats link to table data can also be exported to other formats link to table link to table link to table

43 STC online catalogue

44

45

46

47

48 STC thesaurus

49

50

51

52

53

54 STC publications on the Web two ways to get them two ways to get them – free, direct from Statistics Canada free, direct from Statistics Canada – free (to eligible institutions) via DSP free (to eligible institutions) via DSP

55 CANSIM premier source of Canadian time-series data premier source of Canadian time-series data available through available through – subscription via UofT (DLI only) subscription via UofT – E-STAT (educational institutions, DLI & DSP) E-STAT – STC – same interface as E-STAT, but updated continously; $3/time series

56 E-STAT intended for use by education community, and DSP libraries intended for use by education community, and DSP libraries provides “free” access to CANSIM provides “free” access to CANSIM – CANSIM on E-STAT only updated once a year census data from 1986-2001, and selected censuses from 1665-1871 census data from 1986-2001, and selected censuses from 1665-1871 data can be mapped/exported data can be mapped/exported

57 map generated in E-STAT

58 2001 Census lots of material available on STC website, and much more to come lots of material available on STC website, and much more to come – much more than for 1996 census two levels of access two levels of access – level 1: general population – level 2: DLI & DSP institutions link link

59 information available from STC

60 training & instruction ask your data person for a training session ask your data person for a training session take advantage of training offered by CAPDU/DLI take advantage of training offered by CAPDU/DLI get to know the most heavily-used sources get to know the most heavily-used sources if you find a really good source, tell somebody! if you find a really good source, tell somebody!

61 training, etc. create your own web page(s) of favourite and/or heavily-used sources create your own web page(s) of favourite and/or heavily-used sources – York York – UofT UofT – “cheat sheets” “cheat sheets” DON’T BE AFRAID TO ASK FOR HELP! DON’T BE AFRAID TO ASK FOR HELP!

62 sources of help CAPDU Canadian Association of Public Data Users CAPDU Canadian Association of Public Data Users DLILIST Data Liberation Initiative DLILIST Data Liberation Initiative INFODEP Depository Services Program INFODEP Depository Services Program Don’t be afraid to ask questions; all the stupid ones have already been asked -- by “experts”! Don’t be afraid to ask questions; all the stupid ones have already been asked -- by “experts”!

63 http://www.yorku.ca/walterg/ola2003/ Walter W. Giesbrecht Data Librarian, York University OLA Super Conference 2003 2003.02.01


Download ppt "Doing data & statistics at the reference desk (some of) what you’ll need to know OLA Super Conference 2003 2003.02.01 Walter W. Giesbrecht Data Librarian,"

Similar presentations


Ads by Google