Peter Granda Archival Assistant Director / Data Archives and Data Producers: A Cooperative Partnership.

Slides:



Advertisements
Similar presentations
Archiving Trevor Croft MICS3 Data Archiving, Dissemination and Further Analysis Workshop Geneva - November 6th, 2006.
Advertisements

NATIONAL AERONAUTICS AND SPACE ADMINISTRATION 1 NASA Earth Science Data Systems (ESDS) Software Reuse Working Group CEOS WIGSS-22 Annapolis, MD September.
Data Sharing – an ESRC perspective Siân Bourne, Acting Head of Research Resources.
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
Peter Griffith and Megan McGroddy 4 th NACP All Investigators Meeting February 3, 2013 Expectations and Opportunities for NACP Investigators to Share and.
OVERVIEW & LIBRARY SUPPORT FOR DATA MANAGEMENT/SHARING Jim Van Loon, MSME/MLIS Science Librarian.
Resources for Social Sciences
Producing Archive Ready Data Sets IASSIST 2006 Margaret Hedstrom Jinfang Niu Kaye Marz.
Data Citation for the Social Sciences Mary Vardigan ICPSR CODATA Conference on Data Attribution and Citation August 22-23, 2011.
How to Write Grants Version 2009.
Peter Granda Archival Assistant Director / ICPSR and the Gerald R. Ford Presidential Library: Two Decades of Collaboration.
Copyright © 2003 by The McGraw-Hill Companies, Inc. All rights reserved. Business and Administrative Communication SIXTH EDITION.
IASSIST 2003 Changes in the Way Data Archives Process Data Data Processing at ICPSR Darrell Donakowski.
Is Mobility of Data a Special Problem for Qualitative Research? John Southall ESDS Qualidata A service provider of the UK Data Archive.
NSF Data Management Plan Requirements Alex Kanous
Codebook Centric to Life-Cycle Centric In the beginning….
Archiving our Social Science Digital History ECURE 2005 March 1, 2005.
Chapter 3 Preparing and Evaluating a Research Plan Gay and Airasian
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
Research Methods for Computer Science CSCI 6620 Spring 2014 Dr. Pettey CSCI 6620 Spring 2014 Dr. Pettey.
Non-governmental Actors in the Compliance with and Monitoring of Multilateral Environmental Decisions.
Multiple Indicator Cluster Surveys Data Dissemination and Further Analysis Workshop Data Archiving MICS4 Data Dissemination and Further Analysis Workshop.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Guidance on Preparing a Data Management Plan
2 23,503 hours in FY 2013, compared with 21,273 hours in FY ,651 interview hours in FY 13 have been charged through the AFCP program. Interview.
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Sample Search ___________________________________ Search Results Abstract ___________________________________ Full Text Online Catalog WorldCat Assessment.
DR. AHMAD SHAHRUL NIZAM ISHA
Research data workflow Practice in Slovenian Social Science Data Archives SERSCIDA WP4 – WORKSHOP Ljubljana September 2013.
William Pooler and Heidi Imker PhD Department of Research Data Service & Graduate School of Library and Information Science, University of Illinois at.
F. Petitjean, M-L Charron, S. Ferron (EHESP School of Public Health), C. Stock (Inist-CNRS) GL15 – Bratislava (SK), December 2, 2013.
Providing Access to Your Data Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review Date.
U.S. Department of the Interior U.S. Geological Survey Planning for Data Management Creating data management plans for your project.
Slide 1 D2.TCS.CL5.04. Subject Elements This unit comprises five Elements: 1.Define the need for tourism product research 2.Develop the research to be.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
BBA Degree Programme - 3rd Year, Semester II
DIY Research Data Management Training Kit for Librarians Data sharing Anne Donnelly Liaison Librarian College of Medicine & Veterinary Medicine College.
Data Sharing and Communication Across Partnerships GROUP TECHNICAL ASSISTANCE WEBINAR SEPTEMBER 21, CFPHE.
Evaluating a Research Report
UVa Library Research Data Services
1 Why should “WE” CARE about data?. International initiatives OECD principles and guidelines for access to research data from public funding 2007 “Access.
Do We Need to Preserve Research Data? Taina Jääskeläinen FSD Forskning – Arkiv – Forskning 31 May 2007.
Integrated Database Management at FCT Foundation for Science and Technology - FCT Fundação para a Ciência e a Tecnologia PORTUGAL João G. Crespo Vice-President,
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Background Researchers and funders continue to be concerned about the lack of archiving of scientific data. Such data can be useful to researchers, educators,
Selection Strategies for Digital Institutional Repositories Kent Woynowski 30 September 2004.
Vers national spatial data infrastructure training program NSDI Cooperative Agreements Program (CAP) Introduction to the Cooperative Agreements.
Clinical and Translational Science Institute / CTSI at the University of California, San Francisco UCSF DataShare Making Research Data Available to All.
DRAFT EDMC Procedural Directives NOAA Environmental Data Management Committee 12/3/2015 1
Vision for academic geographic data access Dr David Medyckyj-Scott GRADE Project Director EDINA.
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
Record Authenticity as a Measure of Trust: A View Across Records Professions, Sectors, and Legal Systems Corinne Rogers University of British Columbia.
Compliance Monitoring and Enforcement Audit Program - The Audit Process.
Local Pension Boards for the Firefighters’ Pension Schemes: A discussion document April 2014.
Eve Powell-Griner National Center for Health Statistics Centers for Disease Control and Prevention National Center for Health Statistics Microdata Release.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
- George E. Brown, Jr. Network for Earthquake Engineering Simulation ITSC Issues Forum June 21-22, 2006 Vision of NEES Data Legacy Andrei M Reinhorn Director,
Is there a role for online repositories in e-Learning? Sarah Hayes Andrew Rothery University of Worcester.
Data gathering (Chapter 7 Interaction Design Text)
HETUS Pilot Group 8 Privacy procedures and ethical issues Kimberly Fisher, Centre for Time Use Research – co-ordinator External consultant Kai Ludwigs.
Thomas Gutberlet HZB User Coordination NMI3-II Neutron scattering and Muon spectroscopy Integrated Initiative WP5 Integrated User Access.
Connecticut Department of Public Health - Keeping Connecticut Healthy Connecticut Department of Public Health PHABuloCiTy! Public Health Accreditation.
Ingest and Dissemination with DAITSS
Karen Dennison Collections Development Manager
Welcome slide.
ESTR Basics for View-Only Users
Open Access to your Research Papers and Data
Research Data Management
The role of metadata in census data dissemination
Presentation transcript:

Peter Granda Archival Assistant Director / Data Archives and Data Producers: A Cooperative Partnership

What I plan to discuss Reasons to share social science data Obstacles to share social science data Role of data archives Best practices for preparing data for archiving New developments to facilitate archiving process and improve cooperation between data archives and data producers

Why should data producers share research data? Data sharing achieves many important goals for the scientific community, such as:  reinforcing open scientific inquiry  encouraging diversity of analysis and opinion  promoting new research, testing of new or alternative hypotheses and methods of analysis  supporting studies on data collection methods and measurement  facilitating education of new researchers  enabling the exploration of topics not envisioned by the initial investigators permitting the creation of new datasets by combining data from multiple sources

Obstacles/Challenges to data sharing and archiving: Reasons not to share  Costs to producer in creating public-use files  Maintaining respondent confidentiality  Use by potential competitors  Less credit given for archiving data than for continually collecting new data especially within the academic community  Unexpected duplication of effort possible in using public-use files for research

Result of potential conflict between sharing data and the difficulties of doing so Even in places where there is a tradition of archiving social science survey and aggregate data, it is not always done or not done correctly Funds not always available or, more commonly, all of the funds are spent on data collection process Insufficient thought given to preparing materials throughout the data “life-cycle” process that could be easily used by other researchers

ROLE OF DATA ARCHIVE Assist data producers by providing advice regarding procedures to use when archiving their data and documentation Consult with data producers regarding respondent confidentiality Discuss best strategy and location to preserve the data in perpetuity

Methods of Data Sharing - Versioning  Importance of this issue for replication: e.g. users need to know which version of data file was used in publications  Increasing trend: Data files stored on data producer Web sites: –Greater number of interim or ‘early release’ versions now appearing –Need to have “versioning” system in place if data files are updated  Archives/data depositories usually preserve “final” versions of data files and also have systems in place to record history of each data collection they receive

Public Files Restricted Files Available to all users Available to general research community Available to members of a specific research team Accessible only through a formal application process Accessible only at a specific location under very restricted conditions CONFIDENTIALITYCONCERNSCONFIDENTIALITYCONCERNS

Defining Best Practices – General Goals Maintain respondent confidentiality while releasing the maximum amount of data publicly Archive materials in a format that will insure long-term preservation Provide sufficient information so that users who are not expert in the subject matter of the data collection could still use it effectively

Best Practices – Confidentiality Dangers of direct identifiers and potential dangers of indirect identifiers Solutions: removal, bracketing, top- coding, collapsing and/or combining variables, sampling, swapping, disturbing Restricted-use files or licenses Data enclaves

Best Practices – Data Formats Options: ASCII data files and record layouts ASCII data plus setup files Software-specific system files Portable software-specific files Online analysis-ready files *** IMPORTANCE OF ASCII AS A PRESERVATION FORMAT ***

Best Practices – Documentation Project description Sample and sampling procedures Weighting Date, geographic location of data collection, and time period covered Data source(s) Unit(s) of analysis/observation Variables Technical information on files Data collection instruments Interviewer guide, recode logic, coding instrument

Best Practices: Yes, in theory, but What is the Real Situation? Even on well-funded projects archiving is often given little attention It is not unusual that the vast majority of project funds are spent on data collection Documentation is often prepared hastily with insufficient thought given to how other researchers might use it

Best Practices: Yes, in theory, but What is the Real Situation? Experience from the Archival Perspective: Full compliance with submission requirements is often the exception rather than the rule

New Project between ICPSR and School of Information at the University of Michigan Purpose: to identify barriers and develop incentives for data producers to deposit “archive-ready” datasets Archive-ready: data and documentation files that are supplied to the archive in a format based on a specific agreement with the data producer

What are some of these barriers? Archiving process requires time, resources, and attention to detail In academic settings, researchers are awarded for publishing not for archiving Few “formal” professional rewards for depositing “archive-ready” datasets

What rewards and incentives are now offered to data producers? Appeal to self-interest Appeal to altruism Reputation effects Archive services Professional norms

What rewards and incentives might be offered to data producers in the future? Make it easier to collect and report information about uses of the data collection by other researchers (“reputation through citation”) Scoring rule: how “archive-ready” was the collection submitted? Enhanced service from the archive Publications: “reviews” of datasets

Implementation in different social science research environments Archive resources may vary affecting how much guidance and assistance they can provide to data producers Technical standards could also vary: in some places, the importance of certain data formats (e.g., SPSS files) may be paramount The key: what is most important for local researchers?

Спасибо !