REPORT BACK FROM THE DDI QUALITATIVE WORKING GROUP ……………………………………………………….………………………………..................................................................................................

Slides:



Advertisements
Similar presentations
METS Awareness Training An Introduction to METS Digital libraries – where are we now? Digitisation technology now well established and well-understood.
Advertisements

Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Using Atlas-ti to explore qualitative data Libby Bishop and Louise Corti, UK Data Archive, ESDS, University of Essex IASSIST 2004 workshop.
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
A Common Standard for Data and Metadata: The ESDS Qualidata XML Schema Libby Bishop ESDS Qualidata – UK Data Archive E-Research Workshop Melbourne 27 April.
ESDS Qualidata Libby Bishop, ESDS Qualidata Economic and Social Data Service UK Data Archive ESDS Awareness Day Friday 5 December 2003Royal Statistical.
Data Exchange and Conversion Utilities and Tools (DExT) Louise Corti, Angad Bhat, Herve LHours UK Data Archive CAQDAS Conference, April 2007.
Metadata and the UK Data Archive CESSDA Expert Seminar Odense September 2008 Margaret Ward Lenin Ageer.
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
METS: An Introduction Structuring Digital Content.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Discove r Humanities and Social Science Electronic Thesaurus - HASSET Faceted search HASSET is the subject thesaurus that the UK Data Service uses to index.
3 November 2008CIS 340 # 1 Topics To define XML as a technology To place XML in the context of system architectures.
Resource Discovery Module DigiTool Version 3.0. Resource Discovery 2 Deposit Approval Search & Index Dispatcher & Viewers Single & Bulk Web Services DigiTool.
DigiTool METS Profile DigiTool Version 3.0. DigiTool METS Profile 2 What is METS? A Digital Library Federation initiative built upon the work of MOA2.
DATA IN Qualitative Data Acquisitions Process Louise Corti ESDS Qualidata, UKDA IASSIST WORKSHOP 27 May 2003.
Codebook Centric to Life-Cycle Centric In the beginning….
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
Foundation Data Workshop
Demonstration of repositories Fedora (Flexible Extensible Digital Object Repository Architecture) Marie Lagerwall MIDESS Partners Meeting February 9, 2007.
Making Metadata Work for the NSDL. Starting from Sept with...  A prototype with not much behind it that was re-usable (
Incompatible or Interoperable? A METS bridge for a small gap between two digital preservation software packages Lucas Mak Metadata & CatalogLibrarian
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.
Metadata standards, tools and processes for audio preservation at the British Library: An overview of new systems for audio description, preservation and.
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
Chapter 12 Creating and Using XML Documents HTML5 AND CSS Seventh Edition.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
DExT PROJECT Louise Corti UK Data Archive University of Essex Colchester, Essex CO4 3SQ Tel: +44 (0) URL:
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Query Health Concept-to-Codes (C2C) SWG Meeting #8 January 31,
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
The DigiTool to FDA Program Lydia Motyka Florida Center for Library Automation.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
DDI-RDF Leveraging the DDI Model for the Linked Data Web.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Metadata Lessons Learned Katy Ginger Digital Learning Sciences University Corporation for Atmospheric Research (UCAR)
Smart Qualitative Data: Methods and Community Tools for Data Mark-Up SQUAD Libby Bishop Language and Computation Day University of Essex 4 October 2005.
10/18/2015 NORTEL NETWORKS CONFIDENTIAL – FOR TRAINING PURPOSES ONLY Global Documentation Evolution System Overview and End-to-End Process Training.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Introduction ESDS Qualidata John Southall ESDS Creating and delivering re-usable qualitative data 24 June 2004.
Introduction to metadata
Quads.esds.ac.uk/squad THE PROJECT SMART QUALITATIVE DATA: METHODS AND COMMUNITY TOOLS FOR DATA MARK-UP SQUAD aims to explore methodological and technical.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Jan Christoph Meister University of Hamburg
Qualitative Data in DDI. What is “Qualitative Data” Text, mixed mode documents Images Video Physical Objects 10/5/2015Qualitative data in DDI - Hoyle.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Sally McCallum Library of Congress
NLW. Object Classes Class 1  1 MARC Record  1 Image  No METS Class 2  1 MARC Record  Many images  No METS Class 3  1 MARC Record  Many.
DDI Lifecycle and Qualitative Data: Development of a Formal Model Arofan Gregory Joachim Wackerow.
Lifecycle Metadata for Digital Objects October 2, 2006 Implementing Metadata in XML.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Connecting to External Data. Financial data can be obtained from a number of different data sources.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
What’s New in Colectica 5.3 Part 1
Powerful access to qualitative data: What’s behind the UK QualiBank
Palestinian Central Bureau of Statistics
Presentation transcript:

REPORT BACK FROM THE DDI QUALITATIVE WORKING GROUP ……………………………………………………….……………………………… LOUISE CORTI AROFAN GREGORY ………………………………………... EUROPEAN DDI MEETING, UTRECHT 8-9 DEC 2010

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE DOES CURRENT DDI SUIT QUALITATIVE DATA DDI 2 fine for describing the study and overview of a whole data collection good down to the individual file level (e.g. a single interview) but cannot describe the content of files, e.g. the structure of an textual interview data or how files relate to each other working on a descriptive standard to ensure holistic and detailed description of complex data collections need power to relate data, parts of data and annotations to each other

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE PREVIOUS WORK ON QUALITATIVE SCHEMAS Data Exchange Tools (DExT) project – UK Data Archive and ODaF sought to define a schema that would describe complex collections of data capture relationships between data files preserve references to annotations performed on data purpose for longer term preservation for data exchange providing an open source intermediate format

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE AN EXAMPLE OF A COMPLEX QUALI COLLECTION data collection 50 audio recorded interviews – 200 mp3 files 50 interview transcripts – 50 word files 45 summaries – 45 word files 100 photos – 100 tiff files annotated and coded data in a CAQDS e.g NVivo transcripts classified by some key variables codes attached to segments memos linked to data discussing features of the data assertion - links between parts of data interview level metadata useful, and collected in various ways

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE ADDING META INFORMATION ABOUT AN INTERVIEW TO THE HEADER OF A WORD DOCUMENT

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE COMPILED ‘DATALIST’ OF INTERVIEWS IN A COLLECTION

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE ADDING METADATA THROUGH MARK-UP OF XML DOCUMENTS

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE AUTOMATED MARK-UP - NAME ENTITY RECOGNITION

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE ANNOTATING DATA IN CAQDAS data are loaded into software and classification and annotation of data is done “in situ” classification variables may be attached to whole documents eg, 20 interviews, 10 are male and 10 are female codes normally attached to a segment of text with a start and end point these reference points (e.g. character 1 to character 200) or offsets are usually stored in the software’s database or linked to an audio segment

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE TRANSCRIPTS ASSIGNED TO GROUPS AND CODED IN ATLAS.TI

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE IS THIS ANNOTATED DATA IMPORTANT? researchers have classified data - its subjective but may be useful - social tagging becoming increasingly acceptable as we allow our own classifications be shared teaching with data where students can scrutinise or critique coding schemes and compare against their own classifications sharing team data in a research repository. Having some relationships between data already defined can be very useful for exploring a very large collection, in, for example, a CAQDAS package, to show existing classifications and codings providing context for data by gaining insight into researchers’ reflections (memos)

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE CURRENT EXCHANGEABILITY IN QUAL SOFTWARE minimal, and only in last 6 months import data into system and it gets locked into its proprietary databases Atlas.ti and Nvivo export XML, though not using an agreed schema ATLAS.ti first vendor to pioneer data exchange by exporting annotations in XML (MUHR, also allow import of 1-2 other proprietary packages only for market leaders ideal would be an exchange format, but vendors not overly keen!

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE QuDEx SCHEMA QuDex schema V3 published in 2006 core features various refinements basic viewer available

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE QuDEx ELEMENTS AND DEFINITIONS Top level ElementsSub elementsDefinition resourceCollection segmentCollection codeCollection memoCollection categoryCollection relationCollection The root element; a 'wrapper' for all other elements of the QuDEx Schema. Each top level element in QuDEx is defined as a ‘collection’ and must appear in the order outlined below sources memoSources documents The resourceCollection section lists and locates all content available to the QuDEx file. A source points to the original location of the resource while each author working on the QuDEx file is assigned a surrogate document which points to the relevant source. The child elements sources and memoSources contain direct references to the files under analysis; the documents section contains their surrogates Segment (sub elements text, audio, video, xml, image) The parent element for all segments, which is a subset of a document (text, audio, video or image) under analysis defined in a manner appropriate to the format (text, audio, video, image or xml). Segments may overlap and multiple memos and codes may be assigned to a segment. Start and end points can be formally assigned to segments of text, and audio visual materials in other document codeThe parent element for all codes. A code is a short alphanumeric string, usually a single word; may be assigned to a segment or document though assignment is not required. A code may optionally be taken from a controlled vocabulary defined authority

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE QuDEx ELEMENTS AND DEFINITIONS memo (sub elements memoDocumentRef, memoText) The parent element for all memos; these may be pure text and embedded in the QuDEx file (inline memo) or may refer to external files. A memo is a text string internal to the document (inline memo) or an externally held document (external memo) which may be assigned to a segment, code, document, category or to another categoryThe parent element for all categories. A category is an alphanumeric string (stored assigned to one or more documents. Categories may be hierarchically nested. Documents contained within a category are referenced Nested categories are referenced objectRelationThe parent element for all relationships between objects. For the purposes of a relation all of the following are considered to be ‘objects’  A document: surrogate of a source or memoSource  A segment within a document  An assigned value: code, memo, category, relation A relation is a link between two objects in a QuDEx file. Each object is either the start or end point of a relation (source vs target). Every relation may, optionally, have a name

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE DDI QUALITATIVE WORKING GROUP Set up in late 2009 first meeting April 2010 via skype 21 members across 17 locations and 9 countries collected use cases for complex qualitative collections from group members mapped use cases to the DDI Lifecycle stage elements (thanks to Larry)

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE USE CASES REFERENCING LIFE CYLE ELEMENTS

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE USECASE REFERENCES TO LIFE CYCLE STAGES

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE WORKING WITH QuDEx taking Qudex as starting point as it is useful for both whole document and ‘part of document’ description some members worried about complexity of QuDex, but reassured that it can do very basic file level description demonstrator work by some members on DDI- QuDex based FEDORA ingest systems

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE FEDORA INGEST WORK Two projects using QuDEx metadata in FEDORA Ensemble project, Liverpool. Martinez and Gregory QuDEx Repository: FEDORA based framework Timescapes archive, Leeds. Ben Ryan Conversion of existing Digitool archive to FEDORA

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE QuDEx REPOSITORY BASIC FUNCTIONS ingest of collections of files from a collection along with associated metadata at study and file level load/transform/index the data and metadata ingested, making it available as a set of objects in FEDORA repository, and exposing it for use as RDF dissemination to the end user through various tools search repository for studies and files locate and use contents in various applications

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE PILOT DELIVERABLES spreadsheet-based tool for capturing metadata about qualitative studies and files. Used for ingest into the FEDORA repository metadata display tool using Exhibit browser (a Simile widget for maps, timelines and faceted browsing) tool for harvesting study level DDI 1/ 2 and DC metadata from XML instances (qual and quant) interface for web-based searches through the repository, designed to be integrated into own websites. Uses Lucene automatically populated and managed Mulgara triple-store mirroring the contents of the Repository exposes the contents as RDF in a SPARQL end-point and as Exhibit compatible JSON

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE High level arch diagram here?

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE OTHER METADATA SCHEMA DDI3, of course! TEI for annotated texts. New P5 version uses similar stand off mark-up notation Open Annotation Consortium standards, for annotating video FOXML and METS in FEDORA work

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE OTHER METADATA CREATION USING DDI/QuDEx Qualidata Publisher UK Data Archive has built and are testing an open-source Flex-based system for processing textual interview data uses DDI2 study level data and TEI document level elements could easily use QuDex elements produces user-ready format (.rtf) and XML versions of transcripts and documents item level redaction and access restrictions

……………………………………………………………………………………………………………………………….…………………………….. …………………………………………………………………………………………………………………………………………………………..… UK DATA ARCHIVE NEXT STEPS FOR GROUP evaluate DDI3 definitions to see which physical instances might be relevant e.g. Areas: Data Collection Events and Instruments; Codes and Categories and how these might work with QuDEx select most popular use cases to focus on translate into technical use cases bring to TIC; choose most appropriate tools development based on most suitable metadata Dagstahl workshop next year proposed next Autumn