Click to edit Master subtitle style JISC XYZ Project Principal Investigator: Peter Murray-Rust Project Team: Nick England, Brian Brooks Unilever Centre,

Slides:



Advertisements
Similar presentations
A centre of expertise in digital information management Approaches To The Validation Of Dublin Core Metadata Embedded In (X)HTML Documents Background The.
Advertisements

Issues in methods and reuse for hypermedia ethnography Presented at QUADS Showcase day September 28, 2006 Louise Corti.
S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
The Electronic Office Some supplementary information Corporate websites Office automation Company intranet.
A Toolbox for Blackboard Tim Roberts
Electronic Theses and Dissertations: Benefits, Issues, and the University of Waterloo Approach
MICROFORMATS Ioana B ă rb ă nan Semantic Web developer.
XHTML & CSS 2 By Trevor Adams. Last week XHTML eXtensible HyperText Mark-up Language The beginning – HTML Web Standards Concept and syntax Elements (tags)
Digital Workbooks Options and Guide. Microsoft Office - Publisher If you use PC’s rather than Macs then ‘Publisher’ is part of the Microsoft Office Software.
Faculty of Electrical Engineering University of Belgrade Predrag Radenković 10/3237 Predrag Radenković 3237/10.
Publishing Scientific Data for Electronic Books: Challenges and Opportunities Expanding the Accessibility of Critically Evaluated Data Dr. Joan Fuller,
The Web of data with meaning... By Michael Griffiths.
COMP 6703 eScience Project Commercial Semantic Web of Digital Library  Student : Yin Chen  Client/Technical Supervisor : Tom Worthington  Academic Supervisor.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
Technical Tips and Tricks for User Support Mike Gardner
Records Management Network Digital Archiving Workshop 19 March 2015.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
Chapter 9 Introduction to the Document Object Model (DOM) JavaScript, Third Edition.
The digital scholar’s workbench Ian Barnes ELPUB 2007 Vienna — 13th to 15th June 2007.
Before class begins… Help us to assess this session and plan for future workshops Please complete the Advanced Refworks Pre-learning assessment at:
Computer Literacy BASICS: A Comprehensive Guide to IC 3, 5 th Edition Lesson 14 Sharing Documents 1 Morrison / Wells / Ruffolo.
Document Delivery Formats for the Web and Legal Digital Collections Kevin Reiss June 18 th, 2004 Law Library Rutgers-Newark School of Law.
PLUG INS flash, quicktime, java applets, etc. Browser Plug-ins Netscape wanted a method to extend features of the browser became an unofficial standard.
Create a Website on the CWU network Find “How to Post a Web Page with a PC”
Research Services Introduction to research data management - a humanities case study Slides provided by DaMaRO Project, University of Oxford.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
CP2022 Multimedia Internet Communication1 HTML and Hypertext The workings of the web Lecture 7.
The Semantic Web and Microformats. The Semantic Web Syntax = how you say something – Letters, words, punctuation Semantics = meaning behind what you say.
Organizing Internet Resources OCLC’s Internet Cataloging Project -- funded by the Department of Education -- from October 1, 1994 to March 31, 1996.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
The DiVA System: Current Status and Ongoing Development Uwe Klosa Electronic Publishing Centre, Uppsala University, Sweden Eva Müller.
EBank UK: linking scientific data, scholarly communication and learning Michael Day and Rachel Heery UKOLN, University of Bath
1 Reference Linking in Project Euclid …with some thoughts on the preservation of digital collections. A presentation at the Workshop on Linking and searching.
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
Introduction to Interactive Media Interactive Media Components: Text.
A bad case of content reuse Validator Website to Validate License Violations Validator – Only requires the URI of the site to check for a license violation.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
RDFa, Microformats, and Atom Semantic Web Presented by: Anuradha Kandula Instructor: Steven Seida.
LOGO A comparison of two web-based document management systems ShaoxinYu Columbia University March 31, 2009.
Semantic Web Technologies Brief Readings Discussion Class work: Research topics and Project discussion Research Presentation Topics assigned Building lightweight.
Breakout session OAI The future of scholarly communication: Enhanced Publications Saskia Woutersen University of Amsterdam.
PLUG INS flash, quicktime, java applets, etc. Browser Plug-ins Netscape wanted a method to extend features of the browser became an unofficial standard.
Gurleen Ahluwalia Lecturer in Communication Skills BBSBEC, Fatehgarh Sahib Punjab.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Metadata Input Tool for CADIS Scientists and Data Managers by D. Stott August 8, 2007.
Future Web Trends Brian Kelly UK Web Focus UKOLN University of Bath UKOLN is funded by Resource: The Council for Museums, Archives.
Incentives for Biodiversity Data Publishing June 2011.
Cascading Style Sheets (CSS) EXPLORING COMPUTER SCIENCE – LESSON 3-5.
Microsoft Access 4 Database Creation and Management.
Introduction to the World Wide Web & Internet CIS 101.
Greater Visibility, Greater Access QSpace QSpace Queen’s University Research & Learning Repository.
Electronic Theses and Dissertations: The bepress Approach Ben Hermalin Interim Dean, Haas School of Business, UC Berkeley & Co-Founder, bepress.
ColdFusion MX 7 “Blackstone” Macromedia, Inc. macromedia 2005 Living With Today’s Internet Chronic problems continue to exist for users and developers.
1 Lesson 14 Sharing Documents Computer Literacy BASICS: A Comprehensive Guide to IC 3, 4 th Edition Morrison / Wells.
 A content management system ( CMS ) is a system providing a collection of procedures used to manage work flow in a collaborative environment. These.
Advanced Uses of RSS Lisa Rogers ticTOCs and Gold Dust.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Here are some things you can do while you wait 1.Open your omeka.net site in your browser (e.g. 2.Open.
University of Colorado at Denver and Health Sciences Center Department of Preventive Medicine and Biometrics Contact:
7th Annual Hong Kong Innovative Users Group Meeting
The importance of being Connected
Markup of Educational Content
Presented at Archives Records 2016, session 510
Jenn Riley Metadata Librarian Digital Library Program
Microsoft Office Illustrated
Lesson 14 Sharing Documents
Submitted By: Usha MIT-876-2K11 M.Tech(3rd Sem) Information Technology
Jenn Riley Metadata Librarian Digital Library Program
Manage Sourcing - Supplier
Presentation transcript:

Click to edit Master subtitle style JISC XYZ Project Principal Investigator: Peter Murray-Rust Project Team: Nick England, Brian Brooks Unilever Centre, Department of Chemistry, University of Cambridge Managing Research Data Workshop Birmingham 28/29 March 2011

Publishing scientific data Challenge: How can scientists be encouraged to provide data in support of their papers? Academic papers: – Publication record is important to academics – Papers rarely have supporting data files – Asking for data post-publication not optimal XYZ project – ask for data up-front when a paper is submitted – Provision of data is a condition of acceptance of the paper

Academic Publisher Review Published No data Submission Data post-publication (not reviewed) Data submitted with paper

Acta Cryst E IUCr = International Union of Crystallography – Strong supporters of Open Data Acta Cryst E is primarily for publishing crystallographic data XYZ project working with IUCr Building a data journal (i.e. a publication which contains data)

Acta Cryst E Example of Best Practise Data submitted with paper Automatic validation of data Data and validation report available to reviewers. Data available for download when the paper is published

Advantages of Data?

What format should data be stored in? PDF? XML? RDF? XHTML? Other…? PDF: – Commonly used – Used for long-term archiving – Great for printing, reading; not good for data retrieval – 100% people-oriented - Horrible for machines to read, text mine XML: – Content only – Not easy to author – Not compelling for users; good for machines RDF: – Semantic format – Web3 – Resource Description Framework – Good for machines; not good for users

Formats for data (contd) HTML: – Created in early ‘90’s – Combines content and formatting – Pervasive; good for humans, good for browsers – Not ideal for data storage & use XHTML: – Extensible HTML – next-generation HTML – Store data within HTML as XML or other format – Good for humans; good for computers to use content RDFa: – Resource Description Framework-in-attributes – Inclusion of data into XHTML in RDF format – Deliver data in RDF format along with the HTML page

Microformats A set of simple, open data formats built upon existing and widely adopted standards Designed for humans first and machines second Instead of throwing away what works today, microformats solve simpler problems first by adapting to current behaviours and usage patterns (e.g. XHTML, blogging)

Examples of Microformats

Scholarly-HTML (ScHTML) Started by Peter Sefton – Beyond-The-PDF meeting, California, January 2011 – Hackfest, Cambridge, March 2011 Goal for ScHTML is to make things: – Improve the scholarly document – Without putting too much extra burden on the author – Metadata, using a linked data approach.

Desirable properties of ScHTML Easy to create/store Easily accessible Easy to extract data Store any dataset Maintain the digital format of the data (i.e. allow the “filetype” to be conserved) Add semantics to storage of data & metadata Usable by variety of applications/viewers Packaging – able to encapsulate a variety of display and/or data objects in a way that facilitates distribution/ communication of those objects

Principles for ScHTML Data is stored in HTML pages – Can be part of a more general page, or the page could be purely data Data is stored as RDFa Use redirection technique to allow different applications to process the same content

ScHTML Example

Demo Pete Sefton’s demonstration of the potential of ScHTML The demonstration: – Uses a OpenOffice word document – User simply adds a link to the datafile – Place the file and datafile in a dropbox folder – That's it!

XYZ Project Build an Acta Cryst E data journal In ScHTML Data in semantic form, online Convert 250K existing crystal structures from Crystaleye – Plus new crystal structures

References Peter Murray-Rust blog Background to ScHTML Peter Sefton blog – ScholarlyHTML – CrystalEye