INSPIRE A new information system for High-Energy Physics

Slides:



Advertisements
Similar presentations
Management, Population and Marketing of institutional repositories. Collaboration. Iryna Kuchma, eIFL Open Access program manager, eIFL.net Presented at.
Advertisements

How do High Energy Physics scholars search their information? Anne Gentil-Beccot, CERN – 11 December 2007, GL9 conference.
50 Years of Experience in Making Grey Literature Available Matching the Expectations of the Particle Physics Community Carmen ODell.
Preservation, access and re-use of Research Data The STM view on publishing datasets Presented at the DataCite Summer Meeting 2010 Hannover, 8 June 2010.
Open Access Publishing in High-Energy Physics: Salvatore Mele CERN scoap3.org HEP & OA: a synergy The SCOAP 3 model Status of fund-raising
SCOAP 3 a new publishing model for High-Energy Physics Anne Gentil-Beccot, Salvatore Mele, Jens Vigen CERN European Organization for Nuclear Research scoap3.org.
Towards Open Access publishing A practical approach for particle physics Robert Aymar, Director General, CERN February 15-16, 2007 Scientific Publishing.
SCOAP3 Forum Sponsoring Consortium for Open Access Publishing in Particle Physics ALA Chicago Salvatore Mele CERN European Organization for Nuclear.
SCOAP 3 Forum ACRL Seattle Sponsoring Consortium for Open Access Publishing in Particle Physics Salvatore Mele CERN European Organization for Nuclear.
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
A CASE STUDY OF THE ECONOMIC IMPACT OF OA ON A UNIVERSITY DONALD W. KING SCHOOL OF INFORMATION SCIENCES UNIVERSITY OF PITTSBURGH MEETING ON NATIONAL POLICIES.
DRIVER Building a worldwide scientific data repository infrastructure in support of scholarly communication 1 JISC/CNI Conference, Belfast, July.
Global Digital Library in High-Energy Physics Jan Iwaszkiewicz (CERN) Digital Repositories – Linked Open Data – the possible Role of D4Science 16 th Dec.
Institutional Repositories and Self-Archiving Crisis? What Crisis? Bill Hubbard SHERPA Project Manager University of Nottingham.
The Open Access landscape (and what might be over the horizon) Alma Swan Key Perspectives Ltd Truro London LEAP Open Access conference, 11 June 2007.
Scholarly Communications in Flux Michael Jubb Director, Research Information Network Bloomsbury Conference on E-Publishing and E-Publications 29 June 2007.
The Open Access landscape (and what might be over the horizon) Alma Swan Key Perspectives Ltd Truro London LEAP Open Access conference, 11 June 2007.
German Physical Society (DPG): Open Access in physics Regensburg March 2007.
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
On the golden road - Open access publishing in particle physics Jens Vigen, CERN April 18-20, th Workshop on Innovations in Scholarly Communication.
Collaborative Open Access Projects: Collaborative promotion of research outputs Iryna Kuchma, eIFL Open Access program manager, eIFL.net Presented at Open.
Connecting people and information How open access supports research in High Energy Physics since 50 years Jens Vigen, CERN Emtacl12 – Trondheim, 1 st -
Rolf-Dieter Heuer DESY - Research Director HEP CERN - Director-General Elect APE2008 Berlin - January
1 2 HEP aims to understand how our Universe works: -Experimental HEP : builds the largest scientific instruments ever to reach.
Maximizing the benefit of research information in Particle Physics *** A user-driven story Anne Gentil-Beccot, CERN. EuroCris. 11 May 2010.
Citing and reading behaviours in High Energy Physics *** Learning from OA bibliometrics? Anne Gentil-Beccot, CERN. Uppsala. 17 November 2010.
SPIRES and INSPIRE Travis Brooks SLAC National Accelerator Laboratory INSPIRE Collaboration PPA Computing 1 July 2010.
Realizing the Dream of a Global Digital Library in High-Energy Physics Annette Holtkamp, Salvatore Mele, Tibor Simko, Tim Smith CERN, Geneva DML 2010 –
Information-Seeking Behavior in the High-Energy Physics Community Tamar Sadeh School of Informatics, City University, London Ex Libris HCI conference,
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS The Library behind the scene Opportunities for Scientific.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
What have we done? What are we doing? What can we do? Travis Brooks (SLAC) Zaven Akopov (DESY)
11/18/02Travis Brooks-ASIST The Unpublishing of High Energy Physics Travis Brooks SPIRES Scientific Databases Manager Stanford Linear Accelerator.
Why search again and again? Encore and next-generation searching at UQ Keith Webster University Librarian & Director of Learning Services.
From Berlin back to Business OPEN Stellenbosch University Library and Information Service Mimi Seyffert Manager: Digitisation and Digital Services.
Information systems for HEP: INSPIRE, arXiv and more Annette Holtkamp CERN ASP 2012 Kumasi, Ghana, Aug 3, 2012.
THE DATA CITATION INDEX AN INNOVATIVE SOLUTION TO EASE THE DISCOVERY, USE AND ATTRIBUTION OF RESEARCH DATA MEGAN FORCE 22 FEBRUARY 2014.
INSPIRE Travis Brooks (SLAC) Tibor Simko (CERN). SPIRES’ History Index to HEP literature for 35 years Via terminal login Via Via web (1st U.S. Website/1st.
European Organization for Nuclear Research Organisation Européenne pour la Recherche Nucléaire CDS Invenio CERN’s open source digital library information.
XXII International Symposium on Nuclear Electronics & Computing NEC’09 TOWARDS OPEN ACCESS PUBLISHING AT JINR I.A. Filozova, V.V. Korenkov, G. Musulmanbekov.
CERN – IT Department CH-1211 Genève 23 Switzerland t CERN Open Source Collaborative tools: Digital Library Software Tim Smith CERN/IT.
Collaborative Approach to Open Access: Experience from Bioline International Leslie Chan Associate Director Bioline International University of Toronto.
E-Infrastructures for scholarly communication A first step to OA. An indispensable step for e-Science The case of High-Energy Physics Jens Vigen – Head.
European Organization for Nuclear Research Organisation Européenne pour la Recherche Nucléaire Digital Library and Conferencing update HEPiX at Cornell.
Publisher’s Perspective: Digitization of print resources, and archiving of digital resources Judy Best, June 13, 2006.
CERN - IT Department CH-1211 Genève 23 Switzerland t The CERN Document Server 12 th November 2010 Tim Smith.
California Digital Library eScholarship Publishing Services CDL Users Council Meeting, May 9, 2008 Catherine Mitchell Acting Director, Publishing Group.
Microsoft Academic Search Search | Explore | Discover Alex D. Wade Director - Scholarly Communication.
Tullio Basaglia, CERN GS-SI CERN Scientific Information Service The context Presentation of the Service How do they search and use information? The project.
Jukka Klem & Salvatore Mele | D4Science-II Kick-Off Meeting | Pisa 15 Oct 2009.
A Tony Thomas-inspired guide to INSPIRE The evolution from SPIRES to INSPIRE and what it means for you Tony Thomas 60th Birthday Fest Feb Heath.
CERN - IT Department CH-1211 Genève 23 Switzerland t INSPIRE A Global Digital Library for HEP 14 th February 2011 Tim Smith on behalf of.
VIVO and Scholarly Repositories: Synergistic Opportunities.
T. Brooks OAI6 18/6/09 Giving researchers what they want SPIRES, High-energy physics and subject repositories Travis Brooks SLAC National Accelerator Laboratory.
A Global Digital Library for High-Energy Physics Annette Holtkamp CERN-UNESCO School on Digital Libraries – Rabat, Nov 2010.
Open CERN The context High Energy Physics information landscape Open Access: 3 myths to be dispelled Policies Some stats Licenses What’s next:
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
1 The next generation HEP information system. HEP scientists love community services 2 What is the primary source of information for HEP scientists? From.
12 things that you need to know about Open Access, the REF and the CRIS Rowena Rouse Scholarly Communications Manager June 2016.
Information Literacy & Open Access for Physics and Astronomy Graduate Students Jackie Werner, Science Librarian Georgia State University
from Invenire: inveniō invenīs invenit invenī́mus invenī́tis inveniunt
The High Energy Physics information platform: Introduction
Tim Smith CERN Geneva, Switzerland
CERN-UNESCO School on Digital Libraries
Compilation of SCOAP supported papers
Annette Holtkamp - 2nd HEP Information Summit, DESY, May 20-21, 2008
H.B. O'Connell HEP Info Summit DESY May 2008
Gwyn P. Williams and Kim Kindrew Pizza Seminar, September 18, 2013
Building an open library without walls : Archiving of particle physics data and results for long-term access and use Joanne Yeomans CERN Scientific Information.
DESY Documentation: Status + projects
Presentation transcript:

INSPIRE A new information system for High-Energy Physics INSPIRE A new information system for High-Energy Physics. Lessons learnt. Salvatore Mele, CERN On behalf of the INSPIRE collaboration OR2010 | Madrid | July 6th 2010

~15’000 High Energy Physics (HEP) scientists smash stuff at the speed of light to produce new stuff

~15’000 HEP theorists scratch their heads to make sense of all that stuff and then some more

…and they write 10k papers/year ‘bout it Krause et al. CERN-OPEN-2007-014 90% of papers… …on theory … 3 authors

The “preprint culture” L.Goldschmidt-Clermont, 1965, http://eprints.rclis.org/archive/00000445/02/communication_patterns.pdf Scientific journals of ‘60s too slow for HEP Mass-mail preprints to institutes worldwide Ante litteram (institute-pays) Open Access Leading libraries catalogue & serve preprints From the ‘70s SPIRES: e-Catalogue CERN Library, circa 1960

Something “vague but exciting” @CERN (T. Berners-Lee at CERN, early ‘90s)

Ist Web Site in the U.S.

1991: arXiv.org Discovery and first plateaus Steady state & constant output Conference contributions WWW

Why don’t we need mandates? Why don’t we need advocacy?

What do scientists want ?

Visibility Acceleration Impact

Visibility

Where do HEP scientists look for info? Gentil-Beccot et al. arxiv:0804.2701 Survey of 2’000+ scientists (10% of community) OA tools answer scientists’ information needs Google as proxy of arXiv, SPIRES, publishers

Acceleration

Ten years in the life of a HEP article SPIRES counts: citations to/from preprints/articles Citation peaks at publications Scientific discourse proceeds on discipline repository

Impact

Citation augmentation Discipline repository yields immense avantage Five times more citations for articles in arXiv 20% of 2-year citations occur before publication

By the way, do they read journals?

97% of HEP journals’ content is in arXiv

( ) Given a choice, only one in five (maybe ten) scientists Gentil-Beccot et al. arxiv:0906.5418 Given a choice, only one in five (maybe ten) scientists goes to the published version of an article. ∼30,000 clicks (choice between arXiv and journal) Publisher server 18% arXiv 82% (As many scientists as analyzed here go straight to arXiv) ( ) Still, peer-review and journals are indispensable SCOAP3.org will solve this conundrum

Enter INSPIRE What is it? What does it do? What will it do? Why shall I care?

INSPIRE in a nutshell Joint Project of CERN, DESY, Fermilab and SLAC Inspired by survey HEP scientists (2000+ respondents) expecting the future from ageing SPIRES infrastructure Unify DESY/Fermilab/SLAC SPIRES content with CERN Invenio Open Source Digital Library Platform 860’000 Records/500’000 Full-text/All that matters in HEP 40 years of manual curation; arXiv and publishers feeds

Invenio – CERN Open Source Digital Library solution http://invenio-software.org Designed for scientific libraries with 0.2-10M records Now also powering OpenAIRE “Orphan” repository (Listen to S. Kaplun’s talk today @1500 Room 2)

Who do we know? 80K names (20K affiliation histories, 25K e-mails) 860K papers with authors and affiliations 22M ‘signatures’ on papers Automatic Disambiguation Henning Weiler – PhD@CERN “Who was where when and wrote on what with whom ?” E.g. Scan 5.4M signatures. Attribute 5.3M to 250 authors ! E.g. 963 papers by "Chen, G”, 98% attributed to 21 authors

Content, metadata and feature enhancement Coming soon Content, metadata and feature enhancement

Back to the users Personal libraries, alerts Claim-my-papers (with arXiv and ORCID) Submit theses and old non-arXiv material Attach non-text material (high level data files) OCR of library holdings (with D4Science-II) Advanced feeds (with ADS, arXiv, Publishers)

Full-text search

Figure extraction

Coming later Holistic recommender system (logs, cites, text) Crowdsourcing of keywording (tagging) Semantic layer (did-you-mean and classification) ORE’ish aggregation of cite/people/data/papers (Semantic) image search Platform for high-level data preservation NOTE: Looking for PhD/Post-Docs – e-mail me

Aim – ACCELERATE SCIENCE Lesson learnt Understand users/authors drivers and barriers Invest in value-added service for scientists Harvesting and curation Visibility, accessibility, efficiency Co-operation, collaboration, partnerships Aim – ACCELERATE SCIENCE

Thank you ! Salvatore.Mele@cern.ch http://inspirebeta.net Additional resources: R. Heuer et al. Innovation in Scholarly Communication: Vision and Projects from High-Energy Physics http://arxiv.org/abs/0805.2739 A.Gentil-Beccot et al. Information Resources in High-Energy Physics: Surveying the Present Landscape and Charting the Future Course http://arxiv.org/abs/0804.2701 A.Gentil-Beccot et al. Citing and Reading Behaviors in HEP: How a Community Stopped Worrying about Journals and Learned to Love Repositories http://arxiv.org/abs/0906.5418