H.B. O'Connell HEP Info Summit DESY May 2008

Slides:



Advertisements
Similar presentations
How do High Energy Physics scholars search their information? Anne Gentil-Beccot, CERN – 11 December 2007, GL9 conference.
Advertisements

50 Years of Experience in Making Grey Literature Available Matching the Expectations of the Particle Physics Community Carmen ODell.
Who’s who? Author identification in INSPIRE -Heath O’Connell, Fermilab November 2012AAHEP61.
1 2 HEP aims to understand how our Universe works: -Experimental HEP : builds the largest scientific instruments ever to reach.
Maximizing the benefit of research information in Particle Physics *** A user-driven story Anne Gentil-Beccot, CERN. EuroCris. 11 May 2010.
Realizing the Dream of a Global Digital Library in High-Energy Physics Annette Holtkamp, Salvatore Mele, Tibor Simko, Tim Smith CERN, Geneva DML 2010 –
Information-Seeking Behavior in the High-Energy Physics Community Tamar Sadeh School of Informatics, City University, London Ex Libris HCI conference,
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS The Library behind the scene Opportunities for Scientific.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
Engineering Village ™ ® Basic Searching On Compendex ®
How to fill an institutional repository - winning scientists over – the example from CERN Joanne Yeomans CERN Scientific Information Group Geneva - Switzerland.
Introduction to Information Retrieval Got a question concerning literature? Ask! Marion Bierhahn (4630) Where is the library? Bldg:1d.
Rosalind Moore Entrepreneurship and Small Business Department LIS 620.
Information systems for HEP: INSPIRE, arXiv and more Annette Holtkamp CERN ASP 2012 Kumasi, Ghana, Aug 3, 2012.
OARE Module 5B: Searching for Scientific Research Using Environmental Issues and Policy Index (EBSCO)
Welcome to the Web of Science tutorial By the end of this tutorial you should be able to: Do a basic search to find references Use search techniques to.
Cambridge Journals Online – CJO Redesign 2010 Slides of key pages: 1. CJO homepage 2 & 3. Journal homepage 4. Abstract.
INSPIRE Travis Brooks (SLAC) Tibor Simko (CERN). SPIRES’ History Index to HEP literature for 35 years Via terminal login Via Via web (1st U.S. Website/1st.
Summer students lecture, 06 July 2011T. Basaglia, A. Gentil-Beccot - GS-SIS The CERN Library: an Accelerator of Science.
CERN – IT Department CH-1211 Genève 23 Switzerland t CERN Open Source Collaborative tools: Digital Library Software Tim Smith CERN/IT.
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
OARE Module 5A: Scopus (Elsevier). Table of Contents About Scopus (Elsevier) Using Scopus Search Page Results/Refine Search Pages Download, PDF, Export,
Summer students pres., 03 July 2013T. Basaglia - GS-SIS The CERN Library: an Accelerator of Knowledge.
WISER : OxLIP+ Workshops in Information Skills and Electronic Research Oxford Libraries Information Platform Craig Finlay Gillian Beattie.
CERN - IT Department CH-1211 Genève 23 Switzerland t INSPIRE A Global Digital Library for HEP 14 th February 2011 Tim Smith on behalf of.
1 OSTI - Accelerating Science Information Dr. Walter L. Warnick Director U.S. Department of Energy Office of Scientific and Technical Information Federal.
WISER Social Sciences: SOLO (Search Oxford Libraries Online) Angela Carritt User Education Coordinator.
A Global Digital Library for High-Energy Physics Annette Holtkamp CERN-UNESCO School on Digital Libraries – Rabat, Nov 2010.
Three indexes: Social Science Citation Index Index to Legal Periodicals Index to Foreign Legal Periodicals.
1 The next generation HEP information system. HEP scientists love community services 2 What is the primary source of information for HEP scientists? From.
Oxlip+. What is Oxlip+? A tool for finding & linking to databases – Online collections of (scholarly) materials – Includes full text / indexes / range.
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
Inspire Status Library Group Meeting 2010/03/31. Meetings Mar workshop at CERN –Preparation of beta release –Travis, Joe, Zaven –Many small working.
Searching for Scientific Research Using Environmental Index (EBSCO)
Jacynthe Touchette, MSI JGH Health Sciences Library
Scopus - Elsevier (Advanced Course Module 8)
OARE Module 5A: Scopus (Elsevier)
Summon - HINARI Search (Basic Course Module 7)
The High Energy Physics information platform: Introduction
LMEvents SharePoint Portal How-to Guide
What is a Blog? short for Weblog journal on a website
1.
Tim Smith CERN Geneva, Switzerland
Compilation of SCOAP supported papers
Elsevier Activity Range
Introduction to Information Retrieval
Introduction to Information Retrieval
Summon – Hinari Search Part B (Basic Course Module 7)
Summon - HINARI Search (Basic Course Module 7 Part B)
ICOTS Helpdesk Training
Publications and Research Data – crosslinking repositories
Scopus - Elsevier (Advanced Course Module 8)
Eric Sieverts University Library Utrecht Institute for Media &
WISER Finding stuff: Journal Articles
Literary reference center
New Features Update Web of Knowledge : Discovery Starts Here
Gwyn P. Williams and Kim Kindrew Pizza Seminar, September 18, 2013
Review Key Teaching Points
Introduction of KNS55 Platform
Hands-on Introduction and Refresher Course
Science Reference Center
WISER Humanities: Keeping up to date
USER MANUAL - WORLDSCINET
Building an open library without walls : Archiving of particle physics data and results for long-term access and use Joanne Yeomans CERN Scientific Information.
Summon - HINARI Search (Basic Course: Module 7 Part B)
ProQuest Databases.
Scopus - Elsevier (Advanced Course: Module 8)
DESY Documentation: Status + projects
USER MANUAL - WORLDSCINET
Search for Article Citation
Presentation transcript:

H.B. O'Connell HEP Info Summit DESY 20-21 May 2008 Goals for HEP Heath O’Connell H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

What do physicists want from us? Access to full-text (strongest interest). Search accuracy: find the right information. Coverage: a central place with everything. Citation analysis: co-citation, author citation, etc Conference proceedings. Experimental and Theoretical Results All instances of a result (notes, conf, article). Access to the data in tables and figures. Computer codes Published comments and replies H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

How the achieve this? Three corners of triangle Scientific Community inSPIRE using portal-role to bring involvement of scientific community. Author identification, metadata sharing (CrossRef), peer review, published articles Traditional information curation. Publishers: APS Elsevier Springer IEEE Information Resources: inSPIRE arXiv ADS PDG Sliding scale along this axis H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

User generated content Poll: If a simple web interface would show you an article and offer a set of categories to which it could belong, how much time would you spend in this tagging system to give a service to the community? Willing scientist FTE > current library staff. Incredible community response with incomparable scientific knowledge. How to harness this important resource? Web 2.0! We would share this information with anyone. H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

User-generated content for SPIRES 1998-2008 12,000 HEPNAMES records verified. 10,000 article reference lists added. 10,000 articles added. 1,300 job listings added. 3 FTE years from scientists (5 min each). 1/2 FTE year in staff time just to cut-and-paste this information from email to database (1 min. each). H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

Harnessing the hidden workforce Tagging/correcting/updating records must be easy, automatic, standardized Simple login interface, restricted to community Drop down menus to, e.g., authority files such as author names, experiments, institutions, journal names and keywords. Obvious and rational conventions. Commenting, ranking and enriching records. H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

A Web 2.0 Approach for Records H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

A Web 2.0 Approach (continued) H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

H.B. O'Connell HEP Info Summit DESY 20-21 May 2008 Access to Full-text Poll: 85% responded that access to full-text “very important” (highest rated issue). Older preprints, might require scanning: Time intensive, but worthy Immediate interest in Fermilab’s scanning of 1950’s MURA reports. SLAC scanned with permission “The Two Mile Accelerator” and posted it, huge number of hits. CERN put 12,000 CERN-TH on the web Import from KEK scans Scanning at CERN from collection Emailing authors for copies Asking publishers for permission to scan papers Import from publishers via mutual agreement. H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

Full-text: A role for authors Many authors have pre-arXiv TeX files or paper preprints they could scan. Potential for thousands of papers to be uploaded to the web. Personal web pages not archivally stable. What is the most sensible way to do this? H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

Older preprints are not being deposited at arXiv Submission date-stamp and “new” list of arXiv is important feature for precedence, distributing latest scholarship. Authors reluctant to submit older papers. 50K+ hep-ph papers, only 29 before 1991. 40K+ hep-th papers, only 37 before 1991. Also a problem for Ph.D. theses. H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

Need a place for older preprints inSPIRE will establish a drop box for authors older preprints and other unpublished material. Would need an automatic way to link to record in INSPIRE or create a record in INSPIRE if one didn’t exist. “Click here to upload full-text of paper.” H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

Better organizing the data Scientists use limited search criteria: Author, title, citation Difficult to find all papers on a topic Powerful classification is non-trivial: Hundreds of thousands of terms and articles Assignment cannot be done by non-scientists A taxonomy is required to enable, e.g.: Automatic classification from full-text Improved search tools and display of results Finding related articles Assessing article relevance Finding reviews. H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

Different schemes exist PACS Field Codes: corresponding to arXiv classification Keywords: author keywords (non standardized) DESY-KW other KW-taxonomies exist, e.g. INSPEC all implemented already in INSPIRES H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

H.B. O'Connell HEP Info Summit DESY 20-21 May 2008 HEP Taxonomy evolving from DESY thesaurus, contains related / narrower / broader / composite descriptions / comments / synonyms enables automated keywording by text-mining recommendation system (already implemented) proposes KWs selected and supplemented by physicists H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

H.B. O'Connell HEP Info Summit DESY 20-21 May 2008 KW Example symmetry breaking alt: broken symmetry related: violation narrow: dynamical symmetry breaking spontaneous symmetry breaking symmetry related: invariance narrow: asymmetry hidden symmetry horizontal symmetry kappa symmetry supersymmetry symmetry breaking composite: violation alt: violat\w+ non[-\s]*conserv\w+ related: symmetry breaking symmetry: Becchi-Rouet-Stora symmetry: Lorentz symmetry: O(N) symmetry: SU(3) x SU(2) x U(1) x U(1) symmetry: chiral time: symmetry …: … …: … a wealth of information esp. in composite KWs 100 composites with ‘symmetry’ H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

Improving the Implementation scientists help through Web 2.0 automated assignment of Field Codes linguistic algorithms intelligent search tools H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

H.B. O'Connell HEP Info Summit DESY 20-21 May 2008 Towards common goals Currently SPIRES is taking feeds from the APS, arXiv, Elsevier, IEEE, IOP and Springer. 85% of scientists use community resources to find an article and access the full-text (Google another 12%, probably full-text searching). Key role here for publishers and libraries work together in ensuring system is comprehensive and articles can be found easily by scientists. inSPIRE will receive lots of metadata from authors, that we want to share with others. How do we do this without duplication? H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

H.B. O'Connell HEP Info Summit DESY 20-21 May 2008 Who’s Who of Science Increasing problems of identifying authors. Collaborations with hundreds of people. Ambiguous names, many languages. APS has recently introduced innovative system for Chinese names. Want to find single author’s papers to judge scientific output. Clear need for author ID system. HEPNAMES – 12,000 verified records. Requires constant updating and help from scientific community. H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

HEPNAMES record: author identified, community standard ID needed H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

Data from plots and figures Refined, published data, such as plot of cross-section v. energy, easily understood. Useful for fitting with phenom. models. Could be uploaded with paper as text file. Otherwise software can accurately extract numbers from figures. Durham REACTIONS database has over 5,000 records for papers with data. We want to develop “Google Table” and “Google Plot” (like Google Images). H.B. O'Connell HEP Info Summit DESY 20-21 May 2008

H.B. O'Connell HEP Info Summit DESY 20-21 May 2008 Conclusion: Unification of HEP literature in all forms. Author names, etc standardized with ID. New standardized taxonomy organizes the literature. Web 2.0 harnesses willing scientists. Putting resources in common to advance the common good, publishers, inSPIRE, ADS and arXiv work together, develop synergies to present the scientific literature to the community. Future plans evolve with the HEP community. H.B. O'Connell HEP Info Summit DESY 20-21 May 2008