Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Slides:



Advertisements
Similar presentations
Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,
Advertisements

Finding Academic Literature in Physics Richard Holmes, Durham University Library.
Web of Science Search and Navigation in the Web of Knowledge
Maria Grazia Pia, INFN Genova 1 Part V The lesson learned Summary and conclusions.
1 2 HEP aims to understand how our Universe works: -Experimental HEP : builds the largest scientific instruments ever to reach.
Shou Ray Information Service Co., Ltd.
Information Skills for Computer Scientists June 2007.
11/18/02Travis Brooks-ASIST The Unpublishing of High Energy Physics Travis Brooks SPIRES Scientific Databases Manager Stanford Linear Accelerator.
Grid Architecture Grid Canada Certificates International Certificates Grid Canada Issued over 2000 certificates Condor G Resource TRIUMF.
Using Web of Science as a Research Tool : Experience at HKUST Library Steve Yip Electronic Information Librarian.
How to Read a Technical Paper Locking and Consistency 10/7/05.
Maria Grazia Pia Systematic validation of Geant4 electromagnetic and hadronic models against proton data Systematic validation of Geant4 electromagnetic.
How the University Library can help you with your term paper Computer Science SC Hester Mountifield Science Library x 8050
NURSING 475 Step Five: RESEARCH APPLICATION. STEP FIVE: The Assignment: n Select a nursing intervention you performed on this patient. What are some of.
How to Research for an Essay and Avoid Plagiarism
Comparison of data distributions: the power of Goodness-of-Fit Tests
Summer students lecture, 06 July 2011T. Basaglia, A. Gentil-Beccot - GS-SIS The CERN Library: an Accelerator of Science.
IR1IMEM YEAR ONE RESEARCH METHODOLOGY L2: Reviewing Literature, Formulating Research Problem, Variables DR. JAVED-VASSILIS KHAN, Drs. Allerd Peters, Frank.
Finding and Citing Articles from 36 Database PowerSearch: A Step-by-Step Approach Dr. Jun Wang San Joaquin Delta College.
BME1450: Biomaterials and Biomedical Research Michelle Baratta Engineering & Computer Science Library Maria Buda Dentistry Library.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Networking the World TM k:June21charlotte,ppt IEEE’s Online Products ICOLC Meeting Hickory Ridge Conference Center October 1, 1999 Jonathan Dahl,
Kathleen Padova INFO 861 January 20, Emerged in different disciplines, academically Continued to develop in different disciplines in practice Information.
P. Saracco, M.G. Pia, INFN Genova An exact framework for Uncertainty Quantification in Monte Carlo simulation CHEP 2013 Amsterdam, October 2013 Paolo.
Library Information and Services CSE Librarian: Jason Neal Phone: Office: B 03 E Nedderman Hall UTA.
IEEE Nuclear Science Symposium and Medical Imaging Conference Short Course The Geant4 Simulation Toolkit Sunanda Banerjee (Saha Inst. Nucl. Phys., Kolkata,
HERA/LHC Workshop, MC Tools working group, HzTool, JetWeb and CEDAR Tools for validating and tuning MC models Ben Waugh, UCL Workshop on.
IEEE Nuclear Science Symposium and Medical Imaging Conference Short Course The Geant4 Simulation Toolkit Sunanda Banerjee (Saha Inst. Nucl. Phys., Kolkata,
Maria Grazia Pia, INFN Genova Test & Analysis Project aka “statistical testing” Maria Grazia Pia, INFN Genova on behalf of the T&A team
Nightly System Growth Graphs Abstract For over 10 years of development the ATLAS Nightly Build System has evolved into a factory for automatic release.
3 June 2004GridPP10Slide 1 GridPP Dissemination Sarah Pearce Dissemination Officer
Summer students pres., 03 July 2013T. Basaglia - GS-SIS The CERN Library: an Accelerator of Knowledge.
Maria Grazia Pia, INFN Genova 1 Scholarly literature and the press: scientific impact and social perception of physics computing M. G. Pia 1, T. Basaglia.
IL Step 3: Using Bibliographic Databases Information Literacy 1.
Databases E. Leonardi, P. Valente. Conditions DB Conditions=Dynamic parameters non-event time-varying Conditions database (CondDB) General definition:
1 Information Literacy Program Module 1 Resources The Emalus Campus Library Emalus Campus.
Research Methodology Workshop Session 2: Effective Literature Review Ashish Anand & Vijaya Saradhi Department of Computer Science and Engineering IIT Guwahati.
Metadata Workshop Rick St. Denis Glasgow University April 26-28, 2004.
Writing software or writing scientific articles? Maria Grazia Pia INFN Genova, Italy T. Basaglia (CERN), Z. Bell (ORNL), P. Dressendorfer (IEEE), A. Larkin.
IEEE Nuclear Science Symposium and Medical Imaging Conference Short Course The Geant4 Simulation Toolkit Sunanda Banerjee (Saha Inst. Nucl. Phys., Kolkata,
Maria Grazia Pia, INFN Genova Update on the Goodness of Fit Toolkit M.G. Pia B. Mascialino, A. Pfeiffer, M.G. Pia, A. Ribon, P. Viarengo
Database collection evaluation An application of evaluative methods S519.
Les Les Robertson LCG Project Leader High Energy Physics using a worldwide computing grid Torino December 2005.
Precision analysis of Geant4 condensed transport effects on energy deposition in detectors M. Batič 1,2, G. Hoff 1,3, M. G. Pia 1 1 INFN Sezione di Genova,
LHCbComputing Manpower requirements. Disclaimer m In the absence of a manpower planning officer, all FTE figures in the following slides are approximate.
Writing software or writing scientific articles? Maria Grazia Pia INFN Genova, Italy T. Basaglia (CERN), Z. Bell (ORNL), P. Dressendorfer (IEEE), A. Larkin.
Open Archive Workshop, CERN th March 2001 Peer Review - the HEP View Mick Draper, CERN ETT Division
Maria Grazia Pia, INFN Genova Statistics Toolkit Project Maria Grazia Pia, INFN Genova AIDA Workshop.
CiteSearch: Multi-faceted Fusion Approach to Citation Analysis Kiduk Yang and Lokman Meho Web Information Discovery Integrated Tool Laboratory School of.
Oxlip+. What is Oxlip+? A tool for finding & linking to databases – Online collections of (scholarly) materials – Includes full text / indexes / range.
K.POMMÈS (CERN), B. LANGE (RIO), V. PAVANI NETO(RIO),B. VIEIRA AROSA (RIO) ATLAS Glance Since this week our offices are moved to building 3150.
Susanna Guatelli Geant4 in a Distributed Computing Environment S. Guatelli 1, P. Mendez Lorenzo 2, J. Moscicki 2, M.G. Pia 1 1. INFN Genova, Italy, 2.
Maria Grazia Pia, INFN Genova and CERN1 Geant4 highlights of relevance for medical physics applications Maria Grazia Pia INFN Genova and CERN.
Bodleian Social Science Library Michaelmas, 2011 Post-induction session for Anthropologists Finding key information resources Sarah Rhodes Forced Migration,
Library presentation, 28 September 2010T. Basaglia, A. Gentil-Beccot - GS-SI What can CERN Library do for TE department?
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
My Jobs at CERN April 2015 My Jobs at CERN2
CERN - IT Department CH-1211 Genève 23 Switzerland t Grid Reliability Pablo Saiz On behalf of the Dashboard team: J. Andreeva, C. Cirstoiu,
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES The Common Solutions Strategy of the Experiment Support group.
Publication Pattern of CA-A Cancer Journal for Clinician Hsin Chen 1 *, Yee-Shuan Lee 2 and Yuh-Shan Ho 1# 1 School of Public Health, Taipei Medical University.
Using Google Scholar Ronald Wirtz, Ph.D.Calvin T. Ryan LibraryDec Finding Scholarly Information With A Popular Search Engine Tool.
BME1450: Biomaterials and Biomedical Research
(Prague, March 2009) Andrey Y Shevel
Ming-Yao Chen#, Wen-Ta Chiu and Yuh-Shan Ho*
Introductory Course PTB, Braunschweig, June 2009
Indication of Publication Pattern of Scientometrics
Short Course Siena, 5-6 October 2006
Introductory Course ORNL, May 2008
Building an open library without walls : Archiving of particle physics data and results for long-term access and use Joanne Yeomans CERN Scientific Information.
G. A. P. Cirrone1, G. Cuttone1, F. Di Rosa1, S. Guatelli1, A
Presentation transcript:

Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova, Italy 2 CERN, Geneva, Switzerland 3 ORNL, Oak Ridge, TN, USA 4 IEEE, Piscataway, NJ, USA CHEP 2012, NYC

Maria Grazia Pia, INFN Genova 2 Analysis topics General tools −Geant4 −ROOT HEP experiments −LEP  ALEPH, DELPHI, L3, OPAL −BaBar −LHC  ALICE, ATLAS, CMS, LHCb, TOTEM Grid computing −LCG What they publish How much Where Citations Technology vs physics Software vs hardware Software/DAQ-trigger Representative Not exhaustive

Maria Grazia Pia, INFN Genova 3 Data sources Thomson-Reuters: ISI Web of Knowledge −CERN subscription: since 1970, conference database not included −Search by keywords, collaboration name Journal web sites −IEEE TNS −NIM, Comp. Phys. Comm. (Elsevier) −JINST (IOP/SISSA) ➤ Full-text searches CERN databases −CERN Document System −Greybook Years: (LEP), (BaBar, LHC) −Reproducible sample

Maria Grazia Pia, INFN Genova 4 Data sample Contamination −Non-pertinent entries in the data sample Omission −Pertinent papers are not included in the data sample ➩ Cross-checks −WoS/CDS, WoS/publishers’ web sites WoS inconsistencies and errors −Total number of citations includes Conference database −Proceedings papers: false classifications and omissions ➩ Manually corrected whenever possible Automated analysis (whenever possible) Manual evaluation: abstracts and full-text papers −Some degree of subjectivity

Maria Grazia Pia, INFN Genova 5 S. Agostinelli et al. Geant4: a simulation toolkit NIM A, vol. 506, no. 3, pp , 2003 J. Allison et al. Geant4 Developments and Applications IEEE Trans. Nucl. Sci., vol. 53, no. 1, pp , citations (14 May 2012 ) 2026 citations excluding proceedings Most cited CERN publication in WoS (excluding Rev. Part. Properties) 574 citations (14 May 2012 ) 381 citations excluding proceedings Many papers cite the NIM paper, but they omit citing the TNS one, even though both are indicated in Many papers that use Geant4 do not cite either reference Citation analysis: until 2011 (reproducibility)

Maria Grazia Pia, INFN Genova 6 75% citations (plot) LHC HEP Other 16% citations (plot) 19% citations from collaborations Born from LHC experimental requirements Multidisciplinary sources of citations

Maria Grazia Pia, INFN Genova 7 R. Brun and F. Rademakers ROOT - An object oriented data analysis framework NIM A, vol. 389, no. 1-2, pp , 1997 I. Antcheva et al. ROOT - A C++ framework for petabyte data storage, statistical analysis and visualization Comp. Phys Comm., vol. 180, no. 12, pp , citations (14 May 2012 ) 347 citations excluding proceedings 27 citations (14 May 2012 ) 20 citations excluding proceedings AIHENP Workshop proceedings paper Citation analysis: until 2011 (reproducibility)

Maria Grazia Pia, INFN Genova 8 75% citations 8% of all citations from collaborations Geant4 %ROOT % Technology Physics BioMedical Field of citing journals

Maria Grazia Pia, INFN Genova 9 HEP experiments LEP ALEPH DELPHI L3 OPAL BaBar LHC ALICE ATLAS CMS LHCb TOTEM LEP: 1989 BaBar: 1999 LHC: 2008 Start of run

Maria Grazia Pia, INFN Genova 10 Time distribution LEP: 1989 BaBar: 1999 LHC: 2008 Run start Publication year Rescaled w.r.t. year of start run

Maria Grazia Pia, INFN Genova 11 Time distribution LEP: 1989 BaBar: 1999 LHC: 2008 Run start Same as previous slide, rescaled by the number of experiment members

Maria Grazia Pia, INFN Genova 12 Publications Share of hardware, software and DAQ-trigger publications

Maria Grazia Pia, INFN Genova 13 Physics publications LEP experiments completed their life-cycle LHC experiments: at an early stage of their physics production

Maria Grazia Pia, INFN Genova 14 Technological publications Roughly constant trends, once the number of publications is normalized to the number of collaborators

Maria Grazia Pia, INFN Genova 15 Software vs. hardware Hardware publications: approximately 4 times more than software DAQ-trigger publications: approximately 1.3 times more than software Hardware publications: approximately 4 times more than software DAQ-trigger publications: approximately 1.3 times more than software

Maria Grazia Pia, INFN Genova 16 Journals hardware DAQ-trigger software TNSTNS NIMANIMA JINSTJINST

Maria Grazia Pia, INFN Genova 17 Journals: LEP and LHC Still dominated by technological publications LHC LEP Dominated by physics publications

Maria Grazia Pia, INFN Genova 18 Journals: pre- and post-2000 IEEE TNS is the most popular journal for HEP technological publications in recent years

Maria Grazia Pia, INFN Genova 19 Citations The most cited papers are often the general reference papers about the detector published by each experiment Citations of the most cited paper ALEPH: 340 DELPHI: 309 L3: 509 OPAL: 473 BaBar: 859 ALICE: 116 CMS:129 LHCb:101 TOTEM: 35 ATLAS: ATLAS pixel detector electronics and sensors: citations: 4% 0 citations: 17% 0 citations: 27% 0 citations: 25%

Maria Grazia Pia, INFN Genova 20 More references more citations References Physics papers cite more references than technological papers Bibliographical entries in software papers are often web sites

Maria Grazia Pia, INFN Genova 21 Pages The number of pages of a paper depends on the format of the journal −1 page TNS ≈ 2.5 pages JINST Different journal formats in the same category Evolutions of the format of some journals (e.g. NIM)

Maria Grazia Pia, INFN Genova 22 Sources of citations to physics papers LEP Samples in plots account for >90% of citations Citations to HEP physics papers mostly come from journals specialized in HEP and a few related fields (astroparticle and nuclear physics)

Maria Grazia Pia, INFN Genova 23 Sources of citations to technological papers Citations from HEP physics and technology journals LHC LEP

Maria Grazia Pia, INFN Genova More refined analysis of technological papers published since start of LHC run

Maria Grazia Pia, INFN Genova 25 TNS NIM A Citations Self-citations Outside citations

Maria Grazia Pia, INFN Genova 26 LCG – LHC Computing Grid Sakamoto, H Data grid deployment for high energy physics in JapanCPC Shiers, J The Worldwide LHC Computing Grid (worldwide LCG)CPC Belov, S et al. LCG MCDB - a knowledgebase of Monte-Carlo simulated eventsCPC Yin, Fet al. Grid resource management policies for load-balancing and energy-saving by vacation queuing theory CPC Malawski, M et al. Invocation of operations from script-based Grid applicationsFut. Gen. Comp. Syst Huedo, E et al. A modular meta-scheduling architecture for interfacing with pre-WS and WS Grid resource management services Fut. Gen. Comp. Syst Agarwal, A et al. GridX1: A Canadian computational gridFut. Gen. Comp. Syst Chytracek, R et al. POOL development status and production experienceTNS Hatlo, M et al. Developments of mathematical software libraries for the LHC experimentsTNS Pfeiffer, A et al. The LCG PI project: Using interfaces for physics data analysisTNS Munro, C et al. Measurement of the LCG2 and gLite File Catalogue's performanceTNS Li, H Realistic Workload Modeling and Its Performance Impacts in Large-Scale eScience Grids IEEE Trans. Par. Distr. Syst Andreeva, J et al. High-Energy Physics on the Grid: the ATLAS and CMS ExperienceJ. Grid Comp Munoz, VM et al. A Decentralized Deployment Strategy and Performance Evaluation of LCG File Catalog Service J. Grid Comp Hou, S et al. PacCAF: a Grid Portal in Pacific Asia for the CDF ExperimentJ. Grid Comp Kim, BK et al. A Composition of Monitoring Services for the LHC Computing GridJ. Grid Comp WoS

Maria Grazia Pia, INFN Genova 27 LCG Small sample of publications Hard to perform any statistical analysis

Maria Grazia Pia, INFN Genova 28 Conclusions Software is largely underrepresented in HEP scholarly literature w.r.t. hardware Publication patterns appear similar in the LEP and LHC era Citation patterns are different for publications by HEP experiments and about general software tools Publish! …and don’t forget to cite