Top Banner
Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1 , T. Basaglia 2 , Z. W. Bell 3 , P. V. Dressendorfer 4 1 INFN Genova, Genova, Italy 2 CERN, Geneva, Switzerland 3 ORNL, Oak Ridge, TN, USA 4 IEEE, Piscataway, NJ, USA CHEP 2012, NYC
28

Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Jan 02, 2016

Download

Documents

Rolf Goodwin
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript

Geant4 citations

Publication patterns in HEP computingM.G.Pia1, T.Basaglia2, Z.W.Bell3, P.V.Dressendorfer41INFN Genova, Genova, Italy2CERN, Geneva, Switzerland3ORNL, Oak Ridge, TN, USA4IEEE, Piscataway, NJ, USA

CHEP 2012, NYC

Maria Grazia Pia, INFN Genova#1Analysis topicsGeneral toolsGeant4ROOTHEP experimentsLEPALEPH, DELPHI, L3, OPALBaBarLHCALICE, ATLAS, CMS, LHCb, TOTEMGrid computing LCGWhat they publishHow muchWhereCitationsTechnology vs physicsSoftware vs hardwareSoftware/DAQ-trigger

RepresentativeNot exhaustiveMaria Grazia Pia, INFN Genova#Data sourcesThomson-Reuters: ISI Web of KnowledgeCERN subscription: since 1970, conference database not includedSearch by keywords, collaboration nameJournal web sitesIEEE TNSNIM, Comp. Phys. Comm. (Elsevier)JINST (IOP/SISSA)Full-text searchesCERN databasesCERN Document SystemGreybookYears: 1982-2011 (LEP), 1992-2011 (BaBar, LHC)Reproducible sampleMaria Grazia Pia, INFN Genova#Data sampleContaminationNon-pertinent entries in the data sampleOmissionPertinent papers are not included in the data sampleCross-checksWoS/CDS, WoS/publishers web sitesWoS inconsistencies and errorsTotal number of citations includes Conference databaseProceedings papers: false classifications and omissions Manually corrected whenever possibleAutomated analysis (whenever possible)Manual evaluation: abstracts and full-text papers Some degree of subjectivity

Maria Grazia Pia, INFN Genova#

S. Agostinelli et al.Geant4: a simulation toolkitNIM A, vol. 506, no. 3, pp. 250-303, 2003J. Allison et al.Geant4 Developments and ApplicationsIEEE Trans. Nucl. Sci., vol. 53, no. 1, pp. 270-278, 20062934 citations (14 May 2012)2026 citations excluding proceedingsMost cited CERN publication in WoS(excluding Rev. Part. Properties)574 citations (14 May 2012)381 citations excluding proceedingsMany papers cite the NIM paper, but they omit citing the TNS one, even though both are indicated in http://cern.ch/geant4Many papers that use Geant4 do not cite either referenceCitation analysis: until 2011 (reproducibility)Maria Grazia Pia, INFN Genova#75% citations(plot)LHCHEPOther16% citations (plot)19% citations from collaborations

Born from LHC experimental requirementsMultidisciplinary sources of citationsMaria Grazia Pia, INFN Genova#

R. Brun and F. RademakersROOT - An object oriented data analysis framework NIM A, vol. 389, no. 1-2, pp. 81-86, 1997I. Antcheva et al. ROOT - A C++ framework for petabyte data storage, statistical analysis and visualizationComp. Phys Comm., vol. 180, no. 12, pp. 2499-2512, 2009540 citations (14 May 2012)347 citations excluding proceedings27 citations (14 May 2012)20 citations excluding proceedingsAIHENP Workshop proceedings paperCitation analysis: until 2011 (reproducibility)Maria Grazia Pia, INFN Genova#

75% citations8% of all citations from collaborationsGeant4 %ROOT % Technology30.349.6 Physics29.918.2 BioMedical13.96.0Field of citing journalsMaria Grazia Pia, INFN Genova#HEP experimentsLEP ALEPH DELPHI L3 OPALBaBarLHC ALICE ATLAS CMS LHCb TOTEMLEP: 1989BaBar: 1999LHC: 2008Start of run

Maria Grazia Pia, INFN Genova#Time distributionLEP: 1989BaBar: 1999LHC: 2008Run startPublication yearRescaled w.r.t. year of start run

Maria Grazia Pia, INFN Genova#Time distribution LEP: 1989BaBar: 1999LHC: 2008Run startSame as previous slide, rescaled by the number of experiment members

Maria Grazia Pia, INFN Genova#Publications

Share of hardware, software and DAQ-trigger publicationsMaria Grazia Pia, INFN Genova#Physics publicationsLEP experiments completed their life-cycleLHC experiments: at an early stage of their physics production

Maria Grazia Pia, INFN Genova#Technological publicationsRoughly constant trends, once the number of publications is normalized to the number of collaborators

Maria Grazia Pia, INFN Genova#Software vs. hardware Hardware publications: approximately 4 times more than softwareDAQ-trigger publications: approximately 1.3 times more than software

Maria Grazia Pia, INFN Genova#Journals

hardwareDAQ-triggersoftware

TNSNIMAJINSTMaria Grazia Pia, INFN Genova#

Journals: LEP and LHCStill dominated by technological publicationsLHCLEPDominated by physics publications

Maria Grazia Pia, INFN Genova#17Journals: pre- and post-2000IEEE TNS is the most popular journal for HEP technological publications in recent years

Maria Grazia Pia, INFN Genova#More balanced distribution of physics papers over various journals

18CitationsThe most cited papers are often the general reference papers about the detector published by each experiment

Citations of the most cited paperALEPH: 340DELPHI: 309L3: 509OPAL: 473BaBar: 859ALICE: 116CMS:129LHCb:101TOTEM: 35ATLAS: ATLAS pixel detector electronics and sensors: 185

0 citations: 4%0 citations: 17%0 citations: 27%0 citations: 25%Maria Grazia Pia, INFN Genova#19More references

more citationsReferencesPhysics papers cite more references than technological papers

Bibliographical entries in software papers are often web sites

Maria Grazia Pia, INFN Genova#PagesThe number of pages of a paper depends on the format of the journal1 pageTNS 2.5 pagesJINST

Different journal formats in the same category

Evolutions of the format of some journals (e.g. NIM)

Maria Grazia Pia, INFN Genova#Sources of citations to physics papersLEPSamples in plots account for >90% of citationsCitations to HEP physics papers mostly come from journals specialized in HEP and a few related fields (astroparticle and nuclear physics)Maria Grazia Pia, INFN Genova#Sources of citations to technological papersCitations from HEP physics and technology journalsLHCLEPMaria Grazia Pia, INFN Genova#2008-2011More refined analysis of technological papers published since start of LHC runMaria Grazia Pia, INFN Genova#TNSTNSNIM ANIM ACitations 2008-2011Self-citationsOutside citationsMaria Grazia Pia, INFN Genova#LCG LHC Computing GridSakamoto, HData grid deployment for high energy physics in JapanCPC2007Shiers, JThe Worldwide LHC Computing Grid (worldwide LCG)CPC2007Belov, S et al.LCG MCDB - a knowledgebase of Monte-Carlo simulated eventsCPC2008Yin, Fet al.Grid resource management policies for load-balancing and energy-saving by vacation queuing theoryCPC2009Malawski, M et al.Invocation of operations from script-based Grid applicationsFut. Gen. Comp. Syst.2010Huedo, E et al.A modular meta-scheduling architecture for interfacing with pre-WS and WS Grid resource management servicesFut. Gen. Comp. Syst.2007Agarwal, A et al.GridX1: A Canadian computational gridFut. Gen. Comp. Syst.2007Chytracek, R et al.POOL development status and production experienceTNS2005Hatlo, M et al.Developments of mathematical software libraries for the LHC experimentsTNS2005Pfeiffer, A et al.The LCG PI project: Using interfaces for physics data analysisTNS2005Munro, C et al.Measurement of the LCG2 and gLite File Catalogue's performanceTNS2006Li, HRealistic Workload Modeling and Its Performance Impacts in Large-Scale eScience GridsIEEE Trans. Par. Distr. Syst.2010Andreeva, J et al.High-Energy Physics on the Grid: the ATLAS and CMS ExperienceJ. Grid Comp.2008Munoz, VM et al.A Decentralized Deployment Strategy and Performance Evaluation of LCG File Catalog ServiceJ. Grid Comp.2011Hou, S et al.PacCAF: a Grid Portal in Pacific Asia for the CDF ExperimentJ. Grid Comp.2009Kim, BK et al.A Composition of Monitoring Services for the LHC Computing GridJ. Grid Comp.2009WoSMaria Grazia Pia, INFN Genova#LCG

Small sample of publications

Hard to perform any statistical analysisMaria Grazia Pia, INFN Genova#

ConclusionsSoftware is largely underrepresented in HEP scholarly literature w.r.t. hardwarePublication patterns appear similar in the LEP and LHC eraCitation patterns are different for publications by HEP experiments and about general software tools

Publish!and dont forget to cite

Maria Grazia Pia, INFN Genova#