Geant4 citations
Publication patterns in HEP computingM.G.Pia1, T.Basaglia2,
Z.W.Bell3, P.V.Dressendorfer41INFN Genova, Genova, Italy2CERN,
Geneva, Switzerland3ORNL, Oak Ridge, TN, USA4IEEE, Piscataway, NJ,
USA
CHEP 2012, NYC
Maria Grazia Pia, INFN Genova#1Analysis topicsGeneral
toolsGeant4ROOTHEP experimentsLEPALEPH, DELPHI, L3,
OPALBaBarLHCALICE, ATLAS, CMS, LHCb, TOTEMGrid computing LCGWhat
they publishHow muchWhereCitationsTechnology vs physicsSoftware vs
hardwareSoftware/DAQ-trigger
RepresentativeNot exhaustiveMaria Grazia Pia, INFN Genova#Data
sourcesThomson-Reuters: ISI Web of KnowledgeCERN subscription:
since 1970, conference database not includedSearch by keywords,
collaboration nameJournal web sitesIEEE TNSNIM, Comp. Phys. Comm.
(Elsevier)JINST (IOP/SISSA)Full-text searchesCERN databasesCERN
Document SystemGreybookYears: 1982-2011 (LEP), 1992-2011 (BaBar,
LHC)Reproducible sampleMaria Grazia Pia, INFN Genova#Data
sampleContaminationNon-pertinent entries in the data
sampleOmissionPertinent papers are not included in the data
sampleCross-checksWoS/CDS, WoS/publishers web sitesWoS
inconsistencies and errorsTotal number of citations includes
Conference databaseProceedings papers: false classifications and
omissions Manually corrected whenever possibleAutomated analysis
(whenever possible)Manual evaluation: abstracts and full-text
papers Some degree of subjectivity
Maria Grazia Pia, INFN Genova#
S. Agostinelli et al.Geant4: a simulation toolkitNIM A, vol.
506, no. 3, pp. 250-303, 2003J. Allison et al.Geant4 Developments
and ApplicationsIEEE Trans. Nucl. Sci., vol. 53, no. 1, pp.
270-278, 20062934 citations (14 May 2012)2026 citations excluding
proceedingsMost cited CERN publication in WoS(excluding Rev. Part.
Properties)574 citations (14 May 2012)381 citations excluding
proceedingsMany papers cite the NIM paper, but they omit citing the
TNS one, even though both are indicated in
http://cern.ch/geant4Many papers that use Geant4 do not cite either
referenceCitation analysis: until 2011 (reproducibility)Maria
Grazia Pia, INFN Genova#75% citations(plot)LHCHEPOther16% citations
(plot)19% citations from collaborations
Born from LHC experimental requirementsMultidisciplinary sources
of citationsMaria Grazia Pia, INFN Genova#
R. Brun and F. RademakersROOT - An object oriented data analysis
framework NIM A, vol. 389, no. 1-2, pp. 81-86, 1997I. Antcheva et
al. ROOT - A C++ framework for petabyte data storage, statistical
analysis and visualizationComp. Phys Comm., vol. 180, no. 12, pp.
2499-2512, 2009540 citations (14 May 2012)347 citations excluding
proceedings27 citations (14 May 2012)20 citations excluding
proceedingsAIHENP Workshop proceedings paperCitation analysis:
until 2011 (reproducibility)Maria Grazia Pia, INFN Genova#
75% citations8% of all citations from collaborationsGeant4 %ROOT
% Technology30.349.6 Physics29.918.2 BioMedical13.96.0Field of
citing journalsMaria Grazia Pia, INFN Genova#HEP experimentsLEP
ALEPH DELPHI L3 OPALBaBarLHC ALICE ATLAS CMS LHCb TOTEMLEP:
1989BaBar: 1999LHC: 2008Start of run
Maria Grazia Pia, INFN Genova#Time distributionLEP: 1989BaBar:
1999LHC: 2008Run startPublication yearRescaled w.r.t. year of start
run
Maria Grazia Pia, INFN Genova#Time distribution LEP: 1989BaBar:
1999LHC: 2008Run startSame as previous slide, rescaled by the
number of experiment members
Maria Grazia Pia, INFN Genova#Publications
Share of hardware, software and DAQ-trigger publicationsMaria
Grazia Pia, INFN Genova#Physics publicationsLEP experiments
completed their life-cycleLHC experiments: at an early stage of
their physics production
Maria Grazia Pia, INFN Genova#Technological publicationsRoughly
constant trends, once the number of publications is normalized to
the number of collaborators
Maria Grazia Pia, INFN Genova#Software vs. hardware Hardware
publications: approximately 4 times more than softwareDAQ-trigger
publications: approximately 1.3 times more than software
Maria Grazia Pia, INFN Genova#Journals
hardwareDAQ-triggersoftware
TNSNIMAJINSTMaria Grazia Pia, INFN Genova#
Journals: LEP and LHCStill dominated by technological
publicationsLHCLEPDominated by physics publications
Maria Grazia Pia, INFN Genova#17Journals: pre- and post-2000IEEE
TNS is the most popular journal for HEP technological publications
in recent years
Maria Grazia Pia, INFN Genova#More balanced distribution of
physics papers over various journals
18CitationsThe most cited papers are often the general reference
papers about the detector published by each experiment
Citations of the most cited paperALEPH: 340DELPHI: 309L3:
509OPAL: 473BaBar: 859ALICE: 116CMS:129LHCb:101TOTEM: 35ATLAS:
ATLAS pixel detector electronics and sensors: 185
0 citations: 4%0 citations: 17%0 citations: 27%0 citations:
25%Maria Grazia Pia, INFN Genova#19More references
more citationsReferencesPhysics papers cite more references than
technological papers
Bibliographical entries in software papers are often web
sites
Maria Grazia Pia, INFN Genova#PagesThe number of pages of a
paper depends on the format of the journal1 pageTNS 2.5
pagesJINST
Different journal formats in the same category
Evolutions of the format of some journals (e.g. NIM)
Maria Grazia Pia, INFN Genova#Sources of citations to physics
papersLEPSamples in plots account for >90% of citationsCitations
to HEP physics papers mostly come from journals specialized in HEP
and a few related fields (astroparticle and nuclear physics)Maria
Grazia Pia, INFN Genova#Sources of citations to technological
papersCitations from HEP physics and technology journalsLHCLEPMaria
Grazia Pia, INFN Genova#2008-2011More refined analysis of
technological papers published since start of LHC runMaria Grazia
Pia, INFN Genova#TNSTNSNIM ANIM ACitations
2008-2011Self-citationsOutside citationsMaria Grazia Pia, INFN
Genova#LCG LHC Computing GridSakamoto, HData grid deployment for
high energy physics in JapanCPC2007Shiers, JThe Worldwide LHC
Computing Grid (worldwide LCG)CPC2007Belov, S et al.LCG MCDB - a
knowledgebase of Monte-Carlo simulated eventsCPC2008Yin, Fet
al.Grid resource management policies for load-balancing and
energy-saving by vacation queuing theoryCPC2009Malawski, M et
al.Invocation of operations from script-based Grid applicationsFut.
Gen. Comp. Syst.2010Huedo, E et al.A modular meta-scheduling
architecture for interfacing with pre-WS and WS Grid resource
management servicesFut. Gen. Comp. Syst.2007Agarwal, A et
al.GridX1: A Canadian computational gridFut. Gen. Comp.
Syst.2007Chytracek, R et al.POOL development status and production
experienceTNS2005Hatlo, M et al.Developments of mathematical
software libraries for the LHC experimentsTNS2005Pfeiffer, A et
al.The LCG PI project: Using interfaces for physics data
analysisTNS2005Munro, C et al.Measurement of the LCG2 and gLite
File Catalogue's performanceTNS2006Li, HRealistic Workload Modeling
and Its Performance Impacts in Large-Scale eScience GridsIEEE
Trans. Par. Distr. Syst.2010Andreeva, J et al.High-Energy Physics
on the Grid: the ATLAS and CMS ExperienceJ. Grid Comp.2008Munoz, VM
et al.A Decentralized Deployment Strategy and Performance
Evaluation of LCG File Catalog ServiceJ. Grid Comp.2011Hou, S et
al.PacCAF: a Grid Portal in Pacific Asia for the CDF ExperimentJ.
Grid Comp.2009Kim, BK et al.A Composition of Monitoring Services
for the LHC Computing GridJ. Grid Comp.2009WoSMaria Grazia Pia,
INFN Genova#LCG
Small sample of publications
Hard to perform any statistical analysisMaria Grazia Pia, INFN
Genova#
ConclusionsSoftware is largely underrepresented in HEP scholarly
literature w.r.t. hardwarePublication patterns appear similar in
the LEP and LHC eraCitation patterns are different for publications
by HEP experiments and about general software tools
Publish!and dont forget to cite
Maria Grazia Pia, INFN Genova#