Patterns of research collaboration in a digital library for economics Nisa Bakkalbasi Electronic Collections Librarian Yale University ASIS&T 2006 Annual Meeting November 3-8, 2006 Austin, Texas Thomas Krichel College of Information and Computer Science Long Island University
20
Embed
Patterns of research collaboration in a digital library for economics Nisa Bakkalbasi Electronic Collections Librarian Yale University ASIS&T 2006 Annual.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Patterns of research collaboration in a digital library
for economics
Nisa BakkalbasiElectronic Collections LibrarianYale University
ASIS&T 2006 Annual MeetingNovember 3-8, 2006
Austin, Texas
Thomas KrichelCollege of Information and Computer ScienceLong Island University
Introduction
• This paper analyzes the patterns of authorships and incidence of collaborative relationships in a digital library for economics.
• We study co-authorship using social network analysis.
Background
• Studies on scientific productivity suffer from the multiple names that can be given to the same author, making identification difficult. For example:
•Phillips, P. C. B•Peter C. B. Phillips•Peter Phillips
• For scientific collaboration studies, the issue becomes worse as the error in unique identification of one author extends across the whole network.
Background
• To be precise, most collaboration studies study small networks:
• All authors are known and can be identified “by hand.”• Issues of computation are simple.
• This study examines a large co-authorship network where all authors are uniquely identified. • The dataset comes from the RePEc digital library.
RePEc: Research Papers in Economics
• A digital library for economics and related disciplines.
• Provides access to 362,000 items of interest such as working papers, journal articles, software components, and instructional datasets.
• All RePEc data are freely available online.
• Data is contributed by academic departments, institutions involved in economics research (e.g. central banks), publishers, and individuals.
• A collaborative effort of hundreds of volunteers in 51 countries.
RAS: RePEc Author Service
• The RePEc author service is a site where authors registers and creates a professional profile. • See http://ras.repec.org• The author provides contact information, affiliation, and publications. • The development of the software for the RePEc author service was supported by the Open Society Institute.• For more information see http://acis.openlib.org.
Record from the RAS databaseTemplate-Type: ReDIF-Person 1.0Name-First: ChristianName-Last: ZimmermannName-Full: Christian ZimmermannWorkplace-Organization: RePEc:edi:deuctusEmail: [email protected]: http://ideas.repec.org/zimm/Author-Paper: repec:cre:crefwp:33Author-Paper: repec:mtl:montde:2000-05Author-Software: repec:dge:qmrbcd:99Author-Software: repec:dge:qmrbcd:97Author-Paper: repec:uct:uconnp:2005-01Author-Article: repec:eee:jcecon:v:33:y:2005:i:1:p:88-106Author-Article: repec:eee:jmacro:v:26:y:2004:i:4:p:637-659Author-Paper: repec:sce:scecf5:372Author-Paper: repec:red:sed005:561Short-Id: pzi1Handle: repec:per:1964-12-14:christian_zimmermannLast-Login-Date: 2005-11-21 15:25:20 -0500Registered-Date: 2004-02-29 17:36:09 –0600
Screenshot of a web page that renders this data on the web
How complete is RAS?
• RAS has been in use since 1999.
• When we did the study:• 1/3 papers had been claimed by at least one registered author.• 1/4 authorships are covered in RAS.
• RAS expands over time, but RePEc expands too, so these ratios only move up slowly.
• We conjecture that there is a tendency for prolific authors to register.
Distribution of the number of authors per paper in RePEc and RAS
Number of authors Number of papers
RePEc RAS
1 180,716 (49.91%) 99,562 (80.00%)
2 129,638 (35.80%) 22,315 (17.93%)
3 42,427 (11.72%) 2,425 (1.95%)
4 7,021 (1.94%) 130 (0.10%)
5 1,338 (0.37%) 9 (0.01%)
6 425 (0.12%) 4 (0.00%)
7 193 (0.05%) 1 (0.00%)
8 99 (0.03%) 1 (0.00%)
Summary statistics for RAS registrants
# of RAS registrants 12,381
# of registrants who did not claim a paper 3,715
# of registrants who claim at least one paper 8,666
# of authorships 152,072
Average number of papers/author 17.55
Authors ranked according to the number of co-authors
Rank Author Co-authors Papers
1 Randall Wright 27 106
2 Joseph Stiglitz 26 320
3 Clive Granger 25 165
4 James Stock 23 111
5 Pierre Chiappori 23 91
6 Martin Feldstein 22 259
7 Philip Franses 22 163
8 Robert Hubbard 22 116
9 Francis Diebold 21 189
10 Stephen Jenkins 21 138
Frequency distribution of authors by number of documents
87%76%
69% 63% 58% 54% 50%42%
30%20%
10%
2 3 4 5 6 7 8 11 17 26 45
Number of documents
Per
cent
ege
of a
utho
rs
Summary statistics for RAS authors and co-authorship networks
Number of authorships by co-authors 137,550
Number of authors with at least one co-author 5,661
Number of authorships with at least one co-author 109,924
Average number of collaborators/co-author 2.05
Size of the largest component 4,659
Number of components 382
Network diameter 22
Component size distribution
18 (0.002 %)Others ≤ 12(11%)
4659 (53%)
3138 (36%)
Largest component 2nd largest component Smaller components ≤ 12 No co-authors
Degree centrality distribution
Distribution of degree
0
500
1000
1500
2000
2500
0 10 20 30 40 50 60
Degree
Num
ber
of a
utho
rs
Only a few authors have a high degree of connection while many others have a low degree.
Authors ranked according to centrality measure
RankDegree Betweenness Closeness
1 Randall Wright 54 Joseph Stiglitz 903758.86 Joseph Stiglitz 4.8199
2 Joseph Stiglitz 52 F. Schiantarelli 700949.47 Olivier Blanchard 4.8952
3 Clive Granger 50 J. von Hagen 699927.26 James Stock 4.9594
4 P. Chiappori 46 Costas Meghir 626284.35 F. Schiantarelli 4.9972
5 James Stock 46 Clive Granger 587076.57 Martin Feldstein 5.0004
6 M. Feldstein 44 Gert Wagner 579692.04 J. von Hagen 5.0453
7 Philip Franses 44 Mark Taylor 551873.68 Costas Meghir 5.0459
8 R. Hubbard 44 O. Blanchard 541855.20 B. Eichengreen 5.0711
9 F. Diebold 42 Pierre Chiappori 530045.41 Marcus Miller 5.0805
10 S. Jenkins 42 K. Zimmermann 504285.85 Alison Booth 5.0893
Conclusions
• Authors who have written a large number of papers tend to register with RAS.
• The 80/20 Rule (i.e., 80% of the information productivity is generated by 20% of the information resources), does not apply to RAS authors.
• RAS registrants appear to have a broad range of coauthors, with most having only a few coauthors, whereas a few having many.
• RAS population is made up of highly active academics.
Further Work
• RePEc also identifies institutions.• Therefore work on institutional collaboration can be done quite easily.• It is also possible to compute various rankings of