Presentation on theme: "Patterns of research collaboration in a digital library for economics Nisa Bakkalbasi Electronic Collections Librarian Yale University ASIS&T 2006 Annual."— Presentation transcript:
Patterns of research collaboration in a digital library for economics Nisa Bakkalbasi Electronic Collections Librarian Yale University ASIS&T 2006 Annual Meeting November 3-8, 2006 Austin, Texas Thomas Krichel College of Information and Computer Science Long Island University
Introduction This paper analyzes the patterns of authorships and incidence of collaborative relationships in a digital library for economics. We study co-authorship using social network analysis.
Background Studies on scientific productivity suffer from the multiple names that can be given to the same author, making identification difficult. For example: Phillips, P. C. B Peter C. B. Phillips Peter Phillips For scientific collaboration studies, the issue becomes worse as the error in unique identification of one author extends across the whole network.
Background To be precise, most collaboration studies study small networks: All authors are known and can be identified by hand. Issues of computation are simple. This study examines a large co-authorship network where all authors are uniquely identified. The dataset comes from the RePEc digital library.
RePEc: Research Papers in Economics A digital library for economics and related disciplines. Provides access to 362,000 items of interest such as working papers, journal articles, software components, and instructional datasets. All RePEc data are freely available online. Data is contributed by academic departments, institutions involved in economics research (e.g. central banks), publishers, and individuals. A collaborative effort of hundreds of volunteers in 51 countries.
RAS: RePEc Author Service The RePEc author service is a site where authors registers and creates a professional profile. See The author provides contact information, affiliation, and publications. The development of the software for the RePEc author service was supported by the Open Society Institute. For more information see
Record from the RAS database Template-Type: ReDIF-Person 1.0 Name-First: Christian Name-Last: Zimmermann Name-Full: Christian Zimmermann Workplace-Organization: RePEc:edi:deuctus Homepage: Author-Paper: repec:cre:crefwp:33 Author-Paper: repec:mtl:montde: Author-Software: repec:dge:qmrbcd:99 Author-Software: repec:dge:qmrbcd:97 Author-Paper: repec:uct:uconnp: Author-Article: repec:eee:jcecon:v:33:y:2005:i:1:p: Author-Article: repec:eee:jmacro:v:26:y:2004:i:4:p: Author-Paper: repec:sce:scecf5:372 Author-Paper: repec:red:sed005:561 Short-Id: pzi1 Handle: repec:per: :christian_zimmermann Last-Login-Date: :25: Registered-Date: :36:09 –0600
Screenshot of a web page that renders this data on the web
How complete is RAS? RAS has been in use since When we did the study: 1/3 papers had been claimed by at least one registered author. 1/4 authorships are covered in RAS. RAS expands over time, but RePEc expands too, so these ratios only move up slowly. We conjecture that there is a tendency for prolific authors to register.
Distribution of the number of authors per paper in RePEc and RAS Number of authorsNumber of papers RePEcRAS 1180,716 (49.91%)99,562 (80.00%) 2129,638 (35.80%)22,315 (17.93%) 3 42,427 (11.72%)2,425 (1.95%) 4 7,021 (1.94%) 130 (0.10%) 5 1,338 (0.37%) 9 (0.01%) (0.12%) 4 (0.00%) (0.05%) 1 (0.00%) 8 99 (0.03%) 1 (0.00%)
Summary statistics for RAS registrants # of RAS registrants12,381 # of registrants who did not claim a paper 3,715 # of registrants who claim at least one paper 8,666 # of authorships 152,072 Average number of papers/author17.55
Authors ranked according to the number of co-authors RankAuthorCo- authorsPapers 1Randall Wright Joseph Stiglitz Clive Granger James Stock Pierre Chiappori2391 6Martin Feldstein Philip Franses Robert Hubbard Francis Diebold Stephen Jenkins21138
Frequency distribution of authors by number of documents
Summary statistics for RAS authors and co- authorship networks Number of authorships by co-authors137,550 Number of authors with at least one co-author5,661 Number of authorships with at least one co-author109,924 Average number of collaborators/co-author2.05 Size of the largest component4,659 Number of components382 Network diameter22
Component size distribution
Degree centrality distribution Only a few authors have a high degree of connection while many others have a low degree.
Authors ranked according to centrality measure RankDegreeBetweennessCloseness 1 Randall Wright54Joseph Stiglitz Joseph Stiglitz Joseph Stiglitz52F. Schiantarelli Olivier Blanchard Clive Granger50J. von Hagen James Stock P. Chiappori46Costas Meghir F. Schiantarelli James Stock46Clive Granger Martin Feldstein M. Feldstein44Gert Wagner J. von Hagen Philip Franses44Mark Taylor Costas Meghir R. Hubbard44O. Blanchard B. Eichengreen F. Diebold42Pierre Chiappori Marcus Miller S. Jenkins42K. Zimmermann Alison Booth5.0893
Conclusions Authors who have written a large number of papers tend to register with RAS. The 80/20 Rule (i.e., 80% of the information productivity is generated by 20% of the information resources), does not apply to RAS authors. RAS registrants appear to have a broad range of coauthors, with most having only a few coauthors, whereas a few having many. RAS population is made up of highly active academics.
Further Work RePEc also identifies institutions. Therefore work on institutional collaboration can be done quite easily. It is also possible to compute various rankings of authors institutions journals using citation and download data.