Presentation on theme: "Patterns of research collaboration in a digital library for economics Nisa Bakkalbasi Electronic Collections Librarian Yale University ASIS&T 2006 Annual."— Presentation transcript:
Patterns of research collaboration in a digital library for economics Nisa Bakkalbasi Electronic Collections Librarian Yale University ASIS&T 2006 Annual Meeting November 3-8, 2006 Austin, Texas Thomas Krichel College of Information and Computer Science Long Island University
Introduction This paper analyzes the patterns of authorships and incidence of collaborative relationships in a digital library for economics. We study co-authorship using social network analysis.
Background Studies on scientific productivity suffer from the multiple names that can be given to the same author, making identification difficult. For example: Phillips, P. C. B Peter C. B. Phillips Peter Phillips For scientific collaboration studies, the issue becomes worse as the error in unique identification of one author extends across the whole network.
Background To be precise, most collaboration studies study small networks: All authors are known and can be identified by hand. Issues of computation are simple. This study examines a large co-authorship network where all authors are uniquely identified. The dataset comes from the RePEc digital library.
RePEc: Research Papers in Economics A digital library for economics and related disciplines. Provides access to 362,000 items of interest such as working papers, journal articles, software components, and instructional datasets. All RePEc data are freely available online. Data is contributed by academic departments, institutions involved in economics research (e.g. central banks), publishers, and individuals. A collaborative effort of hundreds of volunteers in 51 countries.
RAS: RePEc Author Service The RePEc author service is a site where authors registers and creates a professional profile. See http://ras.repec.org The author provides contact information, affiliation, and publications. The development of the software for the RePEc author service was supported by the Open Society Institute. For more information see http://acis.openlib.org.
Screenshot of a web page that renders this data on the web
How complete is RAS? RAS has been in use since 1999. When we did the study: 1/3 papers had been claimed by at least one registered author. 1/4 authorships are covered in RAS. RAS expands over time, but RePEc expands too, so these ratios only move up slowly. We conjecture that there is a tendency for prolific authors to register.
Distribution of the number of authors per paper in RePEc and RAS Number of authorsNumber of papers RePEcRAS 1180,716 (49.91%)99,562 (80.00%) 2129,638 (35.80%)22,315 (17.93%) 3 42,427 (11.72%)2,425 (1.95%) 4 7,021 (1.94%) 130 (0.10%) 5 1,338 (0.37%) 9 (0.01%) 6 425 (0.12%) 4 (0.00%) 7 193 (0.05%) 1 (0.00%) 8 99 (0.03%) 1 (0.00%)
Summary statistics for RAS registrants # of RAS registrants12,381 # of registrants who did not claim a paper 3,715 # of registrants who claim at least one paper 8,666 # of authorships 152,072 Average number of papers/author17.55
Authors ranked according to the number of co-authors RankAuthorCo- authorsPapers 1Randall Wright27106 2Joseph Stiglitz26320 3Clive Granger25165 4James Stock23111 5Pierre Chiappori2391 6Martin Feldstein22259 7Philip Franses22163 8Robert Hubbard22116 9Francis Diebold21189 10Stephen Jenkins21138
Frequency distribution of authors by number of documents
Summary statistics for RAS authors and co- authorship networks Number of authorships by co-authors137,550 Number of authors with at least one co-author5,661 Number of authorships with at least one co-author109,924 Average number of collaborators/co-author2.05 Size of the largest component4,659 Number of components382 Network diameter22
Degree centrality distribution Only a few authors have a high degree of connection while many others have a low degree.
Authors ranked according to centrality measure RankDegreeBetweennessCloseness 1 Randall Wright54Joseph Stiglitz903758.86Joseph Stiglitz4.8199 2 Joseph Stiglitz52F. Schiantarelli700949.47Olivier Blanchard4.8952 3 Clive Granger50J. von Hagen699927.26James Stock4.9594 4 P. Chiappori46Costas Meghir626284.35F. Schiantarelli4.9972 5 James Stock46Clive Granger587076.57Martin Feldstein5.0004 6 M. Feldstein44Gert Wagner579692.04J. von Hagen5.0453 7 Philip Franses44Mark Taylor551873.68Costas Meghir5.0459 8 R. Hubbard44O. Blanchard541855.20B. Eichengreen5.0711 9 F. Diebold42Pierre Chiappori530045.41Marcus Miller5.0805 10 S. Jenkins42K. Zimmermann504285.85Alison Booth5.0893
Conclusions Authors who have written a large number of papers tend to register with RAS. The 80/20 Rule (i.e., 80% of the information productivity is generated by 20% of the information resources), does not apply to RAS authors. RAS registrants appear to have a broad range of coauthors, with most having only a few coauthors, whereas a few having many. RAS population is made up of highly active academics.
Further Work RePEc also identifies institutions. Therefore work on institutional collaboration can be done quite easily. It is also possible to compute various rankings of authors institutions journals using citation and download data.