11 Presented to the Science of Science Measurement Workshop December 3, 2010 Robin M. Wagner* Katy Börner** National Institutes of Health * Indiana University.

Slides:



Advertisements
Similar presentations
Assessing and Increasing the Impact of Research at the National Institute of Standards and Technology Susan Makar, Stacy Bruss, and Amanda Malanowski NIST.
Advertisements

U.S. Department of Energy Office of Science Advanced Scientific Computing Research Program NERSC Users Group Meeting Department of Energy Update June 12,
INSTITUTE OF BEHAVIORAL SCIENCES WRITING GRANT PROPOSALS Thursday, April 10, 2014 Randy Draper, Office of the Vice Chancellor for Research Room 125, IBS.
Science of Science and Innovation Policy (SciSIP) Presentation to: SBE Advisory Committee By: Dr. Kaye Husbands Fealing National Science Foundation November.
Patterns of Research Collaboration in U.S. Universities, AAAS Meetings Denver, Colorado February 18, 2003 James D. Adams, University of Florida.
BY THE NUMBERS ARIZONA IN FY 2012 $157 Million: NSF funds awarded 13 th : National ranking in NSF funds 26: NSF-funded institutions 496: NSF grants awarded.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Presented by: Charles Pallandt Title: Managing Director EMEA Academic & Governmental Markets Date: April 28 th, Turkey “Driving Research Excellence.
U.S. Science Policy Cheryl L. Eavey, Program Director
6/3/2015KNOWLEDGE INITIATIVE1 NJ Access to Science, Technology, Medicine and Business Databases New Jersey State Library New Jersey Library Network New.
Community of Science The Leading Internet Site for Researchers Worldwide
1 Scopus Update 15 Th Pan-Hellenic Academic Libraries Conference, November 3rd,2006 Patras, Greece Eduardo Ramos
T H O M S O N S C I E N T I F I C Editorial Development James Testa, Director.
The NIH Roadmap for Medical Research
Integrating the Life Sciences from Molecule to Organism The American Physiological Society Transform a Cookbook Lab Moving Toward More Student-Centered.
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
Transforming Data-Driven Publications and Decision Support Joan L. Aron, Ph.D. Consultant Federal Big Data Working Group COM.BigData 2014.
NIH Roadmap for Medical Research and Common Fund Update on Recent Changes Dinah Singer, Ph.D. Director, Division of Cancer Biology June 18, 2008.
Welcome to Scopus Training by : Arash Nikyar June 2014
The Latest in Information Technology for Research Universities.
National Natural Science Foundation of China initiative research and talent training in fundamental sciences An organization which supports initiative.
Centers for Disease Control and Prevention Office of Public Health Scientific Services CDC Health Information Innovation Consortium November Forum Brian.
Effective User Services for High Performance Computing A White Paper by the TeraGrid Science Advisory Board May 2009.
Computational Scientometrics Studying science by scientific means Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information.
Bibliometrics toolkit: ISI products Website: Last edited: 11 Mar 2011 Thomson Reuters ISI product set is the market leader for.
Data provided by the Division of Statistical Analysis & Reporting (DSAR)/OPAC/OER Contact: Best Practices: Leveraging Existing Data.
Data Science for VIVO Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
American Evaluation Association EVALUATION 2009 November 14, 2009 Building Data Systems to Support Evaluation in a Biomedical Research and Development.
Research Program Overview National Institute on Disability and Rehabilitation Research Robert J. Jaeger, Ph.D. Interagency and International Affairs Interagency.
Directorate for Social, Behavioral, and Economic Sciences Amber L. Story Deputy Division Director Directorate for Social, Behavioral, and Economic Sciences.
Systems Studies Program Peer Review Meeting Albert L. Opdenaker III DOE Program Manager Holiday Inn Express Germantown, Maryland August 29, 2013.
Scholarly communications Discussion group Linked Data Workshop May 2010.
& Collaborating to Build an Open Access Archive of Public Policy Research Coalition for Networked Information Task Force Meeting.
Identification of national S&T priority areas with respect to the promotion of innovation and economic growth: the case of Russia Alexander Sokolov State.
Where are the Academic Jobs ? Interactive Exploration of Job Advertisements in Geospatial and Topical Space Angela M. Zoss 1, Michael Conover 2 and Katy.
Computational Scientometrics Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory, Director School.
The Scholarly Database and Its Utility for Scientometrics Research Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information.
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
Mapping New Strategies: National Science Foundation J. HicksNew York Academy of Sciences4 April 2006 Examples from our daily life at NSF Vision Opportunities.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
SciVal Spotlight Training for KU Huiling Ng, SciVal Product Sales Manager (South East Asia) Cassandra Teo, Account Manager (South East Asia) June 2013.
ESSENTIAL SCIENCE INDICATORS (ESI) James Cook University Celebrating Research 9 OCTOBER 2009 Steven Werkheiser Manager, Customer Education & Training ANZ.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
1 Making a Grope for an Understanding of Taiwan’s Scientific Performance through the Use of Quantified Indicators Prof. Dr. Hsien-Chun Meng Science and.
Disciplinary Maps of Sustainability Science Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory,
Funding Opportunities and Partnerships Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory, Director.
1 AIRI STATISTICS: TRENDS IN NIH EXTRAMURAL FUNDING Presented at the Association of Independent Research Institutes (AIRI) 50 th Annual Meeting October.
How to measure the impact of R&D on SD ? Laurence Esterle, MD, PhD Cermes and Ifris France Cyprus, 16 – 17 October L. ESTERLE Linking science and.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
1 AIRI STATISTICS: TRENDS IN NIH AWARDS Presented at Association of Independent Research Institutes 49 th Annual Meeting September 29, 2010 Washington,
Katy Börner Teaching & Research Teaching & Research Katy Börner
Directorate for Education and Human Resources Photo credits (from left) : U.S. Army Corps of Engineers, Intel Free Press, Kate Ter Haar, Woodley Wonder.
MEASURING RESEARCHERS: FROM RESEARCH PUBLICATIONS AND COMMUNICATION TO RESEARCH EVALUATION Lucie Vavříková 1.
PP 620: Public Policy and Health Administration Unit One Seminar Kris R. Foote, J.D., M.P.A., M.S.W. Kaplan University.
Building PetaScale Applications and Tools on the TeraGrid Workshop December 11-12, 2007 Scott Lathrop and Sergiu Sanielevici.
David M. Murray, Ph.D. Associate Director for Prevention Director, Office of Disease Prevention Multilevel Intervention Research Methodology September.
National Institutes of Health U.S. Department of Health and Human Services Planning for a Team Science Evaluation ∞ NIEHS: Children’s Health Exposure Analysis.
INTRODUCTION TO BIBLIOMETRICS 1. History Terminology Uses 2.
Examining Federal Expert Networking and the Economies of Scale: Moving the “HHS Profiles” Pilot Towards “Experts.gov” James King, Jessica N. Berrellez,
SciENcv: a Federal biosketch tool NIH Regional Meeting October 2016 Neil Thakur, PhD Office of Extramural Research Bart Trawick, PhD National Center for.
Portfolio Analysis in OPASI at NIH
Joslynn Lee – Data Science Educator
Strategic Planning Process
VIVO: Faculty Research Information System and Discovery
Strategic Planning Process
An ecosystem of contributions
Citation databases and social networks for researchers: measuring research impact and disseminating results - exercise Elisavet Koutzamani
Pivot Workshop Deborah Esparza-St Louis, Director of Sponsored Programs Lauren Carr, Director of Corporate & Foundation Development Lyena Chavez, McQuade.
Presentation transcript:

11 Presented to the Science of Science Measurement Workshop December 3, 2010 Robin M. Wagner* Katy Börner** National Institutes of Health * Indiana University ** Introducing the Science of Science (Sci2) Tool to the Reporting Branch at the National Institutes of Health

2 Research Team Cyberinfrastructure for Network Science Center School of Library and Information Science Indiana University Katy Börner, PhD Nianli Ma, MS Joseph R. Biberstine, BS Reporting Branch Office of Extramural Research Office of the Director National Institutes of Health (NIH) Robin M. Wagner, PhD, MS Rediet Berhane, MUPPD Hong Jiang, PhD Susan E. Ivey, MA Katrina Pearson Carl McCabe, PhD

3 Two Key Contributions Discussion of Socio-Technical Challenges when introducing science of science tools to an agency –What context, insight needs exist? –How to select the best tool (and improve it continuously)? –How to best transfer expertise—tutorials or close collaboration? Answering Research Questions with the new tools –What fields of science are covered by publications that acknowledge NIH extramural grant funding and how have the fields evolved from ? –What is the time lag between NIH grant awards being made and papers being published and what is the probability distribution for the number of papers per project?

4 Two Key Contributions Discussion of Socio-Technical Challenges when introducing science of science tools to an agency –What context, insight needs exist? –How to select the best tool (and improve it continuously)? –How to best transfer expertise—tutorials or close collaboration? Answering Research Questions with the new tools –What fields of science are covered by publications that acknowledge NIH extramural grant funding and how have the fields evolved from ? –What is the time lag between NIH grant awards being made and papers being published and what is the probability distribution for the number of papers per project?

5 Background and Motivation Scholars and policy makers have long sought to evaluate the long-term societal impacts of research This task is particularly daunting for large portfolios –Large portfolios may be linked to thousands of researchers and millions of research outputs, outcomes and impacts, appearing in multiple and often unlinked data sources and databases –Data sources may be inconsistent, inaccurate or incomplete Increased digitization of scientific information, improved electronic search and linkage tools and capabilities, and new methods and tools have created new opportunities to evaluate large research enterprises U.S. federal government has mandated, “Agencies should support the development and use of “science of science policy” tools that can improve management of their R&D portfolios and better assess the impact of their science, technology, and innovation investments.” Orszag et al., 2010

6 Available Scientometrics Tools Many tools are available to analyze, model, and visualize publication, patent, funding or other science and technology datasets Highly specialized tools, e.g., –BibExcel and Publish or Perish support bibliometric data acquisition and analysis –HistCite and CiteSpace address specific needs, from studying the history of science to identifying scientific research frontiers More general tools, e.g., –Science and Technology Dynamics Toolbox provides many algorithms commonly used in scientometrics research and bridges to other tools –Pajek and UCINET are very versatile, powerful network analysis tools widely used in social network analysis –Cytoscape is optimized for visualizing biological network data For review of 20 scientometrics tools, see

7 Expanding Visualization Tool Capabilities Cyberinfrastructure for Network Science (CNS) Center –Conducts research on structure/dynamics of science for 10 years –Curates international Mapping Science exhibit ( –Develops large scale scholarly databases and open source tools to study science by scientific means CNS Center has developed the Science of Science (Sci2) and other tools, with significant advantages –Based on open source, free software –Contain some of the most advanced analysis algorithms –Use industry standard, Open Services Gateway Initiative, to build modular software so new algorithms can be easily added by non- computer scientists, tailored to specific agency needs –Support data preprocessing, e.g., data cleaning, de-duplication, filtering, and network extraction, essential for high quality analyses –Generate easy to read visualizations, many with fixed reference systems, automatic legend design, and audit trail documentation –Have extensive publically available documentation

8 Using the Scholarly Database and the Sci2 Tool Scholarly Database Supports free cross-search and bulk download of 25 million MEDLINE papers, USPTO patents, NSF and NIH awards ( Science of Science (Sci2) Tool This NSF SciSIP funded, OSGi/CIShell powered tool with150+ algorithm plug-ins and is compatible with Epidemics, NWB, and TextTrend.org tools (

9 Bringing Sci2 Tool to NIH Reporting Branch Branch conducts analyses of NIH- supported research projects and investigators to support NIH policy development and to communicate the impact of NIH’s research investment, ≈ $30 billion/year Branch sought new visualization tools to provide new insights into how NIH-supported research and investigators contribute to biomedical knowledge and improving health Branch invited Dr. Börner to NIH for one month (July 2010) to provide training and collaborate on research

10 12 Tutorials in 12 Days at NIH 1.Science of Science Research 2.Information Visualization 3.CIShell Powered Tools: Network Workbench and Science of Science (Sci2) Tool 4.Temporal Analysis—Burst Detection 5.Geospatial Analysis and Mapping 6.Topical Analysis & Mapping 7.Network Analysis 8.Network Analysis cont. 9.Extending the Sci2 Tool 10.Using the Scholarly Database at IU 11.VIVO National Researcher Networking 12.Future Developments 1 st Week 2 nd Week 3 rd Week 4 th Week

11 Questions Federal Agencies Can Answer with Sci2 Tools How did the number of grants and total award dollars given to various fields of biomedical science change over time? (Temporal Analysis) Where are agency research collaborators located worldwide? (Geospatial Analysis) To what degree do agency-funded researchers publish in the areas in which they were funded to do research, and does this differ for more basic versus applied research? (Topical Analysis) What are the co-author networks on publications citing agency funding? (Network Analysis) In what areas of science does the agency pioneer funding and in which areas does it follow the initial funding by other agencies? (Scholarly Database)

12 Discussion of Socio-Technical Challenges when introducing science of science tools to an agency –What context, insight needs exist? –How to select the best tool (and improve it continuously)? –How to best transfer expertise—tutorials or close collaboration? Answering Research Questions with the new tools –What fields of science are covered by publications that acknowledge NIH extramural grant funding and how have the fields evolved from ? –What is the time lag between NIH grant awards being made and papers being published and what is the probability distribution for the number of papers per project? First Post-Tutorial Collaboration: MEDLINE Publication Output by NIH

13 Methods Extracted public information on NIH grants using electronic tool, Research Portfolio Online Reporting Tools Expenditures and Results (RePORTER) on NIH RePORT website ( –Includes MEDLINE publications whose authors cite NIH grant support and can be linked with automated tool, SPIRES –Chose all grants with budget start date in fiscal years (10/1/2000-9/30/2009) and linked publications published in budget start date year or later (1/1/ /31/2009) –For analyses of new grants, applied time lag of 3 months for those awarded in first 3 months of fiscal year (10/1-12/31) Evaluated data in 3 time periods examined individually and cumulatively: , , –To answer Q1, evaluated number and growth rate of publications linked to all grants by discipline over time, plotted on the University of California, San Diego (UCSD) Map of Science –To answer Q2, evaluated time lag between new grant awards and linked publications

14 Methods (cont.) UCSD Map of Science –Map based on 7,200,000 publications in 16,000 journals, proceedings and series from Thomson Scientific and Scopus from –Contains 554 individual areas of science representing groups of journals comprising 13 major disciplines plus interdisciplinary “Multiple Categories” –Publications are plotted on map based on their journal names –Advantages Most comprehensive, accurate base map of science at paper level Stable base map enables comparing different analyses generated within or across different agencies Avoids burden of having to create a new semantic topic space for each new analysis

15 Results 147,541 NIH grants (“base projects”) from % of projects (94,074) had at least 1 linked publication After applying time lags, identified 499,322 publications from all grants (Q1) –122,660 papers published 1/1/ /31/2003 –171,393 papers published 1/1/ /31/2006 –205,269 papers published 1/1/ /31/2009 From new grant analyses (Q2), identified –171,920 papers published linked to grants –104,842 papers published linked to grants – 27,415 papers published linked to grants

16 Cumulative Growth of Publications Citing NIH Grants Over Time by Scientific Area Biostatistics See

17 Publications Citing NIH Grant Support by Discipline,

18 Publication Growth by Discipline and Time Period DISCIPLINE TOTAL # Publications Growth from Period I to II ( to ) Growth from Period II to III ( to ) Growth from Period I to III ( ) Total 499,32240%20%67% Humanities 53200%61%383% Chemical, Mechanical, & Civil Engineering 1,074119%95%327% Math & Physics 2,117110%38%190% Electrical Engineering & Computer Science 3,49698%45%187% Social Sciences 10,96072%38%137% Biotechnology 13,99577%31%132% Chemistry 14,61675%22%114% Health Professionals 67,96252%24%89% Biology 8,67248%15%69% Medical Specialties 99,12136%14%55% Brain Research 66,19436%11%52% Earth Sciences 4236%7%45% Infectious Diseases 140,11530%9%41% Multiple Categories 48,23417%4%22% Unrecognized 22,681208%181%764% >>

19 Publications Citing New NIH Grants Increased with Time from Initial Award 1 st FY Year Grant Funded # of Publications Cited Per Grant Min25%50%75%Max # of Publications Citing Project (Grant)

20 Discussion Analyses provide insight into the dynamics of knowledge outputs associated with NIH support –NIH leadership can use these results to better understand the behavior of NIH-supported scientists, informing the development of future policies, e.g., NIH public access policy Most frequent publications in Infectious Diseases, Medical Specialties, Health Professionals, and Brain Research disciplines coincide well with NIH’s large investments in grants in these areas NIH’s contribution to scientific knowledge, measured by publication outputs, increased over last decade, but growth rate was higher in than in , compared to preceding time period –Likely associated with doubling of NIH budget from , which increased # of grants awarded by NIH, from 43,259 in 2000 to a peak of 52,789 in 2004 –After 2003, NIH’s budget -- and # of annually awarded grants -- remained approximately level, which might account for the slower growth rate of publications in

21 Discussion (cont.) NIH-supported investigators are efficient producers of research knowledge –Amongst new grants which generated publications and had enough years of follow-up to observe the majority of publication outputs (5 years), about 2/3 were cited by papers published within the first 3 years of funding Limitations –UCSD map of science (based on journals) may not include emerging fields of science, and precludes mapping publications from newer journals (map update is in preparation) –More recent grants have not had sufficient follow up time to generate all expected publications –Could not ascertain publications with missing, incomplete or incorrect grant number citations

22 Two Key Contributions Discussion of Socio-Technical Challenges when introducing science of science tools to an agency –What context, insight needs exist? –How to select the best tool (and improve it continuously)? –How to best transfer expertise—tutorials or close collaboration? Answering Research Questions with the new tools –What fields of science are covered by publications that acknowledge NIH extramural grant funding and how have the fields evolved from ? –What is the time lag between NIH grant awards being made and papers being published and what is the probability distribution for the number of papers per project?

23 Suggestions for Introducing Sci2 Skills and Tools to Agencies Federal agency (Reporting Branch) perspective –Intense tutorial schedule allowed frequent access to tools and resident scholar, but condensed a semester’s material into one month, making it challenging to absorb the material due to competing Branch duties and heavy workload –More time was needed to learn how to “read” these novel visualizations, e.g., networks, which are unknown to many –Other agencies embarking on a similar training arrangement, might consider arranging for a semester sabbatical visit Resident scholar perspective –12 days is short time to become acquainted with new colleagues, adapt to a different work culture, obtain security clearance, gain access to and understand internal agency data, and to develop, test, and document new workflows and algorithm plug-ins that address agency-specific needs

24 Suggestions for Introducing Sci2 Skills and Tools to Agencies (cont.) These tools can be highly useful to agencies that do not opt for intensive training –Other governmental agencies and private foundations have started to use the Sci2 Tool –As organizations vary on data access, missions, and cultures, each is applying tools to suit its own needs and questions –Some agencies have awarded small contracts for developing new specific functionality in the tools, resulting in new plug-ins, many freely shareable with the larger user community, detailed documentation of new functionality and workflows, and dissemination of new insights via peer-reviewed publications –Several agencies have independently published peer reviewed papers on insights gained using the new tools

25 Questions? Dr. Katy Börner Victor H. Yngve Professor of Information Science Director, Cyberinfrastructure for Network Science Center, and Information Visualization Laboratory School of Library and Information Science Indiana University Bloomington, IN Dr. Robin M. Wagner Chief, Reporting Branch Division of Information Services Office of Research Information Systems Office of Extramural Research Office of the Director National Institutes of Health Bethesda, MD branch_brochure.pdf This work is funded by the School of Library and Information Science and the Cyberinfrastructure for Network Science center at Indiana University, the National Science Foundation under Grant No. SBE , and a James S. McDonnell Foundation grant.SBE Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.