Delineating the Citation Impact of Scientific Discoveries Chaomei Chen 1, Jian Zhang 1, Weizhong Zhu 1, Michael Vogeley 2 1 College of Information Science.

Slides:



Advertisements
Similar presentations
How Well Forecast Were the 2004 and 2005 Atlantic and U. S
Advertisements

The impact of Grey Literature in the web environment: A citation analysis using Google Scholar Rosa Di Cesare, Daniela Luzi, Roberta Ruggieri Consiglio.
1 HARNESSING THE POWER OF GREY Lindy C. Boggs International Conference Center New Orleans, Louisiana USA December 4-5, 2006 Knowledge Generation in the.
Using Citation Analysis to Study Changes in the Information Seeking Behavior of Medical Researchers Brian Bunnett UT Southwestern Library October 24, 2005.
28 April 2004Second Nordic Conference on Scholarly Communication 1 Citation Analysis for the Free, Online Literature Tim Brody Intelligence, Agents, Multimedia.
© 2006 POPULATION REFERENCE BUREAU Marlene A. Lee Senior Policy Analyst Domestic Programs 300 MILLION AND COUNTING Education and Workforce: The Critical.
1 Science and the NVO – Overview and Discussion Dave De Young NVO Project Scientist NOAO NVOSS Aspen September 2006.
HEART TRANSPLANTATION Pediatric Recipients ISHLT 2007 J Heart Lung Transplant 2007;26:
Uptake of a New Cervical Cancer Prevention Technology in a Medicaid Population Rebecca Anhang Price AcademyHealth Annual Research Meeting June 10, 2008.
T H O M S O N S C I E N T I F I C Ann Wescott April 8, 2008 Prous Science Update SLA DPHT Spring Meeting 2008.
Multinational Comparisons of Health Systems Data, 2009 Gerard F. Anderson and Patricia Markovich Johns Hopkins University November 2009 Support for this.
1Regional policy responses to demographic challenges, Bruxelles, January 2007 EUROSTAT regional population projections Giampaolo LANZIERI Eurostat.
Google Scholar and Web of Science: Similarities and Differences in Citation Analysis of Scientific Publications Maurella DELLA SETA, Rosaria CAMMARANO.
Statistical Significance and Population Controls Presented to the New Jersey SDC Annual Network Meeting June 6, 2007 Tony Tersine, U.S. Census Bureau.
AYP Changes for 2007 K-20 Videoconference June 11, 2007 Presented by: JoLynn Berge OSPI Federal Policy Coordinator.
Supported by ESRC Large Grant. What difference does a decade make? Satisfaction with the NHS in Northern Ireland in 1996 and 2006.
Construction Fatalities (Workers, Jan 1981 to March 2007 )
How can I find the number of times a work has been cited by other authors?
When the consumer price index rises, the typical family
1 Understanding Multiyear Estimates from the American Community Survey.
1 Black Holes in the Early Universe Accretion and Feedback Geoff Bicknell & Alex Wagner Research School of Astronomy & Astrophysics Australian National.
Trends in Conceptual Modeling: Citation Analysis of the ER Conference Papers ( ) Chaomei Chen, Il-Yeol Song, Weizhong Zhu
The basics for simulations
Structures in the Universe, Venice March 28, 2006 David Schade Canadian Astronomy Data Centre Herzberg Institute of Astrophysics National Research Council.
1 Exploration of Health Care Providers Behavior to Keep Their Revenues after Reduction of Payment Generosity --- A Case of Drug Payment in Taiwan Likwang.
TV CSG Review Round Robin Meetings. I.Review Previous TV CSG Policy Reviews II.CSG 101 III.Review CSG Funding & NFFS History IV.Discussion OUTLINE FOR.
Investigating the Connections between Oil and Gas Industry Affiliation and Climate Change Perceptions Abstract In discussions about climate change, it.
Education, Life Cycle and Mobility: A Latin American Perspective
How to do Bayes-Optimal Classification with Massive Datasets: Large-scale Quasar Discovery Alexander Gray Georgia Institute of Technology College of Computing.
Projecting Hospital Acute Bed Needs for Workshop organized by US Embassy and the Belgian Health Federal Public Service March 21, 2006 Prof. Dr.
Observing Dark Energy SDSS, DES, WFMOS teams. Understanding Dark Energy No compelling theory, must be observational driven We can make progress on questions:
CSE594 Fall 2009 Jennifer Wong Oct. 14, 2009
Development of China-VO ZHAO Yongheng NAOC, Beijing Nov
Group Meeting Presented by Wyman 10/14/2006
Static Equilibrium; Elasticity and Fracture
The Aging Population Source: U.S. Census Bureau Percent Growth in U.S. Population, by Age Bracket.
INFORMATION SOLUTIONS Citation Analysis Reports. Copyright 2005 Thomson Scientific 2 INFORMATION SOLUTIONS Provide highly customized datasets based on.
Public Opinion : Health Care Coverage, Costs, and Financing.
Research Astronomy In Southern NM: Insights From the Sloan Digital Sky Survey (SDSS) Jon Holtzman NMSU Department of Astronomy.
Galaxy Distributions Analysis of Large-scale Structure Using Visualization and Percolation Technique on the SDSS Early Data Release Database Yuk-Yan Lam.
Sloan Digital Sky Survey Astronomy April 2006 Margaret Flynn.
Astro-DISC: Astronomy and cosmology applications of distributed super computing.
GIANT TO DWARF RATIO OF RED-SEQUENCE GALAXY CLUSTERS Abhishesh N Adhikari Mentor-Jim Annis Fermilab IPM / SDSS August 8, 2007.
High Redshift Quasar Discoveries Scientific knowledge of the Universe’s genesis was advanced with the Sloan Digital Sky Survey’s discovery of three, new.
JOURNAL CITATION REPORTS ® – “THE JCR” FEBRUARY 2009 ENHANCEMENTS GSS – Thomson Reuters, Scientific Business, A&G January 2009.
Survey Science Group Workshop 박명구, 한두환 ( 경북대 )
Visual Analysis of Scientific Discoveries and Knowledge Diffusion Chaomei Chen 1,2 1 College of Information Science and Technology, Drexel University 2.
Chris Luszczek Biol2050 week 3 Lecture September 23, 2013.
Radio Galaxies and Quasars Powerful natural radio transmitters associated with Giant elliptical galaxies Demo.
The Evolution of Quasars and Massive Black Holes “Quasar Hosts and the Black Hole-Spheroid Connection”: Dunlop 2004 “The Evolution of Quasars”: Osmer 2004.
Lecture Outlines Astronomy Today 8th Edition Chaisson/McMillan © 2014 Pearson Education, Inc. Chapter 25.
Visualizing and Analyzing Scientific Literature with CiteSpace Chaomei Chen College of Information Science and Technology Drexel University
G. Miknaitis SC2006, Tampa, FL Observational Cosmology at Fermilab: Sloan Digital Sky Survey Dark Energy Survey SNAP Gajus Miknaitis EAG, Fermilab.
Advanced Stellar Populations Advanced Stellar Populations Raul Jimenez
Luminous Red Galaxies in the SDSS Daniel Eisenstein ( University of Arizona) with Blanton, Hogg, Nichol, Tegmark, Wake, Zehavi, Zheng, and the rest of.
DESIGNING AN ARTICLE Effective Writing 3. Objectives Raising awareness of the format, requirements and features of scientific articles Sharing information.
The dependence on redshift of quasar black hole masses from the SLOAN survey R. Decarli Università dell’Insubria, Como, Italy A. Treves Università dell’Insubria,
Scientific Data Analysis via Statistical Learning Raquel Romano romano at hpcrd dot lbl dot gov November 2006.
Chapter 25 Galaxies and Dark Matter. 25.1Dark Matter in the Universe 25.2Galaxy Collisions 25.3Galaxy Formation and Evolution 25.4Black Holes in Galaxies.
Visualizing and Analyzing Scientific Literature with CiteSpace Chaomei Chen College of Information Science and Technology Drexel University
Data Mining for Expertise: Using Scopus to Create Lists of Experts for U.S. Department of Education Discretionary Grant Programs Good afternoon, my name.
Bibliometric Analysis of Herbal Medicine Publications, 1991 to 2004
CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:
CSE5544 Final Project Interactive Visualization Tool(s) for IEEE Vis Publication Exploration and Analysis Team Name: Publication Miner Team Members:
Yi-Chia Wang LTI 2nd year Master student
Performing CS Research
Mike Brotherton: HST Images of Post-Starburst Quasars
Put your name here Name of the Department, School or College
Put your name here Name of the Department, School or College
Put your name here Name of the Department, School or College
Presentation transcript:

Delineating the Citation Impact of Scientific Discoveries Chaomei Chen 1, Jian Zhang 1, Weizhong Zhu 1, Michael Vogeley 2 1 College of Information Science and Technology, Drexel University 2 Department of Physics, Drexel University This work is supported by the National Science Foundation under Grant No Thomson ISI provides the bibliographic data for the analysis.

As We May Think by Vannevar Bush There is a growing mountain of research. But there is increased evidence that we are being bogged down today as specialization extends. The investigator is staggered by the findings and conclusions of thousands of other workers conclusions which he cannot find time to grasp, much less to remember, as they appear. Yet specialization becomes increasingly necessary for progress, and the effort to bridge between disciplines is correspondingly superficial.

An Increasingly Strong Trend in Science Gray & Szalay 2004 massive scientific data are being collected by one group of scientists and being analyzed by another group of scientists. Two notable examples: 1. The SDSS project in astrophysics 2. The human genome project in biomedicine

Sloan Digital Sky Survey The most ambitious astronomical survey ever undertaken Sloan Survey Data June, 2006: Data Release Five: 8000 square degrees, 1,048,960 spectra. June, 2005: Data Release Four: 6670 square degrees, 806,400 spectra. September, 2004: Data Release Three: 5282 square degrees, 528,640 spectra. March, 2004: Data Release Two: 3324 square degrees, 367,360 spectra. April, 2003: Data Release One: 2099 square degrees, 186,240 spectra. June, 2001: Early Data Release: 462 square degrees, 52,896 spectra. There is an increasingly strong trend in science that massive scientific data are being collected by one group of scientists and being analyzed by another group of scientists (Gray & Szalay 2004). Two notable examples: the SDSS project in astrophysics and the human genome project in biomedicine. SDSS Literature Total number of articles: 1,478 Total citations: 47,282 June 18, 2007: H = 95 January 30, 2007: H = 89 Time SliceSpaceNodeLink

Integrating Microscopic and Macroscopic perspectives Connecting text-level patterns (microscopic) and paper-level citation impacts (macroscopic) –improve our understanding of science in the making –develop data mining and visual analytics algorithms

Figure 3. Prominent keywords assigned by authors and burst terms extracted from titles and abstracts ( ).

H c, H t Split Class I Class II

Fast-Growing SDSS Literature 1,400 papers 40,000 citations The total citation number doubled in the past 1.5 years. H-index of SDSS literature = 89 95

As of June 18, 2007, 95 SDSS papers have 95 or more citations. It was 89 in January 2007.

Measuring the Citation Impact S c discounts citations accumulated over a long period of time. –S c is adjusted for publication age. St measures the recent impact: –St gives heavier weights to relatively recent citations than earlier citations.

YearTitleCitesScSt 2004Cosmological parameters from SDSS and WMAP THE FIRST SURVEY - FAINT IMAGES OF THE RADIO SKY AT 20 CENTIMETERS Stellar population synthesis at the resolution of Evidence for reionization at z similar to 6: Detection of a Gunn- Peterson trough in a z=6.28 quasar The luminosity function of galaxies in SDSS commissioning data A survey of z > 5.7 quasars in the Sloan Digital Sky Survey. II. Discovery of three additional quasars at z > A survey of z > 5.8 quasars in the Sloan Digital Sky Survey. I. Discovery of three new quasars and the spatial density of luminous quasars at z similar to Evolution of the ionizing background and the epoch of reionization from the spectra of z similar to 6 quasars Composite quasar spectra from the Sloan Digital Sky Survey The three-dimensional power spectrum of galaxies from the Sloan Digital Sky Survey

H g Indices and Splits The 1,293 records –H-index = 65, including 3 papers have 65 citations –H c index =52 –H t index = 53 The H split –67 papers in the highly cited group –1,226 remaining papers in the second group

Class I Class II

Significant Noun Phrases 22,665 noun phrases identified by a part-of- speech tagging and pattern matching process. 290 of them are selected based on their log-likelihood ratios. Sc St Total terms: 22,665 A(Sc)G(Sc)A(Sc)G(Sc) Pivotal value #High #Low

Figure 4. An overview of a decision tree generated based on 216 terms selected by log-likelihood ratio values (p<0.01) and a geometric mean split (74.44% of classification accuracy). The tree should be read from the root downwards.

Figure 5. A part of the tree shown in Figure 4. The presence (>0) or absence (<=0) of a term is associated with a citation status group, i.e. highly and timely cited group.

Figure 6. An ADTree derived from the data selected with the same selection criteria with 70.55% of accuracy.

Figure 7. A decision tree of 95.82% classification accuracy derived from 721 terms and 1,267 records. n-

Figure 10. The citation history of timeliness papers shows recently published papers are moved up in the rankings.

Future Work Unsupervised ontology construction to smooth the feature space Incremental classification of incoming new data and scholarly publications Self-directed optimization of existing decision trees based on new evidence Full-text analysis that can model associative relations between hypotheses and evidence and between facts and opinions