Visualizing Japanese Co-authorship Data Gavin LaRowe & Katy Börner, Indiana University, USA Ryutaro Ichise, National Institute of Informatics, Japan Information.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

The Small World of Software Reverse Engineering Ahmed E. Hassan and Richard C. Holt SoftWare Architecture Group (SWAG) University Of Waterloo.
Analysis and Modeling of Social Networks Foudalis Ilias.
What are the characteristics of academic journals
Scopus. Agenda Scopus Introduction Online Demonstration Personal Profile Set-up Research Evaluation Tools -Author Identifier, Find Unmatched Authors,
SciVal Experts & SciVal Funding Information Sessions.
Network Workbench ( 1 NWB IUB Indiana University, Bloomington, IN Towards an All-in-One.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Network Workbench ( 1 NWB Team Indiana University, Bloomington, IN Network Analysis, Modeling,
Networks FIAS Summer School 6th August 2008 Complex Networks 1.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
T HE S TRUCTURE OF S CIENTIFIC C OLLABORATION N ETWORKS & R ESEARCH F UNDING N ETWORKS CS790g Complex Networks Jigar Patel November 30 th 2009.
1 Complex systems Made of many non-identical elements connected by diverse interactions. NETWORK New York Times Slides: thanks to A-L Barabasi.
Towards Scholarly Marketplaces Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory, Director.
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
© 2013 Association for Computing Machinery Honeywell Introduction to the ACM Digital Library January 16, 2013 Honeywell Introduction to the ACM Digital.
Department of Computer Science, University of California, Irvine Site Visit for UC Irvine KD-D Project, April 21 st 2004 The Java Universal Network/Graph.
SCOPUS AND SCIVAL EVALUATION AND PROMOTION OF UKRAINIAN RESEARCH RESULTS PIOTR GOŁKIEWICZ PRODUCT SALES MANAGER, CENTRAL AND EASTERN EUROPE KIEV, 31 JANUARY.
Network Workbench: A CI-Marketplace for Network Scientists Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
The impact of the development of institutional repositories on “Kiyo” or institutional research journals in Japan Hiroya Takeuchi and Syun Tutiya Chiba.
Australian Research Council Support ● 3-year ( ) ARC Discovery Project Grant “New Methods for Researching the Existence and Impact of Political.
Network Workbench ( Weixia (Bonnie) Huang*, Bruce Herr* & Ben Markines+ *School of Library and Information Science.
Pascal Visualization Challenge Blaž Fortuna, IJS Marko Grobelnik, IJS Steve Gunn, US.
Computational Scientometrics Studying science by scientific means Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information.
Towards a Science of Science Cyberinfrastructure Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory,
The Development, Maintenance, and Use of Course Web Sites The Development, Maintenance, and Use of Course Web Sites Panel at the ACM SIGCSE 34th Technical.
Network Analysis using the Network Workbench (NWB) Tool and the Science of Science (Sci2) Tool Ted Polley and Dr. Katy Börner CNS & IVL, SLIS, Indiana.
SCOPUS AND SCIVAL EVALUATION AND PROMOTION OF UKRAINIAN RESEARCH RESULTS PIOTR GOŁKIEWICZ PRODUCT SALES MANAGER, CENTRAL AND EASTERN EUROPE LVIV, 11 SEPTEMBER.
 CiteGraph: A Citation Network System for MEDLINE Articles and Analysis Qing Zhang 1,2, Hong Yu 1,3 1 University of Massachusetts Medical School, Worcester,
TLS: Towards a Macroscope for Science Policy Decision Making NSF SBE Katy Börner & Kevin Boyack Jan Dec. 09 Dr. Katy Börner Cyberinfrastructure.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Temporal Analysis using Sci2 Ted Polley and Dr. Katy Börner Cyberinfrastructure for Network Science Center Information Visualization Laboratory School.
Analyzing and Visualizing Science Using the Scholarly Database and the Network Workbench Tool Dr. Katy Börner Cyberinfrastructure for Network Science Center,
PROV 504 NIKITHA VADDULA INTRODUCTION IMPORTANCE OF DISCIPLINE CURRENT ISSUES MAJOR ORGANIZATIONS PRE-EMINENT SCHOLARS SEMINAL WORKS CONNECTIONS.
Where are the Academic Jobs ? Interactive Exploration of Job Advertisements in Geospatial and Topical Space Angela M. Zoss 1, Michael Conover 2 and Katy.
Entire Dataset – No Thresholding Total Nodes (with Co-Occurrence) Range (Degree of Connectivity) – 1 to 250 Mean (Degree of Connectivity)
Towards a Science of Science (Policy) Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory, Director.
Creative Metaphors to Stimulate New Approaches to Visualizing, Understanding, and Rethinking Large Repositories of Scholarly Data Dr. Katy Börner Cyberinfrastructure.
Computational Scientometrics Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory, Director School.
Computational Scientometrics: Mapping the Structure and Evolution of Science Katy Börner & the InfoVis Lab School of Library and Information Science.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
The Scholarly Database and Its Utility for Scientometrics Research Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information.
Midterm Project Guide Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 23th, 2012.
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
LOGO A comparison of two web-based document management systems ShaoxinYu Columbia University March 31, 2009.
Network Workbench A Workbench for Network Scientists Download at
1. 2 CIShell Features A framework for easy integration of new and existing algorithms written in any programming language. CIShell Sci2 Tool NWB Tool.
RESEARCH – DOING AND ANALYSING Gavin Coney Thomson Reuters May 2009.
Towards Plug-and-Play Macroscopes for Science Policy Decision Making Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Disciplinary Maps of Sustainability Science Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information Visualization Laboratory,
The United States Supreme Court: Visualizations and Metrics (60 Years of Data) Peter Hook September 18, 2004 PhD Student, Information Science SLIS Research.
Rete-Netzwerk-Red: Analyzing and Visualizing Scholarly Networks Using the Network Workbench Tool Dr. Katy Börner Cyberinfrastructure for Network Science.
Computational Scientometrics: Mapping the Structure and Evolution of Science Katy Börner & the InfoVis Lab School of Library and Information Science Indiana.
The Simultaneous Evolution of Article and Author Networks in PNAS Katy Börner, School of Library and Information Science,
Mapping Richard M. Shiffrin's Career Time and Space: 1968 Ph.D. in Mathematical Psychology, Stanford University 1968 joins Faculty of the Department of.
Topical Scientific Community —A combined perspective of topic and topology Jin Mao Postdoc, School of Information, University of Arizona Sept 4, 2015.
Informatics tools in network science
The Structure of Scientific Collaboration Networks by M. E. J. Newman CMSC 601 Paper Summary Marie desJardins January 27, 2009.
InfoVis Cyberinfrastructure Shashikant Penumarthy, Bruce Herr & Katy Börner School of Library and Information Science sprao | bherr
Topical Analysis and Visualization of (Network) Data Using Sci2 Ted Polley Research & Editorial Assistant Cyberinfrastructure for Network Science Center.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Sul-Ah Ahn and Youngim Jung * Korea Institute of Science and Technology Information Daejeon, Republic of Korea { snowy; * Corresponding Author: acorn
Sul-Ah Ahn and Youngim Jung * Korea Institute of Science and Technology Information Daejeon, Republic of Korea { snowy; * Corresponding Author: acorn
The simultaneous evolution of author and paper networks
Understanding outside collaborations of the Chinese Academy of Science using Jensen-Shannon divergence Visualization and Data Analysis 2009 San Jose, California,
Department of Computer Science University of York
Presentation transcript:

Visualizing Japanese Co-authorship Data Gavin LaRowe & Katy Börner, Indiana University, USA Ryutaro Ichise, National Institute of Informatics, Japan Information Visualisation Conference 2007 Zurich, Schweiz

Places & Spaces: Mapping Science exhibit, see also Motivation: Mapping Science

Scholarly Database: Web Interface Search across publications, patents, grants. Download records and/or (evolving) co-author, paper-citation networks.

Scholarly Database: # Records & Years Covered Datasets available via the Scholarly Database (* future feature) Aim for comprehensive geospatial and topic coverage. Dataset# RecordsYears CoveredUpdatedRestricted Access Medline13,149, Yes PhysRev398, Yes PNAS16, Yes JCR59, , 1979, 1984, Yes USPTO3,179, Yes* NSF174, Yes* NIH1,043, Yes* Total18,021,

Network Workbench (NWB) Investigators: Katy Börner, Albert-Laszlo Barabasi, Santiago Schnell, Alessandro Vespignani & Stanley Wasserman, Eric Wernert Software Team: Lead: Weixia (Bonnie) Huang Developers: Bruce Herr, Ben Markines, Santo Fortunato, Cesar Hidalgo, Ramya Sabbineni, Vivek S. Thakre, & Russell Duhon Goal: Develop a large-scale network analysis, modeling and visualization toolkit for biomedical, social science and physics research. Amount: $1,120,926 NSF IIS award. Duration: Sept Aug Website:

NWB Tool: Interface Elements Load Data List of Data Models Scheduler Open Text Files Console Visualize Data Select Preferences

NWB Tool 0.2.0: List of Algorithms CategoryAlgorithmLanguage PreprocessingDirectory Hierarchy ReaderJAVA Modeling Erdös-Rényi RandomFORTRAN Barabási-Albert Scale-FreeFORTRAN Watts-Strogatz Small WorldFORTRAN ChordJAVA CANJAVA HypergridJAVA PRUJAVA Visualization Tree MapJAVA Tree VizJAVA Radial Tree / GraphJAVA Kamada-KawaiJAVA Force DirectedJAVA SpringJAVA Fruchterman-ReingoldJAVA CircularJAVA Parallel Coordinates (demo)JAVA ToolXMGrace Analysis AlgorithmLanguage Attack ToleranceJAVA Error ToleranceJAVA Betweenness CentralityJAVA Site BetweennessFORTRAN Average Shortest PathFORTRAN Connected ComponentsFORTRAN DiameterFORTRAN Page RankFORTRAN Shortest Path DistributionFORTRAN Watts-Strogatz Clustering CoefficientFORTRAN Watts-Strogatz Clustering Coefficient Versus DegreeFORTRAN Directed k-Nearest NeighborFORTRAN Undirected k-Nearest NeighborFORTRAN Indegree DistributionFORTRAN Outdegree DistributionFORTRAN Node IndegreeFORTRAN Node OutdegreeFORTRAN One-point Degree CorrelationsFORTRAN Undirected Degree DistributionFORTRAN Node DegreeFORTRAN k Random-Walk SearchJAVA Random Breadth First SearchJAVA CAN SearchJAVA Chord SearchJAVA

Visualizing Japanese Co-authorship Data Gavin LaRowe & Katy Börner, Indiana University, USA Ryutaro Ichise, National Institute of Informatics, Japan Information Visualisation Conference 2007 Zurich, Schweiz

Introduction This paper reports a bilbiometric analysis of an evolving co-author network composed of 5,009 articles from Transactions D. Information Systems journal of the Institute of Electronics Information and Communication Engineers (IEICE) for the years 1993 to Networks from this data set were subsequently generated, producing metrics used for further analysis. We were particularly interested in whether the characteristics of these networks were similar or different than those of often-cited networks found in popular literature regarding co-authorship networks for other scientific disciplines.

Prior Research Most of the prior research regarding co-authorship networks in Japanese literature was performed during the mid-1990s by public policy analysts focusing on academic collaboration. Recent studies by Professor Ichise and others have looked at co-authorship networks in the context of data mining and information visualization. Other studies in Japan have used co-authorship networks as a mechanism to study the effect conferences play in initiating and sustaining collaborations between researchers.

Method Data Provider: National Institute of Informatics, Tokyo, Japan Years: Institute of Electronics Information and Communication Engineers - Japanese analogue to IEEE Four main journals: – A. Fundamentals – B. Communications – C. Electronics – D. Information Systems 12,337 articles 5,009 unique authors

Method Data Processing Transformation: converted initial data from EUC_JP to UTF-8 For each year, unique authors extracted using Japanese surnames. Custom scripts used to lean/identify/disambiguate names. Data status: < 3% transcription errors. Identifiable errors were cleaned manually. Data parsed into individual lexemes and proper names Data placed into relational database Functions in database used to build network tables in Pajek format R used to generate time-series metrics

IEICE Co-authorship Networks Metrics

Analysis Results We computed centrality measures such as degree, closeness, betweenness as well as distributions for centrality data for each year and plotted using a q-q plot to identify significant changes. Clustering coefficient and average path length were also generated for each year. Degree distribution does not deviate from other popular co-authorship networks; fat-tail distribution. Changes in coauthorship pattern or paradigm almost always reflected in clustering coefficient and average path length. No significant increases in average no. of co-authors, etc.

Analysis Results Q-q plots for betweenness and closeness centrality computed for years No significant deviation for any one year. Quantile distributions could also have been used.

Largest Connected Component Transactions D. ( ): 3,961 nodes showing top eight collaborators. 12,337 articles 5,009 authors

Largest Component #2: IEICE Transactions D. ( ) *Ellipses indicate general affiliation. 12,337 articles 5,009 authors

Largest Component #1: IEICE Transactions D. ( ) *Ellipses indicate general affiliation. 12,337 articles 5,009 authors

Conclusions IEICE Transactions D. network is very similar to SPIRES and other co-authorship data. Average path length and clustering coefficient similar, again pointing out the significance of the degree distribution in regard to other metrics. P(k)  k = (power-law network) Scale-free behavior (small-world network)

Acknowledgements We’d like to thank the National Institute of Informatics, Tokyo, Japan for funding this work by a MOU grant and for providing the data used in this study.