Demonstration: Tools for large scale bibliometric analysis André Somers | 1 June 25, 2009.

Slides:



Advertisements
Similar presentations
Overlay Maps of Science (2010 update)
Advertisements

In the Format section, we have activated the Bibliographic style drop down menu. From this page, you can choose a specific journal or format (e.g. BMC.
Comparison of BIDS ISI (Enhanced) with Web of Science Lisa Haddow.
Database Management Using Microsoft Access Xinhua Chen, Ph.D. Chinese Association of Professionals in Science and Technology March 23, 2003.
Transportation research Fall 2011 Engineering & Computer Science Library Cristina Sewerin.
Sophie Panagi Trainer / Product Specialist ISI January 2001 Principles of citation searching.
ANALYSING RESEARCH – A GLOBAL PERSPECTIVE Krzysztof Szymanski – Country Manager Thomson Reuters October 2009.
Springer.com Zentralblatt MATH Online The most complete and longest running reviewing service in MATH!
How to use ScienceDirect (SDOL) Effectively. Publishes over a quarter of the world's full text scientific, technical and medical (STM) articles – Journals.
INCITES PLATFORM NATIONAL OCEANIC AND ATMOSPHERIC ADMINISTRATION (NOAA)
Tuple – InfoVis Publication Browser CS533 Project Presentation by Alex Gukov.
Literature Review Week 3 Lecture 1. School of Information Technologies Faculty of Science, College of Sciences and Technology The University of Sydney.
1 Using metrics to your advantage Fei Yu and Martin Cvelbar.
Copyright © Allyn & Bacon (2007) Conducting Library Research Graziano and Raulin Research Methods: Appendix C This multimedia product and its contents.
Smart Subjects: Application Independent Subject Recommendations Tito Sierra NCSU Libraries Code4Lib 2007.
1 Urban Education Resources LIBRARY INSTRUCTION Jacqueline A. Gill Associate Professor Reference
SCOPUS AND SCIVAL EVALUATION AND PROMOTION OF UKRAINIAN RESEARCH RESULTS PIOTR GOŁKIEWICZ PRODUCT SALES MANAGER, CENTRAL AND EASTERN EUROPE KIEV, 31 JANUARY.
Web of Science. Copyright 2006 Thomson Corporation 2 Example: (bird* or avian) and (flu or influenz*) Enter your terms to be searched. Search fields are.
Bibliometric Analysis with Sci2: Choose Your Own Adventure Laura Ridenour School of Library and Information Science, Indiana University.
By: Dr. Hamid Alizade IranEssential Science Indicators is a web-based research tool that enables researchers and research evaluators to measure.
Pascal Visualization Challenge Blaž Fortuna, IJS Marko Grobelnik, IJS Steve Gunn, US.
Bibliometrics toolkit: ISI products Website: Last edited: 11 Mar 2011 Thomson Reuters ISI product set is the market leader for.
Bibliometrics and Impact Analyses at the National Institute of Standards and Technology Stacy Bruss and Susan Makar Research Librarians SLA Pharmaceutical.
Introduction to ArcGIS for Environmental Scientists Module 1 – Data Visualization Chapter 1 – GIS Basics.
Top 10 Organizations of Thailand with Research Papers from September 2013 Derived from ISI Web of Knowledge by Mongkon Rayanakorn
Rajesh Singh Deputy Librarian University of Delhi Measuring Research Output.
SCIENTIFIC SOLUTIONS Journal Citation Reports ® New Features of Version 4.0.
MARKETING STRATEGIES More information:
Thomson Scientific October 2006 ISI Web of Knowledge Autumn updates.
Welcome to Georgia Library Learning Online for K-12 Schools
5. Alternative Approaches. Strategic Bahavior in Business and Econ 1. Introduction 2. Individual Decision Making 3. Basic Topics in Game Theory 4. The.
1111 An Experimental Comparison of Bibliometric Mapping Techniques Nees Jan van Eck, Ludo Waltman, Rommert Dekker Erasmus University Rotterdam, The Netherlands.
SUMMON ® 2.0 DISCOVERY REINVENTED. What is Summon 2.0? A new, streamlined, modern interface New and enhanced features providing layers of contextual guidance.
Data visualization as a library service? Examples from Chalmers Library.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Click to edit Master title style European Molecular Biology Laboratory, Heidelberg, Germany EMBL Georgios Pavlopoulos TAC-2, 15 Nov 2007 Data integration.
1. 2 CIShell Features A framework for easy integration of new and existing algorithms written in any programming language. CIShell Sci2 Tool NWB Tool.
SciVal Spotlight Training for KU Huiling Ng, SciVal Product Sales Manager (South East Asia) Cassandra Teo, Account Manager (South East Asia) June 2013.
RESEARCH – DOING AND ANALYSING Gavin Coney Thomson Reuters May 2009.
ESSENTIAL SCIENCE INDICATORS (ESI) James Cook University Celebrating Research 9 OCTOBER 2009 Steven Werkheiser Manager, Customer Education & Training ANZ.
Last updated 30/03/05 ISI Web of Knowledge Service for UK Education Web of Science Version 7 - new features.
Web of Science: Citation Indexes on the Web Gary Wiggins 9/29/2004.
RSC Publishing Platform Amanda Sun
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
SCOPUS for Science and Medicine Gabriella Netting Gabriella Netting
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Information Visualization, Human-Computer Interaction, and Cognitive Psychology: Domain Visualizations Kevin W. Boyack Sandia National Laboratories.
BIBSAM-konsortiet 13/01/2016 ICLC Paris 2009 Updates: the BIBSAM consortium, Sweden Technical conditions in licenses Anna Lundén, coordinator.
0 1 Focused web information Academic library sources 15,100 titles 4,000 publishers STM & Social sciences World’s Largest Abstract & Citation Database.
PubMed/How to Search, Display, Download & (module 4.1)
THOMSON SCIENTIFIC Web of Science 7.0 via the Web of Knowledge 3.0 Platform Access to the World’s Most Important Published Research.
Citation-Based Retrieval for Scholarly Publications 指導教授:郭建明 學生:蘇文正 M
上海海事大学信息工程学院 Unit 6 Introduction to Digital Signal Processing exercises.
InfoVis Cyberinfrastructure Shashikant Penumarthy, Bruce Herr & Katy Börner School of Library and Information Science sprao | bherr
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
IUB Libraries Faculty & Graduate Student Updates Web of Science: Citation Indexes on the Web Presented by Gary Wiggins
Topical Analysis and Visualization of (Network) Data Using Sci2 Ted Polley Research & Editorial Assistant Cyberinfrastructure for Network Science Center.
MARKO ZOVKO, ACCOUNT MANAGER STEPHEN SMITH, SOLUTIONS SPECIALIST JOURNALS & HIGHLY-CITED DATA IN INCITES V. OLD JOURNAL CITATION REPORTS. WHAT MORE AM.
RSC Publishing Platform -- 第九届电子资源上机培训 Amanda Sun 孙燕 Area Manager
الله الرحيم بسم الرحمن علیرضا صراف شیرازی دانشیار و مدیر گروه دندانپزشکی کودکان رئیس کتابخانه مرکزی و مرکز علم سنجی دانشگاه علوم پزشکی مشهد.
A Bibliographic Management Software NORSHUHADA SAIDIN REFERENCE & RESEARCH DIVISION PERPUSTAKAAN KEJURUTERAAN UNIVERSITI SAINS MALAYSIA.
SAINT: Tools for large scale bibliometric analysis André Somers | 1 September 3, 2009.
Bibliometrics toolkit: Thomson Reuters products
Clustering of Web pages
A procedure for field delineation
Introduction to R Programming with AzureML
Optimize your research performance using SciVal
Meet the speakers: Sergey Adonin
Content Coverage of PNAS in 1995 and 2001
Presentation transcript:

Demonstration: Tools for large scale bibliometric analysis André Somers | 1 June 25, 2009

Targets Large data sets Fast Flexible: structured database Easy to use Open (Source) Get it from: André Somers | 2 | Demonstration: tools for large-scale bibliometric analysis

June 25, 2009 Structured database Structured database: different queries possible Standard relational database: SQL for combining data Special tools for things that are impossible, hard or slow in SQL Currently: MS Access only Other backends soon! André Somers | 3 | Demonstration: tools for large-scale bibliometric analysis

June 25, 2009 Workflow 1.Harvest or structure data –Into a relational database –ISI Data Importer 2.Clean and refine the data –Word Splitter –Record Grouper, Subnetwork Identifier, Relation Calculator 3.Query construction –Use pre-defined or construct SQL 4.Output results –Matrix Builder André Somers | 4 | Demonstration: tools for large-scale bibliometric analysis

June 25, 2009 Structure data: ISI Data Importer Download set of articles from ISI Web of Knowledge Selected on keywords, journals, authors, years, … Import as many as you want Optionally filter by type Demo time… André Somers | 5 | Demonstration: tools for large-scale bibliometric analysis

June 25, 2009 Refine data: Word Splitter Split titles, abstracts, etc. into separate words Optionally use stop word lists Or even regular expressions Result: table with words, and tables with data on which word is used where Uses: Co-title word analysis, identify topics in a field, etc. Demo time… André Somers | 6 | Demonstration: tools for large-scale bibliometric analysis

June 25, 2009 Output data: Matrix Compiler Output data to a Pajek-readable format Based on the assumption that: One table or view/query contains the information on the relations you want to visualize in the network (edges or arcs) Optionally (but recommended!) another table or query contains information about the nodes, like the labels Different kinds of matrices supported Output to DL matrix format Output size limited by memory and disk space only Demo time… André Somers | 7 | Demonstration: tools for large-scale bibliometric analysis

June 25, 2009 Possible outputs Basically anything that is supported by the data is possible. Co-authorships Co-citation relations Clustering of authors based their keyword usage Clustering of Journals based on the authors that publish in them or vise versa … You come up with new ideas! Salton, Cosine, Jaccard indices All these can be expressed in SQL! André Somers | 8 | Demonstration: tools for large-scale bibliometric analysis

June 25, 2009 What are we displaying? A clustering of articles Based on Jaccard index Combination of title words and cited references Idea: Title words: content Cited references: context Demo time… André Somers | 9 | Demonstration: tools for large-scale bibliometric analysis

June 25, 2009 Result in Pajek André Somers | 10 | Demonstration: tools for large-scale bibliometric analysis

June 25, 2009 Many plans… There are already more tools, such as: Grouping records (like similar words, addresses, names…) Identifying subnetworks Importing other data sources Interact with BibTechMon Plans for extensions to existing tools: Matrix Compiler output to list format, and include attributes Have Record Grouper use Relation Calculator Have Relation Calculator use GPU for calculations (CUDA) New tools: Integrate into a shell, harvest book data, … André Somers | 11 | Demonstration: tools for large-scale bibliometric analysis

June 25, 2009 Open & Free Open source (GPL 3.0) Open issue tracker, your input is very welcome! Open source code repository (Git) Free as in beer, free as in freedom, but please cite… André Somers | 12 | Demonstration: tools for large-scale bibliometric analysis

June 25, 2009 Edwin Horlings and Peter van den Besselaar | 13 | Where is e-social science going? Title word – cited reference cooccurrence Title word-cited reference combinations Partitioned by domain using Pajek; top cluster, 814 nodes; Kamada Kawai, separate components, circular starting positions cellular automata models for traffic simulation game theory in physics and theoretical biology simulation in chemistry (lattice gas simulation) cellular automata in topics relating to computer science, chemistry, physics, biology, medicine applications of neural networks and genetic algorithms; also learning in neural network and machine learning interface between learning and agent- based modeling some geography papers interspersed in CA (urban studies; spatial dynamics; land use interface between learning and neural networks (neural learning and control) theoretical and technical heart of neural networks and genetic algorithms (math and computer science) cellular automata applied to animal and human behaviour (self- organisation) Image by Edwin Horlings

June 25, 2009 Edwin Horlings and Peter van den Besselaar | 14 | Where is e-social science going? Title word-cited reference combinations Partitioned by domain using Pajek; all connected clusters, 3,430 nodes; Kamada Kawai, separate components, circular starting positions clear geography cluster using CA, neural networks, multi-agent systems simulation in materials science social network analysis and game theory cellular automata models for traffic simulation, now including crowd behaviour learning meets game theory and multi-agent analysis applications of neural networks and genetic algorithms multi-agent systems Image by Edwin Horlings

June 25, 2009 Edwin Horlings and Peter van den Besselaar | 15 | Where is e-social science going? physics computer & information science biology, ecology economics psychology other social science Title word-cited reference combinations Partitioned by domain using Pajek; all connected clusters, 3,430 nodes; Kamada Kawai, separate components, circular starting positions Image by Edwin Horlings

June 25, 2009 Edwin Horlings and Peter van den Besselaar | 16 | Where is computational social science going? computer science physics fuzzy systems Nature and PNAS neuroscience psychology 1 psychology 2 psychology 3 mathematical computer modeling operational research statistics sociology geography finance management and organisation environmental economics game theory mathematical economics econometrics APPLICATION AREAS general areas and problem-specific niches TECHNICAL AND MATHEMATICAL FOUNDATIONS political science Journal citation environment 2007 Similarity between citation structures of journals mapped in 2D-space (Kamada-Kawai) J Math Sociol, J Math Econ, Math Soc Sci, J Math Psych, J Econ Dyn Control ISI, Journal Citation Reports, 0.5% threshold Image by Edwin Horlings

June 25, 2009 Database structure André Somers | 17 | Demonstration: tools for large-scale bibliometric analysis