A Geographical Analysis of Knowledge Production in Computer Science Guilherme Vale Menezes Nivio Ziviani Alberto H. F. Laender Virgílio Almeida Federal University of Minas Gerais - Brazil
LAboratory for Treating INformation (LATIN) – UFMG - Brazil Summary Introduction Data Gathering Results Conclusions
The Problem Study the characteristics of researchers of Computer Science graduate programs 30 graduate programs in 3 geographic regions Build collaboration social networks based on DBLP We use several metrics of collaboration social networks Giant Component Clustering Coefficient LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Steps Comparison between 30 programs in 3 regions Comparison between 30 Computer Science fields Study of the interrelationship between fields Temporal analysis of the 3 regions and the fields LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Collaboration Network Author Collaboration LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Collaborations in DCC-UFMG LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Collaborations in DCC-UFMG LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Summary Introduction Data Gathering Results Conclusions LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Data Gathering Part of our data came from Perfil-CC project Objective of Perfil-CC: study Brazilian Computer Science graduate programs A set of 30 programs was chosen Focus: comparison with North American programs Results supported public policies Data gathered in June 2007 LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Brazilian Programs 8 graduate programs 391 authors PUC-Rio, UFRJ, UFPE, UFMG, USP-SP, USP-SC, UNICAMP, UFRGS LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Canadian and US Programs 16 graduate programs 1,262 authors British Columbia, Toronto, Waterloo, Brown, CalTech, CMU, Cornell, Harvard, Illinois, MIT, Princeton, Stanford, UC Berkeley, UTexas Austin, Washington, Wisconsin LAboratory for Treating INformation (LATIN) – UFMG - Brazil
French, Swiss and UK Programs 6 graduate programs 611 authors ETH Zurich, Cambridge U., Imperial College, Oxford U., École Polytechnique, Paris VI LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Data Gathering Professors obtained from the departments websites Publications from DBLP Programs: 30 Professors: 2,007 Authors: 76,537 Papers: 352,766 Venues: 2,176 LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Venue Classification 2,176 were classified (by humans) into 30 fields The list of fields was obtained from a poll The brazilian Computer Science research community was consulted 312 researchers identified 30 different fields LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Computer Science Fields Algorithms and TheoryApplied ComputingArtificial IntelligenceBioinformatics Circuit ConceptionComp. ArchitectureComp. GraphicsComp. Networks and Distributed Systems Computer VisionData MiningDatabasesEmbedded and Real- Time Systems Formalisms, Logics and Semantics Games and Entertainment GeoinformaticsHuman-Comp. Interaction Informatics in Education Information RetrievalInformation SystemsMachine Learning Modelling and Simulation Natural Language Processing Operation SystemsOperational Research and Optimization Programming Languages Robotics, Automation and Control Security and PrivacySoftware Engineering Ubiquitous Comp.Web, Hypermedia, Multimedia LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Computer Science Fields Algorithms and TheoryApplied ComputingArtificial IntelligenceBioinformatics Circuit ConceptionComp. ArchitectureComp. GraphicsComp. Networks and Distributed Systems Computer VisionData MiningDatabasesEmbedded and Real- Time Systems Formalisms, Logics and Semantics Games and Entertainment GeoinformaticsHuman-Comp. Interaction Informatics in Education Information RetrievalInformation SystemsMachine Learning Modelling and Simulation Natural Language Processing Operation SystemsOperational Research and Optimization Programming Languages Robotics, Automation and Control Security and PrivacySoftware Engineering Ubiquitous Comp.Web, Hypermedia, Multimedia LAboratory for Treating INformation (LATIN) – UFMG - Brazil Algorithms and Theory
Computer Science Fields Algorithms and TheoryApplied ComputingArtificial IntelligenceBioinformatics Circuit ConceptionComp. ArchitectureComp. GraphicsComp. Networks and Distributed Systems Computer VisionData MiningDatabasesEmbedded and Real- Time Systems Formalisms, Logics and Semantics Games and Entertainment GeoinformaticsHuman-Comp. Interaction Informatics in Education Information RetrievalInformation SystemsMachine Learning Modelling and Simulation Natural Language Processing Operation SystemsOperational Research and Optimization Programming Languages Robotics, Automation and Control Security and PrivacySoftware Engineering Ubiquitous Comp.Web, Hypermedia, Multimedia LAboratory for Treating INformation (LATIN) – UFMG - Brazil Information Retrieval
Computer Science Fields Algorithms and TheoryApplied ComputingArtificial IntelligenceBioinformatics Circuit ConceptionComp. ArchitectureComp. GraphicsComp. Networks and Distributed Systems Computer VisionData MiningDatabasesEmbedded and Real- Time Systems Formalisms, Logics and Semantics Games and Entertainment GeoinformaticsHuman-Comp. Interaction Informatics in Education Information RetrievalInformation SystemsMachine Learning Modelling and Simulation Natural Language Processing Operation SystemsOperational Research and Optimization Programming Languages Robotics, Automation and Control Security and PrivacySoftware Engineering Ubiquitous Comp.Web, Hypermedia, Multimedia LAboratory for Treating INformation (LATIN) – UFMG - Brazil Bioinformatics
Summary Introduction Data Gathering Results Conclusions LAboratory for Treating INformation (LATIN) – UFMG - Brazil
General Statistics BrCa-USFr-Sw-UK Authors3571, LAboratory for Treating INformation (LATIN) – UFMG - Brazil
General Statistics BrCa-USFr-Sw-UK Authors3571, Papers4,40540,0398,764 LAboratory for Treating INformation (LATIN) – UFMG - Brazil
General Statistics BrCa-USFr-Sw-UK Authors3571, Papers4,40540,0398,764 Paper/Author LAboratory for Treating INformation (LATIN) – UFMG - Brazil
General Statistics BrCa-USFr-Sw-UK Authors3571, Papers4,40540,0398,764 Paper/Author Author/Paper LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Giant Component A connected component is a maximum connected subgraph LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Giant Component LAboratory for Treating INformation (LATIN) – UFMG - Brazil A connected component is a maximum connected subgraph
Giant Component Giant Component size = 5 / 11 = 0.45 = 45% LAboratory for Treating INformation (LATIN) – UFMG - Brazil A connected component is a maximum connected subgraph The largest connected component is the giant component
Giant Component BrCa-USFr-Sw-UK LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Giant Component inside Programs BrCa-USFr-Sw-UK PUC-Rio0,83Wisconsin0,71Oxford0,46 UFMG0,76Harvard0,70Imperial College0,40 USP-SC0,73Cornell0,66 École Polytechnique 0,30 UFRGS0,70Stanford0,66ETH Zurich0,26 UFPE0,68Washington0,65Cambridge0,25 UFRJ0,64Illinois0,63Paris IV0,12 UNICAMP0,62Princeton0,63-- USP-SP0,43B. Columbia0, U Texas0, CMU0, UC Berkeley0, Brown0, MIT0, Waterloo0, Caltech0,21-- Average0,67Average0,56Average0,30 LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Clustering Coefficient LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Clustering Coefficient Clustering coefficient of the network is the average clustering coefficient of its vertexes The clustering coefficient is a measure of transitivity LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Clustering Coefficient BrCa-USFr-Sw-UK LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Computer Science Fields Clustering Coefficient below the average (87%) for fields closely related to Mathematics Algorithms and Theory (79%) Operational Reaseach and Optimization (83%) Formalisms, Logics and Semantics (83%) LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Interrelationship between Fields LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Interrelationship between Fields LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Giant Component Evolution LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Giant Component Evolution LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Giant Component Evolution LAboratory for Treating INformation (LATIN) – UFMG - Brazil Increase in the number of graduate programs in 1990s
Giant Component Evolution LAboratory for Treating INformation (LATIN) – UFMG - Brazil Increase in government funding
Giant Component Evolution LAboratory for Treating INformation (LATIN) – UFMG - Brazil A shift in policy: more support to research groups instead of individuals
Giant Component Evolution LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Giant Component Evolution LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Giant Component Evolution LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Edges vs Vertices LAboratory for Treating INformation (LATIN) – UFMG - Brazil BrCa-USFr-Sw-UK Inclination
2 well-established fields Computer Architecture Databases 2 emerging fields Bioinformatics Geoinformatics Clustering Coefficient Evolution LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Clustering Coefficient Evolution LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Clustering Coefficient Evolution LAboratory for Treating INformation (LATIN) – UFMG - Brazil Densification
Summary Introduction Data Gathering Results Conclusions LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Conclusions Analysis of the characteristics of researchers of Computer Science graduate programs LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Analysis of the characteristics of researchers of Computer Science graduate programs Differences in the collaboration network of Br, Ca- US and Fr-Sw-UK Giant component Clustering coefficient Conclusions LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Analysis of the characteristics of researchers of Computer Science graduate programs Differences in the collaboration network of Br, Ca- US and Fr-Sw-UK Giant component Clustering coefficient Smaller clustering coefficient for areas more closely related to Mathematics Conclusions LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Conclusions Fast growth of the giant component in Brazil LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Conclusions Fast growth of the giant component in Brazil The number of edges grows faster than the number of vertices in the three regions; faster growth in Ca-US LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Conclusions Fast growth of the giant component in Brazil The number of edges grows faster than the number of vertices in the three regions; faster growth in Ca-US Densification of emerging fields LAboratory for Treating INformation (LATIN) – UFMG - Brazil
References Laender, Lucena, Maldonado, Souza e Silva, Ziviani. Assessing the Research and Education Quality of the Top Brazilian Graduate Programs. ACM SIGCSE Bulletin, 40: , June Martins, Gonçalves, Laender, Ziviani. Assessing the Quality of Scientific Conferences Based on Bibliographic Citations. Scientometrics, to appear LAboratory for Treating INformation (LATIN) – UFMG - Brazil
Guilherme Vale Menezes Nivio Ziviani Alberto H. F. Laender Virgílio Almeida Federal University of Minas Gerais - Brazil ?