By Chris Zachor.  Introduction  Background  Open Source Software  The SourceForge community and network  Previous Work  What can be done different?

Slides:



Advertisements
Similar presentations
1 Future strategy for e-submission as seen by industry Dr Michael Colmorgen, IFAH-Europe 2nd Veterinary Workshop on E-submission 4 Dec 2009, EMEA, London.
Advertisements

Complexity Settlement Simulation using CA model and GIS (proposal) Kampanart Piyathamrongchai University College London Centre for Advanced Spatial Analysis.
Scientific Literature Tutorial
Y.C. Tay National University of Singapore Data Generation for Application-Specific Benchmarking.
Suphakit Awiphan, Takeshi Muto, Yu Wang, Zhou Su, Jiro Katto
The structure of the Internet. How are routers connected? Why should we care? –While communication protocols will work correctly on ANY topology –….they.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Automatic Classification of Accounting Literature Nineteenth Annual Strategic and Emerging Technologies Workshop Vasundhara Chakraborty, Victoria Chiu,
An Open Source Community Christina K Pikas LBSC708P November 10, 2005.
Delivering Citizen Science Data to Citizens Paul Allen Cornell Lab of Ornithology.
Getting Started: Research and Literature Reviews An Introduction.
Towards Understanding: A Study of the SourceForge.net Community using Modeling and Simulation Yongqin Gao Greg Madey Computer Science & Engineering University.
CS 501 : An Introduction to SCM & GForge An Introduction to SCM & GForge Lin Guo
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Supported in part by the National Science Foundation – ISS/Digital Science & Technology Analysis of the Open Source Software development community using.
An Introduction to the Hennepin County Hennepin County GIS Technical Advisory Group (eGTAG) 10/20/2009.
Thinking Processes By Marvi Matos. College of Engineering, UPR BS, Chem E My background.
"Being literate in contemporary society means being active, critical, and creative users not only of print and spoken language but also of the visual.
Science Fair Information Night Presented by: [Teacher Name] Courtesy of Science Buddies: Providing free science fair project ideas, answers, and tools.
Patrick Adam Wagstrom October 2004 Community Building in Open Source Software Ecosystems Patrick Adam Wagstrom Department.
ACT: 16 th Feb 2005OATS – Open Assistive Technology Software Open Source Software for Assistive Technology. ACE Centre Advisory Trust: Andrew Lysley, Jason.
Source Forge Phi Le Thanh Huynh Surinder Singh Benjamin Roppiyakuda.
The Urge to Merge Kathleen A. Hansen, Professor University of Minnesota School of Journalism and Mass Communication SLA, Toronto, June 8, 2005 Kathleen.
Analysis and Modeling of the Open Source Software Community Yongqin Gao, Greg Madey Computer Science & Engineering University of Notre Dame Vincent Freeh.
RPM Summer Institute 2012: Exploring Key Themes to Build a Model for Pre- College Math.
Research Meeting Seungseok Kang Center for E-Business Technology Seoul National University Seoul, Korea.
1.Knowledge management 2.Online analytical processing 3. 4.Supply chain management 5.Data mining Which of the following is not a major application.
OpenAlea An OpenSource platform for plant modeling C. Pradal, S. Dufour-Kowalski, F. Boudon, C. Fournier, C. Godin.
SINHGAD INSTITUTES MOZILLA CLUB A Talk On Introduction to Mozilla & Strenthing the club Presented by:-Varun Pailwan (Firefox Student Ambassador)
READING A PAPER. Basic Parts of a Research Paper 1. Abstract 2. Introduction to Technology (background) 3. Tools & techniques/Methods used in current.
Software Engineering CS3003
Source Forge Phi Le Thanh Huynh Surinder Singh Benjamin Roppiyakuda.
Seeking the Source Software Source Code as a Social and Technical Artifact Cleidson de Souza, Jon Froehlich, and Paul Dourish Jon Froehlich University.
MINING AND MODELING THE OPEN SOURCE SOFTWARE COMMUNITY
material assembled from the web pages at
By Chris Zachor.  Introduction  Background  Changes  Methodology  Data Collection  Network Topologies  Measures  Tools  Conclusion  Questions.
Hosted by Mrs. Davis Jeopardy VocabularyObservation or Inference? What is Science? Doing Science
The Team Science Toolkit is an interactive website that provides resources to help users support, engage in, and study.
7.1 and 7.2: Spanning Trees. A network is a graph that is connected –The network must be a sub-graph of the original graph (its edges must come from the.
Archivists' Toolkit - CRADLE Presentation, 10 Feb The Archivists’ Toolkit CRADLE Presentation 10 Feb
Presentation: A measure of betweenness centrality based on random walks M.E.J. Newman ELSEVIER Social Networks November 2004 A measure of betweenness centrality.
Topology and Evolution of the Open Source Software Community Advisors: Dr. Vincent W. Freeh Dr. Kevin Bowyer Supported in part by the National Science.
Exploring Evidence.
Agent 2004Scott Christley, Public Goods Theory of Open Source Community Public Goods Theory of the Open Source Development Community using Agent-based.
Version Control with SVN Images from TortoiseSVN documentation
© UNT in partnership with TEA1 7 th Grade Career Exploration Module Career Focus: Computer Software Engineer Vocabulary.
Yongqin Gao, Greg Madey Computer Science & Engineering Department University of Notre Dame © Copyright 2002~2003 by Serendip Gao, all rights reserved.
An Introduction to NETS*S Edmonds School District February 7, 2005.
Business Analysis Consulting: STS Realty By: Our group.
Enhancing e-learning experience with online social networks Published in IET Communications J.J.P.C. Rodrigues, F.M.R. Sabino, L. Zhou Reporter: Yu Chih.
Code.soundsoftware.ac.uk: A software repository for sustainable collaborative research Mark Plumbley, Chris Cannam, Luis Figueira Centre for Digital Music.
Science Project Information
Complexity Settlement Simulation using CA model and GIS (proposal) Kampanart Piyathamrongchai University College London Centre for Advanced Spatial Analysis.
De-anonymizing Genomic Databases Using Phenotypic Traits Humbert et al. Proceedings on Privacy Enhancing Technologies 2015 (2) :
Title Authors Introduction Text, text, text, text, text, text Background Information Text, text, text, text, text, text Observations Text, text, text,
1 Pioneer Investments Legal and Compliance System Assessment Weekly Status Update June 23, 2005.
A RESEARCH SUPPORT SYSTEM FRAMEWORK FOR WEB DATA MINING Jin Xu, Yingping Huang, Gregory Madey Department of Computer Science and Engineering University.
Collaborative Development Services Learning From the Open Source Agile Development Process Richard Kilmer, InfoEther LLC.
The Structure of Scientific Collaboration Networks by M. E. J. Newman CMSC 601 Paper Summary Marie desJardins January 27, 2009.
Notes 1.2 THE SCIENTIFIC METHOD. Steps to the Scientific Method Make observations – Come up with a problem Make observations – Come up with a problem.
A Research Collaboratory for Open Source Software Research Yongqin Gao, Matt van Antwerp, Scott Christley, Greg Madey Computer Science & Engineering University.
1 Discovering Web Communities in the Blogspace Ying Zhou, Joseph Davis (HICSS 2007)
Lake Arrowhead 2005Scott Christley, Understanding Open Source Understanding the Open Source Software Community Presented by Scott Christley Dept. of Computer.
Wei Wu, Georgia Southern University Raja R. A. Issa, University of Florida BIM Education for New Career Options.
Our Digital Showcase Scholars’ Mine Annual Report from July 2015 – June 2016 Providing global access to the digital, scholarly and cultural resources.
Introduction to #EdShare
Scientific Method Part 2.
Scientific Method Part 2.
Presentation Outline Science Fair.
Accelerating Regular Path Queries using FPGA
Presentation transcript:

By Chris Zachor

 Introduction  Background  Open Source Software  The SourceForge community and network  Previous Work  What can be done different?  Related Work  Conclusion  Questions

 Goal: Use network analysis to better understand the SourceForge community developers  Identify key developers and groups of developers who create popular open source applications

 Open Source (OS) Software continues to be a popular alternative to standard commercial software  Many OS alternatives to traditional closed source projects exist  osalt.com provides a convenient database for this

 A website to help promote collaboration between developers of OS projects  A repository for OS projects  Developers: revision control, bug tracking, donation system, etc.  Users: bug reporting, recommendations, commenting, etc.

 Multiple networks can be formed from the SourceForge community Project-Developer network Developer network Project network Lots of interesting data to be collected from the website such as total downloads of a project, length of developer membership, recommendations, etc.

 A bipartite graph with two groups of vertices: projects and developers  An edge indicates the developer works on that project

 A collaboration network  Edges are formed where one developer 1 has worked with developer 2

 An edge can represent a related project  An edge can represent projects that share a developer  Or perhaps an edge can represent a related project

 The open source group at Notre Dame  Used network analysis as a tool to understand the Open Source Software phenomenon and predict growth over time  Monthly data dumps directly from SourceForge.net

 The latest paper produced concerning network analysis was in 2007  The project count has more than doubled in size to ~250,000 projects (from ~90,000 in 2007).

 Their main concern was with how the network was evolving  Focus was on the change in measures from month to month  No interpretation of data

 M. E. J. Newman  Scientific Collaboration Networks  4 Major Databases spanning 5 years  Collaboration network using authors who have worked together on a single paper  Explored what fields were producing more papers, what fields collaborated more, etc.

 Obermeier et al.  University College Dublin  Co-authorship between departments at UCD  They wanted to understand the interdisciplinary publication culture within the University  Looked at brokerage individuals and how they play a part in their own departments  Found these brokerage individuals to be most central within their own departments

 Gao and Madey  Network analysis of SourceForge  Used as a tool to understand the open source movement  Documented the growth of the SourceForge community  Structural analysis, centrality analysis, path analysis  They did not interpret the data

 Xu, Christley, and Mady  Network analysis of the SourceForge community  Attempt to explain the success and efficiency of OS development practices  Noted that the SourceForge Network is a scale free network  Also noted the presence of the small world phenomenon within the community

 Xu, Christy, and Madey continued  Observed that co-developers and active users were a major factor in large scale projects  Meanwhile, project leads and core developers were largely involved in small projects

 While previous studies were focused on growth and why the process is a success, this study will focus on how key developers and groups play a part in creating popular software  Many attributes not looked at in previous studies

Anyone?