1 genSpace: Community- Driven Knowledge Sharing for Biological Scientists Gail Kaiser’s Programming Systems Lab Columbia University Computer Science.

Slides:



Advertisements
Similar presentations
NG-CHC Northern Gulf Coastal Hazards Collaboratory Simulation Experiment Integration Sandra Harper 1, Manil Maskey 1, Sara Graves 1, Sabin Basyal 1, Jian.
Advertisements

Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
UWSMC Fall 2011 Lisa Guo. What is Social Graph  Mathematically, Graph is an abstraction for modeling relationships between things. Graphs consists of.
University of Illinois Visualizing Text Loretta Auvil UIUC February 25, 2011.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
BlogMyData A Virtual Research Environment for collaborative visualization of environmental data Andrew Milsted | 14 September 2010.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
GenSpace: Exploring Social Networking Metaphors for Knowledge Sharing and Scientific Collaborative Work Chris Murphy, Swapneel Sheth, Gail Kaiser, Lauren.
Open Statistics: Envisioning a Statistical Knowledge Network Ben Shneiderman Founding Director ( ), Human-Computer Interaction.
Computer Games, Open Source Software, and Computer Supported Work Environments Research Opportunities Walt Scacchi Institute for Software Research Game.
GenSpace: Exploring Social Networking Metaphors for Scientific Collaborative Work Gail Kaiser, Swapneel Sheth, Chris Murphy {kaiser, swapneel, cmurphy}
GPPC Connections 2011 | November 6-8 | Las Vegas, NV SharePoint 101: An Introduction to Microsoft SharePoint 2010 Joseph Tews, MCITP, MCT Summit Group.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
WP 8: Networks for Lifelong Competence Development Alicia Cheak INSEAD CALT (Centre for Advanced Learning Technologies) TEN Competence Kickoff Meeting.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
V. Chandrasekar (CSU), Mike Daniels (NCAR), Sara Graves (UAH), Branko Kerkez (Michigan), Frank Vernon (USCD) Integrating Real-time Data into the EarthCube.
28 February 2012Kaiser: COMS E61251 COMS E6125 Web-enHanced Information Management (WHIM) Prof. Gail Kaiser Spring 2012.
Crystal Hoyer Program Manager IIS Team Preview of features that will be announced at MIX09 Please do not blog, take pictures or video of session.
EUBA: The Emory User Behavior Analysis System Eugene Agichtein, Qi Guo and Ryan Kelly Intelligent Information Access Lab
Visualizing Cyber Security: Usable Workspaces Glenn A. Fink, Christopher L. North, Alex Endert, Stuart Rose.
Framework for Automated Builds Natalia Ratnikova CHEP’03.
Filipe MS Bento University of Aveiro, Portugal » PhD Research grant by VuFind as a Participatory Scientific Information Discovery, Access, Evaluation.
Copyright © 2009 Pearson Education, Inc. Slide 6-1 Chapter 6 E-commerce Marketing Concepts.
1 Benjamin Perry, Venkata Kambhampaty, Kyle Brumsted, Lars Vilhuber, William Block Crowdsourcing DDI Development: New Features from the CED 2 AR Project.
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Helping scientists collaborate BioCAD. ©2003 All Rights Reserved.
1 Web: Steve Brewer: Web: EGI Science Gateways Initiative.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Data Management BIRN supports data intensive activities including: – Imaging, Microscopy, Genomics, Time Series, Analytics and more… BIRN utilities scale:
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
Accada – Open Source EPC Network Prototyping Platform Christian Floerkemeier Christof Roduner SAP October 2006.
August 2003 At A Glance VMOC-CE is an application framework that facilitates real- time, remote cooperative work among geographically dispersed mission.
SBIR Final Meeting Collaboration Sensor Grid and Grids of Grids Information Management Anabas July 8, 2008.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Celine DONDEYNAZ, Joint Research Centre- Italy A. Leone, C. Carmona, P. Mainardi, M.Giacomassi and Prof. Daoyi Chen A Web knowledge Management Platform.
Large Scale Nuclear Physics Calculations in a Workflow Environment and Data Provenance Capturing Fang Liu and Masha Sosonkina Scalable Computing Lab, USDOE.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
01 March 2011Kaiser: COMS E61251 COMS E6125 Web-enHanced Information Management (WHIM) Prof. Gail Kaiser Spring 2011.
You Are Not Alone: How Authoring Tools Can Leverage Activity Traces to Help Users, Developers & Researchers Bjoern Hartmann Stanford HCI Lunch 8/19/2009.
WHIP - Workflow Hosted in Portals Kurt Mueller and Andrew Harrison School of Computer Science, Cardiff And Ian Taylor School of Computer Science, Cardiff.
Nature Reviews/2012. Next-Generation Sequencing (NGS): Data Generation NGS will generate more broadly applicable data for various novel functional assays.
Design and Implementation of a Rationale-Based Analysis Tool (RAT) Diploma thesis from Timo Wolf Design and Realization of a Tool for Linking Source Code.
A collaborative tool for sequence annotation. Contact:
PASSION. INNOVATION. SOLUTIONS.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
NSDL STEM Exchange: Technical Overview and Implications for Active Dissemination of Federally Funded Resources Across Implementation Systems.
End-to-End Data Services A Few Personal Thoughts Unidata Staff Meeting 2 September 2009.
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Social Information Processing March 26-28, 2008 AAAI Spring Symposium Stanford University
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 15 Creating Collaborative Partnerships.
Supporting Collaborative Ontology Development in Protégé International Semantic Web Conference 2008 Tania Tudorache, Natalya F. Noy, Mark A. Musen Stanford.
Tetherless World Constellation Open Government Data Jim Hendler Tetherless World Professor of Computer and Cognitive Science Assistant Dean of Information.
A Collaborative e-Science Architecture towards a Virtual Research Environment Tran Vu Pham 1, Dr. Lydia MS Lau 1, Prof. Peter M Dew 2 & Prof. Michael J.
Groupware What are the goals of a groupware system? - Facilitation - Coordination - Cooperation - Augmented, supported production Is efficiency the goal?
User Characterization in Search Personalization
Joslynn Lee – Data Science Educator
Web 2.0 and Library 2.0 A Brief Overview
Marketplace & service catalog concepts, first design analysis
The 2007 Winter Conference on Business Intelligence
SMART GROUND platform overview
Gail Kaiser, Swapneel Sheth, Chris Murphy
Defining Data-intensive computing
An ecosystem of contributions
ece 627 intelligent web: ontology and beyond
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
Presentation transcript:

1 genSpace: Community- Driven Knowledge Sharing for Biological Scientists Gail Kaiser’s Programming Systems Lab Columbia University Computer Science

2 Introduction Scientists collaborating together in the same lab on the same project share:  Data: specimens, samples, materials, analyses  Tools: instruments, software, hardware  Knowledge: open discussion, whiteboard However, there are temporal (time) and physical (space) constraints This model does not scale to communities of scientists working on different projects but who could possibly learn from each other’s expertise, experience, etc.

3 CSCW Approaches Most current generation Computer-Supported Cooperative Work systems enable data sharing and/or tool sharing (e.g., PNNL Collaboratories, UIUC BioCoRE) However, these systems support relatively limited knowledge sharing  how/when/where/why to use tools and data Knowledge sharing is partially enabled through labor intensive approaches: pubs, lists, wikis, chat, shared display, etc. – may be outdated, requires active participation  We seek to enable automatic knowledge sharing – without requiring “extra work” by scientists

4 Social Networking Metaphor Some online social networking is a form of CSCW that is potentially enjoyable and profitable but requires “extra work”, with dynamism limited by explicit user participation  Facebook, MySpace, LinkedIn, Twitter, etc. Other social networking automatically records, aggregates, data mines and disseminates what people do online in an enjoyable and profitable fashion, with no “extra work” required  Collaborative filtering – “people like you …”

5 genSpace We combine implicit and explicit social networking (and collective intelligence) concepts in our approach to knowledge sharing Prototype implemented as a set of plugins for geWorkbench, MAGNet’s platform for analysis and visualization tools for integrated genomics Records, aggregates, data mines and disseminates geWorkbench users’ activities with tools and tool sequences (workflows) Users can opt-in or opt-out

6 Integrated genomics analysis application  Support for gene expression data, sequences, pathways, structure.  50+ visualization and analysis modules.  Access to local and remote data sources and analytical services.  Integration with biological annotation sources. Development platform  Open source, Java-based.  Component architecture, facilitating customization. geWorkbench – A platform for Integrated Genomics

7

8 Questions genSpace Can Answer What do I do first? Which tools work well together? Where does this tool fit in a typical workflow? Who do I know who also uses this tool? How do I get help (from an expert who is online right now)?

17 Contributions We investigate an approach to collaborative knowledge sharing that is based on data mining and social networking requiring little or no “extra work” by scientists We have developed a prototype implementation, genSpace, built on the geWorkbench platform Logging, data mining, etc. of geWorkbench user activities, tool/workflow recommendation and visualization already included in local pre- release repository Planned for next external release

18 Future Work More precise monitoring - specific analysis parameters and options, visualization activities Privacy and Confidentiality – Leverage collaborative networks to restrict dissemination Address “concept drift” as user participation, tool/workflow usage, privacy settings change Scaling up to hundreds of users and hundreds of thousands of logs – Caching at client and server, incremental update, offline access genSpace APIs enabling easy port to other tool integration frameworks beyond geWorkbench Integration with pub “tagging” in Ken Ross lab

19 Ross Lab Semantic Ranking and Result Visualization for PubMed Search Social Network Aware Search in Collaborative Tagging Sites 2 posters & demo (Julia Stoyanovich)

20 genSpace: Community-Driven Knowledge Sharing for the Discovery and Visualization of Workflows in geWorkbench Gail Kaiser