Training and Outreach Efforts BD2K-LINCS Webinar for the NIH BD2K Working Group on Training May 29, 2015 Avi Ma’ayan, PhD (Contact PI) Associate Professor.

Slides:



Advertisements
Similar presentations
Grant review at NIH for statistical methodology Jeremy M G Taylor Michelle Dunn Marie Davidian.
Advertisements

Education, Outreach and Training. Specifications Document Overall objective: Better integration of ecoinformatics, in general, and SEEK tools, specifically,
Statistical methods and tools for integrative analysis of perturbation signatures Mario Medvedovic Laboratory for Statistical Genomics and Systems Biology.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Bioinformatics in Libraries: building new services from the ground up at UC San Francisco Megan Laurance, Ph.D. Research Informationist UCSF Library and.
Educational Programs in Bioinformatics at UNO Hesham H. Ali Department of Computer Science College of Info Science and Technology University of Nebraska.
Education, Training and Outreach NCI Integrative Cancer Biology Program Co-Chairs Lourdes Estrada, PhD, Vanderbilt Bruce Tidor, PhD, MIT Betty Tarnowski,
Integrating Mathematical Concepts Across the Biology Curriculum— Remediation Efforts, Introductory Biology Sequence, Biostatistics, and Bioinformatics.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
University of Louisville The Department of Bioinformatics and Biostatistics.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Introduction to Molecular Epidemiology Jan Dorman, PhD University of Pittsburgh School of Nursing
Bioinformatics Curriculum Guidelines: Toward a Definition of Core Competencies Lonnie Welch School of Electrical Engineering & Computer Science Biomedical.
1 FACS Data Management Workshop The Immunology Database and Analysis Portal (ImmPort) Perspective Bioinformatics Integration Support Contract (BISC) N01AI40076.
Facilitate Open Science Training for European Research Where Librarians can learn and teach Open Science for European Researchers LIBER 2015 London,
Opportunities for Scientific Engagement with Program Directors Nas Kuhn, Ph.D. Division of Cancer Biology New Grantee Workshop March 18, 2015
The Graduate School University of Colorado Anschutz Medical Campus.
Feb. 2006RUFO- 2nd Workshop Al-Quds University Rashid Jayousi, PhD Computer Science Dept. Experiences in E-learning.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
BD2K-LINCS-Perturbation Data Coordination & Integration Center Applicant Information Webinar for RFA-HG Ajay Pillai and Jennie Larkin January 13,
Finding Internet Resources for Teaching Statistics Using CAUSEweb Ginger Holmes Rowell, Ph. D. Middle Tennessee State University Roundtable Luncheon August.
1 Distributed Big Data & Analytics University of Cincinnati –Bioinformatics Project/Research Title: NIH BD2K-LINCS Perturbation Data Coordination and Integration.
Continuing Education for Biological and Life Sciences Librarians in the Post- Genomic Era You CAN Teach an Old Dog New Tricks Frederick W Stoss University.
Pine Integrated Network: Education, Mitigation, and Adaptation project (PINEMAP) is a Coordinated Agriculture Project funded by the USDA National Institute.
JM - 1 Introduction to Bioinformatics: Lecture I An Overview of the Course Jarek Meller Jarek Meller Division of Biomedical Informatics,
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
Sage Bionetworks A non-profit organization with a vision to enable networked team approaches to building better models of disease BIOMEDICINE INFORMATION.
Department of Health and Human Services National Institutes of Health National Center for Research Resources Division of Research Infrastructure Extending.
1 A National Virtual Specimen Database for Early Cancer Detection June 26, 2003 Daniel Crichton NASA Jet Propulsion Laboratory Sean Kelly NASA Jet Propulsion.
The Cancer Systems Biology Consortium (CSBC)
Christina Leslie, Lonnie Welch, Chairs Gustavo Stolovitzky, Chair Andrea Califano, Chair Steven Leard, Chris Williams Manolis Kellis, Chair Alex Pico,
Methodological Foundations of Biomedical Informatics (BMSC-GA 4449) Kelly Ruggles & David Fenyo.
Intelligent systems in bioinformatics Introduction to the course.
NIH Council of Councils Meeting November 21, 2008 LINCS Library of Integrated Network-based Cellular Signatures.
NIH Common Fund Library of Integrated Network- based Cellular Signatures LINCS Applicant Information Webinar for RFA-RM September 6, :00 –
James Anderson, M.D., Ph.D. Director Division of Program Coordination, Planning, and Strategic Initiatives October 27, 2011 The NIH Common Fund.
Bioinformatics Core Facility Guglielmo Roma January 2011.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Sage Bionetworks A non-profit organization with a vision to enable networked team approaches to building better models of disease BIOMEDICINE INFORMATION.
Overview of Bioinformatics 1 Module Denis Manley..
Valentina Di Francesco Senior Program Officer for Bioinformatics, Structural Genomics and Systems Biology Microbial Genomics.
EB3233 Bioinformatics Introduction to Bioinformatics.
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
Pathogenomics How this project began: Ann Rose - take advantage of DNA sequence information - genomics Julian Davies - use the information to understand.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
John Darrell Van Horn, Ph.D. Associate Professor.
Center for Predictive Computational Phenotyping (CPCP): Training Plans May 15, 2015 Debora Treu and Whitney Sweeney Center for Predictive Computational.
Center for Causal Discovery (CCD) Training Plan Overview Joe Ayoob, Training Component Co-I June 24, 2015 Center Director: Greg Cooper Training Component.
Implementing a National Data Infrastructure: Opportunities for the BIO Community Peter McCartney Program Director Division of Biological Infrastructure.
© 2010 Pittsburgh Supercomputing Center Pittsburgh Supercomputing Center RP Update July 1, 2010 Bob Stock Associate Director
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Advanced Manufacturing Laboratory Department of Industrial Engineering Sharif University of Technology Session #13.
High throughput biology data management and data intensive computing drivers George Michaels.
School of Mechanical, Materials and Manufacturing Engineering About this course Biomedical industries provide a rich diversity.
FNA/Spring CENG 562 – Machine Learning. FNA/Spring Contact information Instructor: Dr. Ferda N. Alpaslan
Ingenuity Pathway Analysis Alex Pico. Description "IPA is a software application that enables researchers to analyze and understand the complex biological.
What Business Analytics Can Do For You!
Towards a unified MOD resource: An Overview
D I S C O V E R Y Challenge.
that may not have been available to them otherwise.
Yale SPORE in Skin Cancer
Computer Science Department, University of Missouri, Columbia
KnowEnG: A SCALABLE KNOWLEDGE ENGINE FOR LARGE SCALE GENOMIC DATA
Progress Report for Year 2007
Dept of Biomedical Informatics University of Pittsburgh
Session 1: WELCOME AND INTRODUCTIONS
Molecular Cell Biology and Genetics
Presentation transcript:

Training and Outreach Efforts BD2K-LINCS Webinar for the NIH BD2K Working Group on Training May 29, 2015 Avi Ma’ayan, PhD (Contact PI) Associate Professor Sherry Jenkins, MS Program Manager Department of Pharmacology and Systems Therapeutics Icahn School of Medicine at Mount Sinai New York, New York DATA COORDINATION AND INTEGRATION CENTER

Assays Cells Pert Assays Cells Pert Assays Cells Pert Assays Cells Pert Assays Cells Pert Assays Cells Pert DCIC High Throughput Transcriptomics L1000 Connectivity Map NeuroLINCS ALS Imaging Proteomics Transcriptomics Microenvironment Effects on Cancer Cells Transcriptomics Imaging Proteomics High Throughput Proteomics P100 Epigenomics Drug Combinations Mitigating Side Effects Proteomics Transcriptomics High Throughput Imaging Proteomics Phenotypes Cancer Cells Modeling Cell Signaling LINCS PHASE II Library of Integrated Network-based Cellular Signatures

BD2K-LINCS Data Coordination and Integration Center DSRIKECTOCCA Internal & External Data Science Research Projects Metadata, APIs, Visualization, Integration Tools Training and Outreach Coordination, Infrastructure eDSRsiDSRs Ma’ayanSchurer Medvedovic lincs-dcic.org lincsproject.org Summer Research Training Program Webinars Mini-symposium, seminars and workshops LINCS Working Groups

BD2K-LINCS Data Coordination and Integration Center – Scientific Objectives - Understand how different layers of human cellular regulatory networks, i.e., transcriptomics and proteomics, correlate and interact. - Develop methods to benchmark computational and experimental methods to objectively evaluate their quality and extract more knowledge from the data. - Understand the inherit biases within low- and high content experiments, and develop methods to correct for such biases. - Map the dimensionality of all possible global molecular states of human cells in normal physiology, disease, and in response to perturbations by small molecules and genetic manipulations. - Develop methods to connect cellular and organismal phenotypes with molecular cellular signatures.

BD2K-LINCS Data Coordination and Integration Center – Data Science Objectives - Organize, curate and serve for search and download the largest possible collection of annotated molecular cellular signatures, networks and attribute tables. - Develop novel data visualization methods for dynamically interacting with large-genomics and proteomics datasets. - Develop educational and outreach activities for training and engaging the next generation of data scientists. - Develop ontologies and other methods for data integration across diverse sets of experimental data collected by different laboratories, centers and large-scale projects utilizing different high content profiling assays.

Community Training and Outreach (CTO) 6 Courses MOOCs on Coursera: 1.Network Analysis in Systems Biology 2.Big Data Science with the BD2K-LINCS DCIC ISMMS Graduate Courses: 1.BD2K-LINCS DCIC - Programming for Big Data Biomedicine 2.BD2K-LINCS DCIC - Data Mining in Systems Biology Big Data Biostatistics PhD Program (at the University of Cincinnati College of Medicine) Summer Research Training Program in Biomedical Big Data Data Science Research Webinars Crowdsourcing Projects Portal External Data Science Projects Mini-symposium, Seminars and Workshops lincs-dcic.orgDCIC Outreach Activities Funding Opportunities

Network Analysis in Systems Biology MOOC on Coursera 7 A graduate-level course which serves as an introduction to Big Data analysis in systems biology including statistical methods used to identify differentially expressed genes, performing various types of enrichment analyses, and applying clustering algorithms. Description: 8 weeks / 7 modules 34 Short video lectures 24 Auto-graded short quizzes Auto-graded final exam Crowdsourcing tasks Course Features: Weekly overviews Discussion forum Last session: January 5, 2015 – March 3, 2015

Network Analysis in Systems Biology Course Analytics 8 Engagement: Content: ~600 students passed the course to obtain a statement of accomplishment

Big Data Science with the BD2K-LINCS Data Coordination and Integration Center MOOC on Coursera 9 Session: Sep 15 – Nov Overview of the NIH Common Fund LINCS Program Overview of the Data and Signature Generation Centers (experiments and data) Meta-Data and Ontologies Data Normalization Unsupervised Learning Methods: Data Clustering Supervised Learning Methods Enrichment Analyses Bayesian Data Integration Network Analysis and Network Visualization Cheminformatics Serving data through RESTful APIs and JSON Interactive Data Visualization of LINCS Data Syllabus:

BD2K-LINCS DCIC: Programming for Big Data Biomedicine ISMMS Graduate Course 10 Ten-week mini-course taught by Avi Ma’ayan PhD and members of his research team within the BD2K-LINCS DCIC at the Icahn School of Medicine at Mount Sinai Agent Based Modeling with NetLogo Agent Based Modeling with MATLAB Python Python and MatPlotLib HTML and CSS JavaScript and PHP MySQL MongoDB Bootstrap Templates R Final Project Topics: Spring 2015 Course Dates: Feb 24 – May

BD2K-LINCS DCIC: Data Mining in Systems Biology ISMMS Graduate Course 11 Ten-week mini-course taught by Avi Ma’ayan PhD and members of his research team within the BD2K-LINCS DCIC at the Icahn School of Medicine at Mount Sinai Self Organizing Maps Hierarchical Clustering PCA Linear Regression Decision Trees Graph Theory Concepts Support Vector Machines Final Project Topics: Fall 2015 Fall 2014 Course Dates: September 16 – December 2, 2014

BD2K-LINCS DCIC Summer Research Training Program in Biomedical Big Data Science 12 Ten-week training program for undergraduate and master’s students interested in research projects aimed at solving data-intensive biomedical problems. Icahn School of Medicine at Mount Sinai Dynamic Data Visualization Machine Learning Data Harmonization Ma’ayan Laboratory of Computational Systems Biology Summer 2015 Program Dates: June 1 – August 7 Summer 2015 | Training Sites University of Washington Yeung / Computational Systems Biology Group Machine Learning Data Integration Network Visualization Plugins Carnegie Mellon University, Biological Sciences (Bar-Joseph) Carnegie Mellon University, Computational Biology (Ma’ayan) University of Washington, Computer Engineering (Yeung) University of Washington, Computer Science (Ma’ayan) The City College of New York, Bioinformatics (Ma’ayan) Yorktown High School (Ma’ayan) 2015 Cohort Summary 6 trainees 2 master’s / 3 undergraduate / 1 high school 4 women / 2 men All future plans include STEM graduate degrees Carnegie Mellon University Machine Learning Time Series Analysis Transcriptional Regulatory Networks Bar-Joseph / Systems Biology Group

Data Science Research Webinars 13 Serve as a general forum to engage data scientists within and outside of the LINCS project to work on problems related to LINCS data analysis and integration. Open to data science research community Advertised on DCIC website, LINCS portal, Twitter, Google group Schedule and connection details posted on the DCIC website and LINCS portal Past webinar videos posted on the DCIC’s YouTube channel Purpose / Target Audience |||

BD2K-LINCS DCIC Crowdsourcing Portal 14

15 Community Science Project: Building a Database of Gene Expression Signatures Extracted from Single Gene Knockout/Knockdown Studies

16 Data Science Research Collaborations with the BD2K-LINCS DCIC

17 Mini-symposium, Seminars and Workshops Mini-symposium | January 7, 2015 Symposium was co-sponsored by the BD2K-LINCS DCIC and Mount Sinai’s Knowledge Management Center for Illuminating the Druggable Genome Winter Invited Seminar Speakers December 5, 2014 Reverse Engineering a more Reliable Translational Pipeline with Patient-Derived iPSC Models of Neurodegenerative Disease, Robotic Longitudinal Single Cell Analysis and Deep Learning Steven Finkbeiner, MD, PhD / NeuroLINCS Center January 14, 2015 The PAGE Study and Coordinating Center (Population Architecture using Genomics and Epidemiology) Tara Matise, PhD / PAGE Coordinating Center Works in Progress Seminar Series March 23, 2015 BD2K-LINCS Outreach Session: Turning Big Data to Knowledge (BD2K-LINCS): A discussion of the NIH BD2K initiative and how it might advance the practice of Toxicology and Risk Assessment John Reichard PhD, Mario Medvedovic PhD / BD2K-LINCS DCIC Poster Session: Big Data to Knowledge (BD2K) - A Graphical Approach for Data Coordination and Integration J.F. Reichard, M. Medvedovic, S. Sivagas / BD2K-LINCS DCIC Outreach Session at the Society of Toxicology’s Annual Meeting January 15, 2015 Enrichr and GEO2Enrichr: Tools to Extract and Analyze Signatures Gregory Gundersen and Matthew Jones / BD2K-LINCS DCIC Calendar of events on lincs-dcic.orglincs-dcic.org

18 Genomic and Computational Approaches for Biomarker and Drug Discovery Workshop hosted by the NIAAA Location: San Antonio, TX Grand Hyatt, San Antonio Room: Travis C/D Time: 2:00 – 5:00pm WORKSHOP | June 19, 2015 Enrichr Search engine for gene lists and signatures Hands-on Session: Web Apps and Tools GEO2Enrichr Differential Expression Analysis Tool L1000CDS 2 L1000 Characteristic Direction Signature Search Engine PAEA Principle Angle Enrichment Analysis

19 Acknowledgements The BD2K-LINCS DCIC is co-funded by BD2K and the NIH Common Fund NIH Grant Number: U54HL Follow BD2K-LINCS DCIC BD2K-LINCS +BD2K-LINCS lincs-dcic.orglincsproject.org BD2K-LINCS DCIC WEBSITE LINCS CONSORTIUM PORTAL