IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop University of Hawaii at Manoa; December 10-11, 2012.

Slides:



Advertisements
Similar presentations
1 Is there an ? Is there an app for that ? Challenges in scalable analysis for Life sciences 1 Nirav Merchant UA BioComputing + iPlant Arizona Research.
Advertisements

Enabling Phenotypic Image Analysis Using Shared Cyberinfrastructure
The iPlant Collaborative Community Cyberinfrastructure for Life Science Nirav Merchant iPlant / University of Arizona
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
1 iPlant Data Store (iDS) Supporting the Lifecycle of Data Nirav Merchant 1.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Jason Williams Cold Spring Harbor Laboratory, iPlant
IPlant Collaborative Powering a New Plant Biology iPlant Collaborative Powering a New Plant Biology.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Arthropod Genomics Research in ARS Workshop Jason Williams Cold.
Customized cloud platform for computing on your terms !
The iPlant Collaborative Community Cyberinfrastructure for Life Science Roger Barthelson/Uwe Hilgert iPlant / University of Arizona.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Introduction to iPlant Dan Stanzione The iPlant Collaborative September 16th, 2013.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Jason Williams Cold Spring Harbor Laboratory Botany 2013, New Orleans, LA.
BISQUE: Enabling Cloud and Grid Powered Image Analysis Ramona Walls iPlant Collaborative
IPlant's Taxonomic Name Resolution Service Naim Matasci BIO5 / The iPlant Collaborative tnrs.iplantc.org.
Enabling Cloud and Grid Powered Image Phenotyping Nirav Merchant iPlant Collaborative
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Cloud Computing for Education and Research Customized cloud platform for computing on your terms ! CSUPERB symposium, Jan 3 rd 2013 Nirav Merchant
The iPlant Collaborative IBP Annual Meeting – June 1 st 2011 Steve.
1 iPlant: Cyberinfrastructure for Plant Sciences (and Beyond) Your Name Here 1.
IPlant Collaborative Bringing Together High Performance Computing and Biology.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Customized cloud platform for computing on your terms ! Nirav Merchant
The iPlant Collaborative Community Cyberinfrastructure for Life Science Jason Williams Cold Spring Harbor Laboratory, iPlant
The iPlant Collaborative Presented by Sheldon McKay Cold Spring Harbor Laboratory.
RNA-Seq 2013, Boston MA, 6/20/2013 Optimizing the National Cyberinfrastructure for Lower Bioinformatic Costs: Making the Most of Resources for Publicly.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
IPlant's Taxonomic Name Resolution Service Naim Matasci BIO5 / The iPlant Collaborative.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Jason Williams Cold Spring Harbor Laboratory, iPlant
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Enabling Cloud and Grid Powered Image Phenotyping Martha Narro iPlant Collaborative Adapted.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
My-Plant.org A Phylogenetically Structured Social Network Matthew R Hanlon November 13, 2010.
IPlant Collaborative Hands-on Cyberinfrastructure Workshop – Part 2 R. Walls University of Arizona Biodiversity Information Standards (TDWG) Sep. 29, 2015,
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop iPlant Data Store.
The iPlant Collaborative Using iPlant for sharing, managing, and analyzing ecological data Ramona Walls Presented at ESA 2014 – Ignite session August 12,
The iPlant Collaborative Community Cyberinfrastructure for Life Science Jason Williams Cold Spring Harbor Laboratory, iPlant.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Atmosphere.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of the iPlant Data Store.
IPlant Collaborative Hands-on Cyberinfrastructure Workshop - Part 1 R. Walls University of Arizona Biodiversity Information Standards (TDWG) Sep. 28, 2015,
The iPlant Collaborative Community Cyberinfrastructure for Life Science Jason Williams iPlant / Cold Spring Harbor Laboratory Texas A&M Tools and Services.
IPlant Genomics in Education Workshop Genome Exploration in Your Classroom.
The iPlant Collaborative Pollen RCN March 2 nd, 2013 The iPlant Collaborative Pollen RCN March 2 nd, 2013 Steve Goff BIO5 Institute.
Overview of Atmosphere
The iPlant Collaborative Community Cyberinfrastructure for Life Science Jason Williams Cold Spring Harbor Laboratory, iPlant.
IPlant Collaborative Bringing Together High Performance Computing and Biology.
Agenda iPG2P Steering Committee September 27, 2011 Welcome Fusheng Wei, Scientific Analyst Virginia Tech Workshop (Ruth) iPlant presentation to NSB (Martha)
Enabling Cloud and Grid Powered Image Phenotyping
داده های عظیم در دوران پساژنوم Big Data in Post Genome Era مهدی صادقی پژوهشگاه ملی مهندسی ژنتیک و زیست فناوری پژوهشکده علوم زیستی، پژوهشگاه دانش های بنیادی.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Jason Williams Cold Spring Harbor Laboratory, iPlant.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment.
The iPlant Collaborative iPToL Data Assembly Workshop November 21 st, 2009 Steve Goff, Sonya Lowry, Martha Narro, Dan Stanzione University of Arizona,
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Transforming Science Through Data-driven Discovery Genomics in Education University of Delaware – February 2016 Jason Williams, Education, Outreach, Training.
Transforming Science Through Data-driven Discovery Tools and Services Workshop Atmosphere Joslynn Lee – Data Science Educator Cold Spring Harbor Laboratory,
Transforming Science Through Data-driven Discovery Tools and Services Workshop Data Store Overview.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Joslynn.
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Atmosphere.
CI Updates and Planning Discussion
CyVerse Tools and Services
Tools and Services Workshop
Customized cloud platform for computing on your terms !
Joslynn Lee – Data Science Educator
Tools and Services Workshop Overview of Atmosphere
Cyberinfrastructure for the Life Sciences
Presentation transcript:

iPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop University of Hawaii at Manoa; December 10-11, 2012

The iPlant Collaborative Cyberinfrastructure for the Plant Sciences

“BGI, based in China, is the world’s largest genomics research institute, with 167 DNA sequencers producing the equivalent of 2,000 human genomes a day. BGI churns out so much data that it often cannot transmit its results to clients or collaborators over the Internet or other communications lines because that would take weeks. Instead, it sends computer disks containing the data, via FedEx.” The Problem of Big Data in Biology

Human Genome: $2.7 Billion, 13 Years Human Genome: $900, 6 Hours 2012: Oxford Nanopore MiniION 2003: ABI 3730 Sequencer The Problem of Big Data in Biology A decade’s progress

The Problem of Big Data in Biology

High Throughput Phenotyping The large amount of sequence based data need balancing with equally powerful phenotypic data. Phytomorph Project (Univ. Wisconsin) $70K for 30 cameras 200 movies of root growth 4GB/day of images for processing

The Problem of Big Data in Biology

Data-intensive biology will mean getting biologists comfortable with new technology…

1973 Sharp, Sambrook, Sugden Gel Electrophoresis Chamber, $ Matt Meselson & Ultracentrifuge, $500,000 The Problem of Big Data in Biology hopefully comfortable enough to minimize the technology and focus on the biology.

The iPlant Collaborative Cyberinfrastructure for the Plant Sciences The iPlant CI is designed as infrastructure. This means it is a platform upon which other projects can build. Use of the iPlant infrastructure can take one of several forms: Storage Computation Hosting Web Services Scalability

For a challenge as broad as “plant science,” focus on specific applications/tools is a moving target, and never enough. Most important to build a *platform* that can support diverse and constantly evolving needs. “Cyberinfrastructure” is, in fact, infrastructure. The platform can lift all the apps, not select winners and losers. “The useful lifetime of our analysis tool chains is now 6 months” -Matthew Trunnel, Broad Institute The iPlant Collaborative Cyberinfrastructure for the Plant Sciences

We have designed iPlant to be consistent with the pillars of CIF21 High Performance Computing Data and Data Analysis Virtual Organization Learning and Workforce The iPlant Collaborative Cyberinfrastructure Philosophy

End Users Computational Users Teragrid XSEDE The iPlant Collaborative Cyberinfrastructure for the Plant Sciences

The iPlant Collaborative Ways to access iPlant Atmosphere: For virtual hosting of web apps, sites, databases. iPlant Data Storage: All data large and small The Discovery Environment: Integrated Web apps. MyPlant: Social Networking. DNASubway: Annotation and more Standalone Apps: TNRS, TreeViewer, PhytoBisque, etc The API: For programmers embedding iPlant CI capabilities Command line for experts (thru TeraGrid/XSEDE)

The iPlant Collaborative Practical Benefits Powerful computational resources (Data analysis and storage) Experimental verifiability, reproducibility, provenance Interconnected resources / multiple levels of access Facilitation of collaboration Scalability/extensibility

90,000 Compute Cores Up to 1TB shared memory Growing to ~500,000 cores by end of 2012 TACC Ranger PSC Blacklight TACC Corral EBI Web Services TACC Lonestar The iPlant Collaborative Scalable Computation for High Throughput Inquiry

Chris Pires, U. of Missouri – Assembly of Brassica Genomes on shared memory systems Haibo Tang, JCVI “ The resources available change your research landscape –the amounts and types of analyses that you do.” The iPlant Collaborative Scalable Computation for High Throughput Inquiry

A rich web client – Consistent interface to bioinformatics tools – Portal for users who won’t want to interact with lower level infrastructure An integrated, extensible system of applications and services – Additional intelligence above low level APIs – Provenance, Collaboration, etc. The iPlant Collaborative iPlant Discovery Environment

The iPlant Collaborative iPlant Discovery Environment

API-compatible implementation of Amazon EC2/S3 interfaces Virtualize the execution environment for applications and services Up to 12 core / 48 GB instances Access to Cloud Storage + EBS Run servers, CloudBurst desktop use cases. Big data and the desktop are co- local again! >60 hosted applications in Atmosphere today, including users from USDA, Forest Service, database providers, etc. (30 more for postdocs and grad students for training classes) The iPlant Collaborative Project Atmosphere™: Custom Cloud Computing

Fast data transfers via parallel, non-TCP file transfer Move large (>2 GB) files with ease Multiple, consistent access modes iPlant API iPlant web apps Desktop mount (FUSE/DAV) Java applet (iDrop) Command line Fine-grained ACL permissions Sharing made simple Access and a storage allocation is automatic with your iPlant account The iPlant Collaborative Data Store

A number of other applications are “Powered by iPlant” but developed by our team on top of the infrastructure. In response to specific grand challenge team requests for things that needed their own web presence. TNRS, My-Plant, and more. The iPlant Collaborative

Other major projects are beginning to adopt the iPlant CI as their underlying infrastructure (some completely, some in limited ways): CoGe (auth service, hosting) BioExtract (web service platform) CiPRES (computation) Gates Integrated Breeding Platform (hosting, development) Galaxy (storage, for now) The iPlant Collaborative

iPlant APIs Resources

UA TACC CSHL The iPlant Collaborative A virtual organization

Staff: Greg Abram Sonali Aditya Roger Barthelson Brad Boyle Todd Bryan Gordon Burleigh John Cazes Mike Conway Karen Cranston Rion Doodey Andy Edmonds Dmitry Fedorov Michael Gatto Utkarsh Gaur Cornel Ghiban Michael Gonzales Hariolf Häfele Matthew Hanlon MetadataDataToolsWorkflowsViz Executive Team: Steve Goff Dan Stanzione Faculty Advisors & Collaborators: Ali Akoglu Greg Andrews Kobus Barnard Sue Brown Thomas Brutnell Michael Donoghue Casey Dunn Brian Enquist Damian Gessler Ruth Grene John Hartman Matthew Hudson Dan Kliebenstein Jim Leebens-Mack David Lowenthal Robert Martienssen Students: Peter Bailey Jeremy Beaulieu Devi Bhattacharya Storme Briscoe Ya-Di Chen John Donoghue Steven Gregory Yekatarina Khartianova Monica Lent Amgad Madkour B.S. Manjunath Nirav Merchant David Neale Brian O’Meara Sudha Ram David Salt Mark Schildhauer Doug Soltis Pam Soltis Edgar Spalding Alexis Stamatakis Ann Stapleton Lincoln Stein Val Tannen Todd Vision Doreen Ware Steve Welch Mark Westneat Andrew Lenards Zhenyuan Lu Eric Lyons Naim Matasci Sheldon McKay Robert McLay Angel Mercer Dave Micklos Nathan Miller Steve Mock Martha Narro Praveen Nuthulapati Shannon Oliver Shiran Pasternak William Peil Titus Purdin J.A. Raygoza Garay Dennis Roberts Jerry Schneider Anthony Heath Barbara Heath Matthew Helmke Natalie Henriques Uwe Hilgert Nicole Hopkins Eun-Sook Jeong Logan Johnson Chris Jordan B.D. Kim Kathleen Kennedy Mohammed Khalfan Seung-jin Kim Lars Koersterk Sangeeta Kuchimanchi Kristian Kvilekval Aruna Lakshmanan Sue Lauter Tina Lee Bruce Schumaker Sriramu Singaram Edwin Skidmore Brandon Smith Mary Margaret Sprinkle Sriram Srinivasan Josh Stein Lisa Stillwell Kris Urie Peter Van Buren Hans Vasquez-Gross Matthew Vaughn Fusheng Wei Jason Williams John Wregglesworth Weijia Xu Jill Yarmchuk Aniruddha Marathe Kurt Michaels Dhanesh Prasad Andrew Predoehl Jose Salcedo Shalini Sasidharan Gregory Striemer Jason Vandeventer Kuan Yang Postdocs: Barbara Banbury Jamie Estill Bindu Joseph Christos Noutsos Brad Ruhfel Stephen A. Smith Chunlao Tang Lin Wang Liya Wang Norman Wickett The iPlant Collaborative

iPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Workshop Goals Demonstrate some of the ways iPlant CI can advance your science Familiarize you with iPlant tools and services Helping you add your “voice” to the iPlant user community Getting your feedback on the computation bottlenecks iPlant should tackle next