IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.

Slides:



Advertisements
Similar presentations
Distributed Data Processing
Advertisements

ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Adding scalability to legacy PHP web applications Overview Mario A. Valdez-Ramirez.
NWfs A ubiquitous, scalable content management system with grid enabled cross site data replication and active storage. R. Scott Studham.
Dell IT Innovation Labs in the Cloud “The power to do more!” Andrew Underwood – Manager, HPC & Research Computing APJ Solutions Engineering Team.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Understanding and Managing WebSphere V5
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
DNA Subway Green Line Overview. Growth of Sequence Read Archive (SRA) 2.2 Quadrillion bases Log Scale!
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
U.S. Department of the Interior U.S. Geological Survey David V. Hill, Information Dynamics, Contractor to USGS/EROS 12/08/2011 Satellite Image Processing.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Types of Operating System
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
IPlant Collaborative Powering a New Plant Biology iPlant Collaborative Powering a New Plant Biology.
Selecting and Implementing An Embedded Database System Presented by Jeff Webb March 2005 Article written by Michael Olson IEEE Software, 2000.
Trimble Connected Community
Customized cloud platform for computing on your terms !
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Chapter 6 Operating System Support. This chapter describes how middleware is supported by the operating system facilities at the nodes of a distributed.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
Software Architecture
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Contents HADOOP INTRODUCTION AND CONCEPTUAL OVERVIEW TERMINOLOGY QUICK TOUR OF CLOUDERA MANAGER.
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
EPE Release 2 IOC Review August 7, 2012 Ocean Observatories Initiative OOI EPE Release 2 Initial Operating Capability Review System Development Overview.
Customized cloud platform for computing on your terms ! Nirav Merchant
Engr. M. Fahad Khan Lecturer Software Engineering Department University Of Engineering & Technology Taxila.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Data Management BIRN supports data intensive activities including: – Imaging, Microscopy, Genomics, Time Series, Analytics and more… BIRN utilities scale:
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
CRISP & SKA WP19 Status. Overview Staffing SKA Preconstruction phase Tiered Data Delivery Infrastructure Prototype deployment.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
The iPlant Collaborative Using iPlant for sharing, managing, and analyzing ecological data Ramona Walls Presented at ESA 2014 – Ignite session August 12,
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Atmosphere.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
IPlant Collaborative Hands-on Cyberinfrastructure Workshop - Part 1 R. Walls University of Arizona Biodiversity Information Standards (TDWG) Sep. 28, 2015,
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop GWAS/QTL Apps Overview.
GCRC Meeting 2004 BIRN Coordinating Center Software Development Vicky Rowley.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
2012 Objectives for CernVM. PH/SFT Technical Group Meeting CernVM/Subprojects The R&D phase of the project has finished and we continue to work as part.
Distributed Data for Science Workflows Data Architecture Progress Report December 2008.
Building and managing production bioclusters Chris Dagdigian BIOSILICO Vol2, No. 5 September 2004 Ankur Dhanik.
IPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment Sriram Srinivasan.
Scaling up R computation with high performance computing resources.
Resource Optimization for Publisher/Subscriber-based Avionics Systems Institute for Software Integrated Systems Vanderbilt University Nashville, Tennessee.
WP5 – Infrastructure Operations Test and Production Infrastructures StratusLab kick-off meeting June 2010, Orsay, France GRNET.
Introduction to Data Analysis with R on HPC Texas Advanced Computing Center Feb
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
CI Updates and Planning Discussion
Tools and Services Workshop
Joslynn Lee – Data Science Educator
Types of Operating System
StratusLab Final Periodic Review
StratusLab Final Periodic Review
High-performance tracing of many-core systems with LTTng
Tools and Services Workshop Overview of Atmosphere
The Improvement of PaaS Platform ZENG Shu-Qing, Xu Jie-Bin 2010 First International Conference on Networking and Distributed Computing SQUARE.
Chapter 2: System Structures
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
Presentation transcript:

iPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant

iPlant is not a company! We’re colleagues We’re potential collaborators

Collaborating with iPlant Image from: Our goal is solve computational bottlenecks that impede research. Bottlenecks might be storage space, but it also might be finding a way to make tools easier to use. Doing this properly requires community input!

Collaborating with iPlant iPlant is Infrastructure. Like most infrastructure, you don’t always use it directly as an end user. Often, it is hidden underneath some other service.

Collaborating with iPlant Other major projects are beginning to adopt the iPlant CI as their underlying infrastructure (some completely, some in limited ways): – CoGe (auth service, hosting) – BioExtract (web service platform) – CiPRES (computation) – Gates Integrated Breeding Platform (hosting, development)

Collaborating with iPlant API The API (Application Programmer Interface) is the way bioinformatics tools and data get integrated with iPlant. The API is a straightforward web service. – You can use it with a simple curl command, e.g.: Paste example No different than embedding a Google API or map. You can use this for just authentication, for transferring files, running computation, looking up taxonomic names, etc.

Collaborating with iPlant API exemplar users Carol Lushbough, Bioextract BioExtract, a DBI funded portal, is currently being re- written to take advantage of the API. Carol is running jobs on Lonestar through the API. The API has also been adopted outside plants, by Apache Airavata, CyberGIS, and some XSEDE sites.

Collaborating with iPlant Stampede - High Level Overview Base Cluster (Dell/Intel/Mellanox): – Intel Sandy Bridge processors – Dell dual-socket nodes w/32GB RAM (2GB/core) – 6,400 nodes – 56 Gb/s Mellanox FDR InfiniBand interconnect – More than 100,000 cores, 2.2 PF peak performance Co-Processors: – Intel Xeon Phi “MIC” Many Integrated Core processors – Special release of “Knight’s Corner” (61 cores) – All MIC cards are on site at TACC more than 6000 installed final installation ongoing for formal summer acceptance – 7+ PF peak performance Max Total Concurrency: – exceeds 500,000 cores – 1.8M threads Entered production operations on January 7, 2013

iPlant is the first project to make most of the XSEDE resources available via a simple web service interface. iPlant can help you deploy an application on these resources. With a few lines of code, your program or web site can directly utilize some of the world’s largest systems, without you or your users ever logging in. Collaborating with iPlant API

Collaborating with iPlant Data Store iPlant now stores hundreds of millions of files, with nearly a petabyte of data under management. If you need to store or distribute a dataset, you can use the data store as your mechanism. The API is one way to do this, but there are also non-web methods for bulk data transfer. Once in the data store, a variety of interfaces can be used to access/retrieve the data – Indexing and analytics capabilities coming soon.

The current version supports multiple sites, but only full mirrors (all data at every site). The next version will allow new storage sites to enroll, but only keep data of interest. – With this option, iPlant could act as a mirror for specific projects/databases without that project hosting “unrelated” data. – Relevant data could be cached at local sites for faster access. – With a query capability, an internationally federated data store could be built where *no* site has all the data, but all sites can find it, allowing greater scaling. Collaborating with iPlant Data Store

Atmosphere is now offering (in beta) a “Metal as-a-service” mode. Servers running Linux are set up at a site, we install the Atmosphere back end. From the main Atmosphere site, you can route your VMs back to your local hardware. – Increase allocation – Control images and data locally. Collaborating with iPlant Atmosphere

If you *don’t* have your own web application, but you have a program, workflow, or dataset you would like to widely distributed, iPlant can be the platform. – Use the DE to share your workflow or program ( working on direct links that can be provided to use in publications or from your website). – Use the data stores public space to store and distribute your data (replicated, many interfaces) Collaborating with iPlant Distributing your science

CI as “Everyday” Science

iPlant Advanced Collaborative Support Provide a computing expert for an extended period of time to rebuild a popular tool for scalability, or other key functionality – Could be scaling, info/visualizations, or just information architecture help. – A typical engagement may run a month to 6 months. – An expert focused on making *your* particular tool or workflow run well on our systems. – Deal with the complexity of coming large scale systems (millions of threads).

Shaping iPlant to Suit Your Science Your Feedback is Critical iplantc.org/tswpost