North Carolina Bioinformatics Grid Thom H. Dunning, Jr. HPCC Division, MCNC Chemistry, University of North Carolina.

Slides:



Advertisements
Similar presentations
CCIRN Meeting Douglas Gatchell US NSF/CISE/SCI July 3, 2004.
Advertisements

ParaMEDIC: Parallel Metadata Environment for Distributed I/O and Computing P. Balaji, Argonne National Laboratory W. Feng and J. Archuleta, Virginia Tech.
The Frame NSF-funded national supercomputer centers Centers have hosted significant projects: TeraGrid, NPACI, GEON, SCEC, Chronopolis Fostered development.
An overview of the EGEE project Bob Jones EGEE Technical Director DTI International Technology Service-GlobalWatch Mission CERN – June 2004.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
SDSC Computing the 21st Century Talk Given to the NSF Sugar Panel May 27, 1998.
UNC – NCRC January 2008 NC State Campus Steven Leath The University of North Carolina General Administration.
SHARCNET. Multicomputer Systems r A multicomputer system comprises of a number of independent machines linked by an interconnection network. r Each computer.
IBM Solutions for Grid Computing. I. IT view on “GRID” II. IBM and GRID III. IBM Storage and GRID Index …
11 Decembre 2000V. Breton Milan WP6 DataGRID meeting Biological applications in testbed 0 Evaluate GRID added value for handling biological data –What.
Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008 Mining for Science and Engineering Presented by: Kenji Yoshigoe.
Archives and Information Retrieval
1 Grid Computing in North Carolina: Past and Present SURA Cyber-infrastructure Workshop Georgia State university January 6, 2005 MCNC Grid Computing and.
About the EnLightened Computing Project Prepared for Grid Computing Class at UNCW and UNCC Carla S. Hunt Senior Solutions Architect MCNC April 12, 2007.
NICLS: Development of Biomedical Computing and Information Technology Infrastructure Presented by Simon Sherman August 15, 2005.
Michael Marron, Ph.D., Director Division of Biomedical Technology National Center for Research Resources National Institutes of Health Department of Health.
Introduction to Scientific Data Grid Kai Nan Computer Network Information Center, CAS
Introduction to Grid Computing Ann Chervenak Carl Kesselman And the members of the Globus Team.
North Carolina Research and Education Network StateNets at Tempe Arizona February, 2004 Copyright, John Killebrew, This work is the intellectual.
Biotechnology Development in North Carolina Jasper D. Memory Prof. of Physics, Emer., NC State U Retired VP for Research, UNC System
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE Discovery Environments Susan L. Graham Chief Computer Scientist Peter.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
E-BIOGENOUEST: A REGIONAL LIFE SCIENCES INITIATIVE FOR DATA INTEGRATION Datacite Annual Conference Nancy Olivier Collin – IRISA/INRIA
Bioinformatics.
The University of North Carolina May 21, The first public university in the nation opens in Chapel Hill General Assembly begins funding other.
ISBE An infrastructure for European (systems) biology Martijn J. Moné Seqahead meeting “ICT needs and challenges for Big Data in the Life Sciences” Pula,
Executive summary Grid computing has become an important concept for high performance computing. By taking advantage of the Internet, geographically distributed.
1 EGEE-Gr3RdParties-Athens, University of Athens University of Athens Communication Networks Laboratory (CNL) Network Operations and Management.
1 Jack Dongarra University of Tennesseehttp://
1 Grid Computing Barry Wilkinson Department of Computer Science University of North Carolina at Charlotte.
SBIR Services in NC John Ujvari, MBA SBIR Program Specialist North Carolina SBTDC Phone: Web:
SAN DIEGO SUPERCOMPUTER CENTER NUCRI Advisory Board Meeting November 9, 2006 Science Gateways on the TeraGrid Nancy Wilkins-Diehr TeraGrid Area Director.
1 “The grid virtualizes heterogeneous geographically disperse resources” "Introduction to Grid Computing with Globus," IBM Redbooks  Using geographically.
Board on Research Data and Information, National Research Council “Changing Roles of Libraries in Support of Scientific Data Activities” June 3, 2010 More.
Jarek Nabrzyski, Ariel Oleksiak Comparison of Grid Middleware in European Grid Projects Jarek Nabrzyski, Ariel Oleksiak Poznań Supercomputing and Networking.
INFSO-RI Enabling Grids for E-sciencE V. Breton, 30/08/05, seminar at SERONO Grid added value to fight malaria Vincent Breton EGEE.
Miron Livny Computer Sciences Department University of Wisconsin-Madison Condor : A Concept, A Tool and.
SouthGrid SouthGrid SouthGrid is a distributed Tier 2 centre, one of four setup in the UK as part of the GridPP project. SouthGrid.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE – paving the way for a sustainable infrastructure.
Kurt Mueller San Diego Supercomputer Center NPACI HotPage Updates.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
Commodity Grid Kits Gregor von Laszewski (ANL), Keith Jackson (LBL) Many state-of-the-art scientific applications, such as climate modeling, astrophysics,
Supporting Scientific Collaboration Online SCOPE Workshop at San Diego Supercomputer Center March 19-22, 2008.
1 Barry Wilkinson University of North Carolina, Charlotte Clayton Ferner University of North Carolina, Wilmington NSF CCLI Showcase SIGCSE 2007 Friday,
XML-Based Grid Data System for Bioinformatics Development Noppadon Khiripet, Ph.D Wasinee Rungsarityotin, MS Chularat Tanprasert, Ph.D Royol Chitradon.
11/15/04PittGrid1 PittGrid: Campus-Wide Computing Environment Hassan Karimi School of Information Sciences Ralph Roskies Pittsburgh Supercomputing Center.
December 10, 2003Slide 1 International Networking and Cyberinfrastructure Douglas Gatchell Program Director International Networking National Science Foundation,
“The UCSD Big Data Freeway System” Invited Short Talk Workshop on “Enriching Human Life and Society” UC San Diego February 6, 2014 Dr. Larry Smarr Director,
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
A Case on Consolidation The Good, The Bad, and the Ugly on Shrinking the UNC system.
Data and storage services on the NGS.
CERN The GridSTART EU accompany measure Fabrizio Gagliardi CERN
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
The SBTDC is a business advisory service of The North Carolina University System operated in partnership with the US Small Business Administration. sbtdc.org.
Northwest Indiana Computational Grid Preston Smith Rosen Center for Advanced Computing Purdue University - West Lafayette West Lafayette Calumet.
Investigations of HIV-1 Env Evolution Evolutionary Bioinformatics Education: A BioQUEST Curriculum Consortium Approach Grand Valley State University August.
10-Feb-00 CERN HepCCC Grid Initiative ATLAS meeting – 16 February 2000 Les Robertson CERN/IT.
Rafael Jimenez ELIXIR CTO BioMedBridges Life science requirements from e-infrastructure: initial results from a joint BioMedBridges workshop Stephanie.
1 NCBioGrid: Challenges in Grid Deployment and Application Enablement Virinder Batra: IBM Phil Emer: MCNC Chuck Kesler: MCNC Dr Alex Tropsha: UNC.
UNC SYSTEM-WIDE INTERNSHIP PROGRAM – Supporting Career Options with
Grid Computing Course Development team: Barry Wilkinson and Clayton Ferner (Instructors), and Mark Holliday Student assistants: Jeff House and Sam Daoud.
Access to Sequence Data and Related Information
What’s in the future for RONs?
Genomes and Their Evolution
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Databases & Applications
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

North Carolina Bioinformatics Grid Thom H. Dunning, Jr. HPCC Division, MCNC Chemistry, University of North Carolina

Genomics A Compute- & Data-Intensive Science * from TimeLogic

Data Explosion Rapid Growth of GenBank No. Gbases Growth of GenBank  Number of base pairs increasing dramatically (exponentially)  Growth in 2002 due to additions in just 21 days!

Data Explosion Number and Diversity of Databases Nucleic Acids Research, 2002, Vol. 30, No. 1 Table 1. Molecular Biology Database Collection Major Public Sequence Repositories DNA Data Bank of Japan (DDBJ) known nucleotide and protein sequences … Varied Biomedical Content … VirOligohttp://viroligo.okstate.eduVirus-specific oligonucleotides for PCR and … 333 Databases

Computing Explosion Assembly and Analysis of Genomic Data Celera Genomics–Assembling the Genome Compaq Alpha Clusters Number of processors: ~ 750 Peak performance: 1 teraops NuTech Sciences–Mining the Genome IBM p640 System Number of processors: ~ 5,000 Peak performance: 7½ teraops Total memory: 2½ terabytes Total disk storage: 50 terabytes

Genomics Meeting the Information Challenge Grid Middleware Data Storage Computers Network

North Carolina Supercomputing Center

North Carolina Research and Education Network Greensboro Charlotte Pembroke Winston Salem NCSU Centennial Campus NCCU Duke UNC-CH Wilmington Elizabeth City Asheville Cullowhee Fayetteville Greenville RTP MCNC Boone Morehead City Rocky Mount Qwest RTP RPoP NCREN3 Increased bandwidth Increased reliability Increased resiliency

Grid Technologies Major New Computing Technology Under development since mid-1990s Distinguishing Characteristics “Middleware” to support efficient resource sharing in a distributed, heterogeneous computing and data storage environment Focus on use of large-scale computing and data storage Some Major Grid Efforts NASA IPG—Testbed linking selected NASA centers DataGrid—International Grid being developed for high- energy physics (CERN)

Grid Technologies ( cont’d ) Some Major Grid Efforts (cont’d) GriPhyN—Research in Grid technologies for physics applications (Argonne, Florida) e-Science Grid—Major effort in UK to develop a Grid infrastructure for science and engineering research BIRN—Data Grid focused on neuroimaging data (UCSD, SDSC)

North Carolina Genomics and Bioinformatics Consortium Goal Provide a venue for Consortium members to share information and resources, plan strategic initiatives, and form alliances Distributed Across North Carolina Concentration in Research Triangle, but extends across all of North Carolina Diverse Goals and Expertise Human health, including animal models; agriculture and forestry; evolutionary biology basic research; tool development

Overall NC BioGrid Architecture Computing and Data Resources Network Grid Middleware BioApp #1 BioApp #2 BioApp #3 … Globus, Legion, … Grid-aware, -enabled bioinformatics applications NCREN3 NCSC plus Member’s Computing Centers

NC BioGrid Project Two Phases Testbed Phase—test existing middleware, resolve issues, prepare detailed plan (12-18 months) Production Phase—create and operate NC BioGrid Funding for Testbed from MCNC Project Manager Phil Emer, MCNC, Chief Architect/NC BioGrid Project Oversight MCNC Board of Directors HPCC Advisory Board NC BioGrid Technical Advisory Group