Report of the “DOE Workshop on Ultra High-Speed Transport Protocols.

Slides:



Advertisements
Similar presentations
TRAINING SERVICES NIGTEL-CS TRAINING SERVICE Mobile Telecommunications in Africa especially Nigeria has recorded rapid growth and expansion in the.
Advertisements

-Grids and the OptIPuter Software Architecture Andrew A. Chien Director, Center for Networked Systems SAIC Chair Professor, Computer Science and Engineering.
Network Resource Broker for IPTV in Cloud Computing Lei Liang, Dan He University of Surrey, UK OGF 27, G2C Workshop 15 Oct 2009 Banff,
All rights reserved © 2006, Alcatel Grid Standardization & ETSI (May 2006) B. Berde, Alcatel R & I.
U.S. Department of Energy’s Office of Science Basic Energy Sciences Advisory Committee Dr. Daniel A. Hitchcock October 21, 2003
Towards a Virtual European Supercomputing Infrastructure Vision & issues Sanzio Bassini
High Performance Computing Course Notes Grid Computing.
4.1.5 System Management Background What is in System Management Resource control and scheduling Booting, reconfiguration, defining limits for resource.
Sponsored by the U.S. Department of Defense © 2005 by Carnegie Mellon University 1 Pittsburgh, PA Dennis Smith, David Carney and Ed Morris DEAS.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Optical communications & networking - an Overview
Summary Role of Software (1 slide) ARCS Software Architecture (4 slides) SNS -- Caltech Interactions (3 slides)
Telecom Italia GRID activities for 6th FP Program Maurizio Cecchi 3/4 October 2002.
EU-GRID Work Program Massimo Sgaravatto – INFN Padova Cristina Vistoli – INFN Cnaf as INFN members of the EU-GRID technical team.
Chapter 19: Network Management Business Data Communications, 4e.
DWDM-RAM: DARPA-Sponsored Research for Data Intensive Service-on-Demand Advanced Optical Networks DWDM RAM DWDM RAM BUSINESS WITHOUT BOUNDARIES.
NGNS Program Managers Richard Carlson Thomas Ndousse ASCAC meeting 11/21/2014 Next Generation Networking for Science Program Update.
An Architecture for Differentiated Services
Introduction to Grid Computing Ann Chervenak Carl Kesselman And the members of the Globus Team.
23 September 2004 Evaluating Adaptive Middleware Load Balancing Strategies for Middleware Systems Department of Electrical Engineering & Computer Science.
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
COnvergence of fixed and Mobile BrOadband access/aggregation networks Work programme topic: ICT Future Networks Type of project: Large scale integrating.
Presentation title SUB TITLE HERE Intelligent 21st Century Strategies for Broadband and Cyber Infrastructures Security By Dr. Emmanuel Hooper, PhD, PhD,
Distributed Real-Time Systems for the Intelligent Power Grid Prof. Vincenzo Liberatore.
NORDUnet NORDUnet The Fibre Generation Lars Fischer CTO NORDUnet.
LIGHTNESS Introduction 10th Oct, 2012 Low latency and hIGH Throughput dynamic NEtwork infrastructureS for high performance datacentre interconnectS.
Tufts Wireless Laboratory School Of Engineering Tufts University “Network QoS Management in Cyber-Physical Systems” Nicole Ng 9/16/20151 by Feng Xia, Longhua.
Department of Electronic Engineering City University of Hong Kong EE3900 Computer Networks Introduction Slide 1 A Communications Model Source: generates.
Chapter 1. Introduction. By Sanghyun Ahn, Deot. Of Computer Science and Statistics, University of Seoul A Brief Networking History §Internet – started.
Applications Requirements Working Group HENP Networking Meeting June 1-2, 2001 Participants Larry Price Steven Wallace (co-ch)
End-to-end Quality of Service over heterogeneous networks José Enríquez Gabeiras Telefónica I+D.
A Framework for Internetworking Heterogeneous High-Performance Networks via GMPLS and Web Services Xi Yang, Tom Lehman Information Sciences Institute (ISI)
The roots of innovation Future and Emerging Technologies (FET) Future and Emerging Technologies (FET) The roots of innovation Proactive initiative on:
DataTAG Research and Technological Development for a Transatlantic Grid Abstract Several major international Grid development projects are underway at.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
ASCR/ESnet Network Requirements an Internet2 Perspective 2009 ASCR/ESnet Network Requirements Workshop April 15/16, 2009 Richard Carlson -- Internet2.
John D. McCoy Principal Investigator Tom McKenna Project Manager UltraScienceNet Research Testbed Enabling Computational Genomics Project Overview.
1 BRUSSELS - 14 July 2003 Full Security Support in a heterogeneous mobile GRID testbed for wireless extensions to the.
Office of Science U.S. Department of Energy ESCC Meeting July 21-23, 2004 Network Research Program Update Thomas D. Ndousse Program Manager Mathematical,
William Stallings Data and Computer Communications
Advanced Networks: The Past and the Future – The Internet2 Perspective APAN 7 July 2004, Cairns, Australia Douglas Van Houweling, President & CEO Internet2.
Group member: Kai Hu Weili Yin Xingyu Wu Yinhao Nie Xiaoxue Liu Date:2015/10/
O AK R IDGE N ATIONAL L ABORATORY U. S. D EPARTMENT OF E NERGY 1 Enabling Supernova Computations by Integrated Transport and Provisioning Methods Optimized.
TeraScale Supernova Initiative: A Networker’s Challenge 11 Institution, 21 Investigator, 34 Person, Interdisciplinary Effort.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Comprehensive Scientific Support Of Large Scale Parallel Computation David Skinner, NERSC.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
DOE UltraScience Net The Need –DOE large-scale science applications on supercomputers and experimental facilities require high-performance networking Petabyte.
TeraPaths: A QoS Enabled Collaborative Data Sharing Infrastructure for Petascale Computing Research The TeraPaths Project Team Usatlas Tier 2 workshop.
1 The ILC Control Work Packages. ILC Control System Work Packages GDE Oct Who We Are Collaboration loosely formed at Snowmass which included SLAC,
Challenges in the Next Generation Internet Xin Yuan Department of Computer Science Florida State University
30 November 2001 Advisory Panel on Cyber Infrastructure National Science Foundation Douglas Van Houweling November 30, 2001 National Science Foundation.
NORDUnet NORDUnet e-Infrastrucure: Grids and Hybrid Networks Lars Fischer CTO, NORDUnet Fall 2006 Internet2 Member Meeting, Chicago.
An Architectural Approach to Managing Data in Transit Micah Beck Director & Associate Professor Logistical Computing and Internetworking Lab Computer Science.
Company LOGO Network Management Architecture By Dr. Shadi Masadeh 1.
Tackling I/O Issues 1 David Race 16 March 2010.
1 Revision to DOE proposal Resource Optimization in Hybrid Core Networks with 100G Links Original submission: April 30, 2009 Date: May 4, 2009 PI: Malathi.
March 2004 At A Glance The AutoFDS provides a web- based interface to acquire, generate, and distribute products, using the GMSEC Reference Architecture.
Presented by DOE UltraScience Net: High-Performance Experimental Network Research Testbed Nagi Rao Computer Science and Mathematics Division Complex Systems.
Office of Science U.S. Department of Energy High-Performance Network Research Program at DOE/Office of Science 2005 DOE Annual PI Meeting Brookhaven National.
Internet2 Members Meeting Washington, DC 1 Advanced Networking Infrastructure and Research (ANIR) Aubrey Bush Division Director, ANIR National Science.
BDTS and Its Evaluation on IGTMD link C. Chen, S. Soudan, M. Pasin, B. Chen, D. Divakaran, P. Primet CC-IN2P3, LIP ENS-Lyon
Advancing National Wireless Capability Date: March 22, 2016 Wireless Test Bed & Wireless National User Facility Paul Titus Department Manager, Communications.
Grid Optical Burst Switched Networks
Clouds , Grids and Clusters
DOE Facilities - Drivers for Science: Experimental and Simulation Data
The ILC Control Work Packages
RESEARCH, EDUCATION, AND TRAINING FOR THE SMART GRID
Optical communications & networking - an Overview
Presentation transcript:

Report of the “DOE Workshop on Ultra High-Speed Transport Protocols and Dynamic Provisioning for Large-Scale Science Applications” April 10-11, 2003, Argonne, IL Panel on Future Directions in Networking International Conference on Network Protocols November 5-7, 2003, Atlanta, GA Nagi Rao (Nageswara S.V. Rao) Computer Science and Mathematics Division Oak Ridge National Laboratory Transport Protocols and Dynamic Provisioning for Large-Scale Science Applications

We engineered the Internet, and it works fine for and web; but to do “world-class” scientific research needed in DOE scientific applications, we need to develop a science of networking that delivers usable performance to the applications Allyn Romanow, Cisco Systems

Introduction Organization Details Provisioning Group Notes Transport Group Notes Dynamics and Stabilization of Network Transport Conclusions Outline Opinions expressed in this presentation belong to the author and are not necessarily the official positions of US Department of Energy, Oak Ridge National Laboratory or UT-Battelle LLC.

Next generation of DOE scientific breakthroughs critically depend on large multi-disciplinary geographically dispersed research teams: – high energy physics, climate simulation, fusion energy, genomics, astrophysics, spallation neutron source, and others These applications are Inherently distributed in: Data – archival or on-line Computations – supercomputers or clusters Research teams – experts in different domains Experimental facilities – one of the kind user facilities – they all need to be seamlessly networked Networking for DOE large-science applications

DOE Large-Scale Science Applications and Numerous Other Science Applications - need extreme and acute networking Detailed account of the needs were identified and discussed at DOE High-Performance Network Planning Workshop, August 13-15, 2002, Science Areas Current End2End Throughput 5 years End2End Throughput 5-10 Years End2End Throughput General Remarks High Energy Physics0.5 Gbps E2E100 Gbps E2E1.0 Tbpshigh throughput Climate Data & Computations 0.5 Gbps E2E Gbpsn Tbpshigh throughput SNS NanoScience does not exist1.0 Gbps steady state Tbps & control channels remote control & high throughput Fusion Energy500MB/min (Burst) 500MB/20sec (burst) n Tbpstime critical transport Astrophysics1TB/weekN*N multicast1TB+ & stable streams computational steering & collaborations Genomics Data & Computations 1TB/day100s usersTbps & control channels high throughput & steering Need more than bulk bandwidth

Science Objective: Understand supernova evolutions –Teams of field experts across the country collaborate on computations Experts in hydrodynamics, fusion energy, high energy Universities and national labs –Massive computational code Terabyte in days are generated currently Archived at nearby HPSS Visualized locally on clusters – only archival data Desired capability –Archive and supply massive amounts of data –Collaboratively visualize archival or on-line data –Monitor, visualize and steer computations into regions of interest Astrophysics Computations Visualization channel Control channel

Data Movement Operations –Experimental and computational data Stored across the country Terabytes of data per day –Between users, archives and computers Molecular Dynamics Computations –Supercomputers or clusters –Monitor, visualize, and steer computations Genomics Networking Needs data channel visualization channel steering channel

Experimental Setups and Monitoring of Expensive Facilities (SNS – billion$) –Setup parameters and start experiments –Adjust parameters as needed; stop when necessary Data Movements –Archive and access massive amounts of experimental data Neutron Facilities – SNS, HFIR

DOE faces unique or acute challenges: Small user base with extreme needs –large data transfers at application-level – rates much higher than current backbones –highly controlled end-to-end data streams – unprecedented agility and stability –capabilities must be available to science users – not just to network experts with special networks Commercial and other networks will not adequately meet these acute requirements –Not large enough user base –Very limited business case New advances in Transport and Provisioning hold enormous promise, if suitably fostered and integrated –Flexible and powerful routers/switches, ultra high bandwidth links, new transport protocols can get us partway there –But, need several critical technologies and expertise: end-to-end dynamic provisioning of paths with guaranteed performance transport methods that optimally provide to user applications Current Network Capabilities: Transport and Provisioning

Workshop Goal Address the research, design, development, testing and deployment aspects of transport protocols and network provisioning as well as the application-level capability needed to build operational ultra-speed networks to support emerging DOE distributed large-scale science applications over the next 10 years.

Workshop Focus: Ultra High-Speed Networks to support DOE Large-Science Applications –not a general network research workshop addressing Internet problems Formulate DOE roadmap in the specific areas: –Transport and Provisioning two very critical subareas of network research needed to meet DOE large-science requirements Not in other areas such as security, wireless networks “Working” workshop –Discussions on very specific problems, methods, potential solutions in transport and provisioning areas –Very short introductory presentations –Not just primarily informational or educational

Balanced participation from universities, industry and national laboratories to represent the needs, technologies, research and business aspects Total: 32 National Laboratories: 10 ORNL:3; ANL:2: LANL:2; PNNL:1; SLAC:1; ESnet: 1 Universities: 11 UMass; GaTech(2), Uva, UIC,Indiana U, U Va, U Tennessee, UC Davis, PSC, CalTech Industry:8 Celion, Cienna, Cisco, Juniper, Level3, Lightsand, MCNC, Qwest DOE Headquarters: 3 Working Groups: Provisioning: 14 Transport:15 Participants

Need Focused Efforts in Develop a scalable architecture for fast provisioning Circuit Switched Network Build an application-centric circuit-switched cross country test-bed Coordination and Graceful integration with Applications and Middleware Transport and OS Developers Legacy and evolutionary networks Summary: Provisioning for DOE Large-Science Networks

Recommendation 1: Agile Optical Network infrastructure: –A scalable architecture for fast provisioning of circuit switched dedicated channels specified on-demand by the applications. Recommendation 2: Hybrid Switched Networks: –High capacity (Tbps) switchable channels for Petabyte data transport, a combination of requirements to accommodate burst, real-time streams as well as lower priority traffic, multi-point or shared use, for large file and data transfers, and for low latency and low jitter. Recommendation 3: Dynamically Reconfigured Channels: –Provisioning of dynamically specified end-to-end quality paths for computational steering and time-constrained experimental data analysis. Recommendation 4: Multi-Resolution Quality of Service: –Channels with various types of Quality of Service (QoS) parameters must be supported at various resolutions using GMPLS, service provisioning and channel sharing technologies. Recommendation 5: Experimental Test-Bed Provisioning Recommendations

Limited deployment of ultra-long haul DWDM links; Lack of support for striped/parallel transport both at the core and application levels; Lack of high-speed circuit-switched infrastructure with network control-plane design and synchronous NICs with high-speed and on-demand reconfigurability; and Lack of well-developed methods and application interfaces for scheduling/reserving, allocation, and initiation. DOE applications do not follow the commercial scaling model of large number of users each with smaller bandwidth requirements; Lack of security paradigms for dedicated paths and the infrastructure that to manage them; Lack of a robust multi-cast solution efficiently supported on dedicated channels; High cost of equipment, including the costs of links, routers/switches and other equipment as well as deployment and maintenance; Lack of field-hardening of optical components such as memory/buffer, high-speed switches, Reamplification, Reshaping and Retiming (RRR) equipment, and lambda conversion gear; Lack of effective contention resolution methods for the allocation of channel pools; and Limited interoperability with other data networks, particularly legacy networks. Provisioning: Barriers

Current transport methods are massively inadequate –Unattained Throughput: wizards can achieve several Gbps for certain durations But throughputs are needed at application user level –Cannot provide sustained and stable streams for control operations –TCP has complicated dynamics – hard to use in finer control operations Need focused efforts in: –Optimal transport methods to exploit provisioning to meet requirements Transport -Tbps throughputs Support stable and agile control channels –Comprehensive theory of transport: synergy and extensions of traditional disciplines Stochastic control, non-linear control, statistics, optimization, protocol engineering –Strict algorithmic design Modular, autonomic, adaptive, composable Integration and interactions: –DOE deployment, wider adoption, legacy integration –Experiment and test-bed –Instrumentation and Diagnostics tools web100/Net100 Statistical Inference and optimized data collection Summary: Transport for DOE Large-Science Networks

Scientists (must) view the network as they view a computer as a resource But they are becoming (not always willingly) network experts – “wizard gap” at all levels but “gray matter” tax must be low Advance the state of network protocols to make them plug-and- play for the application users -need significant effort Time-to-solution in networking area is currently too high – TCP tuning for Gbps throughputs took years Peak is not enough – need sustained throughputs at application level Transport Group Notes

Recommendation 1: Transport Protocols and Implementations –Transport methods for dedicated channels and IP networks for achieving high throughput, steering and control. The transport methods include TCP-, UDP- and SAN- based methods together with newer approaches. Recommendation 2: Transport Customization and Interfacing –Transport methods optimized to single and multiple hosts as well as channels of different modes. –Transport methods suitably interfaced with storage methods to avoid impedance mismatches that could degrade the end-to-end transport performance. Recommendation 3: Stochastic Control Methods –Stochastic control theoretic methods to design protocols with well-understood and/or provable stability properties. Recommendation 4: Monitoring and Estimation Methods –Monitoring and statistical estimation techniques to monitor the critical transport variables and dynamically adjust them to ensure transport stability and efficiency. Recommendation 5: Experimental Test-Bed Transport Group Recommendations: 1-5 years

Recommendation 1: Modular Adaptive Composable and Optimized Transport Modules: –Highly dynamic and adaptive methods to dynamically compose transport methods to match the application requirements and the underlying provisioning. Recommendation 2: Stochastic and Control Theoretic Design and Analysis: –Stochastic control theoretic methods for composable transport methods to analyze them as well as to guide their design to ensure stability and effectiveness Recommendation 3: Graceful Integration with Middleware and Applications: –Application data and application semantics must be mapped into transport methods to optimally meet application requirements –boundary between middleware and transport must be made transparent to applications. Recommendation 4: Vertical Integration of Applications, Transport and Provisioning: –Vertical integration of resource allocation policies (cost and utility) with transport methods to present a unified view and interface to the applications. Transport Group Recommendations: 5-10 years

There is a need for systematic scientific approaches to the design, analysis and implementation of the transport methods and to network provisioning. On-Demand Bandwidth and Circuit Optimization: –Dynamic optimization and scheduling methods to allocate the bandwidth pipes to applications. –A comprehensive approach for on-line estimation and allocation of the “bandwidths” –Signaling to provide the required timeliness and reliability of the allocated channels. –Scientific, systematic understanding to integrate the components for bandwidth allocation, channel scheduling, channel setup and teardown, and performance monitoring. Comprehensive Theory of Transport: –Rigorous transport design methods tailored to the underlying provisioning modes. –A synergy and extensions of a number of traditional disciplines. –New stochastic control methods may be required to design suitable transport control methods. –Non-linear control theoretic methods to analyze delayed feedback. –Statistical theory for designing rigorous measurements and tests. –Optimization theory to obtain suitable parameters for tuning protocols. Strict Algorithmic Design and Implementation: –Strict algorithmic design methods to efficiently implement the designed protocols. –Implementations must be modular, autonomic, adaptive, and composable. Statistical Inference and Optimized Data Collection: –Due to the sheer data volumes, it is inefficient to collect measurements from all nodes all the time for the purposes of diagnosis, optimization and performance tuning. –Systematic inferencing methods to identify the critical and canonical sets of measurements needed. –Statistical design of experiments to ensure that the measurements are strategic and optimal. Science of High-Performance Networking

High-Performance Network Test-beds: Recommended by both groups State-of-the-art Components: software and hardware networking components, including routers/switches, high bandwidth long-haul links, protocols and application interface modules. Integrated Development Environments: mechanisms to integrate a wide spectrum of network technologies including high throughput protocol, dynamic provisioning, interactive visualization and steering, and high performance cyber security measures Smooth Technology Transition: transition of network technologies from research stages to production stages by allowing them to mature in such an environment. Characteristics of ultra high-speed network test-bed: 1.Interconnection of at least three science facilities with large-scale science applications; 2.Geographical coverage adequate to capture optical characteristics, transport protocols dynamics, and application behaviors comparable to that of real-word applications; 3.Integration with appropriate middleware; 4.Scalable network measurement tools; and 5.Well-defined technology transfer plan.

Integration, Interaction, and Interfacing Applications are empowered to “tune” the network Application 1Application 2Application 3 IP provisioning Dynamic lambda switching Net100 modules Non-TCP protocols Stabilize module Control modules UltraNet Middleware Protocols Network-Aware Applications Supernova: Large data stream Control stream Molecular dynamics visualization HEP data transfers

While not on the original agenda, security issues have significant impact on application performance – DOE sites have very strict firewalls Securing Operational and Development Environments: –authentication, validation and access controls –data speeds of multiple tens of Gbps or higher –new security methods for on-demand dedicated channels. Effects of Security Measures on Performance: –impact of security measures on application performance. –graceful interoperation of science applications under secured network environments. Proactive Countermeasures: –protect bandwidth allocation, and signaling to setup and tear down the paths –vulnerability of new transport protocols to certain attacks Network Security Issues