Lambda Station Matt Crawford, Fermilab co-PI: Don Petravick, Fermilab co-PI: Harvey Newman, Caltech.

Slides:



Advertisements
Similar presentations
LambdaStation Phil DeMar Don Petravick NeSC Oct. 7, 2004.
Advertisements

Storage System Integration with High Performance Networks Jon Bakken and Don Petravick FNAL.
Barracuda Link Balancer Link Reliability and Bandwidth Optimization.
Lambda Station: On-demand flow based routing for data intensive GRID applications over multitopology networks GridNets 2006, San Jose,CA, October 1 –
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 4: Routing Concepts Routing Protocols.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Addressing the Network – IPv4 Network Fundamentals – Chapter 6.
Module 5 - Switches CCNA 3 version 3.0 Cabrillo College.
11 TROUBLESHOOTING Chapter 12. Chapter 12: TROUBLESHOOTING2 OVERVIEW  Determine whether a network communications problem is related to TCP/IP.  Understand.
Module 5: Configuring Access for Remote Clients and Networks.
Traffic Management - OpenFlow Switch on the NetFPGA platform Chun-Jen Chung( ) SriramGopinath( )
(part 3).  Switches, also known as switching hubs, have become an increasingly important part of our networking today, because when working with hubs,
Lawrence G. Roberts CEO Anagran September 2005 Advances Toward Economic and Efficient Terabit LANs and WANs.
Questionaire answers D. Petravick P. Demar FNAL. 7/14/05 DLP -- GDB2 FNAL/T1 issues In interpreting the T0/T1 document how do the T1s foresee to connect.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Addressing the Network – IPv4 Network Fundamentals – Chapter 6.
TeraPaths : A QoS Collaborative Data Sharing Infrastructure for Petascale Computing Research USATLAS Tier 1 & Tier 2 Network Planning Meeting December.
TeraPaths: A QoS Collaborative Data Sharing Infrastructure for Petascale Computing Research Bruce Gibbard & Dantong Yu High-Performance Network Research.
INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.
Firewall and Internet Access Mechanism that control (1)Internet access, (2)Handle the problem of screening a particular network or an organization from.
ACM 511 Chapter 2. Communication Communicating the Messages The best approach is to divide the data into smaller, more manageable pieces to send over.
TeraPaths TeraPaths: establishing end-to-end QoS paths - the user perspective Presented by Presented by Dimitrios Katramatos, BNL Dimitrios Katramatos,
Traffic Management - OpenFlow Switch on the NetFPGA platform Chun-Jen Chung( ) Sriram Gopinath( )
MPLS and Traffic Engineering Ji-Hoon Yun Computer Communications and Switching Systems Lab.
Module 4: Designing Routing and Switching Requirements.
Lambda Station: On-demand flow based routing for data intensive GRID applications over multitopology networks Fermi National Accelerator Laboratory Don.
Lambda Station Project Andrey Bobyshev; Phil DeMar; Matt Crawford ESCC/Internet2 Winter 2008 Joint Techs January 22; Honolulu, HI
NETWORKING COMPONENTS AN OVERVIEW OF COMMONLY USED HARDWARE Christopher Johnson LTEC 4550.
US LHC Tier-1 WAN Data Movement Security Architectures Phil DeMar (FNAL); Scott Bradley (BNL)
ESnet Site Coordinators Committee (ESCC): IPv6 Activities & Directions Phil DeMar (ESCC Chair) HEPix IPv6 Workshop (CERN) June 22, 2011.
1 Version 3.1 Module 1 WANs and Routers. 2 Version 3.1 WANS WAN operates at the physical layer and the data link layer of the OSI reference model. Provide.
GrangeNet Dr. Greg Wickham APAN NOC 25 August 2005.
TeraPaths TeraPaths: Establishing End-to-End QoS Paths through L2 and L3 WAN Connections Presented by Presented by Dimitrios Katramatos, BNL Dimitrios.
TeraPaths The TeraPaths Collaboration Presented by Presented by Dimitrios Katramatos, BNL Dimitrios Katramatos, BNL.
Module 1: Configuring Routing by Using Routing and Remote Access.
Switch Features Most enterprise-capable switches have a number of features that make the switch attractive for large organizations. The following is a.
Terapaths: MPLS based Data Sharing Infrastructure for Peta Scale LHC Computing Bruce Gibbard and Dantong Yu USATLAS Computing Facility DOE Network Research.
TeraPaths: A QoS Enabled Collaborative Data Sharing Infrastructure for Petascale Computing Research The TeraPaths Project Team CHEP 06.
Setup and Management for the CacheRaQ. Confidential, Page 2 Cache Installation Outline – Setup & Wizard – Cache Configurations –ICP.
OSCARS Roadmap Chin Guok Feb 6, 2009 Energy Sciences Network Lawrence Berkeley National Laboratory Networking for the Future of.
Introducing a New Concept in Networking Fluid Networking S. Wood Nov Copyright 2006 Modern Systems Research.
ALCF Argonne Leadership Computing Facility GridFTP Roadmap Bill Allcock (on behalf of the GridFTP team) Argonne National Laboratory.
SDN and OSCARS how-to Evangelos Chaniotakis Network Engineering Group ESCC Indianapoilis, July 2009 Energy Sciences Network Lawrence Berkeley National.
TeraPaths: A QoS Enabled Collaborative Data Sharing Infrastructure for Petascale Computing Research The TeraPaths Project Team Usatlas Tier 2 workshop.
1 IEX8175 RF Electronics Avo Ots telekommunikatsiooni õppetool, TTÜ raadio- ja sidetehnika inst.
+ Routing Concepts 1 st semester Objectives  Describe the primary functions and features of a router.  Explain how routers use information.
DICE: Authorizing Dynamic Networks for VOs Jeff W. Boote Senior Network Software Engineer, Internet2 Cándido Rodríguez Montes RedIRIS TNC2009 Malaga, Spain.
Company LOGO Network Architecture By Dr. Shadi Masadeh 1.
Chapter 6.  Internet and characteristics of Internet.  Various Internet protocols  Static IP addressing and Dynamic IP addressing Prepared by Saher.
-1- ESnet On-Demand Secure Circuits and Advance Reservation System (OSCARS) David Robertson Internet2 Joint Techs Workshop July 18,
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 9: Subnetting IP Networks Introduction to Networks.
Supporting Advanced Scientific Computing Research Basic Energy Sciences Biological and Environmental Research Fusion Energy Sciences High Energy Physics.
IPv6 Security Issues Georgios Koutepas, NTUA IPv6 Technology and Advanced Services Oct.19, 2004.
Run - II Networks Run-II Computing Review 9/13/04 Phil DeMar Networks Section Head.
TeraPaths: A QoS Enabled Collaborative Data Sharing Infrastructure for Petascale Computing Research The TeraPaths Project Team Usatlas Tier 2 workshop.
Fermilab Cal Tech Lambda Station High-Performance Network Research PI Meeting BNL Phil DeMar September 29, 2005.
GGF 17 - May, 11th 2006 FI-RG: Firewall Issues Overview Document update and discussion The “Firewall Issues Overview” document.
Configuring Network Devices
Lab A: Planning an Installation
Fermi National Accelerator Laboratory
Grid Optical Burst Switched Networks
CONNECTING TO THE INTERNET
Lambda Station Matt Crawford, Fermilab co-PI: Don Petravick, Fermilab
Introduction to Data Management in EGI
Establishing End-to-End Guaranteed Bandwidth Network Paths Across Multiple Administrative Domains The DOE-funded TeraPaths project at Brookhaven National.
Chapter 4: Routing Concepts
Introduction to Networking
Chapter 9: Subnetting IP Networks
Chapter 9: Subnetting IP Networks
Virtual Private Network
Chapter 10: Advanced Cisco Adaptive Security Appliance
Network Architecture By Dr. Shadi Masadeh 1.
Presentation transcript:

Lambda Station Matt Crawford, Fermilab co-PI: Don Petravick, Fermilab co-PI: Harvey Newman, Caltech

HEP Computing Labs plus University Community Vast ensembles of commodity equipment Something like a petabyte of IDE disk Storage system to storage system transfer Refresh of 200 TB of state at universities Structured production, “chaotic” analysis

HEP Networking Office of High Energy Physics funds LHCnet, (OC192 triangle Starlight  CERN  MANLAN) Interested in switched optical networking –UltraLight (Caltech) –UltraScience Net (ORNL) –OSCARS MPLS tunnels (ESnet: FNAL  BNL, etc) –FNAL-CERN 875 MB/Sec SS-SS service challenge Interest, testing, and following of improvements to TCP at high bandwidth  delay Given the directions of HEP computing, the ends of “pipes” are likely to be locally, competently engineered networks.

Problem statement Experiments and applications now running, or starting soon, will benefit from data movement capabilities now available only on bleeding-edge networks. These systems are connected to production site networks. Duplicating site infrastructure to connect them to special- purpose networks is an expense to be avoided if possible. Multihoming the endpoints to multiple networks is complicated and expensive and it (nearly) precludes graceful failover when one path is lost. Applications (and operating systems) should not have to be re- customized for every new network technology or high- performance path.

Additional complications Rates are not predictable for real data sources and sinks. –Memory-to-memory is somewhat deterministic, but disk-to- disk has several uncontrolled variables. Applications may use multiple streams for maximum exploitation of high-speed links. Lambda Station must be able to deal in aggregates. Straggler flows persist after bulk of transfer has completed, and continued use of high-volume path may be wasteful at that point. Aggressive protocols for the wide area may have negative impacts on the last mile (site or site’s “uplink”) network.

Lambda Station Function –Schedule use of one or more reservable network paths –Arrange for traffic to be forwarded onto such paths

Interfaces to other systems To application (or to manual request system) To authentication/authorization infrastructure To site’s internal network (dynamic reconfiguration of packet forwarding rules) –Operate at any granularity, down to single flows Site’s border/connection point to reservable path Peer site’s Lambda Station Talk to advanced WANs, through network operator- defined setup protocol, as needed* Monitoring, accounting, status reporting

Block Diagram

Client application interface Application describes the traffic which is to be routed over an alternative path. –Traffic selectors: 6-tuples [ IP version, {src cidr(s)}, {dst cidr(s)}, protocol, {src port(s)}, {dst ports(s)} ] –Transfer rate, total volume, duration, direction –Earliest desired start LS and host agree on packet-selection method - we lean toward DSCP. LS informs application of actual BW allocated and setup status. Host or LS should inform the other of early termination, if it occurs.

Site network interface Configure local site’s internal routing to divert traffic to the alternate path. Graceful teardown – resume normal internal routing before WAN path is torn down. Different version of this module will deal with different varieties of site network. –Each site might plug in its own scripts.

Site-edge router interface Graceful setup – Enable the reserved WAN path before internal routing directs traffic onto it. ACL may be in effect on this device to prevent unauthorized use. ACL very likely to be in effect with respect to incoming traffic from the WAN. –At some sites, this is a path which bypasses firewalls!

LS-to-LS protocol Exchange traffic selectors Coordinate setup & teardown Verify path continuity –Implies that LS can communicate simultaneously over reserved and commodity network paths. Inform of early traffic termination

Interface to advanced WANs Multiple flavors of high-performance WANs are anticipated. –Some WANs may require forwarding state to be created before use. –Some may have their own reservation system, which end systems need not learn to use if it reserves through Lambda Station instead. Lambda Station’s WAN module will parameterize and adapt to each sort of WAN, providing an abstract view. –DOE UltraScience Net, ESNET, LHCNet, UltraLight.

Requirements for Production Robustness –LS must enable production systems to make trial use of advanced networks, and cleanly restore default forwarding behavior upon completion or path failure. Monitoring –Lambda Station must present its own state and history. –Currently it serves this info through its web server. –Investigating MonaLisa (OSG component). Accounting –In many environments, different sub-organizations share the network resource. LS must gather usage information to support accounting.

Provide sample integration With Storage Systems that are components of the USCMS software and computing project. Currently are: –Managed storage elements. SRM / GridFTP protocols. Now implementing LS client calls in SRM/dCache.

Current status Release 1.0 – today. –A stable, usable snapshot of a work in progress. –Based on Perl with SOAP::Lite –Dynamically reconfigures site routers to send traffic over alternate paths –End system applied DSCP tags to special- treatment flows. –Traffic path varied cleanly – unnoticed by application; hiccups in throughput at each change.

Path switching effects

Deployment Scenarios Client capabilities: identifying high-impact traffic Specify src & dst address groups, but no more. 2. Specify src and/or dst ports as well as addresses. 3. Apply DSCP label selected by client 4. Apply DSCP label as directed by Lambda Station. Client capabilities: Lambda Station integration level Lambda Station called manually via web interface 2. SOAP call by wrapper around client application 3. SOAP calls from within the client application Site network capabilities Static router config w/ fixed PBR based on DSCP 2. Router ACLs activated and inactivated by LS 3. Lambda Station constructs and applies ACLs for PBR

Directions Next version being built on Apache Axis –probably will use jClarens –WSDL is sure to evolve IPv6 support is mere placeholder as yet Adding support for Force10 site routers Looking forward to speaking to your lightpath setup directly!

Summary Lambda Station’s role in data-intensive science is to dynamically connect production end-systems to advanced high-performance wide-area networks. –Bring the systems to the network –Bring the network to the systems Prototyping has shown the feasibility of using dynamically selected network paths for traffic between production site networks.

Intriguing sidelights Make asymmetric reservations when possible? (What’s an appropriate bandwidth reservation for an ACK stream?) Does LS set up bandwidth policing? –Would apply when multiple concurrent reservations are active. Is the access authorization a site-local decision? Is there a VO role in this? Does LS have knowledge of QoS facilities available in the site network or the WAN?