Presentation on theme: "Infrastructure to Application Information Exposure and Communications (i2aex) -- DC Case IETF i2aex BoF Mar. 26, 2012."— Presentation transcript:
1Infrastructure to Application Information Exposure and Communications (i2aex) -- DC Case IETF i2aex BoFMar. 26, 2012
2Based on Input from Many People Kamil Bajda-PawlikowskiFlorin BalusNabil BitarHarry LiuHui-Lan LuRamki GummadiVijay GurbaniEnrico MaroccoDavid McDysanTom NadeauPing PanMircea PisicaSabine RandriamasyAlexander TianAndreas VoellmyYe WangHenderickx WimRichard Yang (coordinator)
3ScopeLimited to applications with significant components that are (or could be) deployed in data centers (DC)Limited to infrastructure -> application info flowcould be query/response, but info bits are from infra -> appfocus on information thatapplications require (or benefit in a significant way)cannot be made available easily or through existing mechanisms in a practical wayNot limited to information that infrastructure already hasassume that if there is a strong need, infrastructure can collectUse Case/actual projects driven
5Entities Example Data Center 1 Data Center 2 VM VM Data Center 1 AppDataCenter 1Infrastructure AData Center 2MBoxVMVMLUNL1/2/3 VPNSVMLUNInternetInfrastructure BVMLUNDataCenter 1Data Center 2Consumers
6Why Infrastructure Info Exposure Discovery: App/other infrastructure could monitor its current inventory, but does not know the invisible (resources/policies)/could-be-availableAggregation/service: The infrastructure is already monitoring, reduce App complexity and provide (monitoring) information as a serviceCoordination/Joint Optimization (JO): Observe across Apps, signaling for joint optimization
7Challenges of Infrastructure Info Expo ConsistencyThe infrastructure info could be highly dynamic.Security and privacyThe infrastructure may not want to reveal some info, in particular, if across different administrative domains.InterdomainInformation may come from multiple domains.TransparencyExposed info may remove infrastructure flexibility (e.g., VM migration); note that invisible actions from infrastructure may violate app constraints/expectation or lead to the need of notification.HeterogeneityDiverse infrastructure technologies and construction.In addition to other considerations such as scalability
8Use Case: Network Rack/Location Awareness Example project: Hadoop/MapReduceSetting and goal: app uses topology awareness forblock placement: multiple copies of same block at different racks for (1) reliability, (2) flexibility in task schedulingtask placement: place a task close to block, and/or close to communicating tasksCurrent I2A API: A RackID resolver API to map from node IP/DNS name to a rack IDe.g., > /dc1/rack2Info type: App entity DC location discoveryRelationship w/ ALTO:ALTO can implement the API using network map, and cost map can be more general than the tree distance assumptionClassification: discovery of otherwise invisible L2/reliability
9Use Case: Hybrid Cloud Bandwidth On Demand Example: Hybrid cloudSetting and goal:Discover topology/bandwidth/latency between two infrastructures (e.g., a private cloud and a public (virtual private) cloud)Potential I2A: (WAN) topology/bandwidth/latency between/among infrastructures’ boundariesInfo type: Infrastructure interconnect capacity discovery/serviceRelationship w/ ALTO: potential extension to handle changes in interconnection state.
10Use Case: DC Hosted Virtual Desktop Example project: ATIS Cloud Service Forum (CSF) for hosted virtual desktop services for enterprisesSetting and goal:A virtual desktop (VD) is mapped to a VM in a DCThe VM should be close to the end userFederation of VD providers to choose close-by VDPotential I2A: QoS between end user and candidate VDInfo type: Cross-domain resource/location discoveryRelationship w/ ALTO: ALTO appears to provide the basic abstractions; Inter-server communication (Cross-Domain Coordination) can make the topology and cost map available across domains.
11Use Case: Network QoS Awareness Example project: QoSaaS in the context of Microsoft LyncSetting and goal: provide QoS metrics (e.g., delay, loss) between end hosts and media servers deployed at data centers, fordiagnosis,user QoS expectation (indication of QoS bars), andapp adaptation (e.g., choosing the right media gateway)Current I2A info: QoS prediction between entitiesInfo type: Aggregation/serviceRelationship w/ ALTO: ALTO appears to provide the basic abstractions; can it handle the dynamic info required? will a sub/pub framework better for such a service?Classification: service
12Use Case: Inter-DC Bulk Transfer Example project: NetStitcherSetting and goal:many large organizations run backup/replication among multiple sites (DCs), e.g., Google inter-DC copy serviceapp: leveraging delay elasticity of such apps to rescue non-peak bwPotential I2A: leftover bw prediction at different locations, timeInfo type: Coordination/Joint OptimizationRelationship w/ ALTO: ALTO cost map may carry left over bw, but it does not have the time dimension
13Use Case: Cloud Resource Monitoring Example project: Amazon CloudWatchSetting and goal: monitoring predefined/user defined metrics on infrastructure resources, allows alert, connection to infrastructure-provided auto-scaling actionCurrent I2A: retrieve/report metrics/simple statistics; specify some actions on metricsInfo type: Aggregation/serviceRelationship w/ ALTO: Do we want to substantially expand the current schema? Add sub/pub/triggering?