Emulab: Recent Work, Ongoing Work Jay Lepreau University of Utah www.emulab.net DETER Community Meeting January 31, 2006.

Slides:



Advertisements
Similar presentations
Data Mining Challenges for Network Management Nick Feamster, Georgia Tech Dave Andersen, CMU (joint with Jay Lepreau and Emulab)
Advertisements

Overview of local security issues in Campus Grid environments Bruce Beckles University of Cambridge Computing Service.
1 Computational Asset Description for Cyber Experiment Support using OWL Telcordia Contact: Marian Nodine Telcordia Technologies Applied Research
SSRS 2008 Architecture Improvements Scale-out SSRS 2008 Report Engine Scalability Improvements.
SOFTWARE MAINTENANCE 24 March 2013 William W. McMillan.
4.1.5 System Management Background What is in System Management Resource control and scheduling Booting, reconfiguration, defining limits for resource.
Adding scalability to legacy PHP web applications Overview Mario A. Valdez-Ramirez.
Transparent Checkpoint of Closed Distributed Systems in Emulab Anton Burtsev, Prashanth Radhakrishnan, Mike Hibler, and Jay Lepreau University of Utah,
1 Web Server Performance in a WAN Environment Vincent W. Freeh Computer Science North Carolina State Vsevolod V. Panteleenko Computer Science & Engineering.
Lecture 8: Testbeds Anish Arora CIS788.11J Introduction to Wireless Sensor Networks Material uses slides from Larry Peterson, Jay Lapreau, and GENI.net.
1 Failure Recovery for Priority Progress Multicast Jung-Rung Han Supervisor: Charles Krasic.
Distributed Processing, Client/Server, and Clusters
Toward Replayable Research in Networking and Systems Eric Eide University of Utah, School of Computing May 25, 2010.
1 Modeling and Emulation of Internet Paths Pramod Sanaga, Jonathon Duerig, Robert Ricci, Jay Lepreau University of Utah.
Emulab.net: An Emulation Testbed for Networks and Distributed Systems Jay Lepreau and many others University of Utah Intel IXA University Workshop June.
1 Federation of Emulabs and Relevant New Development Jay Lepreau with Rob Ricci, Mike Hibler, Leigh Stoller University of Utah USC/ISI Federation Workshop.
1 GENI: Global Environment for Network Innovations Jennifer Rexford Princeton University
1 Cluster or Network? An Emulation Facility for Research Jay Lepreau Chris Alfeld David Andersen (MIT) Mac Newbold Rob Place Kristin Wright Dept. of Computer.
1 GENI: Global Environment for Network Innovations Jennifer Rexford On behalf of Allison Mankin (NSF)
1 Flexlab: A Realistic, Controlled, and Friendly Environment for Evaluating Networked Systems Jonathon Duerig, Robert Ricci, Junxing Zhang, Daniel Gebhardt,
An Experimentation Workbench for Replayable Networking Research Eric Eide, Leigh Stoller, and Jay Lepreau University of Utah, School of Computing NSDI.
Integrated Scientific Workflow Management for the Emulab Network Testbed Eric Eide, Leigh Stoller, Tim Stack, Juliana Freire, and Jay Lepreau and Jay Lepreau.
© , Michael Aivazis DANSE Software Issues Michael Aivazis California Institute of Technology DANSE Software Workshop September 3-8, 2003.
Student Projects in Computer Networking: Simulation versus Coding Leann M. Christianson Kevin A. Brown Cal State East Bay.
1 A Large-Scale Network Testbed Jay Lepreau Chris Alfeld David Andersen Kevin Van Maren University of Utah September.
Introspective Replica Management Yan Chen, Hakim Weatherspoon, and Dennis Geels Our project developed and evaluated a replica management algorithm suitable.
How To Use It...  Submit ns script via web form  Relax while emulab …  Generates config from script & stores in DB  Maps specified virtual topology.
Emulab Federation Preliminary Design Robert Ricci with Jay Lepreau, Leigh Stoller, Mike Hibler University of Utah USC/ISI Federation Workshop December.
Loupe /loop/ noun a magnifying glass used by jewelers to reveal flaws in gems. a logging and error management tool used by.NET teams to reveal flaws in.
Bandwidth DoS Attacks and Defenses Robert Morris Frans Kaashoek, Hari Balakrishnan, Students MIT LCS.
Network Topologies.
Windows Server MIS 424 Professor Sandvig. Overview Role of servers Performance Requirements Server Hardware Software Windows Server IIS.
2006 NSF CRI-PI Meeting1 ns-3 Project Plan Tom Henderson and Sumit Roy, University of Washington Sally Floyd, ICSI Center for Internet Research George.
1 The SpaceWire Internet Tunnel and the Advantages It Provides For Spacecraft Integration Stuart Mills, Steve Parkes Space Technology Centre University.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
Dynamic Network Emulation Security Analysis for Application Layer Protocols.
Research & Development Roadmap 1. Outline A New Communication Framework Giving Bro Control over the Network Security Monitoring for Industrial Control.
PrimoGENI Tutorial Miguel Erazo, Neil Goldman, Nathanael Van Vorst, and Jason Liu Florida International University Other project participants: Julio Ibarra.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
DEP351 Windows ® Rights Management (Part 2): Enterprise Readiness & Deployment Marco DeMello Group Program Manager Windows Trusted Platforms & Infrastructure.
Overcast: Reliable Multicasting with an Overlay Network CS294 Paul Burstein 9/15/2003.
1 SEG4912 University of Ottawa by Jason Kealey Software Engineering Capstone Project Tools and Technologies.
Distributed Software Engineering Lecture 1 Introduction Sam Malek SWE 622, Fall 2012 George Mason University.
Virtual Private Ad Hoc Networking Jeroen Hoebeke, Gerry Holderbeke, Ingrid Moerman, Bard Dhoedt and Piet Demeester 2006 July 15, 2009.
Access Control for Federation of Emulab-based Network Testbeds Ted Faber, John Wroclawski 28 July 2008
Maintaining and Updating Windows Server Monitoring Windows Server It is important to monitor your Server system to make sure it is running smoothly.
Copyright © cs-tutorial.com. Overview Introduction Architecture Implementation Evaluation.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
1 Testbeds Breakout Tom Anderson Jeff Chase Doug Comer Brett Fleisch Frans Kaashoek Jay Lepreau Hank Levy Larry Peterson Mothy Roscoe Mehul Shah Ion Stoica.
Microsoft Management Seminar Series SMS 2003 Change Management.
Large-scale Virtualization in the Emulab Network Testbed Mike Hibler, Robert Ricci, Leigh Stoller Jonathon Duerig Shashi Guruprasad, Tim Stack, Kirk Webb,
GCRC Meeting 2004 BIRN Coordinating Center Software Development Vicky Rowley.
Presenting By CH . MADHURI(12QU1D5806) Under the supervision of
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 Based upon slides from Jay Lepreau, Utah Emulab Introduction Shiv Kalyanaraman
1 Wide Area Network Emulation on the Millennium Bhaskaran Raman Yan Chen Weidong Cui Randy Katz {bhaskar, yanchen, wdc, Millennium.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Marcelo R.N. Mendes. What is FINCoS? A set of tools for data generation, load submission, and performance measurement of CEP systems; Main Characteristics:
IPS Infrastructure Technological Overview of Work Done.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Towards an Information System Product Team.
Central Management of 300 Firewalls and Access-Lists Fabian Mauchle TNC 2012 Reykjavík, 21-May-2012.
SDN challenges Deployment challenges
University of Maryland College Park
Alternatives to Mobile Agents
CHAPTER 3 Architectures for Distributed Systems
Unit 27: Network Operating Systems
Software Defined Networking (SDN)
Pong: Diagnosing Spatio-Temporal Internet Congestion Properties
EE 122: Lecture 22 (Overlay Networks)
Presentation transcript:

Emulab: Recent Work, Ongoing Work Jay Lepreau University of Utah DETER Community Meeting January 31, 2006

2 Theme Evolve Emulab to be the network-device- independent control and integration center for experimentation, research, development, debugging, measurement, data management, and archiving. –Collaboratory: leverage Emulab’s project abstraction –Workbench: leverage-- and massively extend-- Emulab’s experiment abstraction –Device-independent: leverage and extend Emulab’s builtin abstractions for all things network-related

3 Outline Collaboratory (New Work I) Major Current Initiatives 1.Workbench and Datapository 2.Time travel and stateful swapout 3.PElab : PlanetLab + Emulab New Work II

4 Collaboratory Motivations, Genesis, … –“Sourceforge plus Emulab would be the perfect development environment.” –An Emulab “project” is the perfect scope for membership, access, and naming. Leverage it. Approach –Use standard, familiar systems –Under the covers, transparently do authentication, authorization and membership mgmt: “single signon” –Use separate server for information and resource security and management –Support flexible access policies: default is project-private, but project leader can change, per-subsytem Private, public read-only, public read/write

5 Collaboratory Subsystems “My Wikis” Mailing list(s) Bug database Source repository –CVS, Subversion Chat/IM, chatroom management More probably coming…. Tie in with Moodle? Enormous potential here…

6 Collaboratory Experience “Just works” is enormously handy Useful simply for collaboration! Auth/auth mechanism useful for access to other federated resources, eg. Datapository Should and will convert to a better & more popular Wiki system, probably MediaWiki. But, substantial work…

7 1. Experimentation Workbench Convergence of opportunity and demand Four types: –Workflow management (processes), including Measurement and feedback steps mandatory pipelines. Eg, –Enforce trace data anonymization based on user privileges –Just-in-time decryption of malware –Experiment management –Data management –Analyses “Scientific Workflow” … with many differences

8 A Different Domain, A Different Approach Our domain, our expertise. –“A systems viewpoint” Existing “experiment” model: pervasive Implicit vs. explicit specification History-based views Incremental adoption Pragmatic approach

9 Micro demo Short p aper in submission:

10 Related: “Datapository” for network-oriented measurement data Collaborative CMU (Dave Andersen) and Georgia Tech (Nick Feamster) effort to create an (Inter)net measurement “data repository” Currently running at datapository.net Federated with Emulab Temporarily using 16 TB file server at Utah Proposal under review Short paper in submission:

11 2. “Time Travel” and Stateful Swapout Time-travel of distributed systems for debugging –Generalize disk image format and handling (done) –Periodic disk checkpointing (prototyped, MS thesis) –Full state-save on swapout (prototyped) –Xen-based virtual machines (in progress) –Challenge: network state (packets in flight) Ignore Consistent checkpointing Pragmatic middle ground: quiesce senders, flush buffers Stateful swapout/swapin [easier] –Allows transparent pre-emption experiment Related to workbench: history, tree traversal –Can share some mechanisms, some UI

12 3. “Pelab” Motivation: –PlanetLab (sort of) sees the “real Internet” But its hosts are hugely overloaded, unpredictable Internet and host variabiity ==> Takes many many runs to get statistical significance, and … ==> Hard to debug –Emulab provides predictable, dedicated host resources and a controlled, repeatable environment in every way But its network model is completely fake

13 Approach Goal: get the best of both worlds –Actually, better than the best of each world today Extreme formulation: Application runs on Emulab with its NICs on PlanetLab hosts

14 Possible Approaches Internet- and Model-oriented 1.Measure the Internet over a long time 2.Develop a model 3.Make a super-Dummynet Drawbacks: 1 and 2 are very hard. “Rare events” are difficult to model and measure Delta to above: –Send real time Internetconditions into Emulab “Modeling and Emulating the Internet”

15 Possible Approach #2: View the Internet “through the PlanetLab lens” PlanetLab- and Model-oriented –Measure PlanetLab paths over a long time Much more tractable than the whole Internet –Develop a model –Develop a super-Dummynet Additions to above: –Mirror real-time PlanetLab conditions onto Emulab –Use “stub” on Plab, peered with each Emulab node, sending that node’s traffic into Plab. Needed if app’s traffic evokes a reactive response from the Internet “Projecting PlanetLab into Emulab”: Net -> Net’ Drawbacks: Still hard in many ways, other…

16 Approach #3: Use the application traffic itself as the measurement traffic PlanetLab- and application- and realtime- oriented –Chosen Plab nodes peered with Emulab nodes –App starts up on Emulab –App-traffic gen and measurement stubs start up on Plab (TCP tracing) –Send real time network conditions to Emulab –Develop a super-Dummynet (done; useful separately) –Develop and continuously run adaptive Plab path-condition monitor Pour results into Datapository Use for initial conditions or when app goes idle on certain pairs App -> App’

17 Pelab design

18 New Work (II)

19 1. Exploring a New “Assign” Exploring the “Comet” domain-specific language for combinatorial optimization using constraint-based local search From Brown Univ (van Hentenryck, Michel); there’s a book. Goals: –Easier to understand and extend, especially by non-experts –More flexible –Should enable easy comparison of completely different optimization techniques (simulated annealing, other). Probably primarily of research interest. Have basic prototype implemented My instinct says it will be time-consuming and hard to match assign’s current level of performance and robustness

20 2. Security-related Improvements Secure “Experiment tear down” improved –Cleaned up, fixed some vulnerabilities –Added the MFS bootblock zapper program –enabled it for all firewalls Identified some holes in the control-net firewall rules and will be re-doing Switched to ssh2 keys Zeroing disks: support added to DB, not hooked in to UI Writing up a tech report on Emulab’s security-related design and implementation

21 3. Automatic Online Validation Emulab is: an ongoing research and dev project, it’s big, and it’s complex –Bugs are likely –Bugs arise from subtle interactions: we’ve found that separate regression tests are insufficient … a public scientific facility: Stakes are high Approach: –Validate network config of every experiment –Make it so quick that this approach is acceptable

22 Online Validation (cont.) Uses an entirely separate code path from Emulab configure path –No DB, no XML, no perl scripts, no nothing… A new state in experiment life cycle: –Invoked transparently as part of expt swapin, after all nodes up, but before “time 0”.

23 Automatic Online Validation (cont.) A validation program, linktest, runs after each swapin, modify, or upon user request –Validates the network configuration using end-to-end tests Linktest validates the following: –Duplex, simplex, and LAN links –Symmetric and asymmetric traffic shaping Latency, loss, bandwidth –Static routing –Running invisibly in beta test, ~2 months

24 4. Major Cluster Expansion 160 high-end nodes, 3.0 GHz, 2GB, 6 Gbit NICs, 2 x 146G disks 2 new switches; 1 is very high bandwidth 360 total; back of envelope potential: 10,000 – 20,000 vnodes But: had significant bringup and scaling challenges Enormous boss/ops stability problems when they were moved to the new hardware. OS tweaks/fixes required.

25 New Work (cont’d) “Optimized” (realistic) IP assignment for net topologies –Jon Duerig, Rob Ricci, John Byers (BU), Jay Lepreau –TR: –Automatically used for large topologies Link monitoring and tracing –Integrated, transparent-- like Dummynet nodes –“monitor nodes” run tcpdump with flexible spec. “loghole” to reliably, scalably collect and manage log data UI improvements –Searchable “Knowledge Base” –AJAX-ification improved several Web pages –New Java applet interface for wireless and mobile

26 More good stuff New internal error logging and analysis framework –Reduce operator and user load of error/warning mail –Provide more clear and specific diagnoses –“Root cause” analysis –Prototype in beta Frisbee –Runs as a proxy, support for “delta” images Assign –Heterogeneous links, “fixing” links to interfaces, XML support Emulab in Emulab –WinXP support, allow adding nodes, separate FS machine

27 Wow, there’s more!? Images –New framework for automated testing of images –New: Fedora Core 4, FreeBSD 6 –“Generic” Windows image in progress Good: not tied to hardware Not so good: takes longer to boot while it self-configures Installation –Better automation of initial proj/group setup –Prototype docs for Emulab in Emulab Robots and Motes –Lots and lots of stuff Updated and improved and working PlanetLab interface Fixes, fixes, fixes…

28 Conclusion Moving Emulab to be the control & integration center for all network-related activities Three major projects –Workbench, Time-travel, P/Elab Many medium projects Many small projects and maintenance …. and we support a huge load, 24/7