Artemis Logs Database View Data Collectio n GUI Dryad Overview Data collection Distributed system Plug-ins GUI Plug-ins Hunting for Bugs with Artemis System.

Slides:



Advertisements
Similar presentations
Introduction to Java 2 Programming Lecture 10 API Review; Where Next.
Advertisements

Distributed Data-Parallel Programming using Dryad Andrew Birrell, Mihai Budiu, Dennis Fetterly, Michael Isard, Yuan Yu Microsoft Research Silicon Valley.
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Cluster Computing with Dryad Mihai Budiu, MSR-SVC LiveLabs, March 2008.
Test test Please press the F5 key to begin. (Then, press the Page Up or Page Down keys to move through the following 3 slides.)
MAP REDUCE PROGRAMMING Dr G Sudha Sadasivam. Map - reduce sort/merge based distributed processing Best for batch- oriented processing Sort/merge is primitive.
Machine Learning in DryadLINQ Kannan Achan Mihai Budiu MSR-SVC, 1/30/
JTX Overview Overview of Job Tracking for ArcGIS (JTX)
LIBRA: Lightweight Data Skew Mitigation in MapReduce
UC Berkeley Spark Cluster Computing with Working Sets Matei Zaharia, Mosharaf Chowdhury, Michael Franklin, Scott Shenker, Ion Stoica.
Big Data Platforms Mihai Budiu, Oct My work Ph.D. from Carnegie Mellon, 2003 Hardware synthesis Reconfigurable hardware Compilers and computer.
Bookshelf.EXE - BX A dynamic version of Bookshelf –Automatic submission of algorithm implementations, data and benchmarks into database Distributed computing.
DryadLINQ A System for General-Purpose Distributed Data-Parallel Computing Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Budiu, Úlfar Erlingsson, Pradeep.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
Nectar: Efficient Management of Computation and Data in Data Centers Lenin Ravindranath Pradeep Kumar Gunda, Chandu Thekkath, Yuan Yu, Li Zhuang.
Optimus: A Dynamic Rewriting Framework for Data-Parallel Execution Plans Qifa Ke, Michael Isard, Yuan Yu Microsoft Research Silicon Valley EuroSys 2013.
Exploring Microsoft Excel 2002 Chapter 7 Chapter 7 List and Data Management: Converting Data to Information By Robert T. Grauer Maryann Barber Exploring.
Monitoring and Debugging Dryad(LINQ) Applications with Daphne Vilas Jagannath, Zuoning Yin, Mihai Budiu University of Illinois, Microsoft Research SVC.
Distributed Computations
Moodle web portal with Moodle Using Moodle as a Student Portal with Gerrard Shaw Redbridge Institute of Adult Education Screenshots Resources.
Fall 2007cs4251 Distributed Computing Umar Kalim Dept. of Communication Systems Engineering 31/10/2007.
CompuNet Grid Computing Milena Natanov Keren Kotlovsky Project Supervisor: Zvika Berkovich Lab Chief Engineer: Dr. Ilana David Spring, /
Usage of the Python Programming Language in the CMS Experiment Rick Wilkinson (Caltech), Benedikt Hegner (CERN) On behalf of CMS Offline & Computing 1.
U of R eXtensible Catalog Team MetaCat. Problem Domain.
Software Architecture Patterns (2). what is architecture? (recap) o an overall blueprint/model describing the structures and properties of a "system"
Distributed Computations MapReduce
Lecture 2 – MapReduce CPE 458 – Parallel Programming, Spring 2009 Except as otherwise noted, the content of this presentation is licensed under the Creative.
State of Connecticut Core-CT Project Query 4 hrs Updated 1/21/2011.
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
Linux Operations and Administration
1 DAN FARRAR SQL ANYWHERE ENGINEERING JUNE 7, 2010 SCHEMA-DRIVEN EXPERIMENT MANAGEMENT DECLARATIVE TESTING WITH “DEXTERITY”
Image Processing Image Processing Windows HPC Server 2008 HPC Job Scheduler Dryad DryadLINQ Machine Learning Graph Analysis Graph Analysis Data Mining.NET.
Programming clusters with DryadLINQ Mihai Budiu Microsoft Research, Silicon Valley Association of C and C++ Users (ACCU) Mountain View, CA, April 13, 2011.
Cloud Computing Other High-level parallel processing languages Keke Chen.
Understanding and Predicting Graded Search Satisfaction Tang Yuk Yu 1.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Training Kinect Mihai Budiu Microsoft Research, Silicon Valley UCSD CNS 2012 RESEARCH REVIEW February 8, 2012.
1 Dryad Distributed Data-Parallel Programs from Sequential Building Blocks Michael Isard, Mihai Budiu, Yuan Yu, Andrew Birrell, Dennis Fetterly of Microsoft.
The WinMine Toolkit Max Chickering. Build Statistical Models From Data Dependency Networks Bayesian Networks Local Distributions –Trees Multinomial /
Input Output Excel Internet/ Etc.Grab Bag
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
MapReduce M/R slides adapted from those of Jeff Dean’s.
A Novel Local Patch Framework for Fixing Supervised Learning Models Yilei Wang 1, Bingzheng Wei 2, Jun Yan 2, Yang Hu 2, Zhi-Hong Deng 1, Zheng Chen 2.
MS Access: Introduction 1Database Design. MS Access: Overview MS Access A Database Management System (DBMS) designed to create applications that organize,
Nova Media Center Anthony Romano Christopher Nolan Mr. Edward Char Anthony Romano Christopher Nolan Mr. Edward Char.
Experiment Management System CSE 423 Aaron Kloc Jordan Harstad Robert Sorensen Robert Trevino Nicolas Tjioe Status Report Presentation Industry Mentor:
INFSO-RI Enabling Grids for E-sciencE Graphical User Interface. for Charon Extension Layer System. and Application Dashboards Jan.
© 2006, National Research Council Canada © 2006, IBM Corporation Solving performance issues in OTS-based systems Erik Putrycz Software Engineering Group.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 – The Ganga Evolution Andrew Maier.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Large-scale Machine Learning using DryadLINQ Mihai Budiu Microsoft Research, Silicon Valley Ambient Intelligence: From Sensor Networks to Smart Environments.
1 Visualizer- Manager’s View. 2  Graphical presentation and analysis of Firewall data  Graphical presentation and analysis of Audit data  Immediate.
Navigation Framework using CF Architecture for a Client-Server Application using the open standards of the Web presented by Kedar Desai Differential Technologies,
Active-HDL Server Farm Course 11. All materials updated on: September 30, 2004 Outline 1.Introduction 2.Advantages 3.Requirements 4.Installation 5.Architecture.
SmartCenter for Pointsec - MI
Lesson 5-Exploring Utilities
Some slides adapted from those of Yuan Yu and Michael Isard
Distributed Programming in “Big Data” Systems Pramod Bhatotia wp
MS Access Introduction Database Design.
CSCI5570 Large Scale Data Processing Systems
MVC and other n-tier Architectures
Software Design and Architecture
Deploying and Configuring SSIS Packages
Introduction to Informer
DryadInc: Reusing work in large-scale computations
By Rajanikanth B Eclipse IDE Overview By Rajanikanth B
Map Reduce, Types, Formats and Features
Presentation transcript:

Artemis Logs Database View Data Collectio n GUI Dryad Overview Data collection Distributed system Plug-ins GUI Plug-ins Hunting for Bugs with Artemis System Architecture Conclusio ns

pptPlex Section Divider Hunting for Bugs with Artemis The slides after this divider will be grouped into a section and given the label you type above. Feel free to move this slide to any position in the deck.

Hunting for Bugs with Artemis Gabriela F. Creţu-Ciocârlie Mihai Budiu Moises Goldszmidt Microsoft Research, Silicon Valley WASL 2008 This presentation is built and should be viewed with pptPlex:

Artemis Goal One-stop shop for performance analysis of distributed systems

Principles 1) Modular: Separate generic from application specific parts 2) Extensible: add new analyses via plug-ins 3) Interactive: human expert part of the analysis loop

pptPlex Section Divider System Architecture The slides after this divider will be grouped into a section and given the label you type above. Feel free to move this slide to any position in the deck.

Logs Database View Data collection Distributed system Plug-ins GUI Distributed Local

Logs Database View Data collection Distributed system Plug-ins GUI Application- Specific Generic

pptPlex Section Divider Dryad Overview The slides after this divider will be grouped into a section and given the label you type above. Feel free to move this slide to any position in the deck.

grep sed sort awk perl grep sed sort awk Input files Vertices Output files ChannelsStage Dryad Application Structure

Dryad System Architecture data plane job schedule control plane Serv V VV Job managercluster

pptPlex Section Divider Data Collection The slides after this divider will be grouped into a section and given the label you type above. Feel free to move this slide to any position in the deck.

TextBinaryXMLPerfmon TextBinaryXMLPerfmon Data Persisted data Copy Parse Filter Aggregate DryadLINQ application 10GB-1TB 100MB-1GB TextBinaryXMLPerfmon

pptPlex Section Divider GUI The slides after this divider will be grouped into a section and given the label you type above. Feel free to move this slide to any position in the deck.

pptPlex Section Divider Plug-ins The slides after this divider will be grouped into a section and given the label you type above. Feel free to move this slide to any position in the deck.

Machine Utilization Plug-in

Complex statistics: HiLighter plug-in 22 Metrics Binary search over logistic regression with L1 regularization Key Performance Indicator Correlated metrics

Interactive Analysis Feature Computation Visualization Hilighter KPI Selection

pptPlex Section Divider Conclusions The slides after this divider will be grouped into a section and given the label you type above. Feel free to move this slide to any position in the deck.

Raw data Summarization Feature extraction Statistical analyses Automatic diagnosis Distributed system Artemis today Goal Conclusions