Experiments in computer science Emmanuel Jeannot INRIA – LORIA Aleae Kick-off meeting April 1st 2009.

Slides:



Advertisements
Similar presentations
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Advertisements

7 april SP3.1: High-Performance Distributed Computing The KOALA grid scheduler and the Ibis Java-centric grid middleware Dick Epema Catalin Dumitrescu,
Cache Coherence “Can we do a better job of supporting cache coherence?” Ross Daly Chan Kim.
Christian Delbe1 Christian Delbé OASIS Team INRIA -- CNRS - I3S -- Univ. of Nice Sophia-Antipolis November Automatic Fault Tolerance in ProActive.
Combining Statistical and Symbolic Simulation Mark Oskin Fred Chong and Matthew Farrens Dept. of Computer Science University of California at Davis.
Virtualization and Cloud Computing. Definition Virtualization is the ability to run multiple operating systems on a single physical system and share the.
ALEAE : Handling Uncertainties in Large-Scale Distributed Systems Emmanuel Jeannot LORIA - INRIA - CNRS ALEAE Kick-off April 1st 2009.
On Large-Scale Peer-to-Peer Streaming Systems with Network Coding Chen Feng, Baochun Li Dept. of Electrical and Computer Engineering University of Toronto.
A system Performance Model Instructor: Dr. Yanqing Zhang Presented by: Rajapaksage Jayampthi S.
March 18, 2008SSE Meeting 1 Mary Hall Dept. of Computer Science and Information Sciences Institute Multicore Chips and Parallel Programming.
Automated Analysis and Code Generation for Domain-Specific Models George Edwards Center for Systems and Software Engineering University of Southern California.
Experimental Evaluation in Computer Science: A Quantitative Study Paul Lukowicz, Ernst A. Heinz, Lutz Prechelt and Walter F. Tichy Journal of Systems and.
How to do high quality research on middleware? Frank Eliassen professor University of Oslo and Simula Research Laboratory Oslo, Norway.
The Education of a Software Engineer Mehdi Jazayeri Presented by Matthias Hauswirth.
Paper Title Your Name CMSC 838 Presentation. CMSC 838T – Presentation Motivation u Problem paper is trying to solve  Characteristics of problem  … u.
1 A Performance Study of Grid Workflow Engines Alexandru Iosup and Dick Epema PDS Group Delft University of Technology The Netherlands Corina Stratan Parallel.
Chapter 6: Database Evolution Title: AutoAdmin “What-if” Index Analysis Utility Authors: Surajit Chaudhuri, Vivek Narasayya ACM SIGMOD 1998.
Monitoring OSGi platforms with ProActive Virginie Legrand OASIS Team - INRIA Sophia Antipolis.
Statistical Methods in Computer Science Why? Ido Dagan.
Strategic Directions in Real- Time & Embedded Systems Aatash Patel 18 th September, 2001.
Experimental Evaluation in Computer Science: A Quantitative Study Paul Lukowicz, Ernst A. Heinz, Lutz Prechelt and Walter F. Tichy Journal of Systems and.
July 13, “How are Real Grids Used?” The Analysis of Four Grid Traces and Its Implications IEEE Grid 2006 Alexandru Iosup, Catalin Dumitrescu, and.
Euro-Par 2008, Las Palmas, 27 August DGSim : Comparing Grid Resource Management Architectures Through Trace-Based Simulation Alexandru Iosup, Ozan.
© 2005, it - instituto de telecomunicações. Todos os direitos reservados. System Level Resource Discovery and Management for Multi Core Environment Javad.
Scientific Computing Department Faculty of Computer and Information Sciences Ain Shams University Supervised By: Mohammad F. Tolba Mohammad S. Abdel-Wahab.
IE 594 : Research Methodology – Discrete Event Simulation David S. Kim Spring 2009.
Class 6_1 Today’s topic: More on engineering analysis and modeling Your “computer modeling” assignment.
10 -1  The Term Project demands in-depth research and investigated reporting. All reported contents, figures, and tables must be originally generated.
G RID R ESOURCE BROKER FOR SCHEDULING COMPONENT - BASED APPLICATIONS ON DISTRIBUTED RESOURCES Reporter : Yi-Wei Wu.
Fine Grain MPI Earl J. Dodd Humaira Kamal, Alan University of British Columbia 1.
Computer Science Perspective Ludek Matyska Faculty of Informatics, Masaryk University, Brno and also CESNET, Prague.
“Early Estimation of Cache Properties for Multicore Embedded Processors” ISERD ICETM 2015 Bangkok, Thailand May 16, 2015.
CS492: Special Topics on Distributed Algorithms and Systems Fall 2008 Lab 3: Final Term Project.
The MicroGrid: A Scientific Tool for Modeling Grids Andrew A. Chien SAIC Chair Professor Department of Computer Science and Engineering University of California,
Research in Computing สมชาย ประสิทธิ์จูตระกูล. Success Factors in Computing Research Research Computing Knowledge Scientific MethodAnalytical Skill Funding.
Meta Scheduling Sathish Vadhiyar Sources/Credits/Taken from: Papers listed in “References” slide.
1 Advance Computer Architecture CSE 8383 Ranya Alawadhi.
Lecture on Computer Science as a Discipline. 2 Computer “Science” some people argue that computer science is not a science in the same sense that biology.
Large-scale Deployment in P2P Experiments Using the JXTA Distributed Framework Gabriel Antoniu, Luc Bougé, Mathieu Jan & Sébastien Monnet PARIS Research.
Building a Parallel File System Simulator E Molina-Estolano, C Maltzahn, etc. UCSC Lab, UC Santa Cruz. Published in Journal of Physics, 2009.
Model-Driven Analysis Frameworks for Embedded Systems George Edwards USC Center for Systems and Software Engineering
F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal Grid eXplorer (GdX) An Instrument for eXploring the GRID F. Cappello,
Model Checking and Model-Based Design Bruce H. Krogh Carnegie Mellon University.
CISC Machine Learning for Solving Systems Problems John Cavazos Dept of Computer & Information Sciences University of Delaware
Emulation in Data Grid eXplorer. Emulation problematic Distributed applicationTarget environment Simulation Emulation App. Model Env. model Formal analysis.
Southern California Earthquake Center SCEC Technical Activity Groups (TAGs) Self-organized to develop and test critical methodologies for solving specific.
Platform Abstraction Group 3. Question How to deal with different types hardware and software platforms? What detail to expose to the programmer? What.
An Overview of Scientific Workflows: Domains & Applications Laboratoire Lorrain de Recherche en Informatique et ses Applications Presented by Khaled Gaaloul.
Author : Cedric Augonnet, Samuel Thibault, and Raymond Namyst INRIA Bordeaux, LaBRI, University of Bordeaux Workshop on Highly Parallel Processing on a.
Improving System Availability in Distributed Environments Sam Malek with Marija Mikic-Rakic Nels.
+ Support multiple virtual environment for Grid computing Dr. Lizhe Wang.
1 7 th IEEE International Symposium on Cluster Computing and the Grid Welcome to the TCSC Doctoral Symposium.
Pengenalan Ilmu Komputasi. Computational Science??
National Aeronautics and Space Administration Jet Propulsion Laboratory March 17, 2009 Workflow Orchestration: Conducting Science Efficiently on the Grid.
Denis Caromel1 OASIS Team INRIA -- CNRS - I3S -- Univ. of Nice Sophia-Antipolis -- IUF IPDPS 2003 Nice Sophia Antipolis, April Overview: 1. What.
IPDPS 2003, Nice, France Agent-Based Grid Load Balancing Using Performance-Driven Task Scheduling Junwei Cao (C&C Research Labs, NEC Europe Ltd., Germany)
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
Resource Specification Prediction Model Richard Huang joint work with Henri Casanova and Andrew Chien.
COMPUTATIONAL MODELS.
Security of Grid Computing Environments
The Problem Finding a needle in haystack An expert (CPU)
Model-Driven Analysis Frameworks for Embedded Systems
Improved schedulability on the ρVEX polymorphic VLIW processor
สมชาย ประสิทธิ์จูตระกูล
COMP60621 Fundamentals of Parallel and Distributed Systems
Professor Ioana Banicescu CSE 8843
Automated Analysis and Code Generation for Domain-Specific Models
Maria Méndez Real, Vincent Migliore, Vianney Lapotre, Guy Gogniat
COMP60611 Fundamentals of Parallel and Distributed Systems
CS 791Graduate Topics in Computer Science [Software Engineering]
Presentation transcript:

Experiments in computer science Emmanuel Jeannot INRIA – LORIA Aleae Kick-off meeting April 1st 2009

Experimental validationEmmanuel Jeannot 2/26 The discipline of computing: an experimental science Studied objects (hardware, programs, data, protocols, algorithms, network): more and more complex. Modern infrastructures: Processors have very nice features  Cache  Hyperthreading  Multi-core Operating system impacts the performance (process scheduling, socket implementation, etc.) The runtime environment plays a role (MPICH≠OPENMPI) Middleware have an impact (Globus≠GridSolve) Various parallel architectures that can be:  Heterogeneous  Hierarchical  Distributed  Dynamic

Experimental validationEmmanuel Jeannot 3/26 Analytic study Purely analytical (math) models: Demonstration of properties (theorem) Models need to be tractable: over- simplification? Good to understand the basic of the problem Most of the time ones still perform a experiments (at least for comparison) For a practical impact: analytic study not always possible or not sufficient

Experimental validationEmmanuel Jeannot 4/26 Experimental culture not comparable with other science Different studies: In the 90’s: between 40% and 50% of CS ACM papers requiring experimental validation had none (15% in optical engineering) [Lukovicz et al.] “Too many articles have no experimental validation” [Zelkowitz and Wallace 98]: 612 articles published by IEEE. Quantitatively more experiments with times Computer science not at the same level than some other sciences:  Nobody redo experiments (no funding). Lack of tool and methodologies. M.V. Zelkowitz and D.R. Wallace. Experimental models for validating technology. Computer, 31(5):23-31, May 1998.

Experimental validationEmmanuel Jeannot 5/26 Two types of experiments Test and compare: 1.Model validation (comparing models with reality) 2.Quantitative validation (measuring performance) Can occur at the same time. Ex. validation of the implementation of an algorithm: grounding modeling is precise design is correct Idea/need Design Implementation Experimental validation Experimental validation Observation Model Prediction Experimental test Experimental test

Experimental validation A good alternative to analytical validation: Provides a comparison between algorithms or programs Provides a validation of the model or helps to define the validity domain of the model Four Methodologies In-situ (Real scale) Simulation Emulation Benchmarking Emmanuel Jeannot 6/26Experimental validation

SimulationEmulation In-Situ (real scale)Benchmarking Four Methodologies for Expérimentation Experimental validationEmmanuel Jeannot 7/26 Real application Real environnement Model of the environnement Model of the application Grid’5000 Das-3 Planet Lab Linpack Montage Workflow NAS SimGRID GridSim P2PSim MicroGRID Wrekavoc ModelNet

Complementarity of the approaches Experimental validationEmmanuel Jeannot 8/26 Simgrid (simulation) BenchmarkInsituEmulation Real applicationNoModelYes AbstractionVery HighHighNoLow Execution timeSpeed-upPreserved FoldingMandatoryNo HeterogeneityControllableNo Controllable

Simulation vs real scale Experimental validationEmmanuel Jeannot 9/26 A good strategy : 1.Build benchmarks to asses the characteristics of the environments 2.Calibrate model with real-scale experiments 3.Real application tests with benchmarks 4.Perform large-scale and reproducible experiments with simulation

Conclusion Experimental validationEmmanuel Jeannot 10/26 Experimentation is important to validate our approach (model, solutions, etc.) But it has to be done carefully! This is an important part of the project Important : we get funded (partially) because we promised we will use Grid’500