Real Time Power and Performance Monitoring of Supercomputer Application Shankar Prajapati BS in Computer Science Claflin University

Slides:



Advertisements
Similar presentations
NAGIOS AND CACTI NETWORK MANAGEMENT AND MONITORING SYSTEMS.
Advertisements

PARMON A Comprehensive Cluster Monitoring System PARMON Team Centre for Development of Advanced Computing, Bangalore, India Contact: Rajkumar Buyya
26/05/2004HEPIX, Edinburgh, May Lemon Web Monitoring Miroslav Šiket CERN IT/FIO
Parallel Computing Glib Dmytriiev
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
The Development of Mellanox - NVIDIA GPUDirect over InfiniBand A New Model for GPU to GPU Communications Gilad Shainer.
Program Analysis and Tuning The German High Performance Computing Centre for Climate and Earth System Research Panagiotis Adamidis.
Today’s topics Single processors and the Memory Hierarchy
Beowulf Supercomputer System Lee, Jung won CS843.
Early Linpack Performance Benchmarking on IPE Mole-8.5 Fermi GPU Cluster Xianyi Zhang 1),2) and Yunquan Zhang 1),3) 1) Laboratory of Parallel Software.
Presented by: Yash Gurung, ICFAI UNIVERSITY.Sikkim BUILDING of 3 R'sCLUSTER PARALLEL COMPUTER.
Monitoring Temperature and Fan Speed Using Ganglia and Winbond Chips Caitie McCaffrey, Yemi Adesanya August 2006.
1 Multi - Core fast Communication for SoPC Multi - Core fast Communication for SoPC Technion – Israel Institute of Technology Department of Electrical.
Parallel/Concurrent Programming on the SGI Altix Conley Read January 25, 2007 UC Riverside, Department of Computer Science.
Hitachi SR8000 Supercomputer LAPPEENRANTA UNIVERSITY OF TECHNOLOGY Department of Information Technology Introduction to Parallel Computing Group.
Grid and Cloud Computing By: Simon Luangsisombath.
© 2005, it - instituto de telecomunicações. Todos os direitos reservados. System Level Resource Discovery and Management for Multi Core Environment Javad.
BY: SACHIN SHRIVASTAVA Operating System By : Sachin Shrivastava 1.
By Mr. Abdalla A. Shaame.  An operating system is a software component that acts as the core of a computer system.  It performs various functions and.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
1 Network Statistic and Monitoring System Wayne State University Division of Computing and Information Technology Information Technology.
Parallel Computing The Bad News –Hardware is not getting faster fast enough –Too many architectures –Existing architectures are too specific –Programs.
KUAS.EE Parallel Computing at a Glance. KUAS.EE History Parallel Computing.
Parallel Processing LAB NO 1.
Performance Evaluation of Hybrid MPI/OpenMP Implementation of a Lattice Boltzmann Application on Multicore Systems Department of Computer Science and Engineering,
Statistical Performance Analysis for Scientific Applications Presentation at the XSEDE14 Conference Atlanta, GA Fei Xing Haihang You Charng-Da Lu July.
Bottlenecks: Automated Design Configuration Evaluation and Tune.
Tools and Utilities for parallel and serial codes in ENEA-GRID environment CRESCO Project: Salvatore Raia SubProject I.2 C.R. ENEA-Portici. 11/12/2007.
Energy Profiling And Analysis Of The HPC Challenge Benchmarks Scalable Performance Laboratory Department of Computer Science Virginia Tech Shuaiwen Song,
Hsu Chun-Hung Network Benchmarking Lab
Principles of Scalable HPC System Design March 6, 2012 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
PARMON A Comprehensive Cluster Monitoring System A Single System Image Case Study Developer: PARMON Team Centre for Development of Advanced Computing,
A Self-Manageable Infrastructure for Supporting Web-based Simulations Yingping Huang Xiaorong Xiang Gregory Madey Computer Science & Engineering University.
Frontiers in Massive Data Analysis Chapter 3.  Difficult to include data from multiple sources  Each organization develops a unique way of representing.
A Profiler for a Multi-Core Multi-FPGA System by Daniel Nunes Supervisor: Professor Paul Chow September 30 th, 2008 University of Toronto Electrical and.
Software Scalability Issues in Large Clusters CHEP2003 – San Diego March 24-28, 2003 A. Chan, R. Hogue, C. Hollowell, O. Rind, T. Throwe, T. Wlodek RHIC.
Tests and tools for ENEA GRID Performance test: HPL (High Performance Linpack) Network monitoring A.Funel December 11, 2007.
Performance Characterization and Architecture Exploration of PicoRadio Data Link Layer Mei Xu and Rahul Shah EE249 Project Fall 2001 Mentor: Roberto Passerone.
Performance Monitoring Tools on TCS Roberto Gomez and Raghu Reddy Pittsburgh Supercomputing Center David O’Neal National Center for Supercomputing Applications.
Distributed monitoring system. Why Monitor? Solve them! Identify Problems Ensure conduct Requirements Manage many computers Spot trends in the system.
Kurt Mueller San Diego Supercomputer Center NPACI HotPage Updates.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
1 CMPE 511 HIGH PERFORMANCE COMPUTING CLUSTERS Dilek Demirel İşçi.
Areas of Computing Study. Artificial Intelligence Databases and Data Science Human-Centered Computing Networking Information Security System Software.
CPU Inside Maria Gabriela Yobal de Anda L#32 9B. CPU Called also the processor Performs the transformation of input into output Executes the instructions.
Ch 10 Monitoring NCNU CSIE 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia.
Performance Monitoring of SLAC Blackbox Nodes Using Perl, Nagios, and Ganglia Roxanne Martinez Mentor: Yemi Adesanya United States Department of Energy.
Summary Background –Why do we need parallel processing? Moore’s law. Applications. Introduction in algorithms and applications –Methodology to develop.
Debugging parallel programs. Breakpoint debugging Probably the most widely familiar method of debugging programs is breakpoint debugging. In this method,
Information Services Andrew Brown Jon Ludwig Elvis Montero grid:seminar1:lectures:seminar-grid-1-information-services.ppt.
Session 7C July 9, 2004ICPADS ‘04 A Framework for Profiling Multiprocessor Memory Performance Diana Villa, Jaime Acosta, Patricia J. Teller The University.
A record and replay mechanism using programmable network interface cards Laurent Lefèvre INRIA / LIP (UMR CNRS, INRIA, ENS, UCB)
Motivation: Sorting is among the fundamental problems of computer science. Sorting of different datasets is present in most applications, ranging from.
Architecture of Microprocessor
Benchmarking, Performance Evaluation, Modeling and Prediction Erich Strohmaier.
Parallel IO for Cluster Computing Tran, Van Hoai.
CIP HPC CIP - HPC HPC = High Performance Computer It’s not a regular computer, it’s bigger, faster, more powerful, and more.
Introduction to Performance Tuning Chia-heng Tu PAS Lab Summer Workshop 2009 June 30,
1.3 Operating system services An operating system provide services to programs and to the users of the program. It provides an environment for the execution.
Multi-Core CPUs Matt Kuehn. Roadmap ► Intel vs AMD ► Early multi-core processors ► Threads vs Physical Cores ► Multithreading and Multi-core processing.
POE Parallel Operating Environment. Cliff Montgomery.
INTRODUCTION TO HIGH PERFORMANCE COMPUTING AND TERMINOLOGY.
Parallel OpenFOAM CFD Performance Studies Student: Adi Farshteindiker Advisors: Dr. Guy Tel-Zur,Prof. Shlomi Dolev The Department of Computer Science Faculty.
Hardware vs. Software Question 1 What is hardware?
CRESCO Project: Salvatore Raia
Chapter 4: Threads.
Unit 1: Introduction to Operating System
Hybrid Programming with OpenMP and MPI
Presentation transcript:

Real Time Power and Performance Monitoring of Supercomputer Application Shankar Prajapati BS in Computer Science Claflin University Nate Rini(Mentor)

VS History and progress of Supercomputer

Supercomputer in progress

α Relationship

Objectives

Benchmarking Measuring Performance

High Performance Linpack(HPL) Tools to measure Efficiency and Performance

High Performance LINPACK(HPL) Since 1993, the fastest supercomputers have been ranked on the TOP500 list according to their LINPACK benchmarking results.

Message Passing Interface (MPI) High Performance Portability Scalability

OSU Micro-Benchmarking

Ganglia test cluster

Ganglia Architecture The Ganglia MONitor Daemon (GMOND) The Ganglia METAdata Daemon (GMETAD) Round Robin Database Tools (RRDTools) PHP-based Web interface Web server

Ganglia Web Interface for Jellystone test cluster

Ganglia Load vs time graph LINPACK over 28 nodes on Jellystone test cluster

Model Specific Register Various Control registers in the x86 instruction set used for debugging, program execution tracing, Computer performance monitoring, and toggling certain CPU features.

MSR and Librapl Librapl simplifies access to the RAPL values In the MSR registers of modern Intel CPUs like SandyBridge processors.

Intel® Power Gadget Software based power usage monitoring tool for 2 nd generation Intel Core processors or later.

Host metrics from rvitals rvitals retrieves hardware vital information from the on-board Service Processor for a single or range of nodes and groups.

IBM iDataplex iPDU

Data Collection

Tools V S

Observations

Conclusion

Future

Acknowledgement

Questions