A Grid Parallel Application Framework Jeremy Villalobos PhD student Department of Computer Science University of North Carolina Charlotte.

Slides:



Advertisements
Similar presentations
NGS computation services: API's,
Advertisements

INTRODUCTION TO SIMULATION WITH OMNET++ José Daniel García Sánchez ARCOS Group – University Carlos III of Madrid.
Distributed Processing, Client/Server and Clusters
M. Muztaba Fuad Masters in Computer Science Department of Computer Science Adelaide University Supervised By Dr. Michael J. Oudshoorn Associate Professor.
A Dynamic World, what can Grids do for Multi-Core computing? Daniel Goodman, Anne Trefethen and Douglas Creager
1 Coven a Framework for High Performance Problem Solving Environments Nathan A. DeBardeleben Walter B. Ligon III Sourabh Pandit Dan C. Stanzione Jr. Parallel.
GridFTP: File Transfer Protocol in Grid Computing Networks
Piccolo – Paper Discussion Big Data Reading Group 9/20/2010.
Scripting Languages For Virtual Worlds. Outline Necessary Features Classes, Prototypes, and Mixins Static vs. Dynamic Typing Concurrency Versioning Distribution.
Parallel Programming Models and Paradigms
CHEP03 - UCSD - March 24th-28th 2003 T. M. Steinbeck, V. Lindenstruth, H. Tilsner, for the Alice Collaboration Timm Morten Steinbeck, Computer Science.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
Workload Management Massimo Sgaravatto INFN Padova.
MULTICOMPUTER 1. MULTICOMPUTER, YANG DIPELAJARI Multiprocessors vs multicomputers Interconnection topologies Switching schemes Communication with messages.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
KARMA with ProActive Parallel Suite 12/01/2009 Air France, Sophia Antipolis Solutions and Services for Accelerating your Applications.
RUNNING PARALLEL APPLICATIONS BEYOND EP WORKLOADS IN DISTRIBUTED COMPUTING ENVIRONMENTS Zholudev Yury.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
1 " Teaching Parallel Design Patterns to Undergraduates in Computer Science” Panel member SIGCSE The 45 th ACM Technical Symposium on Computer Science.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
High Throughput Computing on P2P Networks Carlos Pérez Miguel
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
G-JavaMPI: A Grid Middleware for Distributed Java Computing with MPI Binding and Process Migration Supports Lin Chen, Cho-Li Wang, Francis C. M. Lau and.
SALSASALSASALSASALSA Design Pattern for Scientific Applications in DryadLINQ CTP DataCloud-SC11 Hui Li Yang Ruan, Yuduo Zhou Judy Qiu, Geoffrey Fox.
BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.
MATRIX MULTIPLY WITH DRYAD B649 Course Project Introduction.
The Grid computing Presented by:- Mohamad Shalaby.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Bi-Hadoop: Extending Hadoop To Improve Support For Binary-Input Applications Xiao Yu and Bo Hong School of Electrical and Computer Engineering Georgia.
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
Chapter 8-2 : Multicomputers Multiprocessors vs multicomputers Multiprocessors vs multicomputers Interconnection topologies Interconnection topologies.
May 16-18, Skeletons and Asynchronous RPC for Embedded Data- and Task Parallel Image Processing IAPR Conference on Machine Vision Applications Wouter.
Operating Systems David Goldschmidt, Ph.D. Computer Science The College of Saint Rose CIS 432.
The Alternative Larry Moore. 5 Nodes and Variant Input File Sizes Hadoop Alternative.
Latest news on JXTA and JuxMem-C/DIET Mathieu Jan GDS meeting, Rennes, 11 march 2005.
1 MMORPG Servers. 2 MMORPGs Features Avatar Avatar Levels Levels RPG Elements RPG Elements Mission Mission Chatting Chatting Society & Community Society.
GVis: Grid-enabled Interactive Visualization State Key Laboratory. of CAD&CG Zhejiang University, Hangzhou
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Fortress John Burgess and Richard Chang CS691W University of Massachusetts Amherst.
Lab 2 Parallel processing using NIOS II processors
1 Supporting Dynamic Migration in Tightly Coupled Grid Applications Liang Chen Qian Zhu Gagan Agrawal Computer Science & Engineering The Ohio State University.
U N I V E R S I T Y O F S O U T H F L O R I D A Hadoop Alternative The Hadoop Alternative Larry Moore 1, Zach Fadika 2, Dr. Madhusudhan Govindaraju 2 1.
1 THE EARTH SIMULATOR SYSTEM By: Shinichi HABATA, Mitsuo YOKOKAWA, Shigemune KITAWAKI Presented by: Anisha Thonour.
Grid Computing Framework A Java framework for managed modular distributed parallel computing.
1 "Workshop 31: Developing a Hands-on Undergraduate Parallel Programming Course with Pattern Programming SIGCSE The 44 th ACM Technical Symposium.
A Fully Automated Fault- tolerant System for Distributed Video Processing and Off­site Replication George Kola, Tevfik Kosar and Miron Livny University.
MATRIX MULTIPLY WITH DRYAD B649 Course Project Introduction.
CS 351/ IT 351 Modeling and Simulation Technologies HPC Architectures Dr. Jim Holten.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 2.
Miron Livny Computer Sciences Department University of Wisconsin-Madison Condor and (the) Grid (one of.
Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 4: Threads.
SMP Basics KeyStone Training Multicore Applications Literature Number: SPRPxxx 1.
Use of Performance Prediction Techniques for Grid Management Junwei Cao University of Warwick April 2002.
Emulating Volunteer Computing Scheduling Policies Dr. David P. Anderson University of California, Berkeley May 20, 2011.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
GPU Architecture and Its Application
Workload Management Workpackage
Network Controllable MP3 Player
GWE Core Grid Wizard Enterprise (
Spark Presentation.
Pattern Parallel Programming
Accelerating MapReduce on a Coupled CPU-GPU Architecture
Supporting Fault-Tolerance in Streaming Grid Applications
Collaborative Offloading for Distributed Mobile-Cloud Apps
湖南大学-信息科学与工程学院-计算机与科学系
Mihir Awatramani Lakshmi kiran Tondehal Xinying Wang Y. Ravi Chandra
MapReduce: Simplified Data Processing on Large Clusters
Supporting Online Analytics with User-Defined Estimation and Early Termination in a MapReduce-Like Framework Yi Wang, Linchuan Chen, Gagan Agrawal The.
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

A Grid Parallel Application Framework Jeremy Villalobos PhD student Department of Computer Science University of North Carolina Charlotte

Overview Parallel Applications on the Grid Latency Hiding by Redundant Processing (LHRP)‏ PGAFramework Related work Conclusion

Parallel Applications on the Grid Advantages Access to more resources Lower costs Future profits from Grid Economy ? Challenges IO problem Need for easy-to-use Interface Heterogeneous hardware

Latency Hiding by Redundant Processing Latency Hiding problem LHRP Algorithm CPU type CPU task assigned to each CPU type Versioning system Mathematical model to describe LHRP Results

LHRP Latency Hiding Latency Hiding by Redundantly Processing

LHRP Algorithm Internal: Only communicates with LAN CPUs. Border: Communicates with LAN CPUs and one Buffer CPU Buffer: Communicates with LAN Border CPU and receives data from WAN Border CPU

Computation and Communication Stages Internal: Computes borders Transfers borders (Non-blocking)‏ Computes core matrix Waits for transfer ACK

Computation and Communication Stages Border: Computes borders Transfers borders (Non- blocking)‏ Sends far border Computes core matrix Waits for transfer ACK Checks on far border transfer ACK (if it is the last iteration Wait)‏

Computation and Communication Stages Buffer: Computes borders Transfers borders (Non- blocking)‏ Receives far border Computes core matrix Waits for transfer ACK Checks on far border transfer ACK (if it is the last iteration Wait)‏

Buffer Node Versioning Algorithm

LHRP Algorithm Review Node types: Internal Border Buffer Far Border transfer Buffer Node Versioning system

Estimated Algorithm Performance G: Grid Latency I: Internal Latency B: Amount of data tuples used by the Buffer Node W: Total amount of work for all CPUs C: Amount of CPUs doing non-redundant work

Estimated Algorithm Performance

Experimental Result: Memory Footprint 21% increase memory use over conventional form of Latency Hiding. Causes: Extra Matrix in Buffer Node to store old column versions Extra far border buffers.

Experimental Results: Performance

PGAFramework Objective Design Requirements Implementation technology choices API Design API Workpool Example Other API features Synchronization option Recursive option

PGAFramework Objective: To create an efficient parallel application framework for the grid that allows a user programmer easy interaction with the Grid resources.

Design Requirements Platform independence Self Deployment Easy-to-Use Interface Provide the following services without requiring extra effort on the part of the user programmer: Load Balancing Scheduling Fault tolerance Latency/Bandwidth tolerance

Design GPAFramework User's Application API (Interface)‏ Load Balancing Scheduling Fault Tolerance Latency Bandwidth Tolerance Globus Job Scheduler (Condor)‏ GPAFramework User's Applications Hardware Resources

Deployment GridWay ? Globus CondorPBS Globus SGE Desktop PCs Node Cluster computer node Super computer Scheduling Service Job Submit Node Resource Discovery

Implementation Java Platform Independence JXTA (JXSE)‏ Peer-to-peer API Provides tools to work-around NAT's and firewalls Provides library and module runtime loading API

Motivation for API Design Video Codecs Codecs follow an interfaces What happens inside the codec does not matter The input and output for the codec needs to be specified Display a Gui Load File... Output video to screen mpegoggh.264 Video Player Mpeg endoded stream Raw video Data

PGAFramework API There may be multiple “template” API's Each API has Interfaces that the user implements The user “Inserts” his module into the framework API Get data from framework Compute on data Return processed data Request sync (optional)‏ Give data to framework Get data from framework Store or pipe data Schedule processes on Resource Load user Data Create network Determine topology and net behavior Send user process to compute nodes Get Data from user class Send to master node Repeat process in loop until done

API Sample Code

API

API Sample Code

Synchronization option RemoteHandler provides an Interface to synchronize data Data is synced non-blocking User creates blocking procedures if needed

Recursive Feature Allows multiple level of parallelization (granularity)‏ Decode Video Cut Raw Video Into Pictures Blur pictures Blur portion of picture Pipeline Work pool Synchronous

Related Work MPI Implementation for the Grid MPICH-G2 GridMPI MPICH-V2 (MPICH-V1)‏ Peer-to-peer parallel frameworks P2PMPI (for cluster computing)‏ P3(for cluster computing)‏ Self deploying frameworks Jojo

Conclusions Parallel Applications on the Grid Latency Hiding by Redundant Processing (LHRP)‏ PGAFramework Related work