Yuan CMSC 838 Presentation Parallelisation of IBD computation for determining genetic disease map.

Slides:



Advertisements
Similar presentations
Linkage and Genetic Mapping
Advertisements

Distributed Systems CS
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
Using Parallel Genetic Algorithm in a Predictive Job Scheduling
Master/Slave Architecture Pattern Source: Pattern-Oriented Software Architecture, Vol. 1, Buschmann, et al.
Chess Problem Solver Solves a given chess position for checkmate Problem input in text format.
Basics of Linkage Analysis
Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.
Parallel Genehunter: Implementation of a linkage analysis package for distributed memory architectures Michael Moran CMSC 838T Presentation May 9, 2003.
Reference: Message Passing Fundamentals.
1 Distributed Computing Algorithms CSCI Distributed Computing: everything not centralized many processors.
Introduction CS 524 – High-Performance Computing.
Parallelized Evolution System Onur Soysal, Erkin Bahçeci Erol Şahin Dept. of Computer Engineering Middle East Technical University.
CS 584. A Parallel Programming Model We need abstractions to make it simple. The programming model needs to fit our parallel machine model. Abstractions.
Outline Introduction Image Registration High Performance Computing Desired Testing Methodology Reviewed Registration Methods Preliminary Results Future.
1 SWE Introduction to Software Engineering Lecture 23 – Architectural Design (Chapter 13)
Atomistic Protein Folding Simulations on the Submillisecond Timescale Using Worldwide Distributed Computing Qing Lu CMSC 838 Presentation.
Active Messages: a Mechanism for Integrated Communication and Computation von Eicken et. al. Brian Kazian CS258 Spring 2008.
PGA – Parallel Genetic Algorithm Hsuan Lee. Reference  E Cantú-Paz, A Survey on Parallel Genetic Algorithm, Calculateurs Paralleles, Reseaux et Systems.
Reconstructing Genealogies: a Bayesian approach Dario Gasbarra Matti Pirinen Mikko Sillanpää Elja Arjas Department of Mathematics and Statistics
On the Task Assignment Problem : Two New Efficient Heuristic Algorithms.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
Introduction to Parallel Programming MapReduce Except where otherwise noted all portions of this work are Copyright (c) 2007 Google and are licensed under.
SBSE Course 4. Overview: Design Translate requirements into a representation of software Focuses on –Data structures –Architecture –Interfaces –Algorithmic.
Chapter 3: Operating-System Structures System Components Operating System Services System Calls System Programs System Structure Virtual Machines System.
ICOM 5995: Performance Instrumentation and Visualization for High Performance Computer Systems Lecture 7 October 16, 2002 Nayda G. Santiago.
Multiple Processor Systems. Multiprocessor Systems Continuous need for faster and powerful computers –shared memory model ( access nsec) –message passing.
Neural and Evolutionary Computing - Lecture 10 1 Parallel and Distributed Models in Evolutionary Computing  Motivation  Parallelization models  Distributed.
Bulk Synchronous Parallel Processing Model Jamie Perkins.
Introduction, background, jargon Jakub Yaghob. Literature T.G.Mattson, B.A.Sanders, B.L.Massingill: Patterns for Parallel Programming, Addison- Wesley,
GU Junli SUN Yihe 1.  Introduction & Related work  Parallel encoder implementation  Test results and Analysis  Conclusions 2.
A Survey of Distributed Task Schedulers Kei Takahashi (M1)
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
April 26, CSE8380 Parallel and Distributed Processing Presentation Hong Yue Department of Computer Science & Engineering Southern Methodist University.
CSC480 Software Engineering Lecture 11 September 30, 2002.
Blue Brain Project Carlos Osuna, Carlos Aguado, Fabien Delalondre.
Processes Introduction to Operating Systems: Module 3.
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
C OMPARING T HREE H EURISTIC S EARCH M ETHODS FOR F UNCTIONAL P ARTITIONING IN H ARDWARE -S OFTWARE C ODESIGN Theerayod Wiangtong, Peter Y. K. Cheung and.
Supporting Systolic and Memory Communication in iWarp CS258 Paper Summary Computer Science Jaein Jeong.
SOFTWARE DESIGN. INTRODUCTION There are 3 distinct types of activities in design 1.External design 2.Architectural design 3.Detailed design Architectural.
Theory of Programming Languages Introduction. What is a Programming Language? John von Neumann (1940’s) –Stored program concept –CPU actions determined.
OBJECT ORIENTED AND FUNCTION ORIENTED DESIGN 1 Chapter 6.
CS- 492 : Distributed system & Parallel Processing Lecture 7: Sun: 15/5/1435 Foundations of designing parallel algorithms and shared memory models Lecturer/
Parallel Genetic Algorithms By Larry Hale and Trevor McCasland.
Computer Science 340 Software Design & Testing Software Architecture.
Technical Seminar Presentation 2004 Presented by- Geetanjali Konhar EE O81 1 Dynamic power management for embedded system “ Dynamic power management.
Image Processing A Study in Pixel Averaging Building a Resolution Pyramid With Parallel Computing Denise Runnels and Farnaz Zand.
Presented by: Dardan Xhymshiti Spring 2016:. Authors: Publication:  ICDM 2015 Type:  Research Paper 2 Michael ShekelyamGregor JosseMatthias Schubert.
1 The Software Development Process ► Systems analysis ► Systems design ► Implementation ► Testing ► Documentation ► Evaluation ► Maintenance.
Lecture VIII: Software Architecture
ECE 526 – Network Processing Systems Design Programming Model Chapter 21: D. E. Comer.
Network Systems Lab. Korea Advanced Institute of Science and Technology No.1 Ch. 1 Introduction EE692 Parallel and Distribution Computation | Prof. Song.
Holding slide prior to starting show. Processing Scientific Applications in the JINI-Based OGSA-Compliant Grid Yan Huang.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
System Components Operating System Services System Calls.
Operating System Structure Lecture: - Operating System Concepts Lecturer: - Pooja Sharma Computer Science Department, Punjabi University, Patiala.
1.3 Operating system services An operating system provide services to programs and to the users of the program. It provides an environment for the execution.
CEng 713, Evolutionary Computation, Lecture Notes parallel Evolutionary Computation.
Parallel Programming By J. H. Wang May 2, 2017.
Multi-Processing in High Performance Computer Architecture:
PA an Coordinated Memory Caching for Parallel Jobs
Hybrid Programming with OpenMP and MPI
COMP60621 Fundamentals of Parallel and Distributed Systems
Distributed Computing:
Parallel Algorithm Models
Operating System Introduction.
Presented By: Darlene Banta
COMP60611 Fundamentals of Parallel and Distributed Systems
Presentation transcript:

Yuan CMSC 838 Presentation Parallelisation of IBD computation for determining genetic disease map

CMSC 838T – Presentation Introduction u Parallel Genehunter package  In high level  Provide dynamic allocation mechanism  Reduce Space and CPU Time consumption

CMSC 838T – Presentation Reminder u Genehunter  Software package  Construction of human genetic maps  Generate inherited diseases maps  Gene therapy u IBD (Identity by Descent)  One function in Genehunter  Identify the identical locus transmitted by a common ancestor

CMSC 838T – Presentation u Markers  A number to quantify the genetic resemblance of two relatives affected by the same genetic disease u Non-founder  Individual whose parents are in the family u Genetic disease maps  Multipoint linkage analysis with many different markers  Account for all the family information Reminder

CMSC 838T – Presentation Problems with Genehunter u Space Requirement: O(2 2n ) n – number of non-founder u Time Requirement: O(m2 2n ) m-number of markers Genehunter is inefficient in terms of Space and Time usage !

CMSC 838T – Presentation Parallel Genehunter u Goals  Not alter the mathematical detail u Solution  Genehunter running for each family can be considered as an independent task  Master-slave model  Message Passing Interface

CMSC 838T – Presentation Algorithm P0 P1 P2 P3 P4 Master Slaves task1 task2 task3 task4

CMSC 838T – Presentation Algorithm P0 P1 P2 P3 P4 Master Slaves WORK_REQUEST New tasks

CMSC 838T – Presentation Evaluation u Experiment environment  Network of workstation(NOW): 10 processors (distributed memory)  HPC3500: 8 processors (shared memory)  SunFire6800: 24 processors( shared memory) u Parameter for test cases

CMSC 838T – Presentation Evaluation

CMSC 838T – Presentation Evaluation

CMSC 838T – Presentation Evaluation

CMSC 838T – Presentation Related Work u Low-level approach  Parallel Genehunter: Implementation of a linkage analysis package for distributed memory architectures

CMSC 838T – Presentation General approaches u Three model of parallelism  High level based on family  Low level based on different markers  Combine low-level and high-level approach u Select suitable model based on parameters  C1: number of family  C2: number of markers  C3: structure of family

CMSC 838T – Presentation Observations u Pros  Useful: Genehunter is a popular  Simple strategy u Cons:  Too simply to be efficient  Heavy communication between processors  Cannot scale to larger number of processors  Workload unbalance with large family  No concrete example how about to select different models  List Implementing combine model as future work  …