07/14/08. 2 Points Introduction. Cluster and Supercomputers. Cluster Types and Advantages. Our Cluster. Cluster Performance. Cluster Computer for Basic.

Slides:



Advertisements
Similar presentations
PARMON A Comprehensive Cluster Monitoring System PARMON Team Centre for Development of Advanced Computing, Bangalore, India Contact: Rajkumar Buyya
Advertisements

Distributed Processing, Client/Server and Clusters
Master/Slave Architecture Pattern Source: Pattern-Oriented Software Architecture, Vol. 1, Buschmann, et al.
2. Computer Clusters for Scalable Parallel Computing
Chapter 5: Server Hardware and Availability. Hardware Reliability and LAN The more reliable a component, the more expensive it is. Server hardware is.
Network+ Guide to Networks, Fourth Edition
Dinker Batra CLUSTERING Categories of Clusters. Dinker Batra Introduction A computer cluster is a group of linked computers, working together closely.
Presented by: Yash Gurung, ICFAI UNIVERSITY.Sikkim BUILDING of 3 R'sCLUSTER PARALLEL COMPUTER.
High Availability (HA) May 03, Motivation  New Technology  The opportunity to create a cluster  Exploring with Linux Operating system.
Beowulf Cluster Computing Each Computer in the cluster is equipped with: – Intel Core 2 Duo 6400 Processor(Master: Core 2 Duo 6700) – 2 Gigabytes of DDR.
Cluster Computer For Bioinformatics Applications Nile University, Bioinformatics Group. Hisham Adel 2008.
Network+ Guide to Networks, Fourth Edition Chapter 1 An Introduction to Networking.
DISTRIBUTED COMPUTING
Introduction to client/server architecture
© 2001 by Prentice Hall8-1 Local Area Networks, 3rd Edition David A. Stamper Part 3: Software Chapter 8 Client/Server Architecture.
Lecture 2 – MapReduce CPE 458 – Parallel Programming, Spring 2009 Except as otherwise noted, the content of this presentation is licensed under the Creative.
11 SERVER CLUSTERING Chapter 6. Chapter 6: SERVER CLUSTERING2 OVERVIEW  List the types of server clusters.  Determine which type of cluster to use for.
1b.1 Types of Parallel Computers Two principal approaches: Shared memory multiprocessor Distributed memory multicomputer ITCS 4/5145 Parallel Programming,
Chapter 2 Computer Clusters Lecture 2.1 Overview.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
THE AFFORDABLE SUPERCOMPUTER HARRISON CARRANZA APARICIO CARRANZA JOSE REYES ALAMO CUNY – NEW YORK CITY COLLEGE OF TECHNOLOGY ECC Conference 2015 – June.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
U.S. Department of the Interior U.S. Geological Survey David V. Hill, Information Dynamics, Contractor to USGS/EROS 12/08/2011 Satellite Image Processing.
CLUSTER COMPUTING Prepared by: Kalpesh Sindha (ITSNS)
Parallel Computing The Bad News –Hardware is not getting faster fast enough –Too many architectures –Existing architectures are too specific –Programs.
Network+ Guide to Networks, Fourth Edition Chapter 1 An Introduction to Networking.
Remote OMNeT++ v2.0 Introduction What is Remote OMNeT++? Remote environment for OMNeT++ Remote simulation execution Remote data storage.
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
Performance Concepts Mark A. Magumba. Introduction Research done on 1058 correspondents in 2006 found that 75% OF them would not return to a website that.
March 3rd, 2006 Chen Peng, Lilly System Biology1 Cluster and SGE.
PARMON A Comprehensive Cluster Monitoring System A Single System Image Case Study Developer: PARMON Team Centre for Development of Advanced Computing,
Cluster Workstations. Recently the distinction between parallel and distributed computers has become blurred with the advent of the network of workstations.
1 Selecting LAN server (Week 3, Monday 9/8/2003) © Abdou Illia, Fall 2003.
Loosely Coupled Parallelism: Clusters. Context We have studied older archictures for loosely coupled parallelism, such as mesh’s, hypercubes etc, which.
April 26, CSE8380 Parallel and Distributed Processing Presentation Hong Yue Department of Computer Science & Engineering Southern Methodist University.
1 CMPE 511 HIGH PERFORMANCE COMPUTING CLUSTERS Dilek Demirel İşçi.
PARALLEL COMPUTING overview What is Parallel Computing? Traditionally, software has been written for serial computation: To be run on a single computer.
CLUSTER COMPUTING TECHNOLOGY BY-1.SACHIN YADAV 2.MADHAV SHINDE SECTION-3.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
Distributed Programming CA107 Topics in Computing Series Martin Crane Karl Podesta.
Motivation: Sorting is among the fundamental problems of computer science. Sorting of different datasets is present in most applications, ranging from.
+ Clusters Alternative to SMP as an approach to providing high performance and high availability Particularly attractive for server applications Defined.
By Chi-Chang Chen.  Cluster computing is a technique of linking two or more computers into a network (usually through a local area network) in order.
COMP381 by M. Hamdi 1 Clusters: Networks of WS/PC.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 1.
CS4315A. Berrached:CMS:UHD1 Introduction to Operating Systems Chapter 1.
CIP HPC CIP - HPC HPC = High Performance Computer It’s not a regular computer, it’s bigger, faster, more powerful, and more.
Cluster computing. 1.What is cluster computing? 2.Need of cluster computing. 3.Architecture 4.Applications of cluster computing 5.Advantages of cluster.
Background Computer System Architectures Computer System Software.
Computer System Evolution. Yesterday’s Computers filled Rooms IBM Selective Sequence Electroinic Calculator, 1948.
SYSTEM MODELS FOR ADVANCED COMPUTING Jhashuva. U 1 Asst. Prof CSE
System Models Advanced Operating Systems Nael Abu-halaweh.
Chapter 16 Client/Server Computing Dave Bremer Otago Polytechnic, N.Z. ©2008, Prentice Hall Operating Systems: Internals and Design Principles, 6/E William.
Chapter 1: Introduction
Chapter 1: Introduction
Hadoop Clusters Tess Fulkerson.
Chapter 1: Introduction
Chapter 16: Distributed System Structures
CSE8380 Parallel and Distributed Processing Presentation
Chapter 1: Introduction
Language Processors Application Domain – ideas concerning the behavior of a software. Execution Domain – Ideas implemented in Computer System. Semantic.
Network+ Guide to Networks, Fourth Edition
Multithreaded Programming
Chapter 1: Introduction
Chapter 1: Introduction
Database System Architectures
Chapter 1: Introduction
MapReduce: Simplified Data Processing on Large Clusters
Chapter 1: Introduction
Presentation transcript:

07/14/08

2 Points Introduction. Cluster and Supercomputers. Cluster Types and Advantages. Our Cluster. Cluster Performance. Cluster Computer for Basic Problems. General Idea about Sequence Alignment. BLAST and Parallel BLAST Algorithm. Sequence Alignment and Parallel Sequence Alignment. Learned Skills.

3 Introduction

4 Points Introduction. Cluster and Supercomputers. Cluster Types and Advantages. Our Cluster. Cluster Performance. Cluster Computer for Basic Problems. General Idea about Sequence Alignment. BLAST and Parallel BLAST Algorithm. Sequence Alignment and Parallel Sequence Alignment. Learned Skills.

5 Cluster Definition Group of computers and servers ( loosely connected computers ) that act like a single system. computers Each system called a Node. Node contain one or more Processor, Ram,Hard disk and LAN card. Nodes work in Parallel. We can increase performance by adding more Nodes. A c

07/14/08 These cluster computers are referred to by many names, from a poor-man's supercomputer to COWs (clusters of workstations), and NOWs (networks of workstations). cheaper than traditional MPP systems, and often use the same processors, but are more difficult to use since the network capabilities are currently much lower. Cluster computers are also usually much smaller, most often involving fewer than 100 computers

07/14/08 Figure 1. A Cluster Architecture.

07/14/08 large Linux cluster at the Chemnitz University of Technology, GermanyLinuxChemnitz University of Technology

9

07/14/08 A simple, home-built Beowulf clusterBeowulf cluster

11

Motivation for Clusters/Grids Many science and engineering problems today require large amounts of computational resources and cannot be executed in a single machine. Large commercial supercomputers are very expensive… A lot of computational power is underutilized around the world in machines sitting idle.

13 Points Introduction. Cluster and Supercomputers. Cluster Types and Advantages. Our Cluster. Cluster Performance. Cluster Computer for Basic Problems. General Idea about Sequence Alignment. BLAST and Parallel BLAST Algorithm. Sequence Alignment and Parallel Sequence Alignment. Learned Skills.

14 Cluster types Load Balancing Cluster (Parallel BLAST). Computing Cluster(Parallel sequence alignment). High-availability (HA) clusters.

15 Cluster types:Load Balancing Cluster Task

07/14/08 A load balancing cluster with two servers and 4 user stations

07/14/08 Load-balancingLoad-balancing clusters are configurations in which cluster-nodes share computational workload to provide better overall performance. For example, a web server cluster may assign different queries to different nodes, so the overall response time will be optimized

18 Cluster types:Computing Cluster Task

07/14/08 "Computer clusters" are used for computation-intensive purposes, rather than handling IO-oriented operations such as web service or databases.IO-oriented

20 Cluster type:High-availability Clusters

07/14/08 "High-availability clusters“ improve the availability of the cluster approach. They operate by having redundant nodes, which are then used to provide service when system components failHigh-availability clustersnodes

22 Cluster advantages Performance. Scalability. Maintenance. Cost.

23 Points Introduction. Cluster and Supercomputers. Cluster Types and Advantages. Our Cluster. Cluster Performance. Cluster Computer for Basic Problems. General Idea about Sequence Alignment. BLAST and Parallel BLAST Algorithm. Sequence Alignment and Parallel Sequence Alignment. Learned Skills.

24 Node 1 switch Node 4 Node 2 Node 3 Internet Cluster

25 Communication : Switch 5-Port 10/100Mbps. Processor and Ram: -Master Node Duo core Processor 1.86 GHZ. Ram 1GB. -Node 1 Pentium 4 Ram 1GB. -Node 2 Pentium 4 Ram 1GB -Node 3 Pentium 4 Ram 512 MB Cluster specification

26  Operating System OPEN SUSE  MPICH2 Cluster specification (cont’)‏

27 Points Introduction. Cluster and Supercomputers. Cluster Types and Advantages. Our Cluster. Cluster Performance. Cluster Computer for Basic Problems. General Idea about Sequence Alignment. BLAST and Parallel BLAST Algorithm. Sequence Alignment and Parallel Sequence Alignment. Learned Skills.

28 Performance of the Cluster is affected by 1-Node speed. 2-Running Program.

29 Working… Running Program(sequential)‏

30 Working… Running Program(sequential)‏

31 Working… Running Program(sequential)‏

32 Running Program(sequential)‏

33 Data sent Running Program(Parallel)‏

34 Working… Running Program(Parallel)‏

35 Finished… Results Get results… Running Program(Parallel)‏

36 Points Introduction. Cluster and Supercomputers. Cluster Types and Advantages. Our Cluster. Cluster Performance. Cluster Computer for Basic Problems. General Idea about Sequence Alignment. BLAST and Parallel BLAST Algorithm. Sequence Alignment and Parallel Sequence Alignment. Learned Skills.

37 Sequence Alignment

38 Sequence Alignment Used to : 1-Compare between sequences. 2-Search databases.

39 How to Align two Sequences. if we have two sequences A A A C G A A A T G A Let match=1, gap=-1, miss-match=0. they can be aligned as: 1- A A A C G A | | | | | | Score=3 A A T _ G A 2- A A A C _ G A | | | | | | | Score=1 A A _ _ T G A

40 Points Introduction. Cluster and Supercomputers. Cluster Types and Advantages. Our Cluster. Cluster Performance Cluster Computer for Basic Problems.. General Idea about Sequence Alignment. BLAST and Parallel BLAST Algorithm. Sequence Alignment and Parallel Sequence Alignment. Learned Skills.

41 BLAST (Basic Local Alignment Search Tool)‏ Searching DataBases

42 BLAST Algorithm (High scoring pairs)‏

43 Blast search types. BLASTN - Compares a nucleotide query sequence against a nucleotide sequence database. BLASTP- Compares an amino acid query sequence against a protein sequence database. TBLASTN- Compares a protein query sequence against a nucleotide sequence Database. BLASTX- Compares nucleotide query sequence against a protein sequence database.

44 Why We need BLAST to be parallelized ?

45 Our Program:Parallel BLAST

46 Parallel BLAST(cont’)‏ Formatdb.c Nucleotide sequence database “formatdb -i DATABASE -p F “. Protein sequence database “formatdb -i DATABASE -p T “.

47 Linux_Cluster_BLASTALL.c “blastall -p BLAST Search Type -d DATABASE -i QUERY FILE -o out. Txt” Parallel BLAST(cont’)‏

48 Results Average of running 1000 Query, 1000 times.

49 Results(cont’)‏ Average of running 1000 Query, 1000 times.

50 Results(cont’)‏ Average of running 1000 Query, 1000 times.

51 Conclusion about Parallel BLAST. Performane: Batter by using CLUSTER. Scalability:More Nodes time decrease.

52 Points Introduction. Cluster and Supercomputers. Cluster Types and Advantages. Our Cluster. Cluster Performance. Cluster Computer for Basic Problems. General Idea about Sequence Alignment. BLAST and Parallel BLAST Algorithm. Sequence Alignment and Parallel Sequence Alignment. Learned Skills.

53 Sequence Alignment Compare between sequences

54 Sequence Alignment Introduction. Sequence Alignment Benefits. Sequence Alignment Types.

55 Needleman-Wunsch Algorithm

56 Why We need Sequence Alignment to be parallelized ?

57 Parallel Sequence Alignment algorithm

58 Our Sequence Alignment Program Pairwise Alignment. Built Using Needleman-Wunsch algorithm.

59 Learned Skills. Using Linux (Suse 10.3) operating system. Programming using C language. Cluster computers and how to build one. MPICH2 for message passing interfaces between nodes. Latex. Team working, and helping each other. Presentation skills.

07/14/08 Message passing and communication Two widely used approaches for communication between cluster nodes are PVM, Parallel Virtual Machine :(1989)PVM provides a run- time environment for message-passing, task and resource management, and fault notification. PVM can be used by user programs written in C, C++, or Fortran, etcParallel Virtual Machine MPI, Message Passing Interface (1990)MPI implementations typically use TCP/IP and socket connections. [ MPI is now a widely available communications model that enables parallel programs to be written in languages such as C, Fortran, Python, etcMessage Passing InterfaceTCP/IP [CFortranPython

07/14/08 Cluster management: Task scheduling When a large multi-user cluster needs to access very large amounts of data, task scheduling becomes a challenge.task scheduling Node failure management When a node in a cluster fails, strategies such as "fencing" may be employed to keep the rest of the system operational.fencing Fencing is the process of isolating a node or protecting shared resources when a node appears to be malfunctioning.

Cluster Computing - challenges Software to create a single system image Fault tolerance Debugging tools Job scheduling All these have been/are being addressed since then and are leading towards a successful era for cluster computing

63 Thank you