1 Pertemuan 26 Parallel Processing 2 Matakuliah: H0344/Organisasi dan Arsitektur Komputer Tahun: 2005 Versi: 1/1.

Slides:



Advertisements
Similar presentations
1 Pertemuan > > Matakuliah: >/ > Tahun: > Versi: >
Advertisements

1 Pertemuan 04 Proxy/Cache Matakuliah: H0491/Praktikum Jaringan Komputer Tahun: 2005 Versi: 1/0.
Pertemuan <<12>> Paralel dan Basis Data Terdistribusi
1 Pertemuan 22 Radix Sort Matakuliah: T0016/Algoritma dan Pemrograman Tahun: 2005 Versi: versi 2.
1 Pertemuan 15 The Business Owner’s View Matakuliah: A0194/Pengendalian Rekayasa Ulang Informasi Tahun: 2005 Versi: 1/5.
1 Pertemuan 6 Elementary UDP Sockets Matakuliah: H0483 / Network Programming Tahun: 2005 Versi: 1.0.
Computer Organization and Architecture
1 Pertemuan 10 Membuat dan mengelola resiko dan kriminalitas sistem informasi Matakuliah: H0472 / Konsep Sistem Informasi Tahun: 2006 Versi: 1.
Cache Coherent Distributed Shared Memory. Motivations Small processor count –SMP machines –Single shared memory with multiple processors interconnected.
Distributed Processing, Client/Server, and Clusters
1 Pertemuan 10 Arsitektur Jaringan Model OSI Matakuliah: H0174/Jaringan Komputer Tahun: 2006 Versi: 1/0.
1 Minggu 12, Pertemuan 23 Introduction to Distributed DBMS (Chapter , 22.6, 3rd ed.) Matakuliah: T0206-Sistem Basisdata Tahun: 2005 Versi: 1.0/0.0.
1 Pertemuan 13 BACK PROPAGATION Matakuliah: H0434/Jaringan Syaraf Tiruan Tahun: 2005 Versi: 1.
1 Pertemuan 05 Firewall Matakuliah: H0451/Praktikum Jaringan Komputer Tahun: 2006 Versi: 1/0.
1 Pertemuan 12 Input/Output Matakuliah: H0344/Organisasi dan Arsitektur Komputer Tahun: 2005 Versi: 1/1.
1 Pertemuan 03 Routing Matakuliah: H0451/Praktikum Jaringan Komputer Tahun: 2006 Versi: 1/0.
1 Pertemuan 02 LAN Matakuliah: H0451/Praktikum Jaringan Komputer Tahun: 2006 Versi: 1/0.
1 Pertemuan 16 Instruction Set 2 Matakuliah: H0344/Organisasi dan Arsitektur Komputer Tahun: 2005 Versi: 1/1.
1 Pertemuan 17 Internal Memory: I Matakuliah: T0324 / Arsitektur dan Organisasi Komputer Tahun: 2005 Versi: 1.
1 Pertemuan 5 The structure part of object data model Matakuliah: M0174/OBJECT ORIENTED DATABASE Tahun: 2005 Versi: 1/0.
1 Pertemuan 9 Making an outline Matakuliah: G1072 – Reading 1 Tahun: 2005 Versi: revisi 0.
Teknik Routing Pertemuan 20 Matakuliah: H0484/Jaringan Komputer Tahun: 2007.
1 Pertemuan 21 Parallelism and Superscalar Matakuliah: H0344/Organisasi dan Arsitektur Komputer Tahun: 2005 Versi: 1/1.
Chapter 17 Parallel Processing.
1 Pertemuan 20 Time & Condition Clauses with Future reference Matakuliah: G0134 – Grammar III Tahun: 2005 Versi: revisi 1.
Transport Layer Pertemuan 24 Matakuliah: H0484/Jaringan Komputer Tahun: 2007.
1 Pertemuan 23 Reduced Instruction Set Computer 1 Matakuliah: H0344/Organisasi dan Arsitektur Komputer Tahun: 2005 Versi: 1/1.
1 Lecture 23: Multiprocessors Today’s topics:  RAID  Multiprocessor taxonomy  Snooping-based cache coherence protocol.
1 CSE SUNY New Paltz Chapter Nine Multiprocessors.
1 Pertemuan 25 Parallel Processing 1 Matakuliah: H0344/Organisasi dan Arsitektur Komputer Tahun: 2005 Versi: 1/1.
1 Pertemuan 20 Microprogrammed Control Matakuliah: H0344/Organisasi dan Arsitektur Komputer Tahun: 2005 Versi: 1/1.
1 Pertemuan 8 The Object Definition Language (Lanjutan) Matakuliah: M0174/OBJECT ORIENTED DATABASE Tahun: 2005 Versi: 1/0.
Chapter 18 Parallel Processing (Multiprocessing).
Lecture 37: Chapter 7: Multiprocessors Today’s topic –Introduction to multiprocessors –Parallelism in software –Memory organization –Cache coherence 1.
Introduction to Symmetric Multiprocessors Süha TUNA Bilişim Enstitüsü UHeM Yaz Çalıştayı
Advanced Architectures
1 Distributed Processing, Client/Server, and Clusters Chapter 13.
Parallel Processing – Page 1 of 63CSCI 4717 – Computer Architecture CSCI 4717/5717 Computer Architecture Topic: Symmetric Multiprocessors & Clusters Reading:
Parallel ProcessingCSCI 4717 – Computer Architecture CSCI 4717/5717 Computer Architecture Topic: Symmetric Multiprocessors & Clusters Reading: Stallings,
1 Pertemuan 5 Software: System Software Matakuliah: T0604-Pengantar Teknologi Informasi Tahun: 2008 Versi: 2.0/0.0 Williams, B.K, Stacy C. Sawyer (2007).
ECE200 – Computer Organization Chapter 9 – Multiprocessors.
Computer Organization
PARALLEL PROCESSOR- TAXONOMY. CH18 Parallel Processing {Multi-processor, Multi-computer} Multiple Processor Organizations Symmetric Multiprocessors Cache.
+ Clusters Alternative to SMP as an approach to providing high performance and high availability Particularly attractive for server applications Defined.
+ William Stallings Computer Organization and Architecture 10 th Edition © 2016 Pearson Education, Inc., Hoboken, NJ. All rights reserved.
Understanding Parallel Computers Parallel Processing EE 613.
August 13, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 11: Multiprocessors: Uniform Memory Access * Jeremy R. Johnson Monday,
Multiprocessor  Use large number of processor design for workstation or PC market  Has an efficient medium for communication among the processor memory.
The University of Adelaide, School of Computer Science
Pertemuan 19 External Memory: I
Pertemuan 04 Proxy/Cache
Pertemuan 20 The Business Views of the Technology Architecture
Computer Organization and Architecture
Pertemuan 22 The Business Views of the Technology Architecture
Pertemuan 7 JARINGAN INSTAR DAN OUTSTAR
The University of Adelaide, School of Computer Science
Pertemuan 11 Model TCP/IP
Parallel and Multiprocessor Architectures – Shared Memory
Multiprocessors - Flynn’s taxonomy (1966)
High Performance Computing
The University of Adelaide, School of Computer Science
Lecture 24: Virtual Memory, Multiprocessors
CSL718 : Multiprocessors 13th April, 2006 Introduction
The University of Adelaide, School of Computer Science
Presentation transcript:

1 Pertemuan 26 Parallel Processing 2 Matakuliah: H0344/Organisasi dan Arsitektur Komputer Tahun: 2005 Versi: 1/1

2 Learning Outcomes Pada akhir pertemuan ini, diharapkan mahasiswa akan mampu : Menjelaskan prinsip kerja parallel processing

3 Outline Materi Multiple Processor Organization Symmetric Multiprocessor Cache Coherence and The MESI Protocol Clusters Non-uniform Memory Access Vector Computation

4 Cache coherence and MESI Protocol The cache coherence: Multiple copies of the same data can exist in different caches simultaneously, and if processors are allowed to update their own copies freely, an inconsistent view of memory can result. Software solution Hardware solution Directory protocol Snoopy protocol

5 Cache coherence and MESI Protocol MESI cache line states M Modified E Exclusive S Shared I Invalid This cache line valid? Yes No The memory copy is … Out of dateValid - Copies exist in other caches? No Maybe A write to this line … Does not go to bus Goes to bus and updates cache Goes directly to bus

6 Cache coherence and MESI Protocol MESI state transition diagram

7 Clusters Four benefits that can be achieved with clustering: Absolute scalability Incremental scalability High availability Superior price/performance

8 Clusters Cluster configuration

9 Clustering methods: benefits and limitations Clustering methodDescriptionBenefitsLimitation Passive standbyA secondary server takes over in case of primary server failure. Easy to implement.High cost because the secondary server is unavailable for other processing tasks. Active secondaryThe secondary server is also used for processing tasks. Reduces cost because secondary servers can be used for processing. Increased complexity. Separate serversSeparate servers have their own disks. Data are continuously copied from primary to secondary server. High availability.High network and server overhead due to copying operations. Servers connected to disk Servers are cabled to the same disks, but each server owns its disk. If one server fails, its disks are taken over by the other server. Reduced network and server overhead due to elimination of copying operations. Usually requires disk mirroring or RAID technology to compensate for risk of disk failure. Servers share disks Multiple servers simultaneously share access to disks. Low network and server overhead. Reduced risk of downtime caused by disk failure. Requires look manager software. Usually used with disk mirroring or RAID technology.

10 Operating system design issue: Failure management Load balancing Parallel computation Parallelizing compiler Parallelized application Parametric computing Clusters

11 Uniform memory access (UMA) Non uniform memory access (NUMA) Cache coherent NUMA (CC-NUMA) Non uniform memory access

12 Non uniform memory access CC-NUMA Organization

13 DO 100 I = 1, N DO 100 J = 1, NC(I, J) = 0.0 (J = 1, N) C(I, J) = 0.0DO 100 K = 1, N C(1, J) = C(I, J) + A(I, K) * B(K, J) (J = 1, N) C(I, J) = C(I, J) + A(I, K) * B(K, J)100CONTINUE 100CONTINUE (a) Scalar processing(b) Vector processing DO 50 J = 1, N - 1 FORK CONTINUE J = N 100DO 200 I = 1, N C(I, J) = 0.0 DO 200 K = 1, N C(I, J) = C(I, J) + A(I, K) * B(K, J) 200CONTINUE (c) Parallel processing Vector computation

14 Vector computation

15 Vector computation

16 Vector computation DO 100 J = 1, 50 CR(J) = AR(J) * BR(J) – AI(J) * BI(J) 100CI(J) = AR(J) * BI(J) + AI(J) * BR(J) OperationCycleOperationCycle AR(J) * BR(J)  T1(J)3AR(J)  V1(J)1 AI(J) * BI(J)  T2(J)3BR(J)  V2(J)1 T1(J) – T2(J)  CR(J)3V1(J) * V2(J)  V3(J)1 AR(J) * BI(J)  T3(J)3AI(J)  V4(J)1 AI(J) * BR(J)  T4(J)3BI(J)  V5(J)1 T3(J) + T4(J)  CI(J)3V4(J) * V5(J)  V6(J)1 TOTAL12V3(J) – V6(J)  V7(J)1 (a) Storage to storageV7(J)  CR(J)1 V1(J) * V5(J)  V8(J)1 V4(J) * V2(J)  V9(J)1 V8(J) + V9(J)  V0(J)1 V0(J)  CI(J)1 TOTAL12 (b) Register to register

17 Vector computation DO 100 J = 1, 50 CR(J) = AR(J) * BR(J) – AI(J) * BI(J) 100CI(J) = AR(J) * BI(J) + AI(J) * BR(J) OperationCycleOperationCy cle AR(J)  V1(J)1 1 V1(J) * BR(J)  V2(J)1 1 AI(J)  V3(J)1 1 V3(J) * BI(J)  V4(J)1V2(J) – V(3) * BI(J)  V2(J)1 V2(J) – V4(J)  V5(J)1V2(J)  CR(J)1 V5(J)  CR(J)1V1(J) * BI(J)  V4(J)1 V1(J) * BI(J)  V6(J)1V4(J) + V3(J) * BR(J)  V5(J)1 V4(J) * BR(J)  V7(J)1V5(J)  CI(J)1 V6(J) + V7(J)  V8(J)1TOTAL8 V8(J)  CI(J)1(d) Compound instruction TOTAL10 (c) Storage to storage