SSU 1 Dr.A.Srinivas PES Institute of Technology Bangalore, India 9 – 20 July 2012.

Slides:



Advertisements
Similar presentations
1 Lecture 13: Cache and Virtual Memroy Review Cache optimization approaches, cache miss classification, Adapted from UCB CS252 S01.
Advertisements

WHAT IS AN OPERATING SYSTEM? An interface between users and hardware - an environment "architecture ” Allows convenient usage; hides the tedious stuff.
Instructor: Sazid Zaman Khan Lecturer, Department of Computer Science and Engineering, IIUC.
Measuring Performance
CMPE 421 Parallel Computer Architecture MEMORY SYSTEM.
Discovering Computers 2010
Introduction to Systems Architecture Kieran Mathieson.
OPERATING SYSTEM OVERVIEW
EET 4250: Chapter 1 Performance Measurement, Instruction Count & CPI Acknowledgements: Some slides and lecture notes for this course adapted from Prof.
GCSE Computing - The CPU
Processors Menu  INTEL Core™ i Processor INTEL Core™ i Processor  INTEL Core i Processor INTEL Core i Processor  AMD A K.

Systems I Locality and Caching
1 Instant replay  The semester was split into roughly four parts. —The 1st quarter covered instruction set architectures—the connection between software.
Basic Microcomputer Design. Inside the CPU Registers – storage locations Control Unit (CU) – coordinates the sequencing of steps involved in executing.
Computers Central Processor Unit. Basic Computer System MAIN MEMORY ALUCNTL..... BUS CONTROLLER Processor I/O moduleInterconnections BUS Memory.
Computing hardware CPU.
Different CPUs CLICK THE SPINNING COMPUTER TO MOVE ON.
Unit 2 - Hardware Microprocessors & CPUs. What is a microprocessor? ● The brain of the computer, the microprocessor is responsible for organizing and.
Operating System Review September 10, 2012Introduction to Computer Security ©2004 Matt Bishop Slide #1-1.
Types of Computers Mainframe/Server Two Dual-Core Intel ® Xeon ® Processors 5140 Multi user access Large amount of RAM ( 48GB) and Backing Storage Desktop.
Introduction CSE 410, Spring 2008 Computer Systems
EET 4250: Chapter 1 Computer Abstractions and Technology Acknowledgements: Some slides and lecture notes for this course adapted from Prof. Mary Jane Irwin.
Multi-core Programming Introduction Topics. Topics General Ideas Moore’s Law Amdahl's Law Processes and Threads Concurrency vs. Parallelism.
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
1 Embedded Systems Computer Architecture. Embedded Systems2 Memory Hierarchy Registers Cache RAM Disk L2 Cache Speed (faster) Cost (cheaper per-byte)
Recall: Three I/O Methods Synchronous: Wait for I/O operation to complete. Asynchronous: Post I/O request and switch to other work. DMA (Direct Memory.
Hardware Trends. Contents Memory Hard Disks Processors Network Accessories Future.
The Central Processing Unit
Fall 2012 Chapter 2: x86 Processor Architecture. Irvine, Kip R. Assembly Language for x86 Processors 6/e, Chapter Overview General Concepts IA-32.
Lecture 19 Today’s topics Types of memory Memory hierarchy.
Computer Architecture By Chris Van Horn. CPU Basics “Brains of the Computer” Fetch Execute Cycle Instruction Branching.
Chapter 1 Computer Abstractions and Technology. Chapter 1 — Computer Abstractions and Technology — 2 The Computer Revolution Progress in computer technology.
Computer Structure & Architecture 7b - CPU & Buses.
The Goal: illusion of large, fast, cheap memory Fact: Large memories are slow, fast memories are small How do we create a memory that is large, cheap and.
Processor Architecture
Lecture#15. Cache Function The data that is stored within a cache might be values that have been computed earlier or duplicates of original values that.
Processor Structure and Function Chapter8:. CPU Structure  CPU must:  Fetch instructions –Read instruction from memory  Interpret instructions –Instruction.
Lecture on Central Process Unit (CPU)
DR. SIMING LIU SPRING 2016 COMPUTER SCIENCE AND ENGINEERING UNIVERSITY OF NEVADA, RENO CS 219 Computer Organization.
Central Processing Unit (CPU) MATTHEW BOWEN. Function  The function of the CPU is to execute all of the commands and calculations or “processes” that.
Ee314 Microprocessor Systems Dr. Mircea DABACAN Electrical Engineering & Computer Science Dept., Washington State University Office: EE/ME 504 Phone:
Lab Activities 1, 2. Some of the Lab Server Specifications CPU: 2 Quad(4) Core Intel Xeon 5400 processors CPU Speed: 2.5 GHz Cache : Each 2 cores share.
CACHE MEMORY CS 147 October 2, 2008 Sampriya Chandra.
COMPUTER SYSTEMS ARCHITECTURE A NETWORKING APPROACH CHAPTER 12 INTRODUCTION THE MEMORY HIERARCHY CS 147 Nathaniel Gilbert 1.
Chapter 11 System Performance Enhancement. Basic Operation of a Computer l Program is loaded into memory l Instruction is fetched from memory l Operands.
Hardware Architecture
MAHARANA PRATAP COLLEGE OF TECHNOLOGY SEMINAR ON- COMPUTER PROCESSOR SUBJECT CODE: CS-307 Branch-CSE Sem- 3 rd SUBMITTED TO SUBMITTED BY.
Multi-Core CPUs Matt Kuehn. Roadmap ► Intel vs AMD ► Early multi-core processors ► Threads vs Physical Cores ► Multithreading and Multi-core processing.
Computer Organization CS345 David Monismith Based upon notes by Dr. Bill Siever and from the Patterson and Hennessy Text.
Computer System Structures Storage
Cache Advanced Higher.
COSC3330 Computer Architecture
ECE232: Hardware Organization and Design
Yu-Lun Kuo Computer Sciences and Information Engineering
Chapter 1: A Tour of Computer Systems
Assembly Language for Intel-Based Computers, 5th Edition
What happens inside a CPU?
CS-301 Introduction to Computing Lecture 17
CACHE MEMORY.
Multicultural Social Community Development Institute ( MSCDI)
Introduction to Computing
CPU and Embedded Systems
Types of Computers Mainframe/Server
Morgan Kaufmann Publishers Memory Hierarchy: Introduction
Computer System Design Lecture 9
Computer Evolution and Performance
1-2 – Central Processing Unit
Learning Objectives To be able to describe the purpose of the CPU
Computer Science. The CPU The CPU is made up of 3 main parts : Cache ALU Control Unit.
Presentation transcript:

SSU 1 Dr.A.Srinivas PES Institute of Technology Bangalore, India 9 – 20 July 2012

Schedule Day-1: Module 1: Multicore Architecture Module-2 : Parallel Programming Days 2-5: Parallel Programming with OpenMP Assigment : JPEG Compression & Decompression using Parallel Programming OpenMP Directives 2

Memory Hierarchy of early computers: 3 levels CPU registers DRAM Memory Disk storage 3

CACHE MEMORY Principle of locality helped to speed up main memory access by introducing small fast memories known as CACHE MEMORIES that hold blocks of the most recently referenced instructions and data items. Cache is a small fast storage device that holds the operands and instructions most likely to be used by the CPU. 4

Due to increasing gap between CPU and main Memory, small SRAM memory called L1 cache is inserted. L1 caches can be accessed almost as fast as the registers, typically in 1 or 2 clock cycle Due to even more increasing gap between CPU and main memory, Additional cache: L2 cache inserted between L1 cache and main memory : accessed in fewer clock cycles. 5

L2 cache attached to the memory bus or to its own cache bus Some high performance systems also include additional L3 cache which sits between L2 and main memory. It has different arrangement but principle is the same. The cache is placed both physically closer and logically closer to the CPU than the main memory. 6

CACHE LINES / BLOCKS Cache memory is subdivided into cache lines Cache Lines / Blocks: The smallest unit of memory than can be transferred between the main memory and the cache. 7

8

9

Core Vs Processor - A core means, there could be more than one CPU inside; - A Quad core processor of 3 GHz will have four cores in the CPU running at 3 GHz, each with its own Cache.. 10

11

12

Amdahl’s Law Speedup = 13 In Terms of No. of Cores: Speedup = Where S is the time spent in executing the serialized portion of the parallelized version And n is the number of cores.

14

Multicore Philosophy - Two or more cores with in a single Die - each core has its own set of instructions and architectural resources 15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

Hyper Threading: - Parts of a single processor are shared between threads - Execution Engine is shared - OS task switching does not happen in Hyper threading. -Processor is kept as busy as possible 31

32

33

34

35

36

37

38

39

40

41

42 Branch Target Buffer, Translation Look aside Buffer

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

61

62

63

64

65

66

67

68

69

70

71

72

73

74

75

76

77

78

79

80

References [1] Stephen Blair-Chappell, “Intel Core 2 Architecture - Implication for software developers, Intel Compiler Labs. [2] Ruud van der Pas, “An Overview of OpenMP”, IWOMP 2010, CCS, University of Tsukuba, Tsukuba, Japan, June 14-16, 2010 [3] Jernej Barbic, “Multi-core architectures”, , Spring 2007, May 3, 2007 [4] Ankit Kurana, “ Intel’s Multicore Processors”, Intel Corporation. [5] OpenMP Application Program Interface, Version 3.1 July 2011 [6] Ruud van der Pas, “An Introduction Into OpenMP”, IWOMP 2005, University of Oregon, Eugene, Oregon, USA, June 1-4,