IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.01.

Slides:



Advertisements
Similar presentations
Multi-core processors. 2 Processor development till 2004 Out-of-order Instruction scheduling Out-of-order Instruction scheduling.
Advertisements

Multiprocessors— Large vs. Small Scale Multiprocessors— Large vs. Small Scale.
Lecture 6: Multicore Systems
Intel Multi-Core Technology. New Energy Efficiency by Parallel Processing – Multi cores in a single package – Second generation high k + metal gate 32nm.
MULTICORE PROCESSOR TECHNOLOGY.  Introduction  history  Why multi-core ?  What do you mean by multicore?  Multi core architecture  Comparison of.
C OMPUTING E SSENTIALS Timothy J. O’Leary Linda I. O’Leary Presentations by: Fred Bounds.
An Overview of the Computer System
Types of Computers & Computer Hardware Computer Technology.
1 COMP 206: Computer Architecture and Implementation Montek Singh Mon, Dec 5, 2005 Topic: Intro to Multiprocessors and Thread-Level Parallelism.
Computational Astrophysics: Methodology 1.Identify astrophysical problem 2.Write down corresponding equations 3.Identify numerical algorithm 4.Find a computer.
1 Pipelining for Multi- Core Architectures. 2 Multi-Core Technology Single Core Dual CoreMulti-Core + Cache + Cache Core 4 or more cores.
Types of Computers and Computer Systems.
GCSE Computing - The CPU
MIS 175 Spring Learning Objectives When you finish this chapter, you will: –Recognize major components of an electronic computer. –Understand how.
Single-Chip Multi-Processors (CMP) PRADEEP DANDAMUDI 1 ELEC , Fall 08.
Computer performance.
Hardware -Computer Organization. Hardware & Software A computer system consists of A computer system consists of –Hardware: anything you can touch, smell,
 Chasis / System cabinet  A plastic enclosure that contains most of the components of a computer (usually excluding the display, keyboard and mouse)
Computer System Architectures Computer System Software
Writer:-Rashedul Hasan Editor:- Jasim Uddin
Computer Processing of Data
Practical PC, 7th Edition Chapter 17: Looking Under the Hood
Multi Core Processor Submitted by: Lizolen Pradhan
 Design model for a computer  Named after John von Neuman  Instructions that tell the computer what to do are stored in memory  Stored program Memory.
Types of Computers Mainframe/Server Two Dual-Core Intel ® Xeon ® Processors 5140 Multi user access Large amount of RAM ( 48GB) and Backing Storage Desktop.
Classification of Computers
Multi-core architectures. Single-core computer Single-core CPU chip.
Multi-Core Architectures
Chapter 2 The CPU and the Main Board  2.1 Components of the CPU 2.1 Components of the CPU 2.1 Components of the CPU  2.2Performance and Instruction Sets.
Advanced Computer Architecture 0 Lecture # 1 Introduction by Husnain Sherazi.
Super computers Parallel Processing By Lecturer: Aisha Dawood.
+ CS 325: CS Hardware and Software Organization and Architecture Memory Organization.
6/4/ Key components of the computer Classification of computers based on Purposes Classification of computers based on Signals Classification of.
SKILL AREA: 1.2 MAIN ELEMENTS OF A PERSONAL COMPUTER.
Chapter 17 Looking “Under the Hood”. 2Practical PC 5 th Edition Chapter 17 Getting Started In this Chapter, you will learn: − How does a computer work.
Computer Organization & Assembly Language © by DR. M. Amer.
Chapter 1 Computer Hardware1 Computer Hardware A level Computing Book (Reference) By P.M.Heathcore.
Shashwat Shriparv InfinitySoft.
CPS 4150 Computer Organization Fall 2006 Ching-Song Don Wei.
Multi-core processors. 2 Processor development till 2004 Out-of-order Instruction scheduling Out-of-order Instruction scheduling.
THE BRIEF HISTORY OF 8085 MICROPROCESSOR & THEIR APPLICATIONS
Academic PowerPoint Computer System – Architecture.
MULTICORE PROCESSOR TECHNOLOGY.  Introduction  history  Why multi-core ?  What do you mean by multicore?  Multi core architecture  Comparison of.
Lecture on Central Process Unit (CPU)
Defining the terms assignment. I will define the following terms: computer literate, computer, storage, Data, Information, input, output, information processing.
Chapter 2.
Jeffrey Ellak CS 147. Topics What is memory hierarchy? What are the different types of memory? What is in charge of accessing memory?
CS4315A. Berrached:CMS:UHD1 Introduction to Operating Systems Chapter 1.
What is it and why do we need it? Chris Ward CS147 10/16/2008.
Chapter 11 System Performance Enhancement. Basic Operation of a Computer l Program is loaded into memory l Instruction is fetched from memory l Operands.
Lecture # 10 Processors Microcomputer Processors.
Hardware Architecture
CPU (Central Processing Unit). The CPU is the brain of the computer. Sometimes referred to simply as the processor or central processor, the CPU is where.
Fall 2012 Parallel Computer Architecture Lecture 4: Multi-Core Processors Prof. Onur Mutlu Carnegie Mellon University 9/14/2012.
CPU Central Processing Unit
Multiprocessing.
6. Structure of Computers
Assembly Language for Intel-Based Computers, 5th Edition
Phnom Penh International University (PPIU)
Introduction to Microprocessors
Scalable Processor Design
Multi-Core Computing Osama Awwad Department of Computer Science
An Overview of the Computer System
Multicultural Social Community Development Institute ( MSCDI)
BIC 10503: COMPUTER ARCHITECTURE
Microprocessor & Assembly Language
Types of Computers Mainframe/Server
Chapter 1 Introduction.
Classification of computers
Computer Evolution and Performance
Presentation transcript:

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.01

Classes of computers Based on their physical size, performance and application areas they are divided into four categories as Classification of computers Micro Hand-heldDesktopLaptop MiniMainframeSuper IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.02

Micro Computers Small,low cost digital computer, which usually consists of a microprocessor, a storage unit, an input channel, and an output channel, all of which may be on one chip inserted into one or several PC board. Power supply, connecting cables, peripherals an OS and software program can provide a complete micro computer system. Smallest of the computer family. They were designed for the individual users only but nowadays they have become powerful tools for many businesses that when networked together can serve more than one user. e.g: IBM-PC, Pentium 100, Pentium 200, Apple Macintosh IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.03

Desktop Computer:  PC (Personal computer) intended for stand alone use by an individual.  system unit, a display monitor, a keyboard, internal hard disk storage, and other peripheral devices.  Not very expensive.  APPLE, IBM, Dell & Hewlett Packard. IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.04

Laptop:  Portable computer. It resembles a notebook, it is also known as notebook.  Features of a normal desktop’s.  Advantage: use this at anywhere at any time ( when one is traveling).  No need of external power supply, only rechargeable battery is enough.  Expensive when compared to desktop computers. IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.05

Hand held Computers:  PDA (Personal Digital Assistant) conveniently be stored in a pocket used while the user is holding it.  Slightly bigger than the calculator.  Pen or electronic stylus as input device.  Also called as Palmtop computers.  No disk drive instead they have small cards to store programs and data. But they can be connected to a printer to get the output.  Limited memory & not powerful as desktop computers. IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.06

Mini Computers Small digital computer whose process & storage capacity micro computer. Speed in between mainframe & micro computer. Size: two drawing filing cabinet. Also called as mid range computer Supporting 4 to about 200 simultaneous users. Multi user system for real time controls & engineering work. IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.07

Mainframe Computer  Large expensive-simultaneous dp 100s or 1000s of users.  Used to store, manage & process large amounts of data that need to be reliable, secure & centralized.  Supports large volumes of dp, high performance online transaction processing systems & extensive data storage & retrieval.  Mainframes are 2nd largest IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.08

Super Computers  Special purpose machines. specially designed to maximise the numbers of FLOPS (Floating Point Operations Per Second).>1gigaflop/sec  Highest processing speed to solve engineering and scientific problems.  Number of CPU that operate in parallel.  Speed : MFLOPS.  Resolve complex mathematical equations in a few hours.  Fastest, costliest and most powerful computer.  Used area: Aerodynamic metrology, plasma physics, military strategist and Cinematics specialist. IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.09

10

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.011

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.012

Why multi-core ? Difficult to make single-core clock frequencies even higher Deeply pipelined circuits: –heat problems –Clock problems –Efficiency (Stall) problems Doubling issue rates above today’s 3-6 instructions per clock, say to 6 to 12 instructions, is extremely difficult –issue 3 or 4 data memory accesses per cycle, –rename and access more than 20 registers per cycle, and –fetch 12 to 24 instructions per cycle. Many new applications are multithreaded General trend in computer architecture (shift towards more parallelism) IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.013

Multi-Core Processor is a Special Kind of a Multiprocessor IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.014

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.015 Replicate multiple processor cores on a single die. Multi-Core Architectures

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.016 The cores fit on a single processor socket Also called CMP (Chip Multi-Processor) Multi-Core CPU Chip

Trends in Technology Capacity Speed (latency) Logic: 2x in 3 years 2x in 3 years DRAM: 4x in 3 years 2x in 10 years Disk: 4x in 3 years 2x in 10 years DRAM Generations Year Size Cycle Time Kb 250 ns Kb 220 ns Mb 190 ns Mb 165 ns Mb 120 ns Mb 110 ns Mb 100 ns Mb 90 ns Mb 80 ns Mb 60ns 16000:1 4:1 (Capacity) (Latency) IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.017

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.018

Power, Energy & Cost IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.019  A Multi-core processor uses less power than single-core processors  Cache coherency circuitry can operate at a much higher clock rate than is possible if the signals have to travel off-chip.  Signals between different CPUs travel shorter distances, those signals degrade less.  These higher quality signals allow more data to be sent in a given time period since individual signals can be shorter and do not need to be repeated as often.  Ability of multi-core processors to increase application performance depends on the use of multiple threads within applications.  Most Current video games will run faster on a 3 GHz single-core processor than on a 2GHz dual-core processor (of the same core architecture.  Two processing cores sharing the same system bus and memory bandwidth limits the real-world performance advantage.  If a single core is close to being memory bandwidth limited, going to multi-core might only give 30% to 70% improvement.  If memory bandwidth is not a problem, a 90% improvement can be expected.

Dependability Database servers Web servers (Web commerce) Compilers Multimedia applications Scientific applications, CAD/CAM In general, applications with Thread-level parallelism as opposed to instruction level parallelism IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.020

Measuring, Reporting and Summarizing Performance IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.021 processor-multicoreStep 2, masuk ke msconfig melalui menu "start...

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.022  All computers are now parallel computers!  Multi-core processors represent an important new trend in computer architecture.  Decreased power consumption and heat generation.  Minimized wire lengths and interconnect latencies.  They enable true thread-level parallelism with great energy efficiency and scalability.  To utilize their full potential, applications will need to move from a single to a multi-threaded model.  Parallel programming techniques likely to gain importance. the difficult problem is not building multi-core hardware, but programming it in a way that lets mainstream applications benefit from the continued exponential growth in CPU performance.  the software industry needs to get back into the state where existing applications run faster on new hardware Measuring, Reporting and Summarizing Performance

Processor-DRAM Performance To illustrate the performance impact, assume a single- issue pipelined CPU with CPI = 1 using non-ideal memory. The minimum cost of a full memory access in terms of number of wasted CPU cycles: CPU CPU Memory Minimum CPU cycles or Year speed cycle Access instructions wasted MHZ ns ns 1986: / = : /30 -1 = : / = : /5 -1 = : / = : /1 - 1 = : / = 159 IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.023

y MainMemor y Main memory generally uses (DRAM), which uses a single transistor to store a bit, but requires a periodic data refresh (~every 8 msec). Cache uses SRAM: Static Random Access Memory –No refresh (6 transistors/bit vs. 1 transistor/bit for DRAM) Size: DRAM/SRAM ­ 4-8, Cost & Cycle time: SRAM/DRAM ­ 8-16 Main memory performance: –Memory latency: Access time: The time it takes between a memory access request and the time the requested information is available to cache/CPU. Cycle time: The minimum time between requests to memory (greater than access time in DRAM to allow address lines to be stable) –Memory bandwidth: The maximum sustained data transfer rate between main memory and cache/CPU. IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.024

Quantitative Principles of Computer Design IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.025  Exploits increased feature-size and density  Increases functional units per chip (spatial efficiency)  Limits energy consumption per operation  Constrains growth in processor complexity

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.026 The Cores Run in Parallel

Classes of Parallelism Concurrent events  multiprogramming, multiprocessing, or multi computing. Parallelism  pipelining, vectorization, concurrency, simultaneity, data parallelism, partitioning, interleaving, overlapping, multiplicity, replication, time sharing, space sharing, multi tasking, multiprogramming, multithreading, and distributed computing at different process level. IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.027

Classes of Parallelism Parallelism in Hardware (Uniprocessor) – Pipelining, Superscalar, VLIW etc. Parallelism in Hardware -(SIMD, Vector processors, GPUs) Parallelism in Hardware (Multiprocessor) – Shared-memory multiprocessors – Distributed-memory multiprocessors – Chip-multiprocessors, Multi-cores Parallelism in Hardware (Multicomputer,clusters) Parallelism in Software – Task parallelism – Data parallelism IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.028

ILP,DLP,TLP and RLP 1ILPInstruction Level Parallelism 2LLPLoop Level Parallelism 3DLPData Level Parallelism 4TLPThread Level Parallelism(S) 5RLPR Level Parallelism IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.029

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.030

Loop Level Parallelism Instructions: iterations in a loop Accomplished by unrolling the loop H/W :Dynamic  if block size increased S/W : Static  compiler Loop controlling achieved by replicating the loop body multiple times & adjusting the loop termination code.(Creating multiple copies of the loop body) IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.031

Thread Level Parallelism A thread is a short sequence of instructions schedulable as a unit by a processor. This is parallelism on a more coarser scale Server can serve each client in a separate thread (Web server, database server) A computer game can do AI, graphics, and physics in three separate threads Single-core superscalar processors cannot fully exploit TLP Multi-core architectures are the next step in processor evolution: explicitly exploiting TLP IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.032

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.033

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.034

Inter-Core Bus IFETCE/ME/CSE/B.V.R.Raju/Iyear/ Isem/CP7103/MCA/Unit- 1/PPt/Ver1.0 35

Multithreading SIMULTANEOUSLYPermits multiple independent threads to execute SIMULTANEOUSLY on the SAME core Weaving together multiple “threads” on the same core Example: –if one thread is waiting for a floating point operation to complete, –another thread can use the integer units IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.036

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.037

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.038

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.039 SMT Archtechture

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.040 Multi-Core vs SMT Advantages/disadvantages? Multi-core: –Since there are several cores, each is smaller and not as powerful, but easier to design and manufacture –Great with thread-level parallelism SMT –Can have one large and fast superscalar core Great performance on a single thread Mostly still only exploits instruction-level parallelism

CMP Architecture IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.041

Limitations of Single core processors IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.042 Smarter Brain –(e.g. x386  x486  Pentium  P2  P3  P4) Larger Memory –Larger caches, DRAM, Disk Smaller Head –Fewer chips (integrate more things onto a chip) More Power Consumption –few Watts  120+ Watts! More Complex –  1Billion Transistors; design + verification complexity

Multi-Core System In multi-core systems, the term multi-CPU refers to multiple physically separate processing-units (which often contain special circuitry to facilitate communication between each other). The terms many-core and massively multi-core are sometimes used to describe multi-core architectures with an especially high number of cores (tens or hundreds). IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.043

A multi-core processor is a single computing component with 2 or more independent actual cpu (called "cores"), which are the units that read and execute program instructions. The instructions are ordinary CPU instructions such as add, move data, and branch, but the multiple cores can run multiple instructions at the same time, increasing overall speed for programs amenable to parallel computing. Manufacturers typically integrate the cores onto a single integrated circuit die (known as a chip multiprocessor or CMP), or onto multiple dies in a single chip package.computingprogram instructionsCPU instructionsparallel computing integrated circuitdiechip package IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.044

1 single-core 2 dual-core 3 tri-core /triple-core 4 quad-core 5penta-core 6hexa-core 7hepta-core 8octa-core/octo-core 9nona-core 10deca-core Number of cores Common names 11hendeca-core 12dodeca-core 13trideca-core 14tetradeca-core 15pentadeca-core 16hexadeca-core 17heptadeca-core 18octadeca-core 19enneadeca-core 20icosa-core IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.045

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.046 Multicore processors share the cache and MMU with short interconnects

What Is Multicore Processing? Multicore (or multi-core) processing uses software designed to run as parallel or asynchronously processed multiple applications over a multicore processor. The efficiency of multicore processing is dependent on how well the software application is optimized to take advantage of the multiple cores, the composition of those multicore processors, and the speed of the external interfaces and related hardware components in a system. Multicore processing can be especially useful for low-latency applications, with the largest boost in performance likely to be noticed in improved response time while running CPU-intensive processes. IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.047

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.048

The Multi core era A multi-core processor is a single computing component with two or more independent actual central processing units (called "cores"), which are the units that read and execute program instructions. [1] The instructions are ordinary CPU instructions such as add, move data, and branch, but the multiple cores can run multiple instructions at the same time, increasing overall speed for programs amenable to parallel computing. [2] Manufacturers typically integrate the cores onto a single integrated circuit die (known as a chip multiprocessor or CMP), or onto multiple dies in a single chip package.computingcentral processing unitsprogram instructions [1]CPU instructionsparallel computing [2] integrated circuitdiechip package IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.049

Case studies of Multi core Architectures IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.050

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.051

IFETCE/ME/CSE/B.V.R.Raju/Iyear/Isem/CP7103/MCA/Unit-1/PPt/Ver1.052