Virtual Memory and Paging J. Nelson Amaral. Large Data Sets Size of address space: – 32-bit machines: 2 32 = 4 GB – 64-bit machines: 2 64 = a huge number.

Slides:

Advertisements

Similar presentations

Chapter 4 Memory Management Basic memory management Swapping

Advertisements

1 Memory hierarchy and paging Electronic Computers M.

EECS 470 Virtual Memory Lecture 15. Why Use Virtual Memory? Decouples size of physical memory from programmer visible virtual memory Provides a convenient.

1 Virtual Memory Management B.Ramamurthy. 2 Demand Paging Main memory LAS 0 LAS 1 LAS 2 (Physical Address Space -PAS) LAS - Logical Address.

Computer Organization CS224 Fall 2012 Lesson 44. Virtual Memory  Use main memory as a “cache” for secondary (disk) storage l Managed jointly by CPU hardware.

Lecture 34: Chapter 5 Today’s topic –Virtual Memories 1.

CSC 4250 Computer Architectures December 8, 2006 Chapter 5. Memory Hierarchy.

Memory/Storage Architecture Lab Computer Architecture Virtual Memory.

Virtual Memory Chapter 8.

Memory Management (II)

CS 333 Introduction to Operating Systems Class 11 – Virtual Memory (1)

Translation Buffers (TLB’s)

Virtual Memory Chapter 8.

©UCB CS 162 Ch 7: Virtual Memory LECTURE 13 Instructor: L.N. Bhuyan

1 Virtual Memory Management B.Ramamurthy Chapter 10.

Mem. Hier. CSE 471 Aut 011 Evolution in Memory Management Techniques In early days, single program run on the whole machine –Used all the memory available.

©UCB CS 161 Ch 7: Memory Hierarchy LECTURE 24 Instructor: L.N. Bhuyan

CSE378 Virtual memory.1 Evolution in memory management techniques In early days, single program ran on the whole machine –used all the memory available.

Part 8: Virtual Memory. Silberschatz, Galvin and Gagne ©2005 Virtual vs. Physical Address Space Each process has its own virtual address space, which.

Operating Systems Chapter 8

Computer Architecture Lecture 28 Fasih ur Rehman.

Lecture 19: Virtual Memory

July 30, 2001Systems Architecture II1 Systems Architecture II (CS ) Lecture 8: Exploiting Memory Hierarchy: Virtual Memory * Jeremy R. Johnson Monday.

The Memory Hierarchy 21/05/2009Lecture 32_CA&O_Engr Umbreen Sabir.

IT253: Computer Organization

Lecture 9: Memory Hierarchy Virtual Memory Kai Bu

Virtual Memory Expanding Memory Multiple Concurrent Processes.

CS399 New Beginnings Jonathan Walpole. Virtual Memory (1)

Virtual Memory Part 1 Li-Shiuan Peh Computer Science & Artificial Intelligence Lab. Massachusetts Institute of Technology May 2, 2012L22-1

Virtual Memory. Virtual Memory: Topics Why virtual memory? Virtual to physical address translation Page Table Translation Lookaside Buffer (TLB)

Virtual Memory 1 1.

Review °Apply Principle of Locality Recursively °Manage memory to disk? Treat as cache Included protection as bonus, now critical Use Page Table of mappings.

Operating Systems ECE344 Ashvin Goel ECE University of Toronto Demand Paging.

Multilevel Caches Microprocessors are getting faster and including a small high speed cache on the same chip.

CS2100 Computer Organisation Virtual Memory – Own reading only (AY2015/6) Semester 1.

Virtual Memory Ch. 8 & 9 Silberschatz Operating Systems Book.

Virtual Memory Review Goal: give illusion of a large memory Allow many processes to share single memory Strategy Break physical memory up into blocks (pages)

Constructive Computer Architecture Virtual Memory: From Address Translation to Demand Paging Arvind Computer Science & Artificial Intelligence Lab. Massachusetts.

LECTURE 12 Virtual Memory. VIRTUAL MEMORY Just as a cache can provide fast, easy access to recently-used code and data, main memory acts as a “cache”

Virtual Memory 1 Computer Organization II © McQuain Virtual Memory Use main memory as a “cache” for secondary (disk) storage – Managed jointly.

CS203 – Advanced Computer Architecture Virtual Memory.

The Memory Hierarchy Lecture 31 20/07/2009Lecture 31_CA&O_Engr. Umbreen Sabir.

Memory Management memory hierarchy programs exhibit locality of reference - non-uniform reference patterns temporal locality - a program that references.

Virtual Memory Chapter 8.

CS161 – Design and Architecture of Computer

Translation Lookaside Buffer

Virtual Memory Chapter 7.4.

ECE232: Hardware Organization and Design

CS161 – Design and Architecture of Computer

Lecture 12 Virtual Memory.

From Address Translation to Demand Paging

CS510 Operating System Foundations

Lecture 14 Virtual Memory and the Alpha Memory Hierarchy

Evolution in Memory Management Techniques

Chapter 9: Virtual-Memory Management

Morgan Kaufmann Publishers Memory Hierarchy: Virtual Memory

Translation Buffers (TLB’s)

Virtual Memory Overcoming main memory size limitation

Translation Buffers (TLB’s)

Computer Architecture

CSE 471 Autumn 1998 Virtual memory

Evolution in memory management techniques

Evolution in memory management techniques

Evolution in memory management techniques

Translation Buffers (TLBs)

Virtual Memory Use main memory as a “cache” for secondary (disk) storage Managed jointly by CPU hardware and the operating system (OS) Programs share main.

Virtual Memory.

Review What are the advantages/disadvantages of pages versus segments?

Virtual Memory.

Virtual Memory 1 1.

Presentation transcript:

Virtual Memory and Paging J. Nelson Amaral

Large Data Sets Size of address space: – 32-bit machines: 2 32 = 4 GB – 64-bit machines: 2 64 = a huge number Size of main memory: – approaching 4 GB How to handle: – Applications whose data set is larger than the main memory size? – Sets of applications that together need more space than the memory size? Baer, p. 60

Multiprogramming More than one program reside in memory at the same time I/O is slow: If the running program needs I/O, it relinquishes the CPU Baer, p. 60

Multiprogramming Challenges How and where to load a program to memory? How a program asks for more memory? How to protect one program from another? Baer, p. 60

Virtual Memory Solution: – Give each program the illusion that it could address the whole addressing space CPU works with virtual addresses Memory works with real or physical addresses Baer, p. 60

Virtual -> Physical Address Translation Paging System – Divide both the virtual and the physical address spaces into pages of the same size. – Virtual space: page – Physical space: frame Fully associative mapping between pages and frames. – any page can be stored in any frame Baer, p. 60

Paging System Virtual space is much larger than physical memory Memory does not need to store the whole program and its data at the same time Memory can be shared with little fragmentation Pages can be shared among programs Baer, p. 61

Address Translation valid bit = 0 implies a page fault (there is no frame in memory for this page) Baer, p. 62

Page Fault Exception generated in program P 1 because valid bit = 0 in Page Table Entry (PTE) – Page fault handler initiates I/O read for P 1 I/O read takes several miliseconds to complete – context switch occurs O.S. saves processor state and starts I/O operation Handles CPU control to another program P 2 – Restores P 2 ’s state into CPU Baer, p. 62

Address Translation Virtual and physical addresses can be of different sizes. Example: 64 bits 40 or 48 bits Baer, p. 62

Translation Look-Aside Buffer (TLB) Problem: – Storing page table entries (PTEs) in memory would require a load for each address translation. – Caching PTEs interferes with the flow of instructions or data into the cache Solution: TLB, a small, high-associativity, cache dedicated to cache PTEs Baer, p. 62

TLB organization Each TLB entry consists of: – tag – data (a PTE entry) – valid bit – dirty bit – bits to encode memory protection – bits to encode recency of access A set of TLB entries may be reserved to the Operating System Baer, p. 62

TLB Characteristics ArchitecturePage Size (KB) Number of Entries I-TLBD-TLB Alpha (FA)32 (FA) Alpha (FA)64 (FA) Alpha (FA)128 (FA) Pentium432 (4-way)64 (4-way) Pentium II432 (4-way)64 (4-way) Pentium III432 (4-way)64 (4-way) Pentium 4464 (4-way)128 (4-way) Core Duo464 (FA) Baer, p. 63

Large Pages Recent processors implement large page size (typically 4 MB pages) – reduces page faults in applications with lots of data (scientific and graph) – requires that TLB entries be reserved for large pages. Baer, p. 63

Referencing Memory Baer, p. 63

Memory Reference Process TLB hit? protection violation? Handle TLB miss No Page Fault 0 Access Violation Exception Yes Turn PTE dirty bit on Yes valid bit? Yes 1 store? No Update Recency No Baer, p. 63

Handling TLB Misses Must access page table in memory – entirely in hardware – entirely in software – combination of both Replacement Algorithms – LRU for 4-way associativity (Intel) – Not Most Recently Used for full associativity (Alpha) Baer, p. 64

Handling TLB Miss (cont.) Serving a TLB miss takes cycles. – Too short to justify a context switch – Long enough to have significant impact on performance even a small TLB miss rate affects CPI Baer, p. 64

OS handling of page fault Reserve frame from a free list Find if faulting page is in disk Invalidate portions of the TLB (maybe Cache) Initiate read for faulting page Find page to replace if there is no free frame Invalidate cache lines mapping to replaced page Write dirty replaced pages to the disk Baer, p. 64

When page arrives in memory I/O interruption is raised OS updates the PTE of the page OS schedule requesting process for execution Baer, p. 64

Invalidating TLB Entries on Context Switch Page Fault → Exception → Context Switch Let: – P R : Relinquishing process – P I : Incoming Process Problem: TLB entries are for P R, not P I – Invalidating entire TLB on context switch leads to many TLB misses when P I is restored Solution: Use a processor ID number (PID) Baer, p. 64

Process ID (PID) Number O.S. sets a PID for each program The PID is added to the tag in the TLB entries A PID Register stores the PID of the active process Match PID Register with PID in TLB entry No need to invalidate TLB entries on context switch PIDs are recycled by the OS Baer, p. 64

Page Size X Read/Write Time Seek Time Rotation Time Transfer Time 0 to 10 ms ~ 3 ms Page of Size x Seek Time Rotation Time Transfer Time 0 to 10 ms ~ 3 ms Page of Size 2x Baer, p. 65 Amortizing I/O Time: Large page size Read/write consecutive pages

Large Pages Amortize I/O time to transfer pages Smaller Page Tables – More PTEs are in main memory lower probability of double page fault for a single memory reference Fewer TLB misses – Single TLB entry translates more locations Pages cannot be too large – Transfer time and fragmentation Baer, p. 65

Performance of Memory Hierarchy Baer, p. 66

When to bring a missing item (to cache, TLB, or memory)? On demand LevelMiss FrequencyMiss Resolution Cachefew times per 100 references5-100 cycles entirely in hardware TLBfew times per 10,000 references cycles in hardware or software Page Faultfew times per 10,000,000 references millions of cycles require context switch Baer, p. 66

Where to put the missing item? Cache: restrictive mapping (direct or low associativity) TLB: fully associative or high set associativity Paging System: general mapping Baer, p. 66

How do we know it is there? Cache: Compare tags and check valid bits TLB: Compare tags, PID, check valid bits Memory: Check Page Tables Baer, p. 67

What happens on a replacement? Caches and TLBs: (approximation to) LRU Paging Systems: – Sophisticated algorithms to keep page fault rate very low – O.S. policies allocate a number of page to each program according to working set Baer, p. 67

Simulating Memory Hierarchy Memory Hierarchy simulation is faster than simulation to assess IPC or execution time Stack property of some replacement algorithms: – for a sequence of memory references for a given memory location at a given level of the hierarchy, the number of misses is monotonically non increasing with the size of the memory – can simulate a range of sizes in a single simulation pass. Baer, p. 67

Belady’s Algorithm Belady’s algorithm: replace the entry that will be accessed the furthest in the future. – It is the optimal algorithm – It needs to know the future not realizable in practice useful in simulation to compare with practical algorithms Baer, p. 67