Optimizing Dynamic Memory Management

Slides:



Advertisements
Similar presentations
Dynamic Memory Management
Advertisements

Chapter 6: Memory Management
Lecture 10: Heap Management CS 540 GMU Spring 2009.
File Systems.
KERNEL MEMORY ALLOCATION Unix Internals, Uresh Vahalia Sowmya Ponugoti CMSC 691X.
1 Optimizing Malloc and Free Professor Jennifer Rexford COS 217 Reading: Section 8.7 in K&R book
CPSC 388 – Compiler Design and Construction
Memory Management Chapter 4. Memory hierarchy Programmers want a lot of fast, non- volatile memory But, here is what we have:
Memory Management Memory Areas and their use Memory Manager Tasks:
1 Optimizing Malloc and Free Professor Jennifer Rexford
Memory Management A memory manager should take care of allocating memory when needed by programs release memory that is no longer used to the heap. Memory.
Memory Management (continued) CS-3013 C-term Memory Management CS-3013 Operating Systems C-term 2008 (Slides include materials from Operating System.
Memory Management 1 CS502 Spring 2006 Memory Management CS-502 Spring 2006.
CS-3013 & CS-502, Summer 2006 Memory Management1 CS-3013 & CS-502 Summer 2006.
Computer Organization and Architecture
1 Inner Workings of Malloc and Free Professor Jennifer Rexford COS 217.
Memory Allocation CS Introduction to Operating Systems.
File Implementation. File System Abstraction How to Organize Files on Disk Goals: –Maximize sequential performance –Easy random access to file –Easy.
Dynamic Memory Allocation Questions answered in this lecture: When is a stack appropriate? When is a heap? What are best-fit, first-fit, worst-fit, and.
1 Memory Management Memory Management COSC513 – Spring 2004 Student Name: Nan Qiao Student ID#: Professor: Dr. Morteza Anvari.
1. Memory Manager 2 Memory Management In an environment that supports dynamic memory allocation, the memory manager must keep a record of the usage of.
Subject: Operating System.
CSCI-375 Operating Systems Lecture Note: Many slides and/or pictures in the following are adapted from: slides ©2005 Silberschatz, Galvin, and Gagne Some.
1 Address Translation Memory Allocation –Linked lists –Bit maps Options for managing memory –Base and Bound –Segmentation –Paging Paged page tables Inverted.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Chapter 11: File System Implementation.
Chapter 17 Free-Space Management Chien-Chung Shen CIS, UD
Memory Management -Memory allocation -Garbage collection.
Operating Systems ECE344 Ashvin Goel ECE University of Toronto Memory Management Overview.
Dynamic Memory Management
Consider Starting with 160 k of memory do: Starting with 160 k of memory do: Allocate p1 (50 k) Allocate p1 (50 k) Allocate p2 (30 k) Allocate p2 (30 k)
Dynamic Memory Allocation II
CS 241 Discussion Section (12/1/2011). Tradeoffs When do you: – Expand Increase total memory usage – Split Make smaller chunks (avoid internal fragmentation)
Lecture 7 Page 1 CS 111 Summer 2013 Dynamic Domain Allocation A concept covered in a previous lecture We’ll just review it here Domains are regions of.
CSE 351 Dynamic Memory Allocation 1. Dynamic Memory Dynamic memory is memory that is “requested” at run- time Solves two fundamental dilemmas: How can.
Dynamic Memory Management Jennifer Rexford 1. 2 Goals of this Lecture Dynamic memory management techniques Garbage collection by the run-time system (Java)
COMP091 – Operating Systems 1 Memory Management. Memory Management Terms Physical address –Actual address as seen by memory unit Logical address –Address.
Memory Management I: Dynamic Storage Allocation Oct 8, 1998 Topics User-level view Policies Mechanisms class14.ppt Introduction to Computer Systems.
Memory Management What if pgm mem > main mem ?. Memory Management What if pgm mem > main mem ? Overlays – program controlled.
Chapter 17 Free-Space Management
Section 10: Memory Allocation Topics
Memory Management Memory Areas and their use Memory Manager Tasks:
Understanding Operating Systems Seventh Edition
CPSC 231 Organizing Files for Performance (D.H.)
Lectures linked lists Chapter 6 of textbook
CS703 - Advanced Operating Systems
Dynamic Domain Allocation
Dynamic Memory Allocation
Operating Systems (CS 340 D)
Memory Management Memory Areas and their use Memory Manager Tasks:
PA1 is out Best by Feb , 10:00 pm Enjoy early
Main Memory Management
LINKED LISTS CSCD Linked Lists.
Lecture 10: Buffer Manager and File Organization
Chapter 11: File System Implementation
Optimizing Malloc and Free
CS Introduction to Operating Systems
Lecturer SOE Dan Garcia
Main Memory Background Swapping Contiguous Allocation Paging
Memory Management Overview
Virtual Memory Hardware
Memory Allocation II CSE 351 Autumn 2017
CSE 451: Operating Systems Autumn 2005 Memory Management
Operating System Chapter 7. Memory Management
Memory Management (1).
CSE 451: Operating Systems Autumn 2003 Lecture 9 Memory Management
CSE 451: Operating Systems Autumn 2003 Lecture 9 Memory Management
Memory Management Memory Areas and their use Memory Manager Tasks:
CS703 - Advanced Operating Systems
COMP755 Advanced Operating Systems
Presentation transcript:

Optimizing Dynamic Memory Management Jennifer Rexford

Goals of this Lecture Details of K&R heap manager Heap mgr optimizations related to Assignment #6 Faster free() via doubly-linked list, redundant sizes, and status bits Faster malloc() via binning Other heap mgr optimizations Best/good fit block selection Selective splitting Deferred coalescing Segregated data Segregated meta-data Memory mapping

Details of the K&R Heap Manager Part 1: Details of the K&R Heap Manager

An Implementation Challenge Need information about each free block Starting address of the block of memory Length of the free block Pointer to the next block in the free list Where should this information be stored? Number of free blocks is not known in advance So, need to store the information on the heap But, wait, this code is what manages the heap!!! Can’t call malloc() to allocate storage for this info Can’t call free() to deallocate the storage, either

Store Information in the Free Block Solution: Store the information directly in the block Since the memory isn’t being used for anything anyway And allows data structure to grow and shrink as needed

Block Headers Every free block has a header, containing: Pointer to (i.e., address of) the next free block Size of the free block Challenge: programming outside the type system p (address returned to the user) header size user data

Free List: Circular Linked List Free blocks, linked together Example: circular linked list Keep list in order of increasing addresses Makes it easier to coalesce adjacent free blocks Free list In use In use In use

Malloc: First-Fit Algorithm Start at the beginning of the list Sequence through the list Keep a pointer to the previous element Stop when reaching first block that is big enough Patch up the list Return a pointer to the user prev p prev p p

Malloc: First Case: Perfect Fit Suppose the first fit is a perfect fit Remove the block from the list Link the previous free block with the next free block Return the current to the user (skipping header) p+1 prev p

Malloc: Second Case: Big Block Suppose the block is bigger than requested Divide the free block into two blocks Keep first (now smaller) block in the free list Allocate the second block to the user Bonus: No need to manipulate links p p

Free User passes a pointer to the memory block void free(void *ap); free() function inserts block into the list Identify the start of entry Find the location in the free list Add to the list, coalescing entries, if needed bp ap

Free: Finding Location to Insert Start at the beginning Sequence through the list Stop at last entry before the to-be-freed element Free list p bp In use In use FREE ME

Free: Handling Corner Cases Check for wrap-around in memory To-be-freed block is before first entry in the free list, or To-be-freed block is after the last entry in the free list Free list bp p FREE ME In use In use

Free: Inserting Into Free List New element to add to free list Insert in between previous and next entries But, there may be opportunities to coalesce bp p p->s.ptr

Coalescing With Neighbors Scanning the list finds the location for inserting Pointer to to-be-freed element: bp Pointer to previous element in free list: p Coalescing into larger free blocks Check if contiguous to upper and lower neighbors Free list p bp In use In use FREE ME lower upper

Coalesce With Upper Neighbor Check if next part of memory is in the free list If so, make into one bigger block Else, simply point to the next free element p bp p->s.ptr upper p p->s.ptr

Coalesce With Lower Neighbor Check if previous part of memory is in the free list If so, make into one bigger block p bp p->s.ptr lower p p->s.ptr

Strengths of K&R Approach Advantages Simplicity of the code Optimizations to malloc() Splitting large free block to avoid wasting space Optimization to free() Roving free-list pointer is left at the last place a block was allocated Coalescing contiguous free blocks to reduce fragmentation bp upper p p->s.ptr

Weaknesses of K&R Approach Inefficient use of memory: fragmentation First-fit policy can leave lots of “holes” of free blocks in memory Long execution times: linear-time overhead malloc() scans the free list to find a big-enough block free() scans the free list to find where to insert a block Accessing a wide range of memory addresses in free list Can lead to large amount of paging to/from the disk In use Free list 20 8 50

Optimizations Related to Assignment 6 Part 2: Optimizations Related to Assignment 6

Faster Free Performance problems with K&R free() Scanning the free list to know where to insert Keeping track of the “previous” node to do the insertion Doubly-linked, non-circular list Header Size of the block (in # of units) Flag indicating whether the block is free or in use If free, a pointer to the next free block Footer If free, a pointer to the previous free block h e a d f o t

Size: Finding Next Block Go quickly to next block in memory Start with the user’s data portion of the block Go backwards to the head of the block Easy, since you know the size of the header Go forward to the head of the next block Easy, since you know the size of the current block

Size: Finding Previous Block Go quickly to previous chunk in memory Start with the user’s data portion of the block Go backwards to the head of the block Easy, since you know the size of the header Go backwards to the footer of the previous block Easy, since you know the size of the footer Go backwards to the header of the previous block Easy, since you know the size from the footer

Pointers: Next Free Block Go quickly to next free block in memory Start with the user’s data portion of the block Go backwards to the head of the block Easy, since you know the size of the header Go forwards to the next free block Easy, since you have the next free pointer

Pointers: Previous Free Block Go quickly to previous free block in memory Start with the user’s data portion of the block Go backwards to the head of the block Easy, since you know the size of the header Go forwards to the footer of the block Easy, since you know the block size from the header Go backwards to the previous free block Easy, since you have the previous free pointer

Efficient Free Before: K&R Scan the free list till you find the place to insert Needed to see if you can coalesce adjacent blocks Expensive for loop with several pointer comparisons After: with header/footer and doubly-linked list Coalescing with the previous block in memory Check if previous block in memory is also free If so, coalesce Coalescing with the next block in memory the same way Add the new, larger block to the front of the linked list

But Malloc is Still Slow… Still need to scan the free list To find the first, or best, block that fits Root of the problem Free blocks have a wide range of sizes Solution: binning Separate free lists by block size Implemented as an array of free-list pointers

Binning Strategies: Exact Fit Have a bin for each block size, up to a limit Advantages: no search for requests up to that size Disadvantages: many bins, each storing a pointer Except for a final bin for all larger free blocks For allocating larger amounts of memory For splitting to create smaller blocks, when needed 1 1 1 1 2 3 3 3 4 > 4 5 8

Binning Strategies: Range Have a bin cover a range of sizes, up to a limit Advantages: fewer bins Disadvantages: need to search for a big enough block Except for a final bin for all larger free chunks For allocating larger amounts of memory For splitting to create smaller blocks, when needed 1-2 1 2 1 3-4 5-6 6 5 7-8 > 8 10 14

Suggestions for Assignment #6 Debugging memory management code is hard A bug in your code might stomp on the headers or footers … making it very hard to understand where you are in memory Suggestion: debug carefully as you go along Write little bits of code at a time, and test as you go Use assertion checks very liberally to catch mistakes early Use functions to apply higher-level checks on your list E.g,. all free-list blocks are marked as free E.g., each block pointer is within the heap range E.g., the block size in header and footer are the same Suggestion: draw lots and lots of pictures

Part 3: Other Optimizations

Best/Good Fit Block Selection Observation: K&R uses “first fit” (really, “next fit”) strategy Example: malloc(8) would choose the 20-byte block Alternative: “best fit” or “good fit” strategy Example: malloc(8) would choose the 8-byte block Applicable if not binning, or if a bin has blocks of variable sizes Pro: Minimizes internal fragmentation and splitting Con: Increases cost of choosing free block Free list In use In use In use 8 50 20

Selective Splitting Free list In use In use In use 8 50 20 Observation: K&R malloc() splits whenever chosen block is too big Example: malloc(14) splits the 20-byte block Alternative: selective splitting Split only when the saving is big enough Example: malloc(14) allocates the entire 20-byte block Pro: Reduces external fragmentation Con: Increases internal fragmentation Free list In use In use In use 8 50 20

Deferred Coalescing Free list p bp In use In use FREE ME lower upper Observation: K&R does coalescing in free() whenever possible Alternative: deferred coalescing Wait, and coalesce many blocks at a later time Pro: Handles “malloc(x);free();malloc(x)” sequences well Con: Complicates algorithms Free list p bp In use In use FREE ME lower upper

Segregated Data Observation: Problem: Solution: Segregated data Splitting and coalescing consume lots of overhead Problem: How to eliminate that overhead? Solution: Segregated data Make use of the virtual memory concept… Store each bin’s blocks in a distinct virtual memory page Elaboration…

Segregated Data (cont.) Each bin contains blocks of fixed sizes E.g. 32, 64, 128, … All blocks within a bin are from same virtual memory page Malloc never splits! Examples: Malloc for 32 bytes => provide 32 Malloc for 5 bytes => provide 32 Malloc for 100 bytes => provide 128 Free never coalesces! Free block => examine address, infer virtual memory page, infer bin, insert into that bin Pro: Completely eliminates splitting and coalescing overhead Pro: Eliminates most meta-data; only forward links are required (no backward links, sizes, status bits, footers) Con: Some usage patterns cause excessive external fragmentation

Segregated Meta-Data Observations: Problem 1: Problem 2: Meta-data (block sizes, status flags, links, etc.) are scattered across the heap, interspersed with user data Heap mgr often must traverse meta-data Problem 1: User error easily can corrupt meta-data Problem 2: Frequent traversal of meta-data can cause excessive page faults Solution: Segregated meta-data Make use of the virtual memory concept… Store meta-data in a distinct (segregated) virtual memory page from user data

Memory Mapping Observations: Problem: Solution: Memory mapping Heap mgr might want to release heap memory to OS (e.g. for use as stack) Heap mgr can call brk(currentBreak–x) to release freed memory to OS, but… Difficult to know when memory at high end of heap is free, and… Often freed memory is not at high end of heap! Problem: How can heap mgr effectively release freed memory to OS? Solution: Memory mapping Make use of virtual memory concept… Allocate memory via mmap() system call Free memory via munmap() system call

mmap() and munmap() Typical call of mmap() Typical call of munmap() p = mmap(NULL, size, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANON, 0, 0); Asks the OS to map a new private read/write area of virtual memory containing size bytes Returns the virtual address of the new area on success, NULL on failure Typical call of munmap() status = munmap(p, size); Unmaps the area of virtual memory at virtual address p consisting of size bytes Returns 1 on success, 0 on failure See Bryant & O’Hallaron book and man pages for details

Using mmap() and munmap() Typical strategy: Allocate small block => Call brk() if necessary Manipulate data structures described earlier in this lecture Free small block => Do not call brk() Allocate large block => Call mmap() Free large block => Call munmap()

Summary Details of K&R heap manager Heap mgr optimizations related to Assignment #6 Faster free() via doubly-linked list, redundant sizes, and status bits Faster malloc() via binning Other heap mgr optimizations Best/good fit block selection Selective splitting Deferred coalescing Segregated data Segregated meta-data Memory mapping