Presentation is loading. Please wait.

Presentation is loading. Please wait.

Garbage Collection Introduction and Overview Christian Schulte Excerpted from presentation by Christian Schulte Programming Systems Lab Universität des.

Similar presentations


Presentation on theme: "Garbage Collection Introduction and Overview Christian Schulte Excerpted from presentation by Christian Schulte Programming Systems Lab Universität des."— Presentation transcript:

1 Garbage Collection Introduction and Overview Christian Schulte Excerpted from presentation by Christian Schulte Programming Systems Lab Universität des Saarlandes, Germany schulte@ps.uni-sb.de

2 Garbage Collection… …is concerned with the automatic reclamation of dynamically allocated memory after its last use by a program

3 Garbage collection… Dynamically allocated memory Dynamically allocated memory Last use by a program Last use by a program Examples for automatic reclamation Examples for automatic reclamation

4 Kinds of Memory Allocation static int i; void foo(void) { int j; int* p = (int*) malloc(…); }

5 Static Allocation By compiler (in text area) By compiler (in text area) Available through entire runtime Available through entire runtime Fixed size Fixed size static int i; void foo(void) { int j; int* p = (int*) malloc(…); }

6 Automatic Allocation Upon procedure call (on stack) Upon procedure call (on stack) Available during execution of call Available during execution of call Fixed size Fixed size static int i; void foo(void) { int j; int* p = (int*) malloc(…); }

7 Dynamic Allocation Dynamically allocated at runtime (on heap) Dynamically allocated at runtime (on heap) Available until explicitly deallocated Available until explicitly deallocated Dynamically varying size Dynamically varying size static int i; void foo(void) { int j; int* p = (int*) malloc(…); }

8 Dynamically Allocated Memory Also: heap-allocated memory Also: heap-allocated memory Allocation: malloc, new, … Allocation: malloc, new, … –before first usage Deallocation: free, delete, dispose, … Deallocation: free, delete, dispose, … –after last usage Needed for Needed for –C++, Java: objects –SML:datatypes, procedures –anything that outlives procedure call

9 Getting it Wrong Forget to free (memory leak) Forget to free (memory leak) –program eventually runs out of memory –long running programs: OSs. servers, … Free to early (dangling pointer) Free to early (dangling pointer) –lucky: illegal access detected by OS –horror: memory reused, in simultaneous use programs can behave arbitrarily crashes might happen much later Estimates of effort Estimates of effort –Up to 40%! [Rovner, 1985]

10 Nodes and Pointers Node n Node n –Memory block, cell Pointer p Pointer p –Link to node –Node access: *p Children children(n) Children children(n) –set of pointers to nodes referred by n n p

11 Mutator Abstraction of program Abstraction of program –introduces new nodes with pointer –redirects pointers, creating garbage

12 Nodes referred to by several pointers Nodes referred to by several pointers Makes manual deallocation hard Makes manual deallocation hard –local decision impossible –respect other pointers to node Cycles instance of sharing Cycles instance of sharing Shared Nodes

13 Last Use by a Program Question: When is node M not any longer used by program? Question: When is node M not any longer used by program? –Let P be any program not using M –New program sketch: Execute P; Use M; –Hence: M used  P terminates –We are doomed: halting problem! So “last use” undecidable! So “last use” undecidable!

14 Safe Approximation Decidable and also simple Decidable and also simple What means safe? What means safe? –only unused nodes freed What means approximation? What means approximation? –some unused nodes might not be freed Idea Idea –nodes that can be accessed by mutator

15 Reachable Nodes Reachable from root set Reachable from root set –processor registers –static variables –automatic variables (stack) Reachable from reachable nodes Reachable from reachable nodes root

16 Summary: Reachable Nodes A node n is reachable, iff A node n is reachable, iff –n is element of the root set, or –n is element of children(m) and m is reachable Reachable node also called “live” Reachable node also called “live”

17 Mark and Sweep Compute set of reachable nodes Compute set of reachable nodes Free nodes known to be not reachable Free nodes known to be not reachable

18 Reachability: Safe Approximation Safe Safe –access to not reachable node impossible –depends on language semantics –but C/C++? later… Approximation Approximation –reachable node might never be accessed –programmer must know about this! –have you been aware of this?

19 Example Garbage Collectors Mark-Sweep Mark-Sweep Others Others –Mark-Compact –Reference Counting –Copying –see Chapter 1&2 of [Lins&Jones,96]

20 The Mark-Sweep Collector Compute reachable nodes: Mark Compute reachable nodes: Mark –tracing garbage collector Free not reachable nodes: Sweep Free not reachable nodes: Sweep Run when out of memory: Allocation Run when out of memory: Allocation First used with LISP [McCarthy, 1960] First used with LISP [McCarthy, 1960]

21 Allocation node* new() { if (free_pool is empty) mark_sweep(); …

22 Allocation node* new() { if (free_pool is empty) mark_sweep(); return allocate(); }

23 The Garbage Collector void mark_sweep() { for (r in roots) mark(r); …

24 The Garbage Collector void mark_sweep() { for (r in roots) mark(r); … all live nodes marked

25 Recursive Marking void mark(node* n) { if (!is_marked(n)) { set_mark(n); … }

26 Recursive Marking void mark(node* n) { if (!is_marked(n)) { set_mark(n); … } nodes reachable from n marked

27 Recursive Marking void mark(node* n) { if (!is_marked(n)) { set_mark(n); for (m in children(n)) mark(m); } i-th recursion: nodes on path with length i marked

28 The Garbage Collector void mark_sweep() { for (r in roots) mark(r); sweep(); …

29 The Garbage Collector void mark_sweep() { for (r in roots) mark(r); sweep(); … all nodes on heap live

30 The Garbage Collector void mark_sweep() { for (r in roots) mark(r); sweep(); … all nodes on heap live and not marked

31 Eager Sweep void sweep() { node* n = heap_bottom; while (n < heap_top) { … }

32 Eager Sweep void sweep() { node* n = heap_bottom; while (n < heap_top) { if (is_marked(n)) clear_mark(n); else free(n); n += sizeof(*n); }

33 The Garbage Collector void mark_sweep() { for (r in roots) mark(r); sweep(); if (free_pool is empty) abort(“Memory exhausted”); }

34 Assumptions Nodes can be marked Nodes can be marked Size of nodes known Size of nodes known Heap contiguous Heap contiguous Memory for recursion available Memory for recursion available Child fields known! Child fields known!

35 Assumptions: Realistic Nodes can be marked Nodes can be marked Size of nodes known Size of nodes known Heap contiguous Heap contiguous Memory for recursion available Memory for recursion available Child fields known Child fields known

36 Assumptions: Conservative Nodes can be marked Nodes can be marked Size of nodes known Size of nodes known Heap contiguous Heap contiguous Memory for recursion available Memory for recursion available Child fields known Child fields known

37 Mark-Sweep Properties Covers cycles and sharing Covers cycles and sharing Time depends on Time depends on –live nodes (mark) –live and garbage nodes (sweep) Computation must be stopped Computation must be stopped –non-interruptible stop/start collector –long pause Nodes remain unchanged (as not moved) Nodes remain unchanged (as not moved) Heap remains fragmented Heap remains fragmented

38 Software Engineering Issues Design goal in SE: Design goal in SE: decompose systems in orthogonal components Clashes with letting each component do its memory management Clashes with letting each component do its memory management liveness is global property leads to “local leaks” lacking power of modern gc methods

39 Typical Cost Early systems (LISP) Early systems (LISP) up to 40% [Steele,75] [Gabriel,85] “garbage collection is expensive” myth Well engineered system of today Well engineered system of today 10% of entire runtime [Wilson, 94]

40 Areas of Usage Programming languages and systems Programming languages and systems –Java, C#, Smalltalk, … –SML, Lisp, Scheme, Prolog, … –Perl, Python, PHP, JavaScript –Modula 3, Microsoft.NET Extensions Extensions –C, C++ (Conservative) Other systems Other systems –Adobe Photoshop –Unix filesystem –Many others in [Wilson, 1996]

41 Understanding Garbage Collection: Benefits Programming garbage collection Programming garbage collection –programming systems –operating systems Understand systems with garbage collection (e.g. Java) Understand systems with garbage collection (e.g. Java) –memory requirements of programs –performance aspects of programs –interfacing with garbage collection (finalization)

42 References Garbage Collection. Richard Jones and Rafael Lins, John Wiley & Sons, 1996. Garbage Collection. Richard Jones and Rafael Lins, John Wiley & Sons, 1996. Uniprocessor garbage collection techniques. Paul R. Wilson, ACM Computing Surveys. To appear. Uniprocessor garbage collection techniques. Paul R. Wilson, ACM Computing Surveys. To appear. Extended version of IWMM 92, St. Malo.


Download ppt "Garbage Collection Introduction and Overview Christian Schulte Excerpted from presentation by Christian Schulte Programming Systems Lab Universität des."

Similar presentations


Ads by Google