CSE332: Data Abstractions Lecture 23: Programming with Locks and Critical Sections Dan Grossman Spring 2010.

Slides:



Advertisements
Similar presentations
Chapter 6: Process Synchronization
Advertisements

Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 5: Process Synchronization.
5.1 Silberschatz, Galvin and Gagne ©2009 Operating System Concepts with Java – 8 th Edition Chapter 5: CPU Scheduling.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Chapter 6: Process Synchronization.
Concurrency 101 Shared state. Part 1: General Concepts 2.
Section 7: Thread Safety, issues and guidelines Michelle Kuttel
/ PSWLAB Concurrent Bug Patterns and How to Test Them by Eitan Farchi, Yarden Nir, Shmuel Ur published in the proceedings of IPDPS’03 (PADTAD2003)
CS444/CS544 Operating Systems Synchronization 2/16/2006 Prof. Searleman
CSE332: Data Abstractions Lecture 22: Shared-Memory Concurrency and Mutual Exclusion Dan Grossman Spring 2010.
A Sophomoric Introduction to Shared-Memory Parallelism and Concurrency Lecture 5 Programming with Locks and Critical Sections Dan Grossman Last Updated:
CSE332: Data Abstractions Lecture 23: Programming with Locks and Critical Sections Tyler Robison Summer
Threading Part 2 CS221 – 4/22/09. Where We Left Off Simple Threads Program: – Start a worker thread from the Main thread – Worker thread prints messages.
A Sophomoric Introduction to Shared-Memory Parallelism and Concurrency Lecture 4 Shared-Memory Concurrency & Mutual Exclusion Dan Grossman Last Updated:
Assignment – no class Wednesday All: watch the Google Techtalk “Getting C++ Threads Right” by Hans Boehm at the following link in place of Wednesday’s.
Introduction to Lock-free Data-structures and algorithms Micah J Best May 14/09.
Silberschatz, Galvin and Gagne ©2007 Operating System Concepts with Java – 7 th Edition, Nov 15, 2006 Process Synchronization (Or The “Joys” of Concurrent.
Synchronization in Java Nelson Padua-Perez Bill Pugh Department of Computer Science University of Maryland, College Park.
1 Sharing Objects – Ch. 3 Visibility What is the source of the issue? Volatile Dekker’s algorithm Publication and Escape Thread Confinement Immutability.
02/17/2010CSCI 315 Operating Systems Design1 Process Synchronization Notice: The slides for this lecture have been largely based on those accompanying.
29-Jun-15 Java Concurrency. Definitions Parallel processes—two or more Threads are running simultaneously, on different cores (processors), in the same.
A. Frank - P. Weisberg Operating Systems Introduction to Cooperating Processes.
Instructor: Umar KalimNUST Institute of Information Technology Operating Systems Process Synchronization.
02/19/2007CSCI 315 Operating Systems Design1 Process Synchronization Notice: The slides for this lecture have been largely based on those accompanying.
Adopted from and based on Textbook: Operating System Concepts – 8th Edition, by Silberschatz, Galvin and Gagne Updated and Modified by Dr. Abdullah Basuhail,
CSE332: Data Abstractions Lecture 24: Readers/Writer Locks and Condition Variables Tyler Robison Summer
CSE 332 Data Abstractions: Data Races and Memory, Reordering, Deadlock, Readers/Writer Locks, and Condition Variables (oh my!) Kate Deibel Summer 2012.
Operating Systems CSE 411 CPU Management Oct Lecture 13 Instructor: Bhuvan Urgaonkar.
1 Thread II Slides courtesy of Dr. Nilanjan Banerjee.
Chapter 3 Introduction to Collections – Stacks Modified
A Sophomoric Introduction to Shared-Memory Parallelism and Concurrency Lecture 5 Programming with Locks and Critical Sections Original Work by: Dan Grossman.
Concurrency, Mutual Exclusion and Synchronization.
Threading and Concurrency Issues ● Creating Threads ● In Java ● Subclassing Thread ● Implementing Runnable ● Synchronization ● Immutable ● Synchronized.
23-Oct-15 Abstract Data Types. 2 Data types A data type is characterized by: a set of values a data representation, which is common to all these values,
Optimistic Design 1. Guarded Methods Do something based on the fact that one or more objects have particular states  Make a set of purchases assuming.
Operating Systems ECE344 Ashvin Goel ECE University of Toronto Mutual Exclusion.
Internet Software Development Controlling Threads Paul J Krause.
Producer-Consumer Problem The problem describes two processes, the producer and the consumer, who share a common, fixed-size buffer used as a queue.bufferqueue.
(c) University of Washington15-1 CSC 143 Java List Implementation via Arrays Reading: 13.
Chapter 6 – Process Synchronisation (Pgs 225 – 267)
Data races, informally [More formal definition to follow] “race condition” means two different things Data race: Two threads read/write, write/read, or.
Java Thread and Memory Model
Concurrency Control 1 Fall 2014 CS7020: Game Design and Development.
Debugging Threaded Applications By Andrew Binstock CMPS Parallel.
Thread basics. A computer process Every time a program is executed a process is created It is managed via a data structure that keeps all things memory.
CSE332: Data Abstractions Lecture 25: Deadlocks and Additional Concurrency Issues Tyler Robison Summer
Optimistic Design CDP 1. Guarded Methods Do something based on the fact that one or more objects have particular states Make a set of purchases assuming.
Concurrent Programming Acknowledgements: Some slides adapted from David Evans, U. Virginia.
Synchronization Questions answered in this lecture: Why is synchronization necessary? What are race conditions, critical sections, and atomic operations?
1 Stacks Abstract Data Types (ADTs) Stacks Application to the analysis of a time series Java implementation of a stack Interfaces and exceptions.
CSE 332 Data Abstractions: Parallel Sorting & Introduction to Concurrency Kate Deibel Summer 2012 August 6, 2012CSE 332 Data Abstractions, Summer
Mergesort example: Merge as we return from recursive calls Merge Divide 1 element 829.
CPE 779 Parallel Computing - Spring Creating and Using Threads Based on Slides by Katherine Yelick
Healing Data Races On-The-Fly
CSE 120 Principles of Operating
CSE 332: Locks and Deadlocks
Threads and Memory Models Hal Perkins Autumn 2011
CSE373: Data Structures & Algorithms Lecture 25: Software-Design Interlude – Preserving Abstractions Catie Baker Spring 2015.
CSE332: Data Abstractions Lecture 21: Shared-Memory Concurrency & Mutual Exclusion Dan Grossman Spring 2012.
Threads and Memory Models Hal Perkins Autumn 2009
CSE373: Data Structures & Algorithms Lecture 16: Software-Design Interlude – Preserving Abstractions Dan Grossman Fall 2013.
Java Concurrency 17-Jan-19.
CSE 451: Operating Systems Autumn 2003 Lecture 7 Synchronization
CSE 451: Operating Systems Autumn 2005 Lecture 7 Synchronization
CSE 451: Operating Systems Winter 2003 Lecture 7 Synchronization
Java Concurrency.
CSE 153 Design of Operating Systems Winter 19
Java Concurrency.
Java Concurrency 29-May-19.
CSE 332: Concurrency and Locks
CSE 542: Operating Systems
Presentation transcript:

CSE332: Data Abstractions Lecture 23: Programming with Locks and Critical Sections Dan Grossman Spring 2010

Concurrency: where are we Done: –The semantics of locks –Locks in Java –Using locks for mutual exclusion: bank-account example This lecture: –More bad interleavings (learn to spot these!) –Guidelines/idioms for shared-memory and using locks correctly –Coarse-grained vs. fine-grained Upcoming lectures: –Readers/writer locks –Deadlock –Condition variables –Data races and memory-consistency models Spring 20102CSE332: Data Abstractions

Races A race condition occurs when the computation result depends on scheduling (how threads are interleaved) Bugs that exist only due to concurrency –No interleaved scheduling with 1 thread Typically, problem is some intermediate state that “messes up” a concurrent thread that “sees” that state Spring 20103CSE332: Data Abstractions

Example Spring 20104CSE332: Data Abstractions class Stack { … synchronized boolean isEmpty() { … } synchronized void push(E val) { … } synchronized E pop() { if(isEmpty()) throw new StackEmptyException(); … } E peek() { E ans = pop(); push(ans); return ans; }

peek, sequentially speaking In a sequential world, this code is of questionable style, but unquestionably correct The “algorithm” is the only way to write a peek helper method if all you had was this interface: Spring 20105CSE332: Data Abstractions interface Stack { boolean isEmpty(); void push(E val); E pop(); } class C { static E myPeek(Stack s){ ??? } }

peek, concurrently speaking peek has no overall effect on the shared data –It is a “reader” not a “writer” But the way it’s implemented creates an inconsistent intermediate state –Even though calls to push and pop are synchronized so there are no data races on the underlying array/list/whatever This intermediate state should not be exposed –Leads to several wrong interleavings… Spring 20106CSE332: Data Abstractions

peek and isEmpty Property we want: If there has been a push and no pop, then isEmpty returns false With peek as written, property can be violated – how? Spring 20107CSE332: Data Abstractions E ans = pop(); push(ans); return ans; push(x) boolean b = isEmpty() Time Thread 2 Thread 1 ( peek )

peek and isEmpty Property we want: If there has been a push and no pop, then isEmpty returns false With peek as written, property can be violated – how? Spring 20108CSE332: Data Abstractions E ans = pop(); push(ans); return ans; push(x) boolean b = isEmpty() Time Thread 2 Thread 1 ( peek )

peek and push Property we want: Values are returned from pop in LIFO order With peek as written, property can be violated – how? Spring 20109CSE332: Data Abstractions E ans = pop(); push(ans); return ans; push(x) push(y) E e = pop() Time Thread 2 Thread 1 ( peek )

peek and push Property we want: Values are returned from pop in LIFO order With peek as written, property can be violated – how? Spring CSE332: Data Abstractions E ans = pop(); push(ans); return ans; push(x) push(y) E e = pop() Time Thread 2 Thread 1 ( peek )

peek and pop Property we want: Values are returned from pop in LIFO order With peek as written, property can be violated – how? Spring CSE332: Data Abstractions E ans = pop(); push(ans); return ans; Time Thread 2 Thread 1 ( peek ) push(x) push(y) E e = pop()

peek and peek Property we want: peek doesn’t throw an exception if number of pushes exceeds number of pops With peek as written, property can be violated – how? Spring CSE332: Data Abstractions E ans = pop(); push(ans); return ans; Time Thread 2 E ans = pop(); push(ans); return ans; Thread 1 ( peek )

peek and peek Property we want: peek doesn’t throw an exception if number of pushes exceeds number of pops With peek as written, property can be violated – how? Spring CSE332: Data Abstractions E ans = pop(); push(ans); return ans; Time Thread 2 E ans = pop(); push(ans); return ans; Thread 1 ( peek )

The fix In short, peek needs synchronization to disallow interleavings –The key is to make a larger critical section –Re-entrant locks allow calls to push and pop Spring CSE332: Data Abstractions class Stack { … synchronized E peek(){ E ans = pop(); push(ans); return ans; } class C { E myPeek(Stack s){ synchronized (s) { E ans = s.pop(); s.push(ans); return ans; }

The wrong “fix” Focus so far: problems from peek doing writes that lead to an incorrect intermediate state Tempting but wrong: If an implementation of peek (or isEmpty ) does not write anything, then maybe we can skip the synchronization? Does not work due to data races with push and pop … Spring CSE332: Data Abstractions

Example, again (no resizing or checking) Spring CSE332: Data Abstractions class Stack { private E[] array = (E[])new Object[SIZE]; int index = -1; boolean isEmpty() { // unsynchronized: wrong?! return index==-1; } synchronized void push(E val) { array[++index] = val; } synchronized E pop() { return array[index--]; } E peek() { // unsynchronized: wrong! return array[index]; }

Why wrong? It looks like isEmpty and peek can “get away with this” since push and pop adjust the state “in one tiny step” But this code is still wrong and depends on language- implementation details you cannot assume –Even “tiny steps” may require multiple steps in the implementation: array[++index] = val probably takes at least two steps –Code has a data race, allowing very strange behavior Important discussion in future lecture Moral: Don’t introduce a data race, even if every interleaving you can think of is correct Spring CSE332: Data Abstractions

Getting it right Avoiding race conditions on shared resources is difficult –Decades of bugs has led to some conventional wisdom: general techniques that are known to work Rest of lecture distills key ideas and trade-offs –Parts paraphrased from “Java Concurrency in Practice” Chapter 2 (rest of book more advanced) –But none of this is specific to Java or a particular book! Spring CSE332: Data Abstractions

3 choices For every memory location (e.g., object field) in your program, you must obey at least one of the following: 1.Thread-local: Don’t use the location in > 1 thread 2.Immutable: Don’t write to the memory location 3.Synchronized: Use synchronization to control access to the location Spring CSE332: Data Abstractions all memory thread-local memory immutable memory need synchronization

Thread-local Whenever possible, don’t share resources –Easier to have each thread have its own thread-local copy of a resource than to have one with shared updates –This is correct only if threads don’t need to communicate through the resource That is, multiple copies are a correct approach Example: Random objects –Note: Since each call-stack is thread-local, never need to synchronize on local variables In typical concurrent programs, the vast majority of objects should be thread-local: shared-memory should be rare – minimize it Spring CSE332: Data Abstractions

Immutable Whenever possible, don’t update objects –Make new objects instead One of the key tenets of functional programming (see CSE341) –Generally helpful to avoid side-effects –Much more helpful in a concurrent setting If a location is only read, never written, then no synchronization is necessary! –Simultaneous reads are not races and not a problem In practice, programmers usually over-use mutation – minimize it Spring CSE332: Data Abstractions

The rest After minimizing the amount of memory that is (1) thread-shared and (2) mutable, we need guidelines for how to use locks to keep other data consistent Guideline #0: No data races Never allow two threads to read/write or write/write the same location at the same time Necessary: In Java or C, a program with a data race is almost always wrong Not sufficient: Our peek example had no data races Spring CSE332: Data Abstractions

Consistent Locking Guideline #1: For each location needing synchronization, have a lock that is always held when reading or writing the location We say the lock guards the location The same lock can (and often should) guard multiple locations Clearly document the guard for each location In Java, often the guard is the object containing the location –this inside the object’s methods Spring CSE332: Data Abstractions

Consistent Locking continued The mapping from locations to guarding locks is conceptual It partitions the shared-&-mutable locations into “which lock” Spring CSE332: Data Abstractions Consistent locking is: Not sufficient: It prevents all data races, but still allows higher- level races (exposed intermediate states) –Our peek example used consistent locking Not necessary: Can change the locking protocol dynamically…

Beyond consistent locking Consistent locking is an excellent guideline –A “default assumption” about program design But it isn’t required for correctness: Can have different program phases use different invariants –Provided all threads coordinate moving to the next phase Example from Project 3, Version 5: –A shared grid being updated, so use a lock for each entry –But after the grid is filled out, all threads except 1 terminate So synchronization no longer necessary (thread local) –And later the grid becomes immutable Makes synchronization doubly unnecessary Spring CSE332: Data Abstractions

Lock granularity Coarse-grained: Fewer locks, i.e., more objects per lock –Example: One lock for entire data structure (e.g., array) –Example: One lock for all bank accounts Fine-grained: More locks, i.e., fewer objects per lock –Example: One lock per data element (e.g., array index) –Example: One lock per bank account “Coarse-grained vs. fine-grained” is really a continuum Spring CSE332: Data Abstractions … …

Trade-offs Coarse-grained advantages –Simpler to implement –Faster/easier to implement operations that access multiple locations (because all guarded by the same lock) –Much easier: operations that modify data-structure shape Fine-grained advantages –More simultaneous access (performance when coarse- grained would lead to unnecessary blocking) Guideline #2: Start with coarse-grained (simpler) and move to fine- grained (performance) only if contention on the coarser locks becomes an issue. Alas, often leads to bugs. Spring CSE332: Data Abstractions

Example: Hashtable Coarse-grained: One lock for entire hashtable Fine-grained: One lock for each bucket Which supports more concurrency for insert and lookup ? Which makes implementing resize easier? –How would you do it? If a hashtable has a numElements field, maintaining it will destroy the benefits of using separate locks for each bucket Spring CSE332: Data Abstractions

Critical-section granularity A second, orthogonal granularity issue is critical-section size –How much work to do while holding lock(s) If critical sections run for too long: –Performance loss because other threads are blocked If critical sections are too short: –Bugs because you broke up something where other threads should not be able to see intermediate state Guideline #3: Don’t do expensive computations or I/O in critical sections, but also don’t introduce race conditions Spring CSE332: Data Abstractions

Example Suppose we want to change the value for a key in a hashtable without removing it from the table –Assume lock guards the whole table Spring CSE332: Data Abstractions synchronized(lock) { v1 = table.lookup(k); v2 = expensive(v1); table.remove(k); table.insert(k,v2); } Papa Bear’s critical section was too long (table locked during expensive call)

Example Suppose we want to change the value for a key in a hashtable without removing it from the table –Assume lock guards the whole table Spring CSE332: Data Abstractions synchronized(lock) { v1 = table.lookup(k); } v2 = expensive(v1); synchronized(lock) { table.remove(k); table.insert(k,v2); } Mama Bear’s critical section was too short (if another thread updated the entry, we will lose an update)

Example Suppose we want to change the value for a key in a hashtable without removing it from the table –Assume lock guards the whole table Spring CSE332: Data Abstractions done = false; while(!done) { synchronized(lock) { v1 = table.lookup(k); } v2 = expensive(v1); synchronized(lock) { if(table.lookup(k)==v1) { done = true; table.remove(k); table.insert(k,v2); }}} Baby Bear’s critical section was just right (if another update occurred, try our update again)

Atomicity An operation is atomic if no other thread can see it partly executed –Atomic as in “(appears) indivisible” –Typically want ADT operations atomic, even to other threads running operations on the same ADT Guideline #4: Think in terms of what operations need to be atomic –Make critical sections just long enough to preserve atomicity –Then design the locking protocol to implement the critical sections correctly That is: Think about atomicity first and locks second Spring CSE332: Data Abstractions

Don’t roll your own It is rare that you should write your own data structure –Provided in standard libraries –Point of CSE332 is to understand the key trade-offs and abstractions Especially true for concurrent data structures –Far too difficult to provide fine-grained synchronization without data races –Standard thread-safe libraries like ConcurrentHashMap written by world experts Guideline #5: Use built-in libraries whenever they meet your needs Spring CSE332: Data Abstractions