Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Languages and Compilers (SProg og Oversættere) Bent Thomsen Department of Computer Science Aalborg University With acknowledgement to John Mitchell whose.

Similar presentations


Presentation on theme: "1 Languages and Compilers (SProg og Oversættere) Bent Thomsen Department of Computer Science Aalborg University With acknowledgement to John Mitchell whose."— Presentation transcript:

1 1 Languages and Compilers (SProg og Oversættere) Bent Thomsen Department of Computer Science Aalborg University With acknowledgement to John Mitchell whose slides this lecture is based on.

2 2 Concurrency, distributed computing, the Internet Traditional view: Let the OS deal with this => It is not a programming language issue! End of Lecture Wait-a-minute … Maybe “the traditional view” is getting out of date?

3 3 Languages with concurrency constructs Maybe the “traditional view” was always out of date? Simula Modula3 Occam Concurrent Pascal ADA Linda CML Facile Jo-Caml Java C# …

4 4 Possibilities for Concurrency Architecture: Uniprocessor with: -I/O channel - I/o processor - DMA Multiprogramming, multiple process system Programs Network of uniprocessors Distributed programming Multiple CPU’sParallel programming

5 5 The promise of concurrency Speed –If a task takes time t on one processor, shouldn’t it take time t/n on n processors? Availability –If one process is busy, another may be ready to help Distribution –Processors in different locations can collaborate to solve a problem or work together Humans do it so why can’t computers? –Vision, cognition appear to be highly parallel activities

6 6 Challenges Concurrent programs are harder to get right –Folklore: Need an order of magnitude speedup (or more) to be worth the effort Some problems are inherently sequential –Theory – circuit evaluation is P-complete –Practice – many problems need coordination and communication among sub-problems Specific issues –Communication – send or receive information –Synchronization – wait for another process to act –Atomicity – do not stop in the middle and leave a mess

7 7 Why is concurrent programming hard? Nondeterminism –Deterministic: two executions on the same input it always produce the same output –Nondeterministic: two executions on the same input may produce different output Why does this cause difficulty? –May be many possible executions of one system –Hard to think of all the possibilities –Hard to test program since some may occur infrequently

8 8 Concurrency and programming Languages Library view –All(most all) concurrency constructs are packaged as library functions/methods –FORTRAN, C, Lisp, CML, Java (except for a few keywords like synchronized) Language view –Syntactic manifestation in the language –Concurrent Pascal –Occam –ADA –Facile –Erlang –…

9 9 Library view in C System Calls - fork( ) - wait( ) - pipe( ) - write( ) - read( ) Examples

10 10 Process Creation Fork( ) NAME fork() – create a new process SYNOPSIS # include pid_t fork(void) RETURN VALUE success parent- child pid child- 0 failure

11 11 Fork() system call- example #include Main() { printf(“[%ld] parent process id: %ld\n”, getpid(), getppid()); fork(); printf(“\n[%ld] parent process id: %ld\n”, getpid(), getppid()); }

12 12 Fork() system call- example [17619] parent process id: 12729 [2372] parent process id: 17619

13 13 Fork()- program structure #include Main() { pid_t pid; if((pid = fork())>0){ /* parent */ } else if ((pid==0){ /*child*/ } else { /* cannot fork* } exit(0); }

14 14 Wait() system call Wait()- wait for the process whose pid reference is passed to finish executing SYNOPSIS #include pid_t wait(int *stat)loc) The unsigned decimal integer process ID for which to wait RETURN VALUE success- child pid failure- -1 and errno is set

15 15 Wait()- program structure #include #include #include #include Main(int argc, char* argv[]) { pid_t childPID; if((childPID = fork())==0){ /*child*/ } else { /* parent* wait(0); } exit(0); }

16 16 Pipe() system call Pipe()- to create a read-write pipe that may later be used to communicate with a process we’ll fork off. SYNOPSIS Int pipe(pfd) int pfd[2]; PARAMETER Pfd is an array of 2 integers, which that will be used to save the two file descriptors used to access the pipe RETURN VALUE: 0 – success; -1 – error.

17 17 Pipe() - structure /* first, define an array to store the two file descriptors*/ Int pipe[2]; /* now, create the pipe*/ int rc = pipe (pipes); if(rc = = -1) { /* pipe() failed*/ Perror(“pipe”); exit(1); } If the call to pipe() succeeded, a pipe will be created, pipes[0] will contain the number of its read file descriptor, and pipes[1] will contain the number of its write file descriptor.

18 18 Write() system call Write() – used to write data to a file or other object identified by a file descriptor. SYNOPSIS #include Size_t write(int fildes, const void * buf, size_t nbyte); PARAMETER fildes is the file descriptor, buf is the base address of area of memory that data is copied from, nbyte is the amount of data to copy RETURN VALUE The return value is the actual amount of data written, if this differs from nbyte then something has gone wrong

19 19 Read() system call Read() – read data from a file or other object identified by a file descriptor SYNOPSIS #include Size_t read(int fildes, void *buf, size_t nbyte); ARGUMENT fildes is the file descriptor, buf is the base address of the memory area into which the data is read, nbyte is the maximum amount of data to read. RETURN VALUE The actual amount of data read from the file. The pointer is incremented by the amount of data read.

20 20 Solaris 2 Synchronization Implements a variety of locks to support multitasking, multithreading (including real-time threads), and multiprocessing. Uses adaptive mutexes for efficiency when protecting data from short code segments. Uses condition variables and readers-writers locks when longer sections of code need access to data. Uses turnstiles to order the list of threads waiting to acquire either an adaptive mutex or reader-writer lock.

21 21 Windows 2000 Synchronization Uses interrupt masks to protect access to global resources on uniprocessor systems. Uses spinlocks on multiprocessor systems. Also provides dispatcher objects which may act as wither mutexes and semaphores. Dispatcher objects may also provide events. An event acts much like a condition variable.

22 22 Basic question Maybe the library approach is not such a good idea? How can programming languages make concurrent and distributed programming easier?

23 23 Language support for concurrency Make invariants and intentions more apparent (part of the interface) Good software engineering Allows the compiler much more freedom to choose different implementations Also helps other tools

24 24 What could languages provide? Abstract model of system –abstract machine => abstract system Example high-level constructs –Process as the value of an expression Pass processes to functions Create processes at the result of function call –Communication abstractions Synchronous communication Buffered asynchronous channels that preserve msg order –Mutual exclusion, atomicity primitives Most concurrent languages provide some form of locking Atomicity is more complicated, less commonly provided

25 25 Basic issue: conflict between processes Critical section –Two processes may access shared resource –Inconsistent behavior if two actions are interleaved –Allow only one process in critical section Deadlock –Process may hold some locks while awaiting others –Deadlock occurs when no process can proceed

26 26 Cobegin/coend Limited concurrency primitive Example x := 0; cobegin begin x := 1; x := x+1 end; begin x := 2; x := x+1 end; coend; print(x); execute sequential blocks in parallel x := 0 x := 2 x := 1 print(x) x := x+1 Atomicity at level of assignment statement

27 27 Mutual exclusion Sample action procedure sign_up(person) begin number := number + 1; list[number] := person; end; Problem with parallel execution cobegin sign_up(fred); sign_up(bill); end; bob fred bill fred

28 28 Locks and Waiting cobegin begin sign_up(fred); // critical section end; begin sign_up(bill); // critical section end; Need atomic operations to implement wait

29 29 Mutual exclusion primitives Atomic test-and-set –Instruction atomically reads and writes some location –Common hardware instruction –Combine with busy-waiting loop to implement mutex Semaphore –Avoid busy-waiting loop –Keep queue of waiting processes –Scheduler has access to semaphore; process sleeps –Disable interrupts during semaphore operations OK since operations are short

30 30 Monitor Brinch-Hansen, Dahl, Dijkstra, Hoare Synchronized access to private data. Combines: –private data –set of procedures (methods) –synchronization policy At most one process may execute a monitor procedure at a time; this process is said to be in the monitor. If one process is in the monitor, any other process that calls a monitor procedure will be delayed. Modern terminology: synchronized object

31 31 OCCAM Program consists of processes and channels Process is code containing channel operations Channel is a data object All synchronization is via channels Formal foundation based on CSP

32 32 Channel Operations in OCCAM Read data item D from channel C –D ? C Write data item Q to channel C –Q ! C If reader accesses channel first, wait for writer, and then both proceed after transfer. If writer accesses channel first, wait for reader, and both proceed after transfer.

33 33 Tasking in Ada Declare a task type The specification gives the entries –task type T is entry Put (data : in Integer); entry Get (result : out Integer); end T; The entries are used to access the task

34 34 Declaring Task Body Task body gives actual code of task task body T is x : integer; -- local per thread declaration begin … accept Put (M : Integer) do … end Put; … end T;

35 35 Creating an Instance of a Task Declare a single task X : T; or an array of tasks P : array (1.. 50) of T; or a dynamically allocated task type AT is access T; P : AT; … P := new T;

36 36 Task execution Each task executes independently, until –an accept call wait for someone to call entry, then proceed with rendezvous code, then both tasks go on their way –an entry call wait for addressed task to reach corresponding accept statement, then proceed with rendezvous, then both tasks go on their way.

37 37 More on the Rendezvous During the Rendezvous, only the called task executes, and data can be safely exchanged via the entry parameters If accept does a simple assignment, we have the equivalent of a simple CSP channel operation, but there is no restriction on what can be done within a rendezvous

38 38 Termination of Tasks A task terminates when it reaches the end of the begin- end code of its body. Tasks may either be very static (create at start of execution and never terminate) Or very dynamic, e.g. create a new task for each new radar trace in a radar system.

39 39 The Delay Statement Delay statements temporarily pause a task –Delay xyz where xyz is an expression of type duration causes execution of the thread to be delayed for (at least) the given amount of time –Delay until tim where tim is an expression of type time, causes execution of the thread to be delayed until (at the earliest) the given tim

40 40 Selective Accept Select statement allows a choice of actions select entry1 (…) do.. end; or when bla entry2 (…); or delay...ddd...; end select; –Take whichever open entry arrives first, or if none arrives by end of delay, do …ddd…stmts.

41 41 Timed Entry Call Timed Entry call allows timeout to be set select entry-call-statement or delay xxx; … end select –We try to do the entry call, but if the task won’t accept in xxx time, then do the delay stmts.

42 42 Conditional Entry Call Make a call only if it will be accepted select entry-call.. else statements end select; –If entry call is accepted immediately, fine, otherwise execute the else statements.

43 43 Task Abort Unconditionally terminate a task abort taskname; –task is immediately terminated –(it is allowed to do finalization actions) –but whatever it was doing remains incomplete –code that can be aborted must be careful to leave things in a coherent state if that is important!

44 44 Java Concurrency Threads –Create process by creating thread object Communication –shared variables –method calls Mutual exclusion and synchronization –Every object has a lock (inherited from class Object) synchronized methods and blocks –Synchronization operations (inherited from class Object) wait : pause current thread until another thread calls notify notify : wake up waiting threads

45 45 Java Threads Thread –Set of instructions to be executed one at a time, in a specified order Java thread objects –Object of class Thread –Methods inherited from Thread: start : method called to spawn a new thread of control; causes VM to call run method suspend : freeze execution interrupt : freeze execution and throw exception to thread stop : forcibly cause thread to halt

46 46 Example subclass of Thread class PrintMany extends Thread { private String msg; public PrintMany (String m) {msg = m;} public void run() { try { for (;;){ System.out.print(msg + “ “); sleep(10); } } catch (InterruptedException e) { return; } } (inherits start from Thread)

47 47 Interaction between threads Shared variables –Two threads may assign/read the same variable –Programmer responsibility Avoid race conditions by explicit synchronization!! Method calls –Two threads may call methods on the same object Synchronization primitives –Each object has internal lock, inherited from Object –Synchronization primitives based on object locking

48 48 Synchronization example Objects may have synchronized methods Can be used for mutual exclusion –Two threads may share an object. –If one calls a synchronized method, this locks object. –If the other calls a synchronized method on same object, this thread blocks until object is unlocked.

49 49 Synchronized methods Marked by keyword public synchronized void commitTransaction(…) {…} Provides mutual exclusion –At most one synchronized method can be active –Unsynchronized methods can still be called Programmer must be careful Not part of method signature –sync method equivalent to unsync method with body consisting of a synchronized block –subclass may replace a synchronized method with unsynchronized method

50 50 Join, another form of synchronization Wait for thread to terminate class Future extends Thread { private int result; public void run() { result = f(…); } public int getResult() { return result;} } … Future t = new future; t.start() // start new thread … t.join(); x = t.getResult(); // wait and get result

51 51 Aspects of Java Threads Portable since part of language –Easier to use in basic libraries than C system calls –Example: garbage collector is separate thread General difficulty combining serial/concur code –Serial to concurrent Code for serial execution may not work in concurrent sys –Concurrent to serial Code with synchronization may be inefficient in serial programs (10-20% unnecessary overhead) Abstract memory model –Shared variables can be problematic on some implementations

52 52 Concurrent garbage collector How much concurrency? –Need to stop thread while mark and sweep –Other GC: may not need to stop all program threads Problem –Program thread may change objects during collection Solution –Prevent read/write to memory area –Details are subtle; generational, copying GC Modern GC distinguishes short-lived from long-lived objects Copying allows read to old area if writes are blocked … Relatively efficient methods for read barrier, write barrier

53 53 Java memory model Main ideas –Threads have local memory (cache) –Threads fill/flush from main memory Interaction restricted by constraints on actions –Use/assign are local thread memory actions –Load/store fill or flush local memory –Read/write are main memory actions

54 54 Thread Memory Hierarchy ThreadCache Shared Memory Cache code use/assign load/store read/write

55 55 Example Program r.i = r.i+1 load r.i store r.i use r.i assign r.i write r.i read r.i The value of field i of object i is –read from main memory –loaded into the local cache of the thread –used in the addition r.i+1 Similar steps to place the value of r.i in shared memory

56 56 Java Memory Model [Java Lang Spec] Example constraints on use, assign, load, store: –use and assign actions by thread must occur in the order specified by the program –Thread is not permitted to lose its most recent assign –Thread is not permitted to write data from its working memory to main memory for no reason –New thread starts with an empty working memory –New variable created only in main memory, not thread working memory “Provided that all the constraints are obeyed, a load or store action may be issued at any time by any thread on any variable, at the whim of the implementation.”

57 57 Access to Main Memory Constraints on load, store, read,write –For every load, must be a preceding read action –For every store, must be a following write action –Actions on master copy of a variable are performed by the main memory in order requested by thread

58 58 Prescient stores Under certain conditions … –Store actions (from cache to shared memory) may occur earlier than you would otherwise expect –Purpose: Allow optimizations that make properly synchronized programs run faster These optimizations may allow out-of-order operations for programs that are not properly synchronized Details are complicated. Main point: there’s more to designing a good memory model than you might think!

59 59 Criticism [Pugh] Model is hard to interpret and poorly understood Constraints –prohibit common compiler optimizations –expensive to implement on existing hardware Not commonly followed –Java Programs Sun Java Development Kit not guaranteed valid by the existing Java memory model –Implementations not compliant Sun Classic Wintel JVM, Sun Hotspot Wintel JVM, IBM 1.1.7b Wintel JVM, Sun production Sparc Solaris JVM, Microsoft JVM

60 60 Thread States born ready running blocked dead sleeping waiting start() sleep() yield() wait() notify() timeout terminate I/O started I/O completed

61 61 What the Language Spec. Says From The Java Language Specification Vague enough for you? Every thread has a priority. When there is competition for processing resources, threads with higher priority are generally executed in preference to threads with lower priority. Such preference is not, however, a guarantee that the highest priority thread will always be running, and thread priorities cannot be used to reliably implement mutual exclusion.

62 62 Multiple threads at same priority? Language gives implementer freedom Calling yield() suspends current thread to allow other at same priority to run … maybe Solaris implementation runs threads until they stop themselves (wait(), yield(), etc.) Windows implementation timeslices

63 63 Concurrent ML Threads –New type of entity Communication –Synchronous channels Synchronization –Channels –Events Atomicity –No specific language support

64 64 Threads Thread creation –spawn : (unit  unit)  thread_id Example code CIO.print "begin parent\n"; spawn (fn () => (CIO.print "child 1\n";)); spawn (fn () => (CIO.print "child 2\n";)); CIO.print "end parent\n“ Result end parent child 2 child 1 begin parent

65 65 Channels Channel creation –channel : unit  ‘a chan Communication –recv : ‘a chan  ‘a –send : ( ‘a chan * ‘a )  unit Example ch = channel(); spawn (fn()=> … … send(ch,0); … …); spawn (fn()=> … … recv ch; … …); Result send/recv

66 66 CML programming Functions –Can write functions : channels  threads –Build concurrent system by declaring channels and “wiring together” sets of threads Events –Delayed action that can be used for synchronization –Powerful concept for concurrent programming Sample Application –eXene – concurrent uniprocessor window system

67 67 CML from continuations Continuation primitives –callcc : ('a cont  'a)  'a Call function argument with current continuation –throw : 'a cont -> 'a -> 'b –Curried function to invoke continuation with arg Example fun f(x,k) = throw k(x+3); fun g(y,k) = f(y+2,k) + 10; fun h(z) = z + callcc(fn k => g(z+1,k)); h(1);

68 68 A CML implementation (simplified) Use queues with side-effecting functions datatype 'a queue = Q of {front: 'a list ref, rear: 'a list ref} fun queueIns (Q(…)) = (* insert into queue *) fun queueRem (Q(…)) = (* remove from queue *) And continuations val enqueue = queueIns rdyQ fun dispatch () = throw (queueRem rdyQ) () fun spawn f = callcc (fn parent_k => ( enqueue parent_k; f (); dispatch())) Source: Appel, Reppy

69 69 Facile Programming Language Integration of Multiple Paradigms –Functions –Types/complex data types –Concurrency –Distribution/soft real-time –Dynamic connectivity Implemented as extension to SML Syntax for concurrency similar to CML

70 70

71 71 Facile implementation Pre-emptive scheduler implemented at the lowest level –Exploiting CPS translation => state characterised by the set of registers Garbage collector used for linearizing data structures Lambda level code used as intermediate language when shipping data (including code) in heterogeneous networks Native representation is shipped when possible –i.e. same architecture and within same trust domain Possibility to mix between interpretation or JIT depending on usage

72 72 Conclusion Concurrency may be an order of magnitude more difficult to handle Programming language support for concurrency may help make the task easier Which concurrency constructs to add to the language is still a very active research area


Download ppt "1 Languages and Compilers (SProg og Oversættere) Bent Thomsen Department of Computer Science Aalborg University With acknowledgement to John Mitchell whose."

Similar presentations


Ads by Google