Overview Motivation Scala on LLVM Challenges Interesting Subsets.

Slides:



Advertisements
Similar presentations
Borland Optimizeit™ Profiler for the Microsoft .NET Framework
Advertisements

CSC 360- Instructor: K. Wu Overview of Operating Systems.
Virtual Memory Primitives for User Programs Andrew W. Appel and Kai Li Presented by Phil Howard.
Hadi JooybarGPUDet: A Deterministic GPU Architecture1 Hadi Jooybar 1, Wilson Fung 1, Mike O’Connor 2, Joseph Devietti 3, Tor M. Aamodt 1 1 The University.
ECE 562 Computer Architecture and Design Project: Improving Feature Extraction Using SIFT on GPU Rodrigo Savage, Wo-Tak Wu.
MPI and C-Language Seminars Seminar Plan  Week 1 – Introduction, Data Types, Control Flow, Pointers  Week 2 – Arrays, Structures, Enums, I/O,
Contiki A Lightweight and Flexible Operating System for Tiny Networked Sensors Presented by: Jeremy Schiff.
The PTX GPU Assembly Simulator and Interpreter N.M. Stiffler Zheming Jin Ibrahim Savran.
Multicore experiment: Plurality Hypercore Processor Performed by: Anton Fulman Ze’ev Zilberman Supervised by: Mony Orbach Characterization presentation.
0 HPEC 2010 Automated Software Cache Management.
Accelerating SQL Database Operations on a GPU with CUDA Peter Bakkum & Kevin Skadron The University of Virginia GPGPU-3 Presentation March 14, 2010.
UNIX System Administration OS Kernal Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept Kernel or MicroKernel Concept: An OS architecture-design.
Revisiting Kirchhoff Migration on GPUs Rice Oil & Gas HPC Workshop
Bill Au CBS Interactive Troubleshooting Slow or Hung Java Applications.
Bill Au CBS Interactive Troubleshooting Slow or Hung Java Applications.
Instructor Notes GPU debugging is still immature, but being improved daily. You should definitely check to see the latest options available before giving.
CUDA All material not from online sources/textbook copyright © Travis Desell, 2012.
Automatic translation from CUDA to C++ Luca Atzori, Vincenzo Innocente, Felice Pantaleo, Danilo Piparo 31 August, 2015.
GPU Architecture and Programming
Trace-Based Optimization for Precomputation and Prefetching Madhusudan Raman Supervisor: Prof. Michael Voss.
GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond
Parallelization and Characterization of Pattern Matching using GPUs Author: Giorgos Vasiliadis 、 Michalis Polychronakis 、 Sotiris Ioannidis Publisher:
CUDA - 2.
Performance Comparison Xen vs. KVM vs. Native –Benchmarks: SPEC CPU2006, SPEC JBB 2005, SPEC WEB, TPC –Case studies Design instrumentations for figure.
Challenges and Solutions for Embedded Java Michael Wortley Computer Integrated Surgery March 1, 2001.
CIS250 OPERATING SYSTEMS Chapter One Introduction.
CSE 598c – Virtual Machines Survey Proposal: Improving Performance for the JVM Sandra Rueda.
Euro-Par, 2006 ICS 2009 A Translation System for Enabling Data Mining Applications on GPUs Wenjing Ma Gagan Agrawal The Ohio State University ICS 2009.
MIDORI The Windows Killer!! by- Sagar R. Yeole Under the guidance of- Prof. T. A. Chavan.
Common Language Runtime Introduction  The common language runtime is one of the most essential component of the.Net Framework.  It acts.
Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 4: Threads.
Flashback : A Lightweight Extension for Rollback and Deterministic Replay for Software Debugging Sudarshan M. Srinivasan, Srikanth Kandula, Christopher.
Software Toolchains. Motivation 2 Write Run Edit, compile, link, run, debug same platform Desktop Write Run Edit, compile, link, debug on host; run on.
Report on Vector Prototype J.Apostolakis, R.Brun, F.Carminati, A. Gheata 10 September 2012.
Our Graphics Environment Landscape Rendering. Hardware  CPU  Modern CPUs are multicore processors  User programs can run at the same time as other.
1 ”MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs” John A. Stratton, Sam S. Stone and Wen-mei W. Hwu Presentation for class TDT24,
Directions in Linux OpenGL
© D. J. Foreman, Structure of an O/S. © D. J. Foreman, Overview  Required functionality –Handle interrupts –Manage resources Processes.
Chapter 4: Threads Modified by Dr. Neerja Mhaskar for CS 3SH3.
Introduction to threads
Prof. Zhang Gang School of Computer Sci. & Tech.
Chapter 4: Multithreaded Programming
Before You Begin Nahla Abuel-ola /WIT.
Current Generation Hypervisor Type 1 Type 2.
Report on Vector Prototype
Lecture 5: GPU Compute Architecture
Quick Start Guide for Visual Studio 2010
Presented by: Isaac Martin
High Performance Computing (CS 540)
Lecture 5: GPU Compute Architecture for the last time
Chapter 4: Threads.
for Network Processors
The Case for Operating System Services on GPUs
SDK Demo/Tutorial John DeHart.
Discussion HPC Priority project for COSMO consortium
Using OpenMP offloading in Charm++
Upcoming Improvements and Features in Charm++
Operating Systems (CS 340 D)
Module 10: Implementing Managed Code in the Database
Introduction to CUDA.
Operating System Introduction.
Chapter 4: Threads & Concurrency
The Challenge of Cross - Language Interoperability
Compiler Construction
Parallel Computing Explained How to Parallelize a Code
Interpreting Java Program Runtimes
System Programming By Prof.Naveed Zishan.
Peter Oostema & Rajnish Aggarwal 6th March, 2019
CSC Multiprocessor Programming, Spring, 2011
Presentation transcript:

Overview Motivation Scala on LLVM Challenges Interesting Subsets

Motivation

Scala on LLVM

Challenges: Must-Have Garbage Collector

Challenges: Optional Threading Reflection Debugging Code Loading

Challenges: Code Loading

Scala Specific Optimisations Improving Function Handling – Get Rid of Object Overhead – Inlining

Interesting Subsets Tiny Scala On Small Systems Compiled Scala Scala In Flavors

Scala in Flavors

Run the ‘regular’ Code on CPU Run data parallel on GPU or other dedicated hardware Issues – Interchanging Data – Vectorisation – Memory-Management