Overview Motivation Scala on LLVM Challenges Interesting Subsets.

Slides:

Advertisements

Similar presentations

Borland Optimizeit™ Profiler for the Microsoft .NET Framework

Advertisements

CSC 360- Instructor: K. Wu Overview of Operating Systems.

Virtual Memory Primitives for User Programs Andrew W. Appel and Kai Li Presented by Phil Howard.

Hadi JooybarGPUDet: A Deterministic GPU Architecture1 Hadi Jooybar 1, Wilson Fung 1, Mike O’Connor 2, Joseph Devietti 3, Tor M. Aamodt 1 1 The University.

ECE 562 Computer Architecture and Design Project: Improving Feature Extraction Using SIFT on GPU Rodrigo Savage, Wo-Tak Wu.

MPI and C-Language Seminars Seminar Plan  Week 1 – Introduction, Data Types, Control Flow, Pointers  Week 2 – Arrays, Structures, Enums, I/O,

Contiki A Lightweight and Flexible Operating System for Tiny Networked Sensors Presented by: Jeremy Schiff.

The PTX GPU Assembly Simulator and Interpreter N.M. Stiffler Zheming Jin Ibrahim Savran.

Multicore experiment: Plurality Hypercore Processor Performed by: Anton Fulman Ze’ev Zilberman Supervised by: Mony Orbach Characterization presentation.

0 HPEC 2010 Automated Software Cache Management.

Accelerating SQL Database Operations on a GPU with CUDA Peter Bakkum & Kevin Skadron The University of Virginia GPGPU-3 Presentation March 14, 2010.

UNIX System Administration OS Kernal Copyright 2002, Dr. Ken Hoganson All rights reserved. OS Kernel Concept Kernel or MicroKernel Concept: An OS architecture-design.

Revisiting Kirchhoff Migration on GPUs Rice Oil & Gas HPC Workshop

Bill Au CBS Interactive Troubleshooting Slow or Hung Java Applications.

Bill Au CBS Interactive Troubleshooting Slow or Hung Java Applications.

Instructor Notes GPU debugging is still immature, but being improved daily. You should definitely check to see the latest options available before giving.

CUDA All material not from online sources/textbook copyright © Travis Desell, 2012.

Automatic translation from CUDA to C++ Luca Atzori, Vincenzo Innocente, Felice Pantaleo, Danilo Piparo 31 August, 2015.

GPU Architecture and Programming

Trace-Based Optimization for Precomputation and Prefetching Madhusudan Raman Supervisor: Prof. Michael Voss.

GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond

Parallelization and Characterization of Pattern Matching using GPUs Author: Giorgos Vasiliadis 、 Michalis Polychronakis 、 Sotiris Ioannidis Publisher:

Performance Comparison Xen vs. KVM vs. Native –Benchmarks: SPEC CPU2006, SPEC JBB 2005, SPEC WEB, TPC –Case studies Design instrumentations for figure.

Challenges and Solutions for Embedded Java Michael Wortley Computer Integrated Surgery March 1, 2001.

CIS250 OPERATING SYSTEMS Chapter One Introduction.

CSE 598c – Virtual Machines Survey Proposal: Improving Performance for the JVM Sandra Rueda.

Euro-Par, 2006 ICS 2009 A Translation System for Enabling Data Mining Applications on GPUs Wenjing Ma Gagan Agrawal The Ohio State University ICS 2009.

MIDORI The Windows Killer!! by- Sagar R. Yeole Under the guidance of- Prof. T. A. Chavan.

Common Language Runtime Introduction  The common language runtime is one of the most essential component of the.Net Framework.  It acts.

Silberschatz, Galvin and Gagne ©2013 Operating System Concepts – 9 th Edition Chapter 4: Threads.

Flashback : A Lightweight Extension for Rollback and Deterministic Replay for Software Debugging Sudarshan M. Srinivasan, Srikanth Kandula, Christopher.

Software Toolchains. Motivation 2 Write Run Edit, compile, link, run, debug same platform Desktop Write Run Edit, compile, link, debug on host; run on.

Report on Vector Prototype J.Apostolakis, R.Brun, F.Carminati, A. Gheata 10 September 2012.

Our Graphics Environment Landscape Rendering. Hardware  CPU  Modern CPUs are multicore processors  User programs can run at the same time as other.

1 ”MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs” John A. Stratton, Sam S. Stone and Wen-mei W. Hwu Presentation for class TDT24,

Directions in Linux OpenGL

© D. J. Foreman, Structure of an O/S. © D. J. Foreman, Overview  Required functionality –Handle interrupts –Manage resources Processes.

Chapter 4: Threads Modified by Dr. Neerja Mhaskar for CS 3SH3.

Introduction to threads

Prof. Zhang Gang School of Computer Sci. & Tech.

Chapter 4: Multithreaded Programming

Before You Begin Nahla Abuel-ola /WIT.

Current Generation Hypervisor Type 1 Type 2.

Report on Vector Prototype

Lecture 5: GPU Compute Architecture

Quick Start Guide for Visual Studio 2010

Presented by: Isaac Martin

High Performance Computing (CS 540)

Lecture 5: GPU Compute Architecture for the last time

Chapter 4: Threads.

for Network Processors

The Case for Operating System Services on GPUs

SDK Demo/Tutorial John DeHart.

Discussion HPC Priority project for COSMO consortium

Using OpenMP offloading in Charm++

Upcoming Improvements and Features in Charm++

Operating Systems (CS 340 D)

Module 10: Implementing Managed Code in the Database

Introduction to CUDA.

Operating System Introduction.

Chapter 4: Threads & Concurrency

The Challenge of Cross - Language Interoperability

Compiler Construction

Parallel Computing Explained How to Parallelize a Code

Interpreting Java Program Runtimes

System Programming By Prof.Naveed Zishan.

Peter Oostema & Rajnish Aggarwal 6th March, 2019

CSC Multiprocessor Programming, Spring, 2011

Presentation transcript:

Overview Motivation Scala on LLVM Challenges Interesting Subsets

Motivation

Scala on LLVM

Challenges: Must-Have Garbage Collector

Challenges: Optional Threading Reflection Debugging Code Loading

Challenges: Code Loading

Scala Specific Optimisations Improving Function Handling – Get Rid of Object Overhead – Inlining

Interesting Subsets Tiny Scala On Small Systems Compiled Scala Scala In Flavors

Scala in Flavors

Run the ‘regular’ Code on CPU Run data parallel on GPU or other dedicated hardware Issues – Interchanging Data – Vectorisation – Memory-Management