By Chad Andrus. TILE-Gx100  100 Identical Processor Cores Each core has its own L2 & L3 cache Each can run its own OS or group together for multiprocessing.

Slides:



Advertisements
Similar presentations
Automatic Data Movement and Computation Mapping for Multi-level Parallel Architectures with Explicitly Managed Memories Muthu Baskaran 1 Uday Bondhugula.
Advertisements

GDB Improvements for Telecom System. GDBs Strengths, Today Primary debugger for Linux Used by Eclipse for C, C++ debugging –Eclipse communicates with.
Threads, SMP, and Microkernels
The Development of Mellanox - NVIDIA GPUDirect over InfiniBand A New Model for GPU to GPU Communications Gilad Shainer.
1 Hardware Support for Isolation Krste Asanovic U.C. Berkeley MURI “DHOSA” Site Visit April 28, 2011.
Will Wireless Gaming Be a Winner? February 1, 2007 Sumit Rohit Thaker Sung Ho Hwang.
Dr. Alexandra Fedorova August 2007 Introduction to Systems Research at SFU.
Page-based Commands for DRAM Systems Aamer Jaleel Brinda Ganesh Lei Zong.
Page-based Commands for DRAM Systems Aamer Jaleel Brinda Ganesh Lei Zong.
Memory: Virtual MemoryCSCE430/830 Memory Hierarchy: Virtual Memory CSCE430/830 Computer Architecture Lecturer: Prof. Hong Jiang Courtesy of Yifeng Zhu.
Murali Vijayaraghavan MIT Computer Science and Artificial Intelligence Laboratory RAMP Retreat, UC Berkeley, January 11, 2007 A Shared.
Parallel Computer Architectures
Manycore Network Interfaces for In-Memory Rack-Scale Computing Alexandros Daglis, Stanko Novakovic, Edouard Bugnion, Babak Falsafi, Boris Grot.
Digital Graphics and Computers. Hardware and Software Working with graphic images requires suitable hardware and software to produce the best results.
Operating Systems.
Silberschatz, Galvin and Gagne  2002 Modified for CSCI 399, Royden, Operating System Concepts Operating Systems Lecture 1 Introduction Read:
Multi Core Processor Submitted by: Lizolen Pradhan
Mathematical Finance Seminar. What is Mathematical Finance Other Terms Financial Engineering Quantitative Finance Computational Finance Mathematical Finance.
1 Hardware Support for Collective Memory Transfers in Stencil Computations George Michelogiannakis, John Shalf Computer Architecture Laboratory Lawrence.
StimulusCache: Boosting Performance of Chip Multiprocessors with Excess Cache Hyunjin Lee Sangyeun Cho Bruce R. Childers Dept. of Computer Science University.
TILEmpower-Gx36 - Architecture overview & performance benchmarks – Presented by Younghyun Jo 2013/12/18.
Tessellation: Space-Time Partitioning in a Manycore Client OS Rose Liu 1,2, Kevin Klues 1, Sarah Bird 1, Steven Hofmeyr 3, Krste Asanovic 1, John Kubiatowicz.
Annapolis Micro Systems, Inc. 190 Admiral Cochrane Dr., Ste 130, Annapolis, MD Web: HQ Phone: (410) HQ Fax: (410)
Types of computer operation. There a several different methods of operation. Most computers can undertake each of these simultaneously. These methods.
TEMPLATE DESIGN © Hardware Design, Synthesis, and Verification of a Multicore Communication API Ben Meakin, Ganesh Gopalakrishnan.
Hardware. Make sure you have paper and pen to hand as you will need to take notes and write down answers and thoughts that you can refer to later on.
CSE 661 PAPER PRESENTATION
80-Tile Teraflop Network-On- Chip 1. Contents Overview of the chip Architecture ▫Computational Core ▫Mesh Network Router ▫Power save features Performance.
The TILE-Gx Processor: Enabling HPC through Massive-Scale Manycore Bob Doud Director of Processor Strategy, Tilera Corp. HPEC, September 2011.
Server Virtualization
Managing Distributed, Shared L2 Caches through OS-Level Page Allocation Jason Bosko March 5 th, 2008 Based on “Managing Distributed, Shared L2 Caches through.
Test Architecture Design and Optimization for Three- Dimensional SoCs Li Jiang, Lin Huang and Qiang Xu CUhk Reliable Computing Laboratry Department of.
Shashwat Shriparv InfinitySoft.
I NTRODUCTION TO N ETWORK A DMINISTRATION. W HAT IS A N ETWORK ? A network is a group of computers connected to each other to share information. Networks.
HyperThreading ● Improves processor performance under certain workloads by providing useful work for execution units that would otherwise be idle ● Duplicates.
Chapter 1: How are computers organized?. Software, data, & processing ? A computers has no insight or intuition A computers has no insight or intuition.
Shouqing Hao Institute of Computing Technology, Chinese Academy of Sciences Processes Scheduling on Heterogeneous Multi-core Architecture.
Chap 4: Processors Mainly manufactured by Intel and AMD Important features of Processors: Processor Speed (900MHz, 3.2 GHz) Multiprocessing Capabilities.
Multiprocessor  Use large number of processor design for workstation or PC market  Has an efficient medium for communication among the processor memory.
I NTRODUCTION TO N ETWORK A DMINISTRATION. W HAT IS A N ETWORK ? A network is a group of computers connected to each other to share information. Networks.
Parallel Computers Today Oak Ridge / Cray Jaguar > 1.75 PFLOPS Two Nvidia 8800 GPUs > 1 TFLOPS Intel 80- core chip > 1 TFLOPS  TFLOPS = floating.
Introduction Goal: connecting multiple computers to get higher performance – Multiprocessors – Scalability, availability, power efficiency Job-level (process-level)
1.3 Operating system services An operating system provide services to programs and to the users of the program. It provides an environment for the execution.
Introduction to FOSS. Classes of software  Operating System (OS)  Software that manages all the application programs in a computer  Manages the computer.
Computer systems Quiz. The CPU What does CPU stand for?(1) Which 3 step cycle does the CPU follow?(1) In order to run, name 3 things that the CPU needs?(3)
Parallel Computers Today LANL / IBM Roadrunner > 1 PFLOPS Two Nvidia 8800 GPUs > 1 TFLOPS Intel 80- core chip > 1 TFLOPS  TFLOPS = floating point.
Presented by: Nick Kirchem Feb 13, 2004
Multiprocessing.
CS427 Multicore Architecture and Parallel Computing
Chapter 1: Introduction
Central Processing Unit- CPU
High-performance tracing of many-core systems with LTTng
CS 147 – Parallel Processing
Presented by: Tim Olson, Architect
Core i7 micro-processor
Multi-Processing in High Performance Computer Architecture:
CS 286 Computer Organization and Architecture
Multi-Processing in High Performance Computer Architecture:
Assignment 4 – (a) Consider a symmetric MP with two processors and a cache invalidate write-back cache. Each block corresponds to two words in memory.
CPU and Embedded Systems
Real-Time Systems Group
Interconnect with Cache Coherency Manager
Introduction to Multiprocessors
Chapter 1: How are computers organized?
Faustino J. Gomez, Doug Burger, and Risto Miikkulainen
Fast Communication and User Level Parallelism
Leveraging Optical Technology in Future Bus-based Chip Multiprocessors
Operating System Introduction.
Objectives Describe how common characteristics of CPUs affect their performance: clock speed, cache size, number of cores Explain the purpose and give.
Learning Objectives To be able to describe the purpose of the CPU
Presentation transcript:

By Chad Andrus

TILE-Gx100  100 Identical Processor Cores Each core has its own L2 & L3 cache Each can run its own OS or group together for multiprocessing Cores can be dedicated to specific applications  Targeted to run Performance Applications

Specs  1.5GHz per tile  256K L2 cache per tile  750 billion operations per second  500 Gbps memory bandwidth  200 Tbps of on-chip mesh interconnect (iMesh) "Think of it as the Internet on a chip“ - Bob Doud

What is iMesh?  Five different layers: Memory Input/Output Cache coherency User Space (2)  Tile system allows each core to interact with every other core

Benefits  Reduce costs – can eliminate dozens of computing systems  Efficiency - processing resources can be specifically allocated Optimize performance, save power  Fast computing

Sources  processor-announced-tilera.htm processor-announced-tilera.htm   Cores-Into-Next-Gen-Processors html Cores-Into-Next-Gen-Processors html