The TILE-Gx Processor: Enabling HPC through Massive-Scale Manycore Bob Doud Director of Processor Strategy, Tilera Corp. HPEC, September 2011.

Slides:



Advertisements
Similar presentations
Tile Processors: Many-Core for Embedded and Cloud Computing
Advertisements

CSCE 432/832 High Performance ---- An Introduction to Multicore Memory Hierarchy Dongyuan Zhan CS252 S05.
Structure of Computer Systems
Multi-core systems System Architecture COMP25212 Daniel Goodman Advanced Processor Technologies Group.
Evolution of Chip Design ECE 111 Spring A Brief History 1958: First integrated circuit – Flip-flop using two transistors – Built by Jack Kilby at.
OPTERON (Advanced Micro Devices). History of the Opteron AMD's server & workstation processor line 2003: Original Opteron released o 32 & 64 bit processing.
Linux Clustering A way to supercomputing. What is Cluster? A group of individual computers bundled together using hardware and software in order to make.
Performance Characterization of the Tile Architecture Précis Presentation Dr. Matthew Clark, Dr. Eric Grobelny, Andrew White Honeywell Defense & Space,
1 BGL Photo (system) BlueGene/L IBM Journal of Research and Development, Vol. 49, No. 2-3.
Multiprocessors II Andreas Klappenecker CPSC321 Computer Architecture.
1 AppliedMicro X-Gene ® ARM Processors Optimized Scale-Out Solutions for Supercomputing.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
Understanding Computers: Today and Tomorrow, 13th Edition 1 The Motherboard Computer chip: Circuit board: Motherboard or system board: –All devices must.
TM Freescale Semiconductor Confidential and Proprietary Information. Freescale™ and the Freescale logo are trademarks of Freescale Semiconductor, Inc.
5.3 HS23 Blade Server. The HS23 blade server is a dual CPU socket blade running Intel´s new Xeon® processor, the E5-2600, and is the first IBM BladeCenter.
Hardware Overview Net+ARM – Well Suited for Embedded Ethernet
Cluster computing facility for CMS simulation work at NPD-BARC Raman Sehgal.
Appro Products and Solutions Anthony Kenisky, Vice President of Sales Appro, Premier Provider of Scalable Supercomputing Solutions: 4/16/09.
ADVANCE FORENSIC WORKSTATION. SPECIFICATION Mother board : Xeon 5000 Series Server Board support 667MHz, 1066MHz and 1333MHz1 Processor : Two Intel Quad.
Server System. Introduction A server system is a computer, or series of computers, that link other computers or electronic devices together. They often.
Serial vs.Parallel Computing Scalable Perf. vs. Availability
NetBurner MOD 5282 Network Development Kit MCF 5282 Integrated ColdFire 32 bit Microcontoller 2 DB-9 connectors for serial I/O supports: RS-232, RS-485,
Computer Architecture HPC in 2020 CASC 2009 Steve Wallach swallach”at”conveycomputer.com © 2009 Convey Computer Corporation.
Different CPUs CLICK THE SPINNING COMPUTER TO MOVE ON.
1 Introduction to ARM A15 Linux DSP Platform Software Apps Team 04/19/2013 1TI Confidential - NDA Restrictions.
The Tile Processor: A 64-Core Multicore for Embedded Processing Anant Agarwal Tilera Corporation HPEC 2007.
MIDeA :A Multi-Parallel Instrusion Detection Architecture Author: Giorgos Vasiliadis, Michalis Polychronakis,Sotiris Ioannidis Publisher: CCS’11, October.
TILEmpower-Gx36 - Architecture overview & performance benchmarks – Presented by Younghyun Jo 2013/12/18.
Real-Time HD Harmonic Inc. Real Time, Single Chip High Definition Video Encoder! December 22, 2004.
Confidential 1 SpecificationsFeatures ProcessorFreescale MPC8640 Single 1 GHz DDRAMDual channel DDR2 with ECC, 512 MB (expandable up to 2GB) Flash.
The IT700 PIM only supports up to network layer, all other above layers must be executed by other processor. Therefore in the PLC control network two types.
High Performance Computing Processors Felix Noble Mirayma V. Rodriguez Agnes Velez Electric and Computer Engineer Department August 25, 2004.
History of Microprocessor MPIntroductionData BusAddress Bus
Annapolis Micro Systems, Inc. 190 Admiral Cochrane Dr., Ste 130, Annapolis, MD Web: HQ Phone: (410) HQ Fax: (410)
HPC Advisory Council Panel: AMD. 2 | Oak Ridge National Labs (ORNL) Briefing | 2/25/2009 | Confidential AMD HPC Strategy Deliver industry-leading solutions.
Developing Power-Aware Strategies for the Blackfin Processor Steven VanderSanden Giuseppe Olivadoti David Kaeli Richard Gentile Northeastern University.
Standard Sound Card (PCI) This is a typical sound card design. Although most of the newer sound cards use a PCI-E 1x (1Lane) to 8x (with exceptions for.
80-Tile Teraflop Network-On- Chip 1. Contents Overview of the chip Architecture ▫Computational Core ▫Mesh Network Router ▫Power save features Performance.
Guangdeng Liao, Xia Zhu, Steen Larsen, Laxmi Bhuyan, Ram Huggahalli University of California, Riverside Intel Labs.
Hardware Benchmark Results for An Ultra-High Performance Architecture for Embedded Defense Signal and Image Processing Applications September 29, 2004.
IDC HPC User Forum April 14 th, 2008 A P P R O I N T E R N A T I O N A L I N C Steve Lyness Vice President, HPC Solutions Engineering
1 Lecture 1: Computer System Structures We go over the aspects of computer architecture relevant to OS design  overview  input and output (I/O) organization.
Power and Cooling at Texas Advanced Computing Center Tommy Minyard, Ph.D. Director of Advanced Computing Systems 42 nd HPC User Forum September 8, 2011.
Computer Hardware & Processing Inside the Box CSC September 16, 2010.
By Chad Andrus. TILE-Gx100  100 Identical Processor Cores Each core has its own L2 & L3 cache Each can run its own OS or group together for multiprocessing.
Rasberry pi 2 model B. Selve computeren i rasberry pi’en.
Introducing the Raspberry Pi Nauru ICT Department April 2016.
Operations Management - Problems Students cannot play HW Requirements Self-RegistrationSW requirements Upload by Paris SW Capacity Internet connection.
BLUE GENE Sunitha M. Jenarius. What is Blue Gene A massively parallel supercomputer using tens of thousands of embedded PowerPC processors supporting.
Compute Node Tutorial(2) Agenda Introduce to RocketIO How to build a optical link connection Backplane and cross link communications How to.
Hardware Architecture
The Moonshot BladeSystem By Hewlett-Packard Carl J. Hoppe 7 October 2013 COSC
Business-class control Effective management of your office environment is within reach. The OptiPlex™ enables IT administrators to take total control over.
EXtreme Data Workshop Readout Technologies Rob Halsall The Cosener’s House 18 April 2012.
i.MX 8 Series: 3 Processor Families with Targeted Features
Manycore processors Sima Dezső October Version 6.2.
Lynn Choi School of Electrical Engineering
Hands On SoC FPGA Design
Microarchitecture.
Appro Xtreme-X Supercomputers
High-performance tracing of many-core systems with LTTng
What happens inside a CPU?
FPGAs in AWS and First Use Cases, Kees Vissers
Lifecycle Suppose we have two processes that require the CPU. The first one had the CPU and you would like to let the second process run, ie context switch.
Interconnect with Cache Coherency Manager
The Greening of IT November 1, 2007.
Learning Objectives To be able to describe the purpose of the CPU
Ampere for the openEDGE
Presentation transcript:

The TILE-Gx Processor: Enabling HPC through Massive-Scale Manycore Bob Doud Director of Processor Strategy, Tilera Corp. HPEC, September 2011

2 Tilera TILE-Gx Family Manycore Processors with up to 100 Cores HPEC September 2011 Performance 450 BOPS on a single TILE-Gx processor Shared, coherent cache across all cores Power Efficiency 60 Watts typical power dissipation 5x Performance-per-Watt of x86 class CPUs I/O & Connectivity >160G of I/O on the processor Integrated quad DDR3 memory controllers © 2011 Tilera Corporation

The TILE-Gx8100 ™ Processor: System-on-a-Chip with bit cores 3 © 2011 Tilera Corporation HPEC September Memory Controller (DDR3) mPIPE USB x2, UART x2, JTAG, I 2 C, SPI, GPIO USB x2, UART x2, JTAG, I 2 C, SPI, GPIO SerDes PCIe lane PCIe lane SerDes PCIe lane PCIe lane Interlaken 10 GbE XAUI 10 GbE XAUI SerDes 4x GbE SGMII 10 GbE XAUI 10 GbE XAUI SerDes 4x GbE SGMII 10 GbE XAUI 10 GbE XAUI SerDes 4x GbE SGMII 10 GbE XAUI 10 GbE XAUI SerDes 4x GbE SGMII 10 GbE XAUI 10 GbE XAUI SerDes 4x GbE SGMII 10 GbE XAUI 10 GbE XAUI SerDes 4x GbE SGMII 10 GbE XAUI 10 GbE XAUI SerDes 4x GbE SGMII 10 GbE XAUI 10 GbE XAUI SerDes 4x GbE SGMII SerDes PCIe lane PCIe lane MiCA 450 BOPS 32MBytes Coherent Cache ~60 Watts 450 BOPS 32MBytes Coherent Cache ~60 Watts Runs SMP Linux

HPEC September Peta-Op Integer Compute at <500KW TILE-Gx100 Processor: – 3-way core, 1.5GHz, 100 cores = 450 BOPS per chip 1 Tilera Server Shelf: – 3U rack space; 12 blades, 3 processors/blade 1 Tilera Rack; – 13 Shelves, 468 processors, 46,800 cores 5 Tilera Racks; – 2250 processors, 225,000 cores; ~450 Kilowatts © 2011 Tilera Corporation 5 Racks 2250 Tile Processors = Peta-Ops Up to 180 Tbps of I/O 288 TBytes DDR3 Memory Up to 180 Tbps of I/O 288 TBytes DDR3 Memory 3600 cores

TILE-Gx Enables a Range of HPC Applications © 2011 Tilera Corporation 5 HPEC September 2011 Real-Time Informatics Cyber Security Data Harvesting Threat Analysis / Forensics Video Surveillance & Analysis Image compression/decompression Target Tracking / Pattern Recognition Network Security Processing Intrusion Prevention (IPS/IDS) Data Leakage Protection (DLP) Integer Compute Hyper-Scale Integer Computing Compute-Intensive SIMD & DSP

Please stop by our table outside the auditorium TILE-Gx processor solutions on display We’ll be happy to discuss your HPC requirements Thank You © 2011 Tilera Corporation 6 HPEC September 2011