Presentation is loading. Please wait.

Presentation is loading. Please wait.

CAPS project-team Compilation et Architectures pour Processeurs Superscalaires et Spécialisés.

Similar presentations


Presentation on theme: "CAPS project-team Compilation et Architectures pour Processeurs Superscalaires et Spécialisés."— Presentation transcript:

1 CAPS project-team Compilation et Architectures pour Processeurs Superscalaires et Spécialisés

2 André Seznec CAPS project-team Irisa-Inria 2 History of CAPS project-team  Created in 1994:  “Compiler Parallel Architectures and Systems”  Common focus: high performance through optimizing the memory hierarchy  January 2000:  split in PARIS and “historical canal” CAPS  “historical canal” CAPS: Mainly compiler and architecture for uniprocessor CAPS was renamed, but the acronym was conserved

3 André Seznec CAPS project-team Irisa-Inria 3 CAPS themes  Two main interacting activities  microprocessor architecture High performance Migrating high performance concepts to embedded systems  Performance oriented compilation: High performance Embedded processors

4 André Seznec CAPS project-team Irisa-Inria 4 CAPS « missions »  Defining the tradeoffs between:  what should be done through hardware  what can be done by the compiler  for maximum performance  or for minimum cost  or for minimum size, power..

5 André Seznec CAPS project-team Irisa-Inria 5 Issues on high performance processor architecture  Memory hierarchy management:  1 cycle L1 – 10 cycles L2 – 30 cycles L3 – 200 cycles memory  Branch prediction :  30 cycles penalty x N instructions per cycle  Single cycle next instruction block address generation ?  Complexity quadratic with issue width:  Register file, bypass network, issue logic  Single chip hardware thread parallelism is available:  How do we exploit it ?  Power

6 André Seznec CAPS project-team Irisa-Inria 6 Issues on code generation/software environments for embedded processors  ILP, caches are entering embedded processor world  Code generation must manage them  Binary compatibility is not critical, time-to-market is critical  Retargetable platforms are wanted: ISAs, architecture  Performance is not the only ultimate goal:  Code size/ performance  Power/ performance  System cost/ performance

7 André Seznec CAPS project-team Irisa-Inria 7 CAPS project-team composition February 2004  Core team: A. Seznec, DR INRIA Back from Compaq 03/2000 F. Bodin, Pr Rennes I P. Michaud, CR INRIA With Intel 07/01 -07/02  Temporary member: H.P. Charles, U. Versailles 09/02-09/04  Honorary member: J. Lenfant, ex-president Rennes I  Ph. D students: A. Darsch, A. Djabelkhir, R. Dolbeau, A. Fraboulet, K. Heydemann, L. Morin, G. Pokam, O. Rochecouste, P. Terjan, E. Toullec  Temporary engineers: A. Colin T. Lafage  Post-doc: J. Hernandez

8 André Seznec CAPS project-team Irisa-Inria 8 Scientific contributions (1) Processor architecture  Global history branch predictors and instruction fetch front-end  2bcgskew used in Compaq EV8  Pipelining the I-fetch front end  Limiting hardware complexity on superscalar processors  Dataflow prescheduling: instruction window  WSRS architecture: register file, bypass network and issue logic  Thread parallelism and single chip parallelism :  CASH: CMP and SMT hybrid  Execution migration: single thread on a multicore, to use all the cache space

9 André Seznec CAPS project-team Irisa-Inria 9 Scientific contributions (2) architecture/compiler interaction  ISA simulation:  ABSCISS : ISA and architecture retargetable high speed simulator for VLIW processor  IATO : simulation of out-of-order execution IA64  Low power and architecture configurability:  Cache reconfiguration at software level on phase basis  Hardware/software speculative management of data path and register file width  SWARP: retargetable C-to-C preprocessor to enhance multimedia instruction use

10 André Seznec CAPS project-team Irisa-Inria 10 Scientific contributions (3) compiler and software environments  Artificial intelligence in performance tuning  CAHT: case based reasoning for assisting performance tuning  Automatic derivation of compiler heuristics: using machine learning to derive compiler heuristics  Performance code size tradeoffs:  Iterative compilation  Mixing interpretation on compressed code and native execution

11 André Seznec CAPS project-team Irisa-Inria 11 Scientific contributions (4) unpredictable random number generation  HAVEGE:  HArdware Volatile Entropy Gathering and Expansion Combining entropy gathering on non-architectural hardware states of a microprocessor with pseudo- random number generation

12 André Seznec CAPS project-team Irisa-Inria 12 Direct technology transfers  A. Seznec’s sabbatical with Compaq Alpha Development group (1999-2000):  EV8 branch predictor directly issued from CAPS project-team researches  Parallel access scheme to strided vectors in caches in Tarantula vector processor project directly derived from “old” vector CAPS background  P. Michaud’s sabbatical with Intel (2001-2002):  Covered by NDA

13 André Seznec CAPS project-team Irisa-Inria 13 Industrially funded Ph. Ds  3 CIFRE Ph. D. fellowships with STmicroelectronics  Flexible infrastructure for code scheduling and optimization  Compilation and power consumption  Thread extraction for heterogeneous embedded systems  1 CIFRE Ph. D. fellowship with Thomson MMD  Code analysis and performance evaluation for embedded systems  2 Ph. D. fellowships funded through Intel grant:  WSRS architecture  Pipelined instruction fetch

14 André Seznec CAPS project-team Irisa-Inria 14 Institutionally funded projects  EPICEA project ( ministry of industry)  collaboration with UVSQ, CEA and Bull  Software environment for scientific IA64  IATO simulation toolkit  MEDEA SMT – MEDEA+ MESA (European project):  Philips, STM, Bull, Eads, + SMEs + academics  Multiprocessor on-a-chip for embedded systems  Paccman project (Oppidum, ministry of industry)  Eads, Matra, Bull  Cryptographic ASIP

15 André Seznec CAPS project-team Irisa-Inria 15 Software prototypes  Calvin2+DICE: traces and emulates SPARC, fast skipping mode  used internally for 4 years  ALISE: Assembly Level Infrastructure for Software Enhancement  Prototype validated on MMDSP, property of STmicroelectronics  Atllas: Analysis and Tuning tool for Low Level Assembly and Source code  Prototype, property of Thomson MMD  CAHT: Computer Aided Hand Tuning  Prototype validated on scientific applications and embedded applications  SOFAN : code optimizer for a custom designed processor for QCD  ApeNEXT project: multiteraflop machine for QCD

16 André Seznec CAPS project-team Irisa-Inria 16 Major (mature) software developments  ABSCISS: retargetable processor simulation  SALTO: System for Assembly Languages Transformation and optimization  SWARP: C-to-C retargetable preprocessor for multimedia instructions  Menhir: Matlab to C parallel code generator  PACCMAN compiler/simulator  HAVEGE random number generator  IATO toolkit: IA64 simulation  Transferred to industry  Distributed on demand, also transferred to industry  Transferred to industry  Maintenance by industry  Distributed for non-commercial use  GPL

17 André Seznec CAPS project-team Irisa-Inria 17 Set-up of the start-up CAPS Entreprise  Software tools for high performance embedded systems: Simulation, code transformation, worst-case execution time  Custom consulting services: Performance analysis, instruction set evaluations,..  Awarded as an innovative company by ministry of Industry  Currently 8 employees, including 5 former CAPS project-team members

18 André Seznec CAPS project-team Irisa-Inria 18 Positionning at INRIA and in France  Four groups on performance oriented compilation and architecture: A3, CAPS, Compsys, R2D2  CAPS: uniprocessor architecture and compiler for embedded uniprocessor  A3, Compsys: parallel architectures, now parallel embedded architectures  R2D2: reconfigurable FPGAs and SoCs

19 André Seznec CAPS project-team Irisa-Inria 19 International positioning  Processor architecture:  A single large visible group in Europe: UPC Barcelona  A few visible seniors in Europe: P. Stenstrom, S. Vassiliadis,..  Many groups in USA, Wisconsin, Michigan, Washington,..  + industry labs : Intel, IBM,..  CAPS highly recognized on caches and branch prediction  Compiler/software environment for embedded systems :  Original position associated with the focus on developing retargetable platforms  Related activities: R. Leupers (Aachen), R. Wilhelm (Saarbruck)

20 André Seznec CAPS project-team Irisa-Inria 20 CAPS future objectives  Leverage current expertise of CAPS core team  Microarchitecture  Compiler/software environment

21 André Seznec CAPS project-team Irisa-Inria 21 “New-CAPS” objectives (2)  High-end microprocessor architecture: From “ultimate performance” to “ maintaining performance to cheaper”  Migrating “high-end” concepts to embedded processors:  (limited) O-O-O execution  Compiler/architecture power management

22 André Seznec CAPS project-team Irisa-Inria 22 “New-CAPS” objectives (3)  Embedded systems are more and more complex:  performance often comes with unpredictability and unstabibility Dimensioning a system ? Real time constraints ?  Research on performance predictability and stability:  Predictable/stable performance oriented code generation  Predictable/stable performance oriented architecture

23 André Seznec CAPS project-team Irisa-Inria 23 “New-CAPS” objectives (4)  On-chip thread parallelism is a new opportunity:  Homogeneous: SMT/CMP Tradeoffs, sharing, synchronization  Heterogeneous: single ISA –Power, performance, multiple ISAs (e.g. SoC) –Thread extraction

24 André Seznec CAPS project-team Irisa-Inria 24 “New-CAPS” objectives (4)  Capitalize on pioneering researches:  Iterative compilation  Use of machine learning techniques for compilers  Believe in retargetable software environments:  ISAs, architecture,..


Download ppt "CAPS project-team Compilation et Architectures pour Processeurs Superscalaires et Spécialisés."

Similar presentations


Ads by Google