HPEC2002_Session1 1 DRM 11/11/2015 MIT Lincoln Laboratory Session 1: Novel Hardware Architectures David R. Martinez 24 September 2002 This work is sponsored.

Slides:



Advertisements
Similar presentations
The HPEC Challenge Benchmark Suite
Advertisements

EEL6686 Guest Lecture February 25, 2014 A Framework to Analyze Processor Architectures for Next-Generation On-Board Space Computing Tyler M. Lovelly Ph.D.
Andrew Prout, William Arcand, David Bestor, Chansup Byun, Bill Bergeron, Matthew Hubbell, Jeremy Kepner, Peter Michaleas, Julie Mullen, Albert Reuther,
Presenter: Jeremy W. Webb Course: EEC 289Q: Reconfigurable Computing Course Instructor: Professor Soheil Ghiasi Processor Architectures At A Glance: M.I.T.
Graduate Computer Architecture I Lecture 15: Intro to Reconfigurable Devices.
K-means clustering –An unsupervised and iterative clustering algorithm –Clusters N observations into K clusters –Observations assigned to cluster with.
Silicon Graphics, Inc. Poster Presented by: SGI Proprietary Technologies for Breakthrough Research Rosario Caltabiano North East Higher Education & Research.
Reconfigurable Application Specific Computers RASCs Advanced Architectures with Multiple Processors and Field Programmable Gate Arrays FPGAs Computational.
Presenter: Jyun-Yan Li Multiprocessor System-on-Chip Profiling Architecture: Design and Implementation Po-Hui Chen, Chung-Ta King, Yuan-Ying Chang, Shau-Yin.
1 Dr. Frederica Darema Senior Science and Technology Advisor NSF Future Parallel Computing Systems – what to remember from the past RAMP Workshop FCRC.
IBM RS/6000 SP POWER3 SMP Jari Jokinen Pekka Laurila.
University of Kansas Research Interests David Andrews Rm. 324 Nichols
DaMoN 2011 Paper Preview Organized by Stavros Harizopoulos and Qiong Luo Athens, Greece Jun 13, 2011.
SSS 4/9/99CMU Reconfigurable Computing1 The CMU Reconfigurable Computing Project April 9, 1999 Mihai Budiu
HPEC_GPU_DECODE-1 ADC 8/6/2015 MIT Lincoln Laboratory GPU Accelerated Decoding of High Performance Error Correcting Codes Andrew D. Copeland, Nicholas.
1 DSP Implementation on FPGA Ahmed Elhossini ENGG*6090 : Reconfigurable Computing Systems Winter 2006.
On Chip Nonlinear Digital Compensation for RF Receiver Karen M. G. V. Gettings, Andrew K. Bolstad, Show-Yah Stuart Chen, Benjamin A. Miller and Michael.
Implementation of Digital Front End Processing Algorithms with Portability Across Multiple Processing Platforms September 20-21, 2011 John Holland, Jeremy.
BENCHMARK SUITE RADAR SIGNAL & DATA PROCESSING CERES EPC WORKSHOP
Darema Dr. Frederica Darema NSF Dynamic Data Driven Application Systems (Symbiotic Measurement&Simulation Systems) “A new paradigm for application simulations.
Supporting GPU Sharing in Cloud Environments with a Transparent
MIT Lincoln Laboratory HPEC 2004 KASSPER Testbed-1 GES 4/21/2004 A KASSPER Real-Time Embedded Signal Processor Testbed Glenn Schrader Massachusetts Institute.
XYZ 10/2/2015 MIT Lincoln Laboratory Sidecar Network Adapters: Open Architecture for ISR Data Sources Craig McNally, Larry Barbieri, Charles Yee.
Semantic Information Fusion Shashi Phoha, PI Head, Information Science and Technology Division Applied Research Laboratory The Pennsylvania State.
MIT Lincoln Laboratory hch-1 HCH 5/26/2016 Achieving Portable Task and Data Parallelism on Parallel Signal Processing Architectures Hank Hoffmann.
Cognitive Radio Evolution from Agile Platforms to Omniscient Networks: the Road from Dreams to Prototypes Charles W. Bostian Virginia Tech
HPEC_NDP-1 MAH 10/11/2015 MIT Lincoln Laboratory Efficient Multidimensional Polynomial Filtering for Nonlinear Digital Predistortion Matthew Herman, Benjamin.
Slide 1 MIT Lincoln Laboratory Toward Mega-Scale Computing with pMatlab Chansup Byun and Jeremy Kepner MIT Lincoln Laboratory Vipin Sachdeva and Kirk E.
HPEC-1 MIT Lincoln Laboratory Session 1: GPU: Graphics Processing Units Miriam Leeser / Northeastern University HPEC Conference 15 September 2010.
Heterogeneous Multikernel OS Yauhen Klimiankou BSUIR
Radar Pulse Compression Using the NVIDIA CUDA SDK
October 12, 2004Thomas Sterling - Caltech & JPL 1 Roadmap and Change How Much and How Fast Thomas Sterling California Institute of Technology and NASA.
Cousins HPEC 2002 Session 4: Emerging High Performance Software David Cousins Division Scientist High Performance Computing Dept. Newport,
Improved air combat awareness - with AESA and next-generation signal processing Main beam jamming rejection Wide transmit beam Communication Side lobe.
Haney - 1 HPEC 9/28/2004 MIT Lincoln Laboratory pMatlab Takes the HPCchallenge Ryan Haney, Hahn Kim, Andrew Funk, Jeremy Kepner, Charles Rader, Albert.
A new perspective on processing-in-memory architecture design These data are submitted with limited rights under Government Contract No. DE-AC52-8MA27344.
MIT Lincoln Laboratory SPP-HPEC09-1 BAM 12/5/2015 A Special-Purpose Processor System with Software-Defined Connectivity Benjamin Miller, Sara Siegal, James.
Hardware Benchmark Results for An Ultra-High Performance Architecture for Embedded Defense Signal and Image Processing Applications September 29, 2004.
An Open Architecture for an Embedded Signal Processing Subsystem
SLAAC Systems Level Applications of Adaptive Computing Delivering ACS Technology to Applications DARPA/ITO Adaptive Computing Systems PI Meeting San Juan,
Slide-1 Multicore Theory MIT Lincoln Laboratory Theory of Multicore Algorithms Jeremy Kepner and Nadya Bliss MIT Lincoln Laboratory HPEC 2008 This work.
George Mason University Follow-up Courses. ECE Department MS in Electrical Engineering MS EE MS in Computer Engineering MS CpE COMMUNICATIONS & NETWORKING.
Interconnection network network interface and a case study.
HPEC HN 9/23/2009 MIT Lincoln Laboratory RAPID: A Rapid Prototyping Methodology for Embedded Systems Huy Nguyen, Thomas Anderson, Stuart Chen, Ford.
Current Research Overview Jeremy Espenshade 09/04/08.
Cray XD1 Reconfigurable Computing for Application Acceleration.
A 1.2V 26mW Configurable Multiuser Mobile MIMO-OFDM/-OFDMA Baseband Processor Motivations –Most are single user, SISO, downlink OFDM solutions –Training.
PTCN-HPEC-02-1 AIR 25Sept02 MIT Lincoln Laboratory Resource Management for Digital Signal Processing via Distributed Parallel Computing Albert I. Reuther.
The objective of Nonlinear Equalization (NLEQ) is to extend the dynamic range of RF receivers by reducing in-band nonlinear distortions. This enables detection.
© SKY Computers, Inc. All Rights Reserved 2/8/00 The Decomposition of HPEC Applications Mapped to the Natural Decomposition of a Solution Architecture.
MIT Lincoln Laboratory 1 of 4 MAA 3/8/2016 Development of a Real-Time Parallel UHF SAR Image Processor Matthew Alexander, Michael Vai, Thomas Emberley,
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy’s National Nuclear.
3/12/07CS Visit Days1 A Sea Change in Processor Design Uniprocessor SpecInt Performance: From Hennessy and Patterson, Computer Architecture: A Quantitative.
HPEC-1 SMHS 7/7/2016 MIT Lincoln Laboratory Focus 3: Cell Sharon Sacco / MIT Lincoln Laboratory HPEC Workshop 19 September 2007 This work is sponsored.
COMP7330/7336 Advanced Parallel and Distributed Computing Data Parallelism Dr. Xiao Qin Auburn University
George Viamontes, Mohammed Amduka, Jon Russo, Matthew Craven, Thanhvu Nguyen Lockheed Martin Advanced Technology Laboratories 3 Executive Campus, 6th Floor.
Albert I. Reuther & Joel Goodman HPEC Sept 2003
M. Richards1 ,D. Campbell1 (presenter), R. Judd2, J. Lebak3, and R
For Massively Parallel Computation The Chaotic State of the Art
Welcome: Intel Multicore Research Conference
OCP: High Performance Computing Project
SmartCell: A Coarse-Grained Reconfigurable Architecture for High Performance and Low Power Embedded Computing Xinming Huang Depart. Of Electrical and Computer.
Session 4: Reconfigurable Computing
Reconfigurable Computing MONARCH/MCHIP
Many-core Software Development Platforms
Parallel Processing in ROSA II
Matthew Lyon James Montgomery Lucas Scotta 13 October 2010
BlueGene/L Supercomputer
Paper discussed in class: H. Zhang, V. Prabhu, V. George, M. Wan, M
Star Bridge Systems, Inc.
Presentation transcript:

HPEC2002_Session1 1 DRM 11/11/2015 MIT Lincoln Laboratory Session 1: Novel Hardware Architectures David R. Martinez 24 September 2002 This work is sponsored by the High Performance Computing Modernization Office under Air Force Contract F C Opinions, interpretations, conclusions, and recommendations are those of the author and are not necessarily endorsed by the Department of Defense.

MIT Lincoln Laboratory HPEC2002_Session1 2 DRM 11/11/2015 High Performance Embedded Computing - A Historical Perspective Computing Systems Enabling Technologies 50 MHz i nodes 0.07 Mflop/s per watt Intel Paragon STAP Processor 40 MHz ADSP ~1000 nodes 12 Mflop/s per watt Distributed Computing Heterogeneous processors 100s of Mflop/s per watt to Gflop/s per watt Vector Signal and Image Processing Library 1.0 API MPI-RT standard Adaptive Computing Systems & Reconfigurable Computing Data Reorganization forum VLSI Photonics High- Performance CORBA Polymorphous Computing Architectures Fault tolerant/fault recovery High- Performance Embedded Interconnects –InfiniBand –Serial RapidIO Parallel MATLAB Mk48 Torpedo CBASS BSAR 80 MHz ADSP nodes 91 Mflop/s per watt Improved Space Processor Architecture 266 MHz PowerPC 603e 20 nodes / 40 PowerPCs 30 Mop/s per watt Cognitive Processing Integrated ASICs, FPGAs, PIM, and/or prog. devices Real-Time Linux VXS: VMEbus Switched Serial MIT Lincoln Laboratory MA 04 Sep 2002 AFRL High Performance Computing System 333 MHz PowerPC 603e 384 nodes 39 Mflop/s per watt

MIT Lincoln Laboratory HPEC2002_Session1 3 DRM 11/11/2015 Session 1: Papers Highlights Cognitive Systems Goals Reason Learn Explain Be Aware Respond Cognitive Systems Goals Reason Learn Explain Be Aware Respond Cognitive Information Processing Technology (Invited) Zach Lemnios / DARPA / IPTO MIND: Scalable Embedded Computing Through Advanced Processor in Memory (PIM) Architecture(Invited) Tom Sterling / CalTech / JPL Processor in Memory 10x x memory BW 4x – 10x reduced latency >10x power efficiency PIM for unmanned space vehicles Processor in Memory 10x x memory BW 4x – 10x reduced latency >10x power efficiency PIM for unmanned space vehicles MONARCH: A High Performance Embedded Processor Architecture with Two Native Computing Nodes John Granacki / USC / ISI DARPA DIS Embedded Computing Benchmarks for Critical Defense Signal Processing Applications Stephen Shank / Lockheed Martin Data Intensive System Benchmarks Novel DIS architectures Applications: pulse comp., sidelobe canceller, digital target generation, and beam steering Data Intensive System Benchmarks Novel DIS architectures Applications: pulse comp., sidelobe canceller, digital target generation, and beam steering Integrated Heterogeneous Processors System on a chip architecture Flexible VLSI device for stream front-end and threaded back-end processing Integrated Heterogeneous Processors System on a chip architecture Flexible VLSI device for stream front-end and threaded back-end processing