1 An FPGA Implementation of the Two-Dimensional Finite-Difference Time-Domain (FDTD) Algorithm Wang Chen Panos Kosmas Miriam Leeser Carey Rappaport Northeastern.

Slides:



Advertisements
Similar presentations
The 3D FDTD Buried Object Detection Forward Model used in this project was developed by Panos Kosmas and Dr. Carey Rappaport of Northeastern University.
Advertisements

Mohammed Yousef Abd El ghany, Faculty of Eng., Comm. Dep., 3rd year. Digital Signal Processor The Heart of Modern Real-Time Control Systems.
Bryan Lahartinger. “The Apriori algorithm is a fundamental correlation-based data mining [technique]” “Software implementations of the Aprioiri algorithm.
Hardware Implementation of Antenna Beamforming using Genetic Algorithm Kevin Hsiue Bryan Teague.
K-means clustering –An unsupervised and iterative clustering algorithm –Clusters N observations into K clusters –Observations assigned to cluster with.
Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.
Steven Koelmeyer BDS(hons)1 Reconfigurable Hardware for use in Ad Hoc Sensor Networks Supervisors Charles Greif Nandita Bhattacharjee.
Characterization Presentation Neural Network Implementation On FPGA Supervisor: Chen Koren Maria Nemets Maxim Zavodchik
The Design Process Outline Goal Reading Design Domain Design Flow
Evolution of implementation technologies
Field-Programmable Logic and its Applications INTERNATIONAL CONFERENCE August 30 – September 01, 2004 Albert A. Conti, Tom Van Court, Martin C. Herbordt.
Target Control Electronics Upgrade 08/01/2009 J. Leaver P. Smith.
Final Presentation Neural Network Implementation On FPGA Supervisor: Chen Koren Maria Nemets Maxim Zavodchik
A Parameterized Floating Point Library Applied to Multispectral Image Clustering Xiaojun Wang Dr. Miriam Leeser Rapid Prototyping Laboratory Northeastern.
Finite Difference Time Domain Method (FDTD)
Field Programmable Gate Array (FPGA) Layout An FPGA consists of a large array of Configurable Logic Blocks (CLBs) - typically 1,000 to 8,000 CLBs per chip.
Using Programmable Logic to Accelerate DSP Functions 1 Using Programmable Logic to Accelerate DSP Functions “An Overview“ Greg Goslin Digital Signal Processing.
FPGA Based Fuzzy Logic Controller for Semi- Active Suspensions Aws Abu-Khudhair.
The Goal Redesign Microprocessor Labs For New Chip Redesign Microprocessor Labs For New Chip Continuation of work started by Miguel Morales last year Continuation.
Fernando Ortiz EM Photonics, Inc. Newark, DE
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
Finite Differences Finite Difference Approximations  Simple geophysical partial differential equations  Finite differences - definitions  Finite-difference.
University of Veszprém Department of Image Processing and Neurocomputing Emulated Digital CNN-UM Implementation of a 3-dimensional Ocean Model on FPGAs.
Student : Andrey Kuyel Supervised by Mony Orbach Spring 2011 Final Presentation High speed digital systems laboratory High-Throughput FFT Technion - Israel.
Making FPGAs a Cost-Effective Computing Architecture Tom VanCourt Yongfeng Gu Martin Herbordt Boston University BOSTON UNIVERSITY.
Floating Point vs. Fixed Point for FPGA 1. Applications Digital Signal Processing -Encoders/Decoders -Compression -Encryption Control -Automotive/Aerospace.
Operating Systems for Reconfigurable Systems John Huisman ID:
Efficient FPGA Implementation of QR
Variable Precision Floating Point Division and Square Root Albert Conti Xiaojun Wang Dr. Miriam Leeser Rapid Prototyping Laboratory Northeastern University,
FPGA FPGA2  A heterogeneous network of workstations (NOW)  FPGAs are expensive, available on some hosts but not others  NOW provide coarse- grained.
ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.
J. Christiansen, CERN - EP/MIC
VORPAL Optimizations for Petascale Systems Paul Mullowney, Peter Messmer, Ben Cowan, Keegan Amyx, Stefan Muszala Tech-X Corporation Boyana Norris Argonne.
1 Fly – A Modifiable Hardware Compiler C. H. Ho 1, P.H.W. Leong 1, K.H. Tsoi 1, R. Ludewig 2, P. Zipf 2, A.G. Oritz 2 and M. Glesner 2 1 Department of.
An FPGA Implementation of the Ewald Direct Space and Lennard-Jones Compute Engines By: David Chui Supervisor: Professor P. Chow.
4/19/20021 TCPSplitter: A Reconfigurable Hardware Based TCP Flow Monitor David V. Schuehler.
Floating-Point Divide and Square Root for Efficient FPGA Implementation of Image and Signal Processing Algorithms Xiaojun Wang, Miriam Leeser
FPGA Based Smoke Simulator Jonathan Chang Yun Fei Tianming Miao Guanduo Li.
Wang Chen, Dr. Miriam Leeser, Dr. Carey Rappaport Goal Speedup 3D Finite-Difference Time-Domain.
Hardware Accelerator for Combinatorial Optimization Fujian Li Advisor: Dr. Areibi.
FPL Sept. 2, 2003 Software Decelerators Eric Keller, Gordon Brebner and Phil James-Roxby Xilinx Research Labs.
CPS 4150 Computer Organization Fall 2006 Ching-Song Don Wei.
SLAAC SLD Update Steve Crago USC/ISI September 14, 1999 DARPA.
DSP Architectures Additional Slides Professor S. Srinivasan Electrical Engineering Department I.I.T.-Madras, Chennai –
Remcom Inc. 315 S. Allen St., Suite 416  State College, PA  USA Tel:  Fax:   ©
Acceleration of the Retinal Vascular Tracing Algorithm using FPGAs Direction Filter Design FPGA FIREBIRD BOARD Framegrabber PCI Bus Host Data Packing Design.
1 EEE 431 Computational Methods in Electrodynamics Lecture 8 By Dr. Rasime Uyguroglu
November 29, 2011 Final Presentation. Team Members Troy Huguet Computer Engineer Post-Route Testing Parker Jacobs Computer Engineer Post-Route Testing.
FPGA Hardware Synthesis Jessica Baxter. Reference M. Haldar, A. Nayak, N. Shenoy, A. Choudhary and P. Banerjee, “FPGA Hardware Synthesis from MATLAB”,
Backprojection and Synthetic Aperture Radar Processing on a HHPC Albert Conti, Ben Cordes, Prof. Miriam Leeser, Prof. Eric Miller
Fail-Safe Module for Unmanned Autonomous Vehicle
1 EEE 431 Computational Methods in Electrodynamics Lecture 7 By Dr. Rasime Uyguroglu
DDRIII BASED GENERAL PURPOSE FIFO ON VIRTEX-6 FPGA ML605 BOARD PART B PRESENTATION STUDENTS: OLEG KORENEV EUGENE REZNIK SUPERVISOR: ROLF HILGENDORF 1 Semester:
Formulation of 2D‐FDTD without a PML.
In this work, we propose a novel local mesh refinement algorithm based on the use of transformation optics. The new algorithm is an alternative way to.
The Finite Difference Time Domain Method FDTD Haythem H. abdullah ERI, Electronics Research Institute, Microwave Engineering Dept.
1 Modeling Considerations for the Hardware-Software Co-design of Flexible Modern Wireless Transceivers Benjamin Drozdenko, Matthew Zimmermann, Tuan Dao,
Relaxation Methods in the Solution of Partial Differential Equations
Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro
Fast & Accurate Biophotonic Simulation for Personalized Photodynamic Cancer Therapy Treatment Planning Investigators: Vaughn Betz, University of Toronto.
Backprojection Project Update January 2002
Two-Dimensional Phase Unwrapping On FPGAs And GPUs
Cellular Automata Project:
Dynamo: A Runtime Codesign Environment
Parallel Beam Back Projection: Implementation
Dynamo: A Runtime Codesign Environment
Wavelet “Block-Processing” for Reduced Memory Transfers
XILINX CPLDs The Total ISP Solution
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
Optimizing stencil code for FPGA
Presentation transcript:

1 An FPGA Implementation of the Two-Dimensional Finite-Difference Time-Domain (FDTD) Algorithm Wang Chen Panos Kosmas Miriam Leeser Carey Rappaport Northeastern University Boston, MA

2 FDTD Algorithm and Implementation  Finite Difference Time-Domain  Method for solving Maxwell’s equations  Used for buried object detection  Hardware Implementation  3D to 2D model simplification  Data dependency analysis  Fixed-point quantization

3 Finite-Difference Time-Domain Method Maxwell’s Equations  A direct time-domain solution of Maxwell's equations  Accurate and flexible for solving electromagnetic problems  Discretize time and electromagnetic space

4 FDTD Method (cont’d) One FDTD Equation Yee CellTaylor Series Expansion Adjacent Cells

5 FDTD Applications  Antenna Design  Discrete Scattering Studies  Medical Studies  The study of the cell phone electromagnetic waves' effect on human brain  The study of breast cancer detection using electromagnetic antenna

6 Buried Object Detection Forward Model Buried Object Detection Model Space

7 FDTD Simulated Model Space

8 FDTD Simulated Model Space (cont’d)

9 Related Work  Software acceleration of FDTD  Parallel computers do not provide significant speedup  FPGA implementations of FDTD  1D FDTD on hardware: architecture is too simple  Full 3D FDTD on hardware developed at UDel  Design is slower than software: uses complex floating-point representation no parallelism or pipelining  Our 2D FDTD hardware implementation  24 times speedup compare to 3.0G PC:  fixed-point representation  expandable structure

10 3D to 2D Model Simplification

11 Exterior Boundary Conditions 3D Model Space 2D Model Space 4 Edges 6 Faces and 12 Edges Mur-type Absorbing Boundary Condition

12 Data Dependency Analysis

13 Hardware Acceleration  Smart memory interface  Parallelism  Pipelining  Quantized fixed point representation  Less area in datapath -- more parallelism  Careful error analysis to ensure accurate results

14 Fixed-point quantization

15 Design Flow

16 Firebird FPGA Board from Annapolis  A Xilinx VIRTEX-E XCV2000E with 2.5 million system gates  Processing clock up to 150MHz FDTD runs at 70 MHz  Five independent memory banks (4 x 64-bit, 1 x 32-bit) 288Mbytes in total  6.6Gbytes/sec of memory bandwidth  3Gbytes/sec of I/O bandwidth SlicesBlockRAM Number Available Number Used Percentage Used46%54% Utilization of Xilinx XCV2000E FPGA Chip

17 FDTD on Firebird Board

18 Memory Interface

19 Pipelining and Parallelism

20 Data Flow

21 Results and Performance

22 Conclusions  FPGA Implementation of FDTD exhibits significant speedup compared to software: 24 times faster than 3GHz PC  With larger FPGA, more parallelism will be available, hence more speedup  Current design easily extendible to handle multiple types of materials, 3D space

23 Future Work  Upgrade curent design to handle multiple types of materials  Upgrade to 3D model space  Add three more field updating algorithms: same structure as the original three algorithms  Upgrade boundary condition updating algorithm  Redesign memory interface  Apply FDTD Hardware to other applications