High Speed, Low Power FIR Digital Filter Implementation Presented by, Praveen Dongara and Rahul Bhasin.

Slides:



Advertisements
Similar presentations
UNIT 2: Data Flow description
Advertisements

DSPs Vs General Purpose Microprocessors
Programmable FIR Filter Design
1 ECE734 VLSI Arrays for Digital Signal Processing Chapter 3 Parallel and Pipelined Processing.
Distributed Arithmetic
1 KU College of Engineering Elec 204: Digital Systems Design Lecture 9 Programmable Configurations Read Only Memory (ROM) – –a fixed array of AND gates.
Using Carry-Save Adders For Radix- 4, Can Be Used to Generate 3a – No Booth’s Slight Delay Penalty from CSA – 3 Gates.
Modern VLSI Design 2e: Chapter 6 Copyright  1998 Prentice Hall PTR Topics n Multipliers.
FIR Tap Filter Optimization CE222 Final Project Spring 2003 S oleste H ilberg N icole S tarr.
VLSI Design Spring03 UCSC By Prof Scott Wakefield Final Project By Shaoming Ding Jun Hu
UNIVERSITY OF MASSACHUSETTS Dept
Digital Kommunikationselektronik TNE027 Lecture 4 1 Finite Impulse Response (FIR) Digital Filters Digital filters are rapidly replacing classic analog.
Contemporary Logic Design Arithmetic Circuits © R.H. Katz Lecture #24: Arithmetic Circuits -1 Arithmetic Circuits (Part II) Randy H. Katz University of.
ECE 301 – Digital Electronics
Vector Multiplication & Color Convolution Team Members Vinay Chinta Sreenivas Patil EECC VLSI Design Projects Dr. Ken Hsu.
1 Real time signal processing SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and Applications Miodrag Bolic.
Kathy Grimes. Signals Electrical Mechanical Acoustic Most real-world signals are Analog – they vary continuously over time Many Limitations with Analog.
FPGA Based Fuzzy Logic Controller for Semi- Active Suspensions Aws Abu-Khudhair.
Word-Size Optimization for Low Energy, Variable Workload Sub-threshold Systems Sudhanshu Khanna, Anurag Nigam ECE 632 – Fall 2008 University of Virginia.
Aug Shift Operations Source: David Harris. Aug Shifter Implementation Regular layout, can be compact, use transmission gates to avoid threshold.
Institute of Applied Microelectronics and Computer Engineering College of Computer Science and Electrical Engineering, University of Rostock Slide 1 Spezielle.
Matrix Multiplication on FPGA Final presentation One semester – winter 2014/15 By : Dana Abergel and Alex Fonariov Supervisor : Mony Orbach High Speed.
Digital Integrated Circuits Chpt. 5Lec /29/2006 CSE477 VLSI Digital Circuits Fall 2002 Lecture 21: Multiplier Design Mary Jane Irwin (
P. 4.1 Digital Technology and Computer Fundamentals Chapter 4 Digital Components.
L7: Pipelining and Parallel Processing VADA Lab..
Software Defined Radio 長庚電機通訊組 碩一 張晉銓 指導教授 : 黃文傑博士.
FPGA-Based System Design: Chapter 4 Copyright  2004 Prentice Hall PTR Topics n Multipliers.
Lecture 4 Multiplier using FPGA 2007/09/28 Prof. C.M. Kyung.
Scott Robinson Aaron Sikorski Peter Phelps.  Introduction  FIR Filter Design  Optimization  Application  Edge Detection  Sobel Filter  Communications.
A Reconfigurable Low-power High-Performance Matrix Multiplier Architecture With Borrow Parallel Counters Counters : Rong Lin SUNY at Geneseo
Pipelined and Parallel Computing Data Dependency Analysis for 1 Hongtao Du AICIP Research Mar 9, 2006.
Scott Robinson Aaron Sikorski Peter Phelps.  Introduction  FIR Filter Design  Optimization  Application  Edge Detection  Sobel Filter  Communications.
ECE 734 Project Implementation of Multiple Constant Multiplication Algorithms for FIR Filters Hamid Shojaei.
ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU Under-Graduate Project Improving Timing, Area, and Power Speaker: 黃乃珊 Adviser: Prof.
ELEC692 VLSI Signal Processing Architecture Lecture 2 Pipelining and Parallel Processing.
Which one? You have a vector, a[ ], of random integers, which can modern CPUs do faster and why? //find max of vector of random ints max=0; for (inda=0;
By: Daniel Barsky, Natalie Pistunovich Supervisors: Rolf Hilgendorf, Ina Rivkin Characterization Sub Nyquist Implementation Optimization 11/04/2010.
Topics Multipliers..
Full Tree Multipliers All k PPs Produced Simultaneously Input to k-input Multioperand Tree Multiples of a (Binary, High-Radix or Recoded) Formed at Top.
A New Class of High Performance FFTs Dr. J. Greg Nash Centar ( High Performance Embedded Computing (HPEC) Workshop.
Comparison of Various Multipliers for Performance Issues 24 March Depart. Of Electronics By: Manto Kwan High Speed & Low Power ASIC
UNIT 2. ADDITION & SUBTRACTION OF SIGNED NUMBERS.
Multipliers. More complicated than addition accomplished via shifting and addition More time and more area Multiplication What should be the length of.
Reconfigurable Computing - Options in Circuit Design John Morris Chung-Ang University The University of Auckland ‘Iolanthe’ at 13 knots on Cockburn Sound,
L9 : Low Power DSP Jun-Dong Cho SungKyunKwan Univ. Dept. of ECE, Vada Lab.
Reconfigurable Computing - Options in Circuit Design John Morris Chung-Ang University The University of Auckland ‘Iolanthe’ at 13 knots on Cockburn Sound,
Full Adder Truth Table Conjugate Symmetry A B C CARRY SUM
Array Multiplier Haibin Wang Qiong Wu. Outlines Background & Motivation Principles Implementation & Simulation Advantages & Disadvantages Conclusions.
Array multiplier TU/e Processor Design 5Z032.
Multiplier Design [Adapted from Rabaey’s Digital Integrated Circuits, Second Edition, ©2003 J. Rabaey, A. Chandrakasan, B. Nikolic]
EEE4176 Applications of Digital Signal Processing
Pipelining and Retiming 1
By: Mohammadreza Meidnai Urmia university, Urmia, Iran Fall 2014
Serial Multipliers Prawat Nagvajara
Multipliers Multipliers play an important role in today’s digital signal processing and various other applications. The common multiplication method is.
DESIGN AND IMPLEMENTATION OF DIGITAL FILTER
Multiplier-less Multiplication by Constants
ARM implementation the design is divided into a data path section that is described in register transfer level (RTL) notation control section that is viewed.
Chapter 6 Discrete-Time System
Number Systems and Circuits for Addition
Topics Multipliers..
Digital Signal Processing
UNIVERSITY OF MASSACHUSETTS Dept
Lecture 9 Digital VLSI System Design Laboratory
Comparison of Various Multipliers for Performance Issues
XOR Function Logic Symbol  Description  Truth Table 
Real time signal processing
Half & Full Subtractor Half Subtractor Full Subtractor.
Half & Full Subtractor Half Subtractor Full Subtractor.
UNIVERSITY OF MASSACHUSETTS Dept
Presentation transcript:

High Speed, Low Power FIR Digital Filter Implementation Presented by, Praveen Dongara and Rahul Bhasin

Flow Chart Motivation Brief Discussion on FIR Full Adder Design Pipelined Multiplier (8 X 8) Pipelined adder (16 X 16) Results Trouble shooting

Motivation FIR - Finite Impulse Response Filter –Fundamental processing unit in DSP systems High frequency applications –Video imaging Low power applications –Wireless communications

Brief Discussion on FIR Filters An N-tap FIR filter can be described by: Direct Implementation T sample  T M + T A

FIR Discussion contd. 2-parallel FIR Filter Parallel Processing Advantage –Reduced power consumption –Or high speed Disadvantage –Overhead of area 2-parallel design

Why Pipelined/Parallel ? Pipelined to enable higher sampling rates –Sampling frequency can be increased n-fold if we have n pipeline stages. Parallel for low power

Why Pipelined/Parallel? Contd. - V can be decreased by  - C increases by L (L=2) - f can be decreased by L

Optimizations at various levels Improvement Achieved Levels of design hierarchy

Full Adder Dynamic Logic True Single Phase Clocking (TSPC)

Full Adder contd.

Pipelined Multiplier Baugh-Wooley Algorithm for 8 X 8 multiplication 2’s complement numbers

Pipelined Multiplier Floor Plan Latency of 12 cycles Partial product summing full adder array Vector merge adder Latch stages to skew the multiplier bits b0-b7 Deskewing latches for the product bits

Pipelined Multiplier Layout

Pipelined Multiplier contd. Example –Input vectors a1 X b If a=1 and b=1 we have the following output sequence. –

Pipelined Multiplier contd.

16 X 16 Pipelined Adder Latency of 8 cycles Triangular array of half adders Merging of two half adder rows –Leads to decrease in latency Advantage –Regularity of design –Fewer deskewing latches

16 X 16 Pipelined Adder Layout

FIR Filter Layout

Results

Trouble Shooting Convergence problems Elmore/Penfield analysis requires lot of disk space and time