NCTU, CS VLSI Information Processing Research Lab 研究生 : ABSTRACT Introduction NEW Recursive DFT/IDFT architecture Low computation cycle  1/2: Chebyshev.

Slides:

Advertisements

Similar presentations

Acceleration of Cooley-Tukey algorithm using Maxeler machine

Advertisements

David Hansen and James Michelussi

Cost-Effective Pipeline FFT/IFFT VLSI Architecture for DVB-H System Present by: Yuan-Chu Yu Chin-Teng Lin and Yuan-Chu Yu Department of Electrical and.

© KLMH Lienig 1 Impact of Local Interconnects and a Tree Growing Algorithm for Post-Grid Clock Distribution Jiayi Xiao.

Contents 1. Introduction 2. UWB Signal processing 3. Compressed Sensing Theory 3.1 Sparse representation of signals 3.2 AIC (analog to information converter)

1 Asynchronous Bit-stream Compression (ABC) IEEE 2006 ABC Asynchronous Bit-stream Compression Arkadiy Morgenshtein, Avinoam Kolodny, Ran Ginosar Technion.

ELEC692 VLSI Signal Processing Architecture Lecture 9 VLSI Architecture for Discrete Cosine Transform.

VLSI Arithmetic Adders Prof. Vojin G. Oklobdzija University of California

Chapter 15 Digital Signal Processing

May 29, Final Presentation Sajib Barua1 Development of a Parallel Fast Fourier Transform Algorithm for Derivative Pricing Using MPI Sajib Barua.

10/11/05ELEC / Lecture 121 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.

N. Karampetakis, S. Vologiannidis

IC-SOC STEAC: An SOC Test Integration Platform Cheng-Wen Wu.

Viterbi Decoder: Presentation #1 Omar Ahmad Prateek Goenka Saim Qidwai Lingyan Sun M1 Overall Project Objective: Design of a high speed Viterbi Decoder.

Comparison of LFSR and CA for BIST

BIN LI, HOUQIAN LI, LI LI, AND JINLEI ZHANG IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL.23, NO.9, SEPTEMBER

Low power and cost effective VLSI design for an MP3 audio decoder using an optimized synthesis- subband approach T.-H. Tsai and Y.-C. Yang Department of.

ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU A New Algorithm to Compute the Discrete Cosine Transform VLSI Signal Processing 台灣大學電機系.

DELAY INSERTION METHOD IN CLOCK SKEW SCHEDULING BARIS TASKIN and IVAN S. KOURTEV ISPD 2005 High Performance Integrated Circuit Design Lab. Department of.

Requirements Determine processor core Determine the number of hardware profiles and the benefits of each profile Determine functionality of each profile.

Introduction to Adaptive Digital Filters Algorithms

A Thermal-Aware Mapping Algorithm for Reducing Peak Temperature of an Accelerator Deployed in a 3D Stack A Thermal-Aware Mapping Algorithm for Reducing.

VLSI Arithmetic Adders & Multipliers Prof. Vojin G. Oklobdzija University of California

Graduate Category: Engineering and Technology Degree Level: Ph.D. Abstract ID# 122 On-Chip Spectral Analysis for Built-In Testing and Digital Calibration.

TEMPLATE DESIGN © Gate-Diffusion Input (GDI) Technique for Low Power CMOS Logic Circuits Design Yerkebulan Saparov, Aktanberdi.

ABSTRACT Introduction NEW Recursive DFT/IDFT architecture Low computation cycle  1/2: Chebyshev polynomial  2/N: Folded architecture High speed  Register-splitting.

Doc.: IEEE /0161r1 Submission doc.: IEEE /1131r0 Sept K. Ishihara et al.,(NTT) Slide 1 Sept Slide 1 Time-Domain CSI Compression.

Fast Memory Addressing Scheme for Radix-4 FFT Implementation Presented by Cheng-Chien Wu, Master Student of CSIE,CCU 1 Author: Xin Xiao, Erdal Oruklu and.

Implementation of Finite Field Inversion

Adviser：高永安 Student：林柏廷

Low-Power and Area-Efficient Carry Select Adder on Reconfigurable Hardware Presented by V.Santhosh kumar, B.Tech,ECE,4 th Year, GITAM University Under.

Reconfigurable Computing Using Content Addressable Memory (CAM) for Improved Performance and Resource Usage Group Members: Anderson Raid Marie Beltrao.

Low-Power H.264 Video Compression Architecture for Mobile Communication Student: Tai-Jung Huang Advisor: Jar-Ferr Yang Teacher: Jenn-Jier Lien.

Radix-2 2 Based Low Power Reconfigurable FFT Processor Presented by Cheng-Chien Wu, Master Student of CSIE,CCU 1 Author: Gin-Der Wu and Yi-Ming Liu Department.

Figure 1.a AVS China encoder [3] Video Bit stream.

Area: VLSI Signal Processing.

Paper Reading - A New Approach to Pipeline FFT Processor Presenter:Chia-Hsin Chen, Yen-Chi Lee Mentor:Chenjo Instructor:Andy Wu.

Fourier Analysis of Discrete Time Signals

ACCESS IC LAB Graduate Institute of Electronics Engineering, NTU Under-Graduate Project Case Study: Single-path Delay Feedback FFT Speaker: Yu-Min.

ON THE INTERMEDIATE SYMBOL RECOVERY RATE OF RATELESS CODES Ali Talari, and Nazanin Rahnavard IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 60, NO. 5, MAY 2012.

ECEN 679 Project 1 Topic 22: Fault Location Using Synchronized Phasors Po-Chen Chen Instructor: Dr. M. Kezunovic Mar. 3, 2014.

Multi-Split-Row Threshold Decoding Implementations for LDPC Codes

Priority encoder. Overview Priority encoder- theoretic view Other implementations The chosen implementation- simulations Calculations and comparisons.

Speaker: Darcy Tsai Advisor: Prof. An-Yeu Wu Date: 2013/10/31

Tae- Hyoung Kim, Hanyong Eom, John Keane Presented by Mandeep Singh

A New Class of High Performance FFTs Dr. J. Greg Nash Centar ( High Performance Embedded Computing (HPEC) Workshop.

Case Study: Implementing the MPEG-4 AS Profile on a Multi-core System on Chip Architecture R 楊峰偉 R 張哲瑜 R 陳宸.

Homework II Fast Discrete Cosine Transform Jain-Yi Lu ( 呂健益 ) Visual Communications Laboratory Department of Communication Engineering National Central.

Fast VLSI Implementation of Sorting Algorithm for Standard Median Filters Hyeong-Seok Yu SungKyunKwan Univ. Dept. of ECE, Vada Lab.

VLSI Design of 2-D Discrete Wavelet Transform for Area-Efficient and High- Speed Image Computing - End Presentation Presentor: Eyal Vakrat Instructor:

EEL 5722 FPGA Design Fall 2003 Digit-Serial DSP Functions Part I.

November 25Asian Test Symposium 2008, Nov 24-27, Sapporo, Japan1 Sequential Circuit BIST Synthesis using Spectrum and Noise from ATPG Patterns Nitin Yogi.

AN ENHANCED LOW POWER HIGH SPEED ADDER FOR ERROR TOLERANT APPLICATIONS BY K.RAJASHEKHAR, , VLSI Design.

FEC decoding algorithm overview VLSI 자동설계연구실 정재헌.

Array Multiplier Haibin Wang Qiong Wu. Outlines Background & Motivation Principles Implementation & Simulation Advantages & Disadvantages Conclusions.

CORDIC Based 64-Point Radix-2 FFT Processor

FFT corrections for tune measurements

Fang Fang James C. Hoe Markus Püschel Smarahara Misra

Interconnect and Packaging Chapter 1: Spectrum and Resonance (digital vs. analog) Chung-Kuan Cheng UC San Diego.

By: Mohammadreza Meidnai Urmia university, Urmia, Iran Fall 2014

Integer transform and Triangular matrix scheme

A New Approach to Pipeline FFT Processor

Discrete Cosine Transform (DCT)

2D DCT in ARM-based JPEG Processor

1-D DISCRETE COSINE TRANSFORM DCT

C Model Sim (Fixed-Point) -A New Approach to Pipeline FFT Processor

Speaker: Chris Chen Advisor: Prof. An-Yeu Wu Date: 2014/10/28

DSPs for Future Wireless Base-Stations

Rich QR Codes With Three-Layer Information Using Hamming Code

Presentation transcript:

NCTU, CS VLSI Information Processing Research Lab 研究生 : ABSTRACT Introduction NEW Recursive DFT/IDFT architecture Low computation cycle  1/2: Chebyshev polynomial  2/N: Folded architecture High speed  Register-splitting and computation-sharing scheme New Recursive Formula For DFT/IDFT Challenges: High-Performance and Area-Aware VLSI In this work, Proposed two design Core Type: N 2 /2 Folded Architecture: N,where The Proposed Recursive DFT formula: The Proposed Recursive IDFT formula: Cost & Speed The Proposed Core-Type DFT Architecture The Proposed Core-Type IDFT Architecture Achieve Data Buffer  bit word length complex data storage Control Unit  Clock Gated Control  Sequence Controller  Parameter Controller 32 PEs32 PEs  Pre-processing for S k and r k  TWO PEs for DST and DCT DFT Length (N) Input Word Length Critical Delay Time Active Chip Area Power Consumption Process Technology 212/106-point recursive DFT/IDFT Design  For DTMF Detector System Simulation and Implementation Results Lower round of error due to the fewest computation cycle AWGN Channel Comparisons Results ParametersSecond Order DFT/IDFT V-Y’s Structure [4] (Core Type) Y-C’s Structure [6] (FFR-DFT) Proposed Work1 (Core Type) Proposed Work2 (Folded Type) Conclusion A new recursive DFT/IDFT architecture based on the hybrid of Input strength reduction, Chebyshev polynomial and register-splitting schemes is proposed. The proposed VLSI algorithms lead to the fewest computation cycle and higher speed than others. The proposed core type and folding type recursive architecture with regular organization is certainly amenable to VLSI implementation. References [1] M. D. Felder, J. C. Mason, and B. L. Evans, “Efficient dual-tone multifrequency detection using the nonuniform discrete Fourier transform,” IEEE Signal Processing Lett., vol. 5, pp , Jul [2] G. Goertzel, “An algorithm for the evaluation of finite trigonometric series,” American Math. Monthly, vol. 65, pp , Jan [3] V. V. Cizek, “Recursive calculation of Fourier transform of discrete signal,” IEEE Int. Conf. Acoustics, Speech, and Signal Processing, May 1982, pp [4] L. D. Van and C. C. Yang, "High-speed area-efficient recursive DFT/IDFT architectures," in Proc. IEEE Int. Symp. Circuits Syst., May 2004, vol. 3, pp , Vancuover, Canada. [5] H. V. Sorensen, D. L. Jones, M. T. Heideman, C S. Burrus, “Real-valued fast Fourier transform algorithms,” IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 35, pp , June [6] C. H. Chen, B. D. Liu, J. F. Yang, and J. L. Wang, “Efficient Recursive Structures for forward and inverse discrete cosine transform,” IEEE Trans. Signal Processing, vol. 52, pp , Sep 指導教授 : 范倫達博士 Regularity construct by the N/2 PEs in parallel No intermediate register bank needed Further reduce the computation cycle to N N = (N 2 /2) / (N/2) Processor latency: 64 clock (Computation cycles) Critical Path: T m +2T a The Proposed Folded Type DFT/IDFT Architecture