RICE UNIVERSITY “Joint” architecture & algorithm designs for baseband signal processing Sridhar Rajagopal and Joseph R. Cavallaro Rice Center for Multimedia.

Slides:



Advertisements
Similar presentations
Multiuser Detection for CDMA Systems
Advertisements

Lecture 4 Introduction to Digital Signal Processors (DSPs) Dr. Konstantinos Tatas.
VLSI Communication SystemsRecap VLSI Communication Systems RECAP.
Development of Parallel Simulator for Wireless WCDMA Network Hong Zhang Communication lab of HUT.
Real-Time DSP Multiprocessor Implementation for Future Wireless Base-Station Receivers Bryan Jones, Sridhar Rajagopal, and Dr. Joseph Cavallaro.
Data-Parallel Digital Signal Processors: Algorithm mapping, Architecture scaling, and Workload adaptation Sridhar Rajagopal.
EE360: Lecture 8 Outline Multiuser Detection
A Programmable Coprocessor Architecture for Wireless Applications Yuan Lin, Nadav Baron, Hyunseok Lee, Scott Mahlke, Trevor Mudge Advance Computer Architecture.
A Low-Power Low-Memory Real-Time ASR System. Outline Overview of Automatic Speech Recognition (ASR) systems Sub-vector clustering and parameter quantization.
A stack based tree searching method for the implementation of the List Sphere Decoder ASP-DAC 2006 paper review Presenter : Chun-Hung Lai.
An Energy-Efficient Reconfigurable Multiprocessor IC for DSP Applications Multiple programmable VLIW processors arranged in a ring topology –Balances its.
Implementation Issues for Channel Estimation and Detection Algorithms for W-CDMA Sridhar Rajagopal and Joseph Cavallaro ECE Dept.
DSPs in Wireless Communication Systems Vishwas Sundaramurthy Electrical and Computer Engineering Department, Rice University, Houston,TX.
1 Miodrag Bolic ARCHITECTURES FOR EFFICIENT IMPLEMENTATION OF PARTICLE FILTERS Department of Electrical and Computer Engineering Stony Brook University.
RICE UNIVERSITY Implementing the Viterbi algorithm on programmable processors Sridhar Rajagopal Elec 696
A bit-streaming, pipelined multiuser detector for wireless communications Sridhar Rajagopal and Joseph R. Cavallaro Rice University
Multiuser Detection (MUD) Combined with array signal processing in current wireless communication environments Wed. 박사 3학기 구 정 회.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,
ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.
RICE UNIVERSITY DSPs for 4G wireless systems Sridhar Rajagopal, Scott Rixner, Joseph R. Cavallaro and Behnaam Aazhang This work has been supported by Nokia,
CHANNEL ESTIMATION FOR MIMO- OFDM COMMUNICATION SYSTEM PRESENTER: OYERINDE, OLUTAYO OYEYEMI SUPERVISOR: PROFESSOR S. H. MNENEY AFFILIATION:SCHOOL OF ELECTRICAL,
TI DSPS FEST 1999 Implementation of Channel Estimation and Multiuser Detection Algorithms for W-CDMA on Digital Signal Processors Sridhar Rajagopal Gang.
RICE UNIVERSITY SWAPs: Re-thinking mobile and base-station architectures Sridhar Rajagopal VLSI Signal Processing Group Center for Multimedia Communication.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal Srikrishna Bhashyam, Joseph R. Cavallaro,
RICE UNIVERSITY DSP architectures for wireless communications Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
RICE UNIVERSITY Advanced Wireless Receivers: Algorithmic and Architectural Optimizations Suman Das Rice University Department of Electrical and Computer.
RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
RICE UNIVERSITY A real-time baseband communications processor for high data rate wireless systems Sridhar Rajagopal ECE Department Ph.D.
RICE UNIVERSITY DSPs for future wireless systems Sridhar Rajagopal.
DSP Architectural Considerations for Optimal Baseband Processing Sridhar Rajagopal Scott Rixner Joseph R. Cavallaro Behnaam Aazhang Rice University, Houston,
VIRGINIA POLYTECHNIC INSTITUTE & STATE UNIVERSITY MOBILE & PORTABLE RADIO RESEARCH GROUP MPRG Combined Multiuser Reception and Channel Decoding for TDMA.
Implementing algorithms for advanced communication systems -- My bag of tricks Sridhar Rajagopal Electrical and Computer Engineering This work is supported.
Pipelining and number theory for multiuser detection Sridhar Rajagopal and Joseph R. Cavallaro Rice University This work is supported by Nokia, TI, TATP.
RICE UNIVERSITY On the architecture design of a 3G W-CDMA/W-LAN receiver Sridhar Rajagopal and Joseph R. Cavallaro Rice University Center for Multimedia.
RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
Implementing Multiuser Channel Estimation and Detection for W-CDMA Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro and Behnaam Aazhang Rice.
DSP base-station comparisons. Second generation (2G) wireless 2 nd generation: digital: last decade: 1990’s Voice and low bit-rate data –~14.4 – 28.8.
RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
Muhammad Shoaib Bin Altaf. Outline Motivation Actual Flow Optimizations Approach Results Conclusion.
SR: 599 report Channel Estimation for W-CDMA on DSPs Sridhar Rajagopal ECE Dept., Rice University Elec 599.
Algorithms and Architectures for Future Wireless Base-Stations Sridhar Rajagopal and Joseph Cavallaro ECE Department Rice University April 19, 2000 This.
Application of Addition Algorithms Joe Cavallaro.
Architectural Effects on DSP Algorithms and Optimizations Sajal Dogra Ritesh Rathore.
RICE UNIVERSITY Handset architectures Sridhar Rajagopal ASICsProgrammable The support for this work in.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,
An FFT for Wireless Protocols Dr. J. Greg Nash Centar ( HAWAI'I INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES Mobile.
RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
RICE UNIVERSITY SWAPs: Re-thinking mobile and base-station architectures Sridhar Rajagopal VLSI Signal Processing Group Center for Multimedia Communication.
Optimal Sequence Allocation and Multi-rate CDMA Systems Krishna Kiran Mukkavilli, Sridhar Rajagopal, Tarik Muharemovic, Vikram Kanodia.
Channel Equalization in MIMO Downlink and ASIP Architectures Predrag Radosavljevic Rice University March 29, 2004.
Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro
CORDIC (Coordinate rotation digital computer)
Low-power Digital Signal Processing for Mobile Phone chipsets
A programmable communications processor for future wireless systems
Sridhar Rajagopal April 26, 2000
Optimal Sequence Allocation and Multi-rate CDMA Systems
How to ATTACK Problems Facing 3G Wireless Communication Systems
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
Modeling of RF in W-CDMA with SystemView
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
DSPs for Future Wireless Base-Stations
On-line arithmetic for detection in digital communication receivers
Sridhar Rajagopal COMP 625 April 17, 2000
Modeling of RF in W-CDMA with SystemView
Sridhar Rajagopal, Srikrishna Bhashyam,
DSPs in emerging wireless systems
DSP Architectures for Future Wireless Base-Stations
On-line arithmetic for detection in digital communication receivers
Suman Das, Sridhar Rajagopal, Chaitali Sengupta and Joseph R.Cavallaro
DSPs for Future Wireless Base-Stations
Presentation transcript:

RICE UNIVERSITY “Joint” architecture & algorithm designs for baseband signal processing Sridhar Rajagopal and Joseph R. Cavallaro Rice Center for Multimedia Communications This work has been supported by Nokia, TI, TATP and NSF

RICE UNIVERSITY Single-slide version of my talk Algorithms DSP VLSI FPGA IMAGINE Multiuser channel estimation Multiuser detection Task-partitioning Parallelism Pipelining Conventional arithmetic On-line arithmetic Instruction set extensions Co-processor support Functional unit design and usage Distant Past Recent Past Recent and Near Future

RICE UNIVERSITY Contents  Algorithms for channel estimation and detection  Conventional and on-line arithmetic designs  Programmable architecture design using the IMAGINE simulator

RICE UNIVERSITY Estimation - detection algorithms?  Sophisticated, computationally complex algorithms proposed for 3G - 4G standards  Typically need complex operations, huge matrix sizes, matrix inversions  Difficult for hardware implementation and for real- time performance

RICE UNIVERSITY Multiuser channel estimation algorithm  = {+1, -1} : training/tracking bits  = 8-bit integer (complex) : Received signal  N = spreading gain (typically fixed, e.g. 32)  K = number of users (variable, <=N)  = maximum likelihood channel estimate

RICE UNIVERSITY Iterative scheme for channel estimation  Bit-streaming : suitable for tracking (window length L)  Method of gradient descent  Stable convergence behavior  Simple fixed-point VLSI architecture [ASAP 2000]

RICE UNIVERSITY Comparisons  DSPs unable to exploit bit-level parallelism  Inefficient storage of bits  Replacing multiplications by additions/subtractions

RICE UNIVERSITY Multiuser detection innovations  Developed a simple architecture for asynchronous multiuser detection for CDMA [ +, x ]  Bit-streaming  reduced latency  eliminates window edge computations  lower memory requirements  Pipelined stages  higher throughput (with more hardware)

RICE UNIVERSITY Block Pipelined Detector Variable latency [Worst case (1st bit)  D*latency per bit] 2 extra edge bit computations per stage. 11 MF 22 Bits TIME 1 MF 12 Bits PIC 1211 PIC 22 1 PIC 1211 PIC 22 1 PIC 1211 PIC 22

RICE UNIVERSITY Bit-streaming multiuser detection Savings in memory by D 2

RICE UNIVERSITY Pipelining the multiuser detector Matched Filter (causal) PIC - Stage 1 PIC - Stage 2 PIC - Stage 3 TIME Latency = 2*latency per bit (D/2 speedup over block) eliminated edge bit computations. [ISCAS 2001]

RICE UNIVERSITY Contents  Algorithms for channel estimation and detection  Conventional and on-line arithmetic designs  Programmable architecture design using the IMAGINE simulator

RICE UNIVERSITY Matched filter with conventional arithmetic T ~ log(N) * log(d) N - dot product size d - precision

RICE UNIVERSITY Conventional MF using CSAs T ~ a + log(d+c) a,c - constants

RICE UNIVERSITY Key concept in on-line arithmetic  Conventional detection - high precision operations (8-32 bits) followed by testing for sign.  Actual detection dependent only on most significant digits (1-3 bits).  Use MSDF computation to find the sign and avoid computation of the successive digits. [Arith-15] Detection

RICE UNIVERSITY Comparisons of arithmetic schemes

RICE UNIVERSITY Using on-line arithmetic for detection Channel -1, Received Signal Amplitude (Normalized) Time taken for addition (Normalized)

RICE UNIVERSITY Equations Probability of error for optimal BPSK detection Probability of error for on-line BPSK detection r – radix of the number system p – number of digits

RICE UNIVERSITY Probability of error using on-line

RICE UNIVERSITY On-line MF implementation T ~ c c - constant

RICE UNIVERSITY Throughput comparisons

RICE UNIVERSITY Area comparisons

RICE UNIVERSITY Implementing higher modulation schemes

RICE UNIVERSITY Conclusions on arithmetic schemes  CSAs better than straightforward implementation  X speedup for 8-32 bit precision  X less area  If reduced precision computations, on-line still better  X speedup over CSA  X less area over CSA

RICE UNIVERSITY Contents  Algorithms for channel estimation and detection  Conventional and on-line arithmetic designs  Programmable architecture design using the IMAGINE simulator

RICE UNIVERSITY A programmable architecture?  Flexibility in the algorithm requirements  channel dependent computations  changing algorithms on-the-fly  seamless switching between wireless LAN and wideband CDMA -- RENE.  Simulator needed to test performance of algorithms  extensions/modifications for critical operations

RICE UNIVERSITY Algorithms needed for 3G base-band base-station implementation  Equalization  FFT  Viterbi decoding  Channel estimation  Multiuser detection  Viterbi/Turbo decoding  Multiple antennas  Long spreading codes  Space-Time codes Wireless LAN W-CDMA If you felt that life was too easy

RICE UNIVERSITY The IMAGINE architecture and simulator  IMAGINE is a media signal processor, built at Stanford.  Many common workload features  Good starting point to explore.  Local expertise - Dr. Scott Rixner ( )

RICE UNIVERSITY IMAGINE architecture  Great for media processing algorithms  1024 pt FFT in 7.4  s on a 500 MHz processor with a 8-cluster (48 units)  3.8W of power  Great for parallel, vector and streaming computations  Performance/extensions to sequential computation kernels such as Viterbi traceback needs to be investigated.

RICE UNIVERSITY Conclusions  Algorithm steps for designing communication systems  Design hardware-efficient versions  Fixed-point implementation  DSP implementation - bottlenecks  Task partitioning, pipelining, parallelism  Computer arithmetic ideas -- VLSI  Integration into a programmable processor