Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,

Slides:



Advertisements
Similar presentations
OFDM Transmission over Wideband Channel
Advertisements

Introduction[1] •Three techniques are used independently or in tandem to improve receiver signal quality •Equalization compensates for.
Multiuser Detection for CDMA Systems
VLSI Communication SystemsRecap VLSI Communication Systems RECAP.
Comparison of different MIMO-OFDM signal detectors for LTE
Real-Time DSP Multiprocessor Implementation for Future Wireless Base-Station Receivers Bryan Jones, Sridhar Rajagopal, and Dr. Joseph Cavallaro.
Multiuser Detection in CDMA A. Chockalingam Assistant Professor Indian Institute of Science, Bangalore-12
Data-Parallel Digital Signal Processors: Algorithm mapping, Architecture scaling, and Workload adaptation Sridhar Rajagopal.
ISSPIT Ajman University of Science & Technology, UAE
EE360: Lecture 8 Outline Multiuser Detection
An FPGA Based Adaptive Viterbi Decoder Sriram Swaminathan Russell Tessier Department of ECE University of Massachusetts Amherst.
II. Medium Access & Cellular Standards. TDMA/FDMA/CDMA.
1 Lecture 9: Diversity Chapter 7 – Equalization, Diversity, and Coding.
Sub-Nyquist Sampling DSP & SCD Modules Presented by: Omer Kiselov, Daniel Primor Supervised by: Ina Rivkin, Moshe Mishali Winter 2010High Speed Digital.
Implementation Issues for Channel Estimation and Detection Algorithms for W-CDMA Sridhar Rajagopal and Joseph Cavallaro ECE Dept.
Introduction to Adaptive Digital Filters Algorithms
1 Techniques to control noise and fading l Noise and fading are the primary sources of distortion in communication channels l Techniques to reduce noise.
Efficient FPGA Implementation of QR
A bit-streaming, pipelined multiuser detector for wireless communications Sridhar Rajagopal and Joseph R. Cavallaro Rice University
Multiuser Detection (MUD) Combined with array signal processing in current wireless communication environments Wed. 박사 3학기 구 정 회.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro,
ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.
NTU Confidential Baseband Transceiver Design for the DVB-Terrestrial Standard Baseband Transceiver Design for the DVB-Terrestrial Standard Advisor : Tzi-Dar.
RICE UNIVERSITY DSPs for 4G wireless systems Sridhar Rajagopal, Scott Rixner, Joseph R. Cavallaro and Behnaam Aazhang This work has been supported by Nokia,
TI DSPS FEST 1999 Implementation of Channel Estimation and Multiuser Detection Algorithms for W-CDMA on Digital Signal Processors Sridhar Rajagopal Gang.
Ali Al-Saihati ID# Ghassan Linjawi
DSP Processors We have seen that the Multiply and Accumulate (MAC) operation is very prevalent in DSP computation computation of energy MA filters AR filters.
RICE UNIVERSITY SWAPs: Re-thinking mobile and base-station architectures Sridhar Rajagopal VLSI Signal Processing Group Center for Multimedia Communication.
Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal Srikrishna Bhashyam, Joseph R. Cavallaro,
RICE UNIVERSITY DSP architectures for wireless communications Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
RICE UNIVERSITY “Joint” architecture & algorithm designs for baseband signal processing Sridhar Rajagopal and Joseph R. Cavallaro Rice Center for Multimedia.
RICE UNIVERSITY Advanced Wireless Receivers: Algorithmic and Architectural Optimizations Suman Das Rice University Department of Electrical and Computer.
RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
Motivation Wireless Communication Environment Noise Multipath (ISI!) Demands Multimedia applications  High rate Data communication  Reliability.
RICE UNIVERSITY A real-time baseband communications processor for high data rate wireless systems Sridhar Rajagopal ECE Department Ph.D.
A Semi-Blind Technique for MIMO Channel Matrix Estimation Aditya Jagannatham and Bhaskar D. Rao The proposed algorithm performs well compared to its training.
RICE UNIVERSITY DSPs for future wireless systems Sridhar Rajagopal.
DSP Architectural Considerations for Optimal Baseband Processing Sridhar Rajagopal Scott Rixner Joseph R. Cavallaro Behnaam Aazhang Rice University, Houston,
Implementing algorithms for advanced communication systems -- My bag of tricks Sridhar Rajagopal Electrical and Computer Engineering This work is supported.
Pipelining and number theory for multiuser detection Sridhar Rajagopal and Joseph R. Cavallaro Rice University This work is supported by Nokia, TI, TATP.
RICE UNIVERSITY On the architecture design of a 3G W-CDMA/W-LAN receiver Sridhar Rajagopal and Joseph R. Cavallaro Rice University Center for Multimedia.
RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
Implementing Multiuser Channel Estimation and Detection for W-CDMA Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro and Behnaam Aazhang Rice.
Presented by Rajatha Raghavendra
RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
Single Correlator Based UWB Receiver Implementation through Channel Shortening Equalizer By Syed Imtiaz Husain and Jinho Choi School of Electrical Engineering.
SR: 599 report Channel Estimation for W-CDMA on DSPs Sridhar Rajagopal ECE Dept., Rice University Elec 599.
Algorithms and Architectures for Future Wireless Base-Stations Sridhar Rajagopal and Joseph Cavallaro ECE Department Rice University April 19, 2000 This.
RICE UNIVERSITY Handset architectures Sridhar Rajagopal ASICsProgrammable The support for this work in.
RICE UNIVERSITY Flexible wireless communication architectures Sridhar Rajagopal Department of Electrical and Computer Engineering Rice University, Houston.
Optimal Sequence Allocation and Multi-rate CDMA Systems Krishna Kiran Mukkavilli, Sridhar Rajagopal, Tarik Muharemovic, Vikram Kanodia.
Channel Equalization in MIMO Downlink and ASIP Architectures Predrag Radosavljevic Rice University March 29, 2004.
Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro
Techniques to control noise and fading
A programmable communications processor for future wireless systems
Sridhar Rajagopal April 26, 2000
Optimal Sequence Allocation and Multi-rate CDMA Systems
How to ATTACK Problems Facing 3G Wireless Communication Systems
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
Modeling of RF in W-CDMA with SystemView
Sridhar Rajagopal and Joseph R. Cavallaro Rice University
DSPs for Future Wireless Base-Stations
On-line arithmetic for detection in digital communication receivers
Modeling of RF in W-CDMA with SystemView
Sridhar Rajagopal, Srikrishna Bhashyam,
EM based Multiuser detection in Fading Multipath Environments
DSP Architectures for Future Wireless Base-Stations
On-line arithmetic for detection in digital communication receivers
Suman Das, Sridhar Rajagopal, Chaitali Sengupta and Joseph R.Cavallaro
DSPs for Future Wireless Base-Stations
Presentation transcript:

Efficient VLSI architectures for baseband signal processing in wireless base-station receivers Sridhar Rajagopal, Srikrishna Bhashyam, Joseph R. Cavallaro, and Behnaam Aazhang This work is supported by Nokia, TI, TATP and NSF

Introduction Real-time VLSI architecture for multiuser channel estimation Multiuser channel estimation usually neglected – high computational complexity - DSPs infeasible –Single user sliding correlator structures used Iterative fixed point algorithm developed Area-Time tradeoffs presented –Area-Constrained,Time-Constrained, Area-Time efficient

Baseband signal processing Base-Station Receiver Channel estimation DetectionDecoding Multiple Users Antenna Detected Bits Tracking Training

Channel estimation Direct Path Reflected Path Noise +MAI User 1 User 2 Base Station compensate for unknown fading amplitudes and asynchronous delays.

Need for multiuser channel estimation Detector performance depends on accuracy of channel estimator Multiuser Channel Estimation –Jointly estimate parameters for all users –Better performance than single user estimates

Computing multiuser channel estimates Computed by sending a training sequence of known bits to the receiver. When absent, detected bits can be used to update estimates in a decision feedback mode for tracking. Importance of multiuser estimation usually neglected May exceed detector complexity

Multiuser Channel Estimation Algorithm = {+1, -1} : Training/Tracking bits = 8-bit integer (complex) : Received signal N = spreading gain (typically fixed,e.g: 32) K = number of users (variable, <=N) = Maximum Likelihood channel estimate

Implementation complexity Matrix inversions (size 32x32) per window Unable to meet real-time on DSPs [Asilomar’99] VLSI fixed-point architectures for matrix inversions –Difficult to design, Finite precision problems Typically, simpler single-user sliding correlator structures used.

Outline What is multiuser channel estimation? Need for multiuser channel estimation Implementation problems Algorithm enhancements VLSI architectures –Area-constrained,Time-constrained, Area-Time efficient Conclusions

Iterative scheme for channel estimation Bit-streaming : suitable for tracking (window length L) Method of gradient descent Stable convergence behavior Simple fixed-point VLSI architecture

Simulations - Static multipath channel SINR = 0 dB Paths =3 Preamble =150 Spreading N = 31 Users K = 15

Rayleigh Fading channel with tracking Doppler = 10 Kmph

Outline What is multiuser channel estimation? Need for multiuser channel estimation Implementation problems Algorithm enhancements VLSI architectures –Area-constrained,Time-constrained, Area-Time efficient Conclusions

Area-Time Tradeoffs Design for 32 users (K) and spreading code (N) 32 Target = 128 Kbps (4000 cycles at 500 MHz). Assume single cycle addition/multiplication Area-Constrained Architecture :Pico-cells/fewer users Time-Constrained Architecture : Maximum data rates Area-Time Efficient Architecture : Real-Time

Task decomposition: channel estimation IterateCorrelation Matrices (Per Bit) Pilot Bits Pilot MUXMUX Detected Bits Data MUXMUX A O(4K 2 N,8) R br O(2KN,8 ) R bb O(2K 2,8) TIME Channel Estimate to Detector b 0 (2K,1) Tracking Window r 0 (N,8) b(2K,1) r(N,8) L

Architecture design: auto-correlation b = {+1,-1} Multiplication is a XNOR operation Matrix updated using XNOR gates Auto-correlation matrix implemented as an UP/DOWN counter(s)

Architecture design: cross-correlation b = {+1,-1}, r = 8-bit integer vector (complex) Multiplications reduce to additions/subtractions Matrix (complex) can be updated with 8-bit adders Cross-correlation matrix stored as RAM.

Architecture design: channel estimate A = 8-bit integer matrix (complex) µ << 1 : Truncated multiplication [Schulte’93] Matrix-matrix (real-complex) multiplication of integers Forms the bottleneck (8-bit multipliers) Concentrate on multiplication for area-time tradeoffs!

Area-Constrained Architecture b0 b Channel Estimate

Area-constrained Architecture: Hardware Requirements

Time-constrained Architecture b*b T b 0 *b 0 T b b0b0 MUX R br MUXMUX r r0r0 MUXMUX R bb A Mult Subtract >> Subtract 2K*1 K(2K-1)*1 2K 2 *8 2KN*16 2KN*8 2K*1 N*8 2KN*8 Channel Estimate

Auto-correlation Update in Parallel R bb (i,j) Counter bb T (i,j) U/D# R bb (i,i) Counter 1 U/D# Array of XNORs a·ba·ca·d b·cb·d c·d bcda b (2K) bb T (K*{2K-1}*1) R bb (2K 2 *8) Array of Counters

Cross-Correlation Update in Parallel bcda b (2K*1) r (N*8) R br (2KN*8) r(j) R br (i,j) Adder b(i) Add/ Sub# 88 1

Time-constrained Architecture: Hardware Requirements

Area-Time efficient architecture design Area - constrained Architecture –Minimize area - single 8-bit multiplier –4K 2 N cycles (128,000 cycles ; 3.81 Kbps) Time-constrained Architecture –Minimize time - 4K 2 N 8-bit multipliers –Log 2 (2K) cycles (6 cycles ; Mbps) Aim : To meet real-time with min. area overhead Different parallelism levels for multipliers

Area-Time Efficient Architecture b*b T b 0 *b 0 T b b0b0 MUX MUXMUX r r0r0 Mult Subtract >> Subtract 2K*1 2K*8 1*16 1*8 1*1 1*8 N*8 1*8 R br Counters StoreLoad R bb A (i) DEMUX MUX A (i-1) 1*8 Adder 1*8 2K*1 2K*8 Channel Estimate

Area-Time Efficient Architecture: Hardware Requirements

Outline What is multiuser channel estimation? Need for multiuser channel estimation Implementation problems Algorithm enhancements VLSI architectures –Area-constrained,Time-constrained, Area-Time efficient Conclusions

Comparisons DSPs unable to exploit bit-level parallelism Inefficient storage of bits Replacing multiplications by additions/subtractions

Scalability of Architectures with K Disadvantages of VLSI architectures Design for maximum number of users in the system If there are fewer users, –Turn off functional units to reduce power –Reconfigure hardware for higher data rates (FPGA) –Dr. Cavallaro, don’t know to handle this Question properly –We never designed an architecture/algorithm for varying number of users dynamically. (Though we had started on it) –What should be included in future work? –Please give suggestions!!

Conclusions Real-Time VLSI architecture for multiuser channel estimation Iterative fixed-point algorithm developed to avoid matrix inversions Area-Time Tradeoffs presented –Area-Constrained, Time-Constrained, Area-Time efficient VLSI architectures exploit bit-level computations and parallelism to meet real-time.