Ariadne’s Thread Kristian Zarb Adami. Simulator Aims ۞ Provide the system architect a tool to visualise trade-offs in designs ۞ Provide the scientist.

Slides:



Advertisements
Similar presentations
Beamformer implementations (Mike Jones, Kris Zarb Adami, David Sinclair, Chris Shenton) Starting with top level considerations for now, ie Not, which FPGA.
Advertisements

SKAMP Square Kilometre Array Molonglo Prototype. Supporting Institutions  University of Sydney  Argus Technologies  ATNF  ICT Centre.
DSPs Vs General Purpose Microprocessors
Digital FX Correlator Nimish Sane Center for Solar-Terrestrial Research New Jersey Institute of Technology, Newark, NJ EOVSA Technical Design Meeting.
Schediwy: Advantages and Disadvantages of Sparse Aperture Arrays Fermilab: October 2009.
NATIONAL INSTITUTE OF SCIENCE & TECHNOLOGY Presented by: Susman Das Technical Seminar Presentation FPAA for Analog Circuit Design Presented by Susman.
18/05/2015 Calice meeting Prague Status Report on ADC LPC ILC Group.
DFT Filter Banks Steven Liddell Prof. Justin Jonas.
BDT Radio – 2b – CMV 2009/10/09 Basic Detection Techniques 2b (2009/10/09): Focal Plane Arrays Case study: WSRT System overview Receiver and.
Sascha D-PAD Sparse Aperture Array.
Prototype SKA Technologies at Molonglo: 3. Beamformer and Correlator J.D. Bunton Telecommunications and Industrial Physics, CSIRO. Australia. Correlator.
Introduction to Analog-to-Digital Converters
STARLight PDR 3 Oct ‘01H.1 Miller STARLight Sensor Signal Processing Ryan Miller STARLight Electrical Engineer (734)
Signal Processing for Aperture Arrays. AAVS1 256 antenna elements distributed over –4 stations –64 elements each.
Dec 2010 AAVP Cambridge workshop AAVP AAVS1/2-low demonstrators Jan Geralt Bij de Vaate.
Keith Grainge Calibration requirementsAAVP 2010 Calibration and Imaging Requirements Keith Grainge (plus teams at ASTRON, Oxford, Cambridge)
The World Leader in High Performance Signal Processing Solutions Audio ADC/DACs Primer David Hossack.
Ninth Synthesis Imaging Summer School Socorro, June 15-22, 2004 Cross Correlators Walter Brisken.
Use of FOS for Airborne Radar Target Detection of other Aircraft Example PDS Presentation for EEE 455 / 457 Preliminary Design Specification Presentation.
LOFAR Antenna Systems Dion Kant, Wim van Cappellen AAVP – 10 December 2010, Cambridge, UK.
Name1 SKA(DS) System Design Aspects 4 th SKADS Workshop, Lisbon, 2-3 October 2008 SKA(DS) System Design Aspects: building a system Laurens Bakker.
GallagherP188/MAPLD20041 Accelerating DSP Algorithms Using FPGAs Sean Gallagher DSP Specialist Xilinx Inc.
Low Cost TDC Using FPGA Logic Cell Delay Jinyuan Wu, Z. Shi For CKM Collaboration Jan
An Energy-Efficient Reconfigurable Multiprocessor IC for DSP Applications Multiple programmable VLIW processors arranged in a ring topology –Balances its.
DARPA Digital Audio Receiver, Processor and Amplifier Group Z James Cotton Bobak Nazer Ryan Verret.
AA-Low Technical Progress Meeting, October 2012, Medicina, Italy AAVS0 & AAVS0.5: System Design and Test Plan Nima Razavi-Ghods Eloy de Lera Acedo.
Backend electronics for radioastronomy G. Comoretto.
Digital Filters Mike Davis. Requirements Avoid non-linearity up to and through the analog to digital (A/D) converter Use enough bits to adequately represent.
By Grégory Brillant Background calibration techniques for multistage pipelined ADCs with digital redundancy.
Computational Technologies for Digital Pulse Compression
Phased Array Feeds John O’Sullivan SKANZ 2012 CSIRO Astronomy and Space Science,
FPGA-based Dedispersion for Fast Transient Search John Dickey 23 Nov 2005 Orange, NSW.
NTD/xNTD Signal Processing Presented by: John Bunton Signal Processing team: Joseph Pathikulangara, Jayasri Joseph, Ludi de Souza and John Bunton Plus.
XNTD/SKAMP/LFD Correlator 4th RadioNet Engineering Forum Workshop Next Generation Correlators for Radio Astronomy and Geodesy June 2006, Groningen,
The BEST-2 SKA pathfinder is composed by 32 receivers. The front ends are installed on the focal lines of 8 cylinders, offering a collecting area of about.
Casper Signal Processing Workshop 2009 SKA Signal Processing (Preliminary) Wallace Turner Domain Specialist for Signal Processing.
Correlator Growth Path EVLA Advisory Committee Meeting, March 19-20, 2009 Michael P. Rupen Project Scientist for WIDAR.
FE8113 ”High Speed Data Converters”. Part 2: Digital background calibration.
Casper 2010Marc Torres Part 2: Building blocks for the next generation.
Integrated receivers for mid-band SKA Suzy Jackson Engineer, Australia Telescope National Facility SKADS FP6 Meeting – Chateau de Limelette – 4-6 November,
1 Advanced ATNF Correlators Dick Ferris AT Electronics Development Group May 2003.
Andrew Faulkner 2 nd SKADS Workshop October 2007 SKADS Benchmark Scenario Andrew Faulkner.
1 Reconfigurable Acceleration of Microphone Array Algorithms for Speech Enhancement Ka Fai Cedric Yiu, Yao Lu, Xiaoxiang Shi The Hong Kong Polytechnic.
Australian Astronomy MNRF Development of Monolithic Microwave Integrated Circuits (MMIC) ATCA Broadband Backend (CABB)
SkauConfidential and Proprietary Honeywell Page 1 Algorithm Flow: GMTI ECCM Beamforming (Jammer Nulling) 3.4 GOPS Adaptive Weights 0.12 GOPS Matrix Formation.
Adaptive Filters for RFI Mitigation in Radioastronomy
The Correlators ( Spectrometers ) Mopra Induction - May 2005.
ASKAP: Setting the scene Max Voronkov ASKAP Computing 23 rd August 2010.
AAVS processing: Uniboard implementation. UNIBOARD Jive led FP7 project UniBoard, high integration density >> processing / m3
Andrew Faulkner1 DS4 Deliverables 4 th SKADS Workshop, Lisbon DS4 Deliverables Andrew Faulkner.
A real-time software backend for the GMRT : towards hybrid backends CASPER meeting Capetown 30th September 2009 Collaborators : Jayanta Roy (NCRA) Yashwant.
Graphical Design Environment for a Reconfigurable Processor IAmE Abstract The Field Programmable Processor Array (FPPA) is a new reconfigurable architecture.
Philippe Picard 2 nd SKADS Workshop October 2007 Station Processing Philippe Picard Observatoire de Paris Meudon, 11th October 2007.
1 Advanced ATNF Correlators Dick Ferris AT Electronics Group.
Low Power, High-Throughput AD Converters
S. Bota – Calorimeter Electronics overview - July 2002 Status of SPD electronics Very Front End Review of ASIC runs What’s new: RUN 4 and 5 Next Actions.
An FFT for Wireless Protocols Dr. J. Greg Nash Centar ( HAWAI'I INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES Mobile.
0 /59 Nyquist Rate ADCs Dr. Hossein Shamsi ECE Dept, K.N. Toosi University of Technology.
Digital Signal Processor HANYANG UNIVERSITY 학기 Digital Signal Processor 조 성 호 교수님 담당조교 : 임대현
Netherlands Institute for Radio Astronomy 1 APERTIF beamformer and correlator requirements Laurens Bakker.
FP7 Uniboard project Digital Receiver G. Comoretto, A. Russo, G. Tuccari, A Baudry, P. Camino, B. Quertier Dwingeloo, February 27, 2009.
JIVE UniBoard Correlator External Review
Introduction.
OSKAR station simulator
2PAD’s Beamforming Software
Chapter 1 Introduction.
Aperture Array Station Processing
Technical Foundations & Enabling Technologies – DS4
Aperture Array Simulations
Fixed-point Analysis of Digital Filters
Presentation transcript:

Ariadne’s Thread Kristian Zarb Adami

Simulator Aims ۞ Provide the system architect a tool to visualise trade-offs in designs ۞ Provide the scientist top-level performance numbers for a given system ۞ Maintain a close relationship with the cost-model so that cost estimates can be derived from the model ۞ Maintain a close relationship with the telescope simulation environments (eg. MeqTrees) ۞ Provide an easy route to implementation

The Specification hierarchy … Line Sensitivity Continuum Sensitivity Survey Speed FoV Dynamic Range BandwidthA eff T sys Dynamic Range # of bits# of channels # of beams Front-End Analog Antenna Efficiency Digital Processing Signal Transport CORRELATION & IMAGE PIPE Cost & Functional Model Sky Science Engineering Software

Digital Aperture Array Beam Pattern showing interference cancellation

2-Pad System Example ۞ Log-Periodic Dipole Array between 0.3 – 1GHz ۞ Demonstrate All-Digital Aperture Array for beam- forming, calibration and RFI mitigation ۞ Demonstrate scalability in terms of power consumption, cost and performance ۞ Demonstrate functionality in terms of bits, bandwidth and beams... (0... One-tile sub-array antenna unit of 8x8x2 LP dipoles Main LN Amplifiers and WB Filters 63)x2 8x8x2 ADC... (063)x2 (063)x2... Frequency splitting onto 1024 subbands... (0 1023)x2... (01023)x2 Frequency splitting onto 1024 subbands 2.4GS/s 4-bit real 4-bit imag Equaliser 2.4MS/s 4-bit real 4-bit imag 1024x2 8-bit preset coefficients... Polarisation correction of each H-V pair... Polarisation correction of each H-V pair... (0 1023)x2 (0... 8x8x2 2D FFT 0 8x8x2 2D FFT (0 63)x Beam selector and beam- steering 8 Beam selector and beam- steering... (0 8)x2 FOV correction FOV correction... (0 8)x2... (0 8)x2 Analog Processing Digital Processing

Analog Model Outside Station

Digitisation ۞1 st filter response ۞Channel Selection ۞4-bit ‘Ideal’ ADC

A/D Converters ۞ 4-bit 2.4 GS/s CMOS already available at sample level ۞ Front-End bit-count can be increased to 8 (~50dB of Dynamic Range) for RFI mitigation, with reduced bit- count at the back-end ۞ Equalization ‘may’ also be included in this block ۞ Decimation and digital formatting can be integrated into back-end of A/D

Channelization ۞ Form channels from each ADC ۞ Different channelisation algorithms can be tried ۞ Equalisation can be implemented as part of the channelisation block ۞ Data is now moved to the beamformer (windowing can occur in beamforming block)

Channelization Algorithms Algorithm Decomposition into 1024 sub-bands No. of GMACs Per polarisation per element DSP Operations Direct time DecimatorFor 5-taps = 1272 GOps FFT (no-filtering)For Radix-2 = 24144GOps Polyphase + FFT5-taps FIR filter + FFT (Radix-2) 216GOps Analysis Filter Bank (Wavelets) For 10 stages = 4802,880GOps

Cost/Power Calculations for 1 st stage Total RequirementsStream XXX (register files)XXX (on-chip mem.) Chips to sustain Ops1 11 Power for Operations (FIR)1.92W Power for Operations (FFT)1.92W Chips to sustain Bandwidth2 11 Total Power required for Bandwidth3.00E-01W Chips required2.00E E+00 Power/Chip for Operations (FIR)9.60E-01W 9.60E+01 Power/Chip for Operations (FFT)9.60E-01W 9.60E+01 Power/Chip for Bandwidth1.50E-01W3.00E-01 Total Power per Chip (FIR)1.11W (FFT)1.11W Cost per Chip ($)149$100 Cost per GMAC ($)1.16$0.78 Total Power2.22W Total Cost298$100.00

۞ Side-lobes considerably reduced with increasing number of bits ۞ Trade-off number of taps with coefficient accuracy ۞ Aim to relate this with science goals Poly-phase filtering Response

FFT Beamformer ۞ Form many beams at once ۞ Different beams in different directions ۞ Windowing function can be used to suppress sidelobes (especially 13.0dB from rectangular window) ۞ Trade-off windowing with sidelobe height and A eff /Gain

8-pt FFT ۞ 8-pt FFT real-time model implementation ۞ Twiddles can be retrieved from memory ۞ Easy mapping onto FPGA/RPA interface

Fixed vs. Floating Point ۞ Bit-growth per stage ۞ 4-bit FFT vs. floating point

Windowing for Sidelobe Cancellation

Reconfigurable Processor Arrays ۞ Heterogeneous Array of Instruction cells ۞ Rate controller to control delay in data paths ۞ I/O Ports mapped as instruction cells ۞ Low Power / Area ۞ Highly Configurable ۞ High I/O Bandwidth Configuration stream Program counter Data Memory or ADC I/O Ports Program Memory Rate Control + << x

Example Code void oned_dct (int *coeff,int *block) { b0 = coeff[0]; b1 = coeff[1]; b2 = coeff[2]; b3 = coeff[3]; b4 = coeff[4]; b5 = coeff[5]; b6 = coeff[6]; b7 = coeff[7]; e = b1 * const_f7 - b7 * const_f1; f = b5 * const_f3 - b3 * const_f5; c4 = e + f; c5 = e - f; h = b7 * const_f7 + b1 * const_f1; g = b3 * const_f3 + b5 * const_f5; c6 = h - g; c7 = h + g; c0 = (b0 + b4) * const_f4; c1 = (b0 - b4) * const_f4; c2 = b2 * const_f6 - b6 * const_f2; c3 = b6 * const_f6 + b2 * const_f2; b5 = (c6 - c5) * const_f0; b6 = (c6 + c5) * const_f0; b0 = c0 + c3; b1 = c1 + c2; b2 = c1 - c2; b3 = c0 - c3; block[0] = b0 + c7; block[1] = b1 + b6; block[6] = b1 - b6; block[7] = b0 - c7; block[2] = b2 + b5; block[3] = b3 + c4; block[4] = b3 - c4; block[5] = b2 - b5; } Datapath example: Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 2 ns 1 ns 4 ns 2 ns 3 ns 1 ns

RPA Solution - Datapath Illustration Step 1 Step 2 Step 3 Step 4 Step 5 Step 6 Configuration stream Program counter Data Memory or ADC I/O Ports Program Memory Rate Control + << x Off-chip bandwidth ~ 10 Gbit/s Intra-cell bandwidth ~ 100 Gbit/s

Future Work ۞ Include clock delays for each block to ensure real-time data tracking (as well as JITTER !!!) ۞ Include a calibration model for Gain/Phase each antenna ۞ Relate beam-forming to FoV to work out the trade-off between number of beams vs. no. of bits/ beam ۞ Relate DSP-induced noise to noise temperature to work out contribution to T sys ۞ Relate side-lobe suppression models to Dynamic Range and Image Fidelity ۞ Produce Cost / Power benchmarks for each block for input into the Cost Model ۞ Extend model to the full-frequency coverage of a station ۞ Produce technology and architecture roadmaps including cable architecture

Extra slides Science Goals Technology Capabilities $$ Software Capabilities Simulation s

Error Modelling ۞ Quantization error dependent on bit-width ۞ Scaling considerations are IMPORTANT