Digital FX Correlator Nimish Sane Center for Solar-Terrestrial Research New Jersey Institute of Technology, Newark, NJ EOVSA Preliminary Design Review.

Slides:



Advertisements
Similar presentations
West Coast Spectrometer Team Mark Wagner, Berkeley project manager, FPGA designer Terry Filiba, data transport: FPGA --> CPU --> GPU Suraj Gowda, boosting.
Advertisements

Digital FX Correlator Nimish Sane Center for Solar-Terrestrial Research New Jersey Institute of Technology, Newark, NJ EOVSA Technical Design Meeting.
Digital RF Stabilization System Based on MicroTCA Technology - Libera LLRF Robert Černe May 2010, RT10, Lisboa
David Hawkins Exascale Signal Processing for Millimeter-Wavelength Radio Interferometers David Hawkins
Digital FX Correlator Nimish Sane Center for Solar-Terrestrial Research New Jersey Institute of Technology, Newark, NJ EOVSA Technical Design Meeting.
Implement a 2x2 MIMO OFDM-based channel measurement system (no data yet) at 2.4 GHz Perform baseband processing and digital up and down conversion on Nallatech.
Dale E. Gary Professor, Physics, Center for Solar-Terrestrial Research New Jersey Institute of Technology 1 3/15/2012OVSA Preliminary Design Review Meeting.
Digital FX correlator Samuel Tun. FASR Subsystem Testbed (FST) 1-9 GHz in 500 MHz band recorded at 1 GS/s from each antenna. Correlation carried out offline.
Dale E. Gary Professor, Physics, Center for Solar-Terrestrial Research New Jersey Institute of Technology 1 11/7/2011OVSA Technical Design Meeting.
EOVSA Data Volume / Data Rates Gordon Hurford Nov 8, 2011 EOVSA Technical Design Meeting - NJIT.
A SINGLE FREQUENCY GPS SOFTWARE RECEIVER
FF-1 9/30/2003 UTD Practical Priority Contention Resolution for Slotted Optical Burst Switching Networks Farid Farahmand The University of Texas at Dallas.
Prototype SKA Technologies at Molonglo: 3. Beamformer and Correlator J.D. Bunton Telecommunications and Industrial Physics, CSIRO. Australia. Correlator.
Programmable logic and FPGA
Configuration. Mirjana Stojanovic Process of loading bitstream of a design into the configuration memory. Bitstream is the transmission.
Ninth Synthesis Imaging Summer School Socorro, June 15-22, 2004 Cross Correlators Walter Brisken.
Dale E. Gary Professor, Physics, Center for Solar-Terrestrial Research New Jersey Institute of Technology 1 03/16/2012OVSA Preliminary Design Review.
System Architecture A Reconfigurable and Programmable Gigabit Network Interface Card Jeff Shafer, Hyong-Youb Kim, Paul Willmann, Dr. Scott Rixner Rice.
Dale E. Gary Professor, Physics, Center for Solar-Terrestrial Research New Jersey Institute of Technology 1 09/24/2012Prototype Review Meeting.
GallagherP188/MAPLD20041 Accelerating DSP Algorithms Using FPGAs Sean Gallagher DSP Specialist Xilinx Inc.
Transition Converter " Supply signals from new antennas to old correlator. " Will be discarded or abandoned in place when old correlator is turned off.
Highly Scalable Packetised correlators Jason Manley CASPER workshop 2009.
Backend electronics for radioastronomy G. Comoretto.
Sub-Nyquist Reconstruction Final Presentation Winter 2010/2011 By: Yousef Badran Supervisors: Asaf Elron Ina Rivkin Technion Israel Institute of Technology.
Dale E. Gary Professor, Physics, Center for Solar-Terrestrial Research New Jersey Institute of Technology 1 9/25/2012Prototype Review Meeting.
Dale E. Gary Professor, Physics, Center for Solar-Terrestrial Research New Jersey Institute of Technology 1 9/26/2012Prototype Review Meeting.
DLS Digital Controller Tony Dobbing Head of Power Supplies Group.
Prototype Test of SPring-8 FADC Module Da-Shung Su Wen-Chen Chang 02/07/2002.
Nov 1, 2011 RN - 1 Jet Propulsion Laboratory California Institute of Technology Implementation Issues and Choices for VLBI data Acquisition System in DSN.
The GNU in RADIO Shravan Rayanchu. SDR Getting the code close to the antenna –Software defines the waveform –Replace analog signal processing with Digital.
Accelerating a Software Radio Astronomy Correlator By Andrew Woods Supervisor: Prof. Inggs & Dr Langman.
8/12/201514th RHESSI Workshop 1 Dale E. Gary, Center for Solar-Terrestrial Research, Physics Department, NJIT.
Digital Packaging Processor Gordon Hurford Jim McTiernan EOVSA PDR 15-March-2012.
EOVSA Data and Database System Gordon Hurford and Jim McTiernan EOVSA Prototype Readiness Review 24-Sept-2012.
Correlator Growth Path EVLA Advisory Committee Meeting, March 19-20, 2009 Michael P. Rupen Project Scientist for WIDAR.
Casper 2010Marc Torres Part 2: Building blocks for the next generation.
CPT Week, April 2001Darin Acosta1 Status of the Next Generation CSC Track-Finder D.Acosta University of Florida.
© ASTRON On the Fly LOFAR Station Correlator André W. Gunst.
CASPER Packetised correlators Jason Manley CASPER workshop 2009.
Overclocking the V Why we want to overclocking the V1495? Data compression and transfer over multiple clock cycles allows more detailed information.
MeerKAT DBE Past Present Future Alan Langman 28 September 2009.
Features of the new Alibava firmware: 1. Universal for laboratory use (readout of stand-alone detector via USB interface) and for the telescope readout.
Dale E. Gary Professor, Physics, Center for Solar-Terrestrial Research New Jersey Institute of Technology 1 9/25/2012Prototype Review Meeting.
An FX software correlator for VLBI Adam Deller Swinburne University Australia Telescope National Facility (ATNF)
S.Montebugnoli BEST 4 th SKADS Workshop, Lisbon, 2-3 October 2008 Stelio Montebugnoli IRA -INAF- Bologna –Italy- SKADS 4th -Lisbon- Oct 2-3, 2008.
Jason Manley, Aaron Parsons, Don Backer, Henry Chen, Terry Filiba, David MacMahon, Peter McMahon, Arash Parsa, Andrew Siemion, Dan Werthimer, Mel Wright.
Final Presentation Final Presentation OFDM implementation and performance test Performed by: Tomer Ben Oz Ariel Shleifer Guided by: Mony Orbach Duration:
Chinese Real Time VLBI Correlator Xiang Ying, Xu Zhijun, Zhu Renjie, Zhang Xiuzhong, Shu Fengchun, Zheng Weimin Shanghai Astronomical Observatory China.
Short introduction Pulsar Parkes. Outline PDFB – Single beam pulsar timing system CASPER – Single beam pulsar coherent dedispersion system.
Digital Packaging Processor - Overview Gordon Hurford Nov 7, 2011 EOVSA Technical Design Meeting - NJIT.
Travis Newton LO-IF Engineer EVLA LO/IF PDR January IF Downconverter Travis Newton LO/IF Engineer.
Philippe Picard 2 nd SKADS Workshop October 2007 Station Processing Philippe Picard Observatoire de Paris Meudon, 11th October 2007.
Queuing Delay 1. Access Delay Some protocols require a sender to “gain access” to the channel –The channel is shared and some time is used trying to determine.
1 Implementation of Polymorphic Matrix Inversion using Viva Arvind Sudarsanam, Dasu Aravind Utah State University.
LKr readout and trigger R. Fantechi 3/2/2010. The CARE structure.
1 Status report 2011/8/12 Atsushi Nukariya. 2 Progress ・ FPGA -> Revision is completed. -> Obtained data from front-end chip. ・ Software.
Atacama Large Millimeter/submillimeter Array Karl G. Jansky Very Large Array Robert C. Byrd Green Bank Telescope Very Long Baseline Array ALMA Correlator.
Possible Instrumentation Development Items for SKA at ASIAA Chau-Ching Chiong (ASIAA) and Yuh-Jing Hwang, Homin Jiang, Chao-Te Li.
Netherlands Institute for Radio Astronomy 1 APERTIF beamformer and correlator requirements Laurens Bakker.
FP7 Uniboard project Digital Receiver G. Comoretto, A. Russo, G. Tuccari, A Baudry, P. Camino, B. Quertier Dwingeloo, February 27, 2009.
Enhancement Presentation Carlos Abellan Barcelona September, 9th 2009.
JIVE UniBoard Correlator External Review
Digital Down Converter (DDC)
LLRF Research and Development at STF-KEK
Mark 5 / VLBA Correlator Topics
JIVE UniBoard Correlator (JUC) Firmware
FIT Front End Electronics & Readout
Correlator – Backend System Overview
5 Gsps ADC Developement and Future Collaboration
Correlator Growth Path
Presentation transcript:

Digital FX Correlator Nimish Sane Center for Solar-Terrestrial Research New Jersey Institute of Technology, Newark, NJ EOVSA Preliminary Design Review March 15-17, 2012

Overview Nimish Sane, NJIT2 No. of antennas16 No. of polarizations2 No. of frequency channels (subbands)4096 Integration time (ms)20 (possibly, tunable) IF (MHz)600 ADCF-EngineX-Engine P, P 2 Calculation

Hardware KatADC Roach-2 board [1] – Virtex-6 SX475T FPGA (XC6VSX475T-1FFG1759C) – PowerPC 440EPx stand-alone processor to provide control functions – 2 x Multi-gigabit transceiver break out card slots, supporting up to 8x10Ge links which may be CX4 or SFP+ 8 boards with 2 antennas (dual-polarization) per board Nimish Sane, NJIT3

KatADC Hardware – 20dB Gain Block (50.0MHz MHz) [RF front-end can be upgraded with a higher frequency device (SBB-5089Z: 50.0MHz - 6.0GHz)](SBB-5089Z: 50.0MHz - 6.0GHz) – 0dB to 31.5dB Variable Attenuator (controllable in 0.5dB steps) – Non-reflective 50ohm RF switch to disconnect input – Provision for a fixed attenuator (LAT-series) Software Library (“Yellow Block”) – Available from SKA, South Africa group – Not clear how to control attenuation and enable inputs (problems when using software registers as inputs to this yellow block) Nimish Sane, NJIT4

F-Engine Coarse delay: The maximum value supported should be actually ADC samples (corresponding to 10000ns) For polarimetry: We will need values for each frequency channel for each sky frequency band (34 x 4096) While converting to circular polarization, the factor of 1/sqrt(2) has not been included. This is taken care of by diving the result of P or X-correlation by 2 (right shift by 1 bit in digital hardware). This results in 5-bit output in case circular polarization is used. When using linear polarization, the MSB is always zero. g x, g y, D x, D y Phase Switching Pattern Phase Switching Pattern delay [0, 10000] delay [0, 10000] To DPP Nimish Sane, NJIT5

F-Engine Comments We decided not to do fine delay correction in the correlator. Data rates: – No. of F-engines per Roach board = 4 (2 antennas dual polarization) – P (32-bit): Data rate: 32 * 4096 * 50 * 4 bits/sec – P 2 (64-bit): Data rate: 64 * 4096 * 50 * 4 bits/sec – Total data rate per Roach board (F-Engine to DPP): 96 * 4096 * 50 * 4 ≈ 78.6 Mbps – Data from F-engine to X-engine per Roach board = 20 x 4 bits/clock cycle = 24 Gbps At what point do we throw 100 MHz? (≈ 672 channels) --- For now, in DPP/ downstream of Correlator Nimish Sane, NJIT6

F-Engine: Current Status ADC Phase Switch ing Coarse Delay PFB FFT (4096 channels) Polari metry Linear/Ci rcular Polarizati on* Power (P), P^2 g x, g y, D x, D y Quantiz ation to 5 bits* To X- engine Phase Switching Pattern Phase Switching Pattern delay [0, 10000] delay [0, 10000] To DPP FPGA ResourceUtilization (%) Occupied slices20 BRAM (36 x 36)16 BRAM (18 x 18)12 DSP48E1s28 Slice LUTs19 Slice registers11 FPGA Clock frequency of 150 MHz Nimish Sane, NJIT7

F-Engine: Issues Hardware – KatADC hardware upgrade Software/Implementation – KatADC software library block and control – Compiling design at 300 MHz FPGA clock – Compiling the design with scheme to have 34 x 4096 values of each of the coefficients required for polarimetry Synchronization/Timing Power calculation and feedback to ADC Data transfer to X-engine and DPP Nimish Sane, NJIT8

X-Engine Nimish Sane, NJIT9 XAXA YAYA XBXB YBYB X test Y test XAXBXAXB YAYBYAYB XAYBXAYB YAXBYAXB XAX1XAX1 YAY1YAY1 XAY1XAY1 YAX1YAX1 X B X test Y B Y test X B Y test Y B X test X1X2X1X2 Y1Y2Y1Y2 X 13 X test Y 13 Y test Y1Y3Y1Y3 X1X3X1X3 Visibility 0 Visibility 28 Visibility Visibility 29 Visibility 119 Visibility Baselines that include at least one 27-m antenna (Antenna # A and Antenna # B) Baselines that include at least one 27-m antenna (Antenna # A and Antenna # B) EOVSA Design Each X-engine (one per each Roach board) processes 4096/8 = 512 spectral channels. X1X1 Y1Y1

X0X0 Y0Y0 X1X1 Y1Y1 X3X3 Y3Y3 X0X1X0X1 Y0Y1Y0Y1 X0Y1X0Y1 Y0X1Y0X1 X0X2X0X2 Y0Y2Y0Y2 X0Y2X0Y2 Y0X2Y0X2 X0X3X0X3 Y0Y3Y0Y3 X0Y3X0Y3 Y0X3Y0X3 X2X3X2X3 Y2Y3Y2Y3 Visibility 0 Visibility 2 Visibility 1 Visibility 5 Baselines that can include at least one 27-m antenna (Antenna # 0 and Antenna # 1) Baselines that can include at least one 27-m antenna (Antenna # 0 and Antenna # 1) EOVSA 4-antenna Prototype Design EOVSA 4-antenna Prototype Design X2X2 Y2Y2 Y1Y2Y1Y2 X1Y2X1Y2 Y1X2Y1X2 Visibility 3 Y1Y3Y1Y3 X1Y3X1Y3 Y1X3Y1X3 Visibility 4 X1X2X1X2 X1X3X1X3 Each X-engine (one per each Roach board) processes 4096/2 = 2048 spectral channels. X-Engine

Each X-engine will handle 4096/8 = 512 frequency channels (256 even and 256 odd channels) Each input (X or Y) is Fix 5_3 Output of multiplication (and division by 2 if converted to circular polarization) is a Fix 10_3 real part and Fix 10_3 imaginary part Accumulating with maximum accumulation length of 2^16, the output of vector accumulator is a Fix 26_6 real part and Fix 26_6 imaginary part Each output (XX, YY, XY, YX) is Fix 26_6 * 2 (real and imaginary) * 2 (odd and even channel) = 104 bits Nimish Sane, NJIT11 X0X0 Y0Y0 X1X1 Y1Y1 X0X1X0X1 Y0Y1Y0Y1 X0Y1X0Y1 Y0X1Y0X1

X-Engine: Comments Nimish Sane, NJIT12 X-Engine Output 4-antenna prototype EOVSA design (16- antenna) Visibilities with XX, YY, XY, and YX outputs 529 Data per accumulation (bits) 104 * 4 * 5 * 1024 = 2080 K 104 * 4 * 29 * 256 = 3016 K Visibilities with XX, and YY outputs only 191 Data per accumulation (bits) 104 * 2 * 1 * 1024 = 208 K 104 * 2 * 91 * 256 = 4732 K Total data per accumulation (bits) 2288 K7748 K Total data rate (Mbps) (20 ms accumulation time)

X0X0 Y0Y0 X1X1 Y1Y1 X3X3 Y3Y3 X0X1X0X1 Y0Y1Y0Y1 X0Y1X0Y1 Y0X1Y0X1 X0X2X0X2 Y0Y2Y0Y2 X0Y2X0Y2 Y0X2Y0X2 X0X3X0X3 Y0Y3Y0Y3 X0Y3X0Y3 Y0X3Y0X3 X2X3X2X3 Y2Y3Y2Y3 EOVSA 4-antenna Prototype Design EOVSA 4-antenna Prototype Design X2X2 Y2Y2 Y1Y2Y1Y2 X1Y2X1Y2 Y1X2Y1X2 Y1Y3Y1Y3 X1Y3X1Y3 Y1X3Y1X3 X1X2X1X2 X1X3X1X3 X-Engine: Current Status FPGA ResourceUtilization (%) Occupied slices4 BRAM (36 x 36)8 BRAM (18 x 18)0 DSP48E1s2 Slice LUTs4 Slice registers1 FPGA Clock frequency of 150 MHz

X-Engine: Issues Software/Implementation – X-Engine with 120 visibilities has been implemented, but it did not compile successfully – Compiling design at 300 MHz FPGA clock Data transfer from X-engine to DPP Nimish Sane, NJIT14

F and X-engine Connections For prototype, we should be able to fit both F and X engines on the same Roach board For the final design – we will have to check availability of BRAM resources (more work is needed to come to a conclusion) Use full-duplex bidirectional capacity of 10 GbE link: Send output of F – engine to a switch that will distribute it to X – engines (even if F and X are on the same board) All Roach boards have identical design Nimish Sane, NJIT15

F-X-DPP Interconnection Nimish Sane, NJIT16 F F P P Q Q DPP Mbps 78.6 Mbps EOVSA 4-antenna Prototype Design EOVSA 4-antenna Prototype Design 6 Gbps F F Q Q P P X X X X 10 GbE port

F-X-DPP Interconnections: Possible Variations Nimish Sane, NJIT17 F F P P Q Q DPP Mbps 78.6 Mbps 6 Gbps F F Q Q P P X X X X 293 Mbps 10 GbE port 1 GbE Link Switc h EOVSA 4-antenna Prototype Design EOVSA 4-antenna Prototype Design

F-X-DPP Interconnection Nimish Sane, NJIT18 DPP EOVSA Design EOVSA Design Switc h F0 F2 F3 F4 F1 F5 F6 F7 X0 X2 X3 X4 X1 X5 X6 X7

F – X Packets F to X: Each port sends 1024 channels in a single 4096-channel frame – Accumulation length = No. of data frames per accumulation = 2930 – 6 frames per packet – No. of packets/acc = 2930 / 6 = # of packets in an accumulation can exceed 256 (but does not affect DPP) All packets may not have same number of frequency channels (but does not affect DPP) Nimish Sane, NJIT19

P to DPP (2 antenna dual polarization) – No. of bytes per accumulation = 96 * 4096 * 4 / 8 = 192 K – A possible packet size: 6 KB – # packets/accumulation = 32 – # of frequency channels / packet = 4096 / 32 = 128 X to DPP Nimish Sane, NJIT20 F/X-DPP Packets EOVSA 4-antenna Prototype Design EOVSA 4-antenna Prototype Design Output precision for XX/YY/XY/YX Fix 26_6Fix 24_4 Fix 22_6Fix 32_6 Accumulation size (KB) Possible packet size (KB) # packets/accumulation # Frequency channels/packet

F-X-DPP Interconnections Issues Architecture Use of 1 GbE vs 10 GbE ports Size of a Switch and choice of Switch DPP input ports Collision while sending data to the same DPP input port Nimish Sane, NJIT21

10 GbE Packet Header Header in a packet should include – Header length (1 byte) – Accumulation length (2 bytes) – Packet number within an accumulation (1 byte) – Accumulation number (global) (4 bytes) – Accumulation number (within 0 and 49) (4 bytes) This is used to align with the 1 pps signal – Delay0, Delay1, Delay2, Delay 3 (4 bytes each) – FFT Shift (4 bytes) – ADC overflow count (4 bytes) – For P/P^2 Antenna number (1 byte) Polarization (X, Y, R, L) (1 byte) – For X-engine output, Visibility (2 bytes: 1 byte for each antenna number) Roach board number or Engine number (1 byte) – Whether it is P/P^2 information or X-corr information (1 byte) (?) – ? A length of 40 bytes should suffice Nimish Sane, NJIT22

Miscellaneous Issues What happens when a signal exceeds the 4-bit quantization? – Data is lost. – Dale: We may be better off scaling for 3 bits and leaving at least 1 bit of headroom. At least the Van Vleck correction can apply to fewer bits, and while we lose efficiency we do not lose the data itself. Nimish Sane, NJIT23

Nimish Sane, NJIT24 References P. McMahon, et al. “CASPER Memo 017: Packetized FX Correlator Architectures,” September 2007.