Korea Astronomy and Space Science Institute

Slides:



Advertisements
Similar presentations
E-VLBI down-under Tasso Tzioumis ATNF, CSIRO July 2005 Towards e-VLBI.
Advertisements

Digital FX Correlator Nimish Sane Center for Solar-Terrestrial Research New Jersey Institute of Technology, Newark, NJ EOVSA Technical Design Meeting.
Digital Correlator Design Using Vertex-2 FPGAs
ALMA Cycle 2 Capability Jongsoo Kim ALMA EA Korea node.
E-VLBI progress in the South Tasso Tzioumis ATNF, CSIRO September 2006 “Towards e-VLBI”  “e-VLBI”
Supercomputing Challenges at the National Center for Atmospheric Research Dr. Richard Loft Computational Science Section Scientific Computing Division.
A supercomputer-based software correlator at the Swinburne University of Technology Tingay, S.J. and Deller, A.
Low-Frequency Pulsar Surveys and Supercomputing Matthew Bailes.
Decadal Plan UNSW, 17 Dec 2004 Australia Telescope Operated by CSIRO as a National Facility since 1990 As a National Facility, the Australia Telescope.
Signal Processing for Aperture Arrays. AAVS1 256 antenna elements distributed over –4 stations –64 elements each.
VLBI in Africa Michael Bietenholz Hartebeesthoek Radio Observatory.
DiFX: Software Correlation at Swinburne for the LBA Adam Deller Swinburne University/CSIRO Australia Telescope National Facility Supervisors: A/Prof Steven.
Backend electronics for radioastronomy G. Comoretto.
Status of Korean Participation in the SKA Project Minho Choi (KASI) SKA Japan Workshop NAOJ, Mitaka.
ASKAP Signal Processing Overview DIFX Users and Developers Meeting
FPGA-based Dedispersion for Fast Transient Search John Dickey 23 Nov 2005 Orange, NSW.
SKA Introduction Jan Geralt Bij de Vaate Andrew Faulkner, Andre Gunst, Peter Hall.
Atacama Large Millimeter/submillimeter Array Expanded Very Large Array Robert C. Byrd Green Bank Telescope Very Long Baseline Array The VLBA Sensitivity.
Design of a Software Correlator for the Phase I SKA Jongsoo Kim Cavendish Lab., Univ. of Cambridge & Korea Astronomy and Space Science Institute Collaborators:
Paul Alexander & Jaap BregmanProcessing challenge SKADS Wide-field workshop SKA Data Flow and Processing – a key SKA design driver Paul Alexander and Jaap.
Casper Signal Processing Workshop 2009 SKA Signal Processing (Preliminary) Wallace Turner Domain Specialist for Signal Processing.
Atacama Large Millimeter/submillimeter Array Expanded Very Large Array Robert C. Byrd Green Bank Telescope Very Long Baseline Array Software correlators.
EVLBI and the Australian SKA Pathfinder (ASKAP) Prof. Steven Tingay Curtin University of Technology Perth, Australia 7th eVLBI workshop SHAO, 2008 June.
Next Generation Digital Back-ends at the GMRT Yashwant Gupta Yashwant Gupta National Centre for Radio Astrophysics Pune India CASPER meeting Cambridge.
GPU DAS CSIRO ASTRONOMY AND SPACE SCIENCE Chris Phillips 23 th October 2012.
SKA System Design Kobus Cloete 9 December 2010 AAVP Workshop "Exploring the Universe with the world's largest radio telescope"
Australian Astronomy MNRF Development of Monolithic Microwave Integrated Circuits (MMIC) ATCA Broadband Backend (CABB)
An FX software correlator for VLBI Adam Deller Swinburne University Australia Telescope National Facility (ATNF)
GSB : A real-time Software back-end for the GMRT Jayanta Roy National Centre for Radio Astrophysics Pune, India 12 th December 2008 Collaborators.
Answers from the Working Group on AGN and jets G. Moellenbrock, J. Romney, H. Schmitt, V. Altunin, J. Anderson, K. Kellermann, D. Jones, J. Machalski,
Review of developments in Australasia and mainland Asia Steven Tingay Swinburne University of Technology Next Generation correlator meeting, JIVE 27 -
A real-time software backend for the GMRT : towards hybrid backends CASPER meeting Capetown 30th September 2009 Collaborators : Jayanta Roy (NCRA) Yashwant.
Paul AlexanderSKA Computational Challenges Square Kilometre Array Computational Challenges Paul Alexander.
Philippe Picard 2 nd SKADS Workshop October 2007 Station Processing Philippe Picard Observatoire de Paris Meudon, 11th October 2007.
ATCA GPU Correlator Strawman Design ASTRONOMY AND SPACE SCIENCE Chris Phillips | LBA Lead Scientist 17 November 2015.
1 Next Generation Correlators, June 26 th −29 th, 2006 The LOFAR Blue Gene/L Correlator Stichting ASTRON (Netherlands Foundation for Research in Astronomy)
Cross correlators are a highly computationally intensive part of any radio interferometer. Although the number of operations per data sample is small,
DiFX at KASI Jongsoo Kim and Taehyun Jung Korea Astronomy and Space Science Institute
Square Kilometre Array eInfrastructure: Requirements, Planning, Future Directions Duncan Hall SPDO Software and Computing EGEE 2009.
M.P. Rupen, Synthesis Imaging Summer School, 18 June Cross Correlators Michael P. Rupen NRAO/Socorro.
VLBI Developments in Australia ASTRONOMY AND SPACE SCIENCE Chris Phillips| LBA Lead Scientist 11 November 2014.
Netherlands Institute for Radio Astronomy 1 APERTIF beamformer and correlator requirements Laurens Bakker.
VGOS GPU Based Software Correlator Design Igor Surkis, Voytsekh Ken, Vladimir Mishin, Nadezhda Mishina, Yana Kurdubova, Violet Shantyr, Vladimir Zimovsky.
SAGE meeting Socorro, May 22-23, 2007 WIDAR Correlator Overview Michael P. Rupen Project Scientist for WIDAR & Software.
Andrew Faulkner April 2016 STFC Industry Day: Low Frequency Aperture Array Andrew Faulkner Project Engineer.
Netherlands Institute for Radio Astronomy 1 ASTRON is part of the Netherlands Organisation for Scientific Research (NWO) Square Kilometer Array Low Central.
First result with PAF on a big single-dish radio telescope X. Deng, A. Chippendale, S. Johnston, G. Hobbs, D. George, R. Karuppusamy ASTRONOMY AND SPACE.
ACA TP Spectrometer Manabu Watanabe (NAOJ)
JIVE UniBoard Correlator External Review
Network Connected Multiprocessors
RFI Protection Activities in IAA RAS
The Low Frequency Array (LOFAR)
RFI Protection Activities in IAA RAS
Mid Frequency Aperture Arrays
RadioNet FP7 Joint Research Activity
The UniBoard Generic Hardware for Radio Astronomy Signal Processing
Signal Processing for Aperture Arrays
Modern Interferometric Arrays
JIVE UniBoard Correlator (JUC) Firmware
Imaging and Calibration Challenges
5 Gsps ADC Developement and Future Collaboration
Aperture Array Station Processing
Rick Perley National Radio Astronomy Observatory
Observational Astronomy
Observational Astronomy
Technical Foundations & Enabling Technologies – DS4
The Uniboard  FPGA Processing for Astronomy
SKA PEP - Dish Array WP5 Phase 1 Dishes with Single Pixel Feed Systems
Correlator Growth Path
17/04/2019 Future VLBI systems Tasso Tzioumis| Facilities Program Director– Technologies for Radio Astronomy CSIRO Astronomy and Space Science IVTW.
Presentation transcript:

Korea Astronomy and Space Science Institute Software Correlator Jongsoo Kim Korea Astronomy and Space Science Institute

Future Correlator Projects SKA phase I and phase II Test Correlator Verification Correlator SKA1 AIP and SKA2 Upgrade plan of a correlator for ACA (ALMA compact array) Korea, Japan, Taiwan are interested in correlators.

Correlators for Radio Interferometry ASIC (Application-Specific Integrated Circuit): FPGA (Field-Programmable Gate Arrays): Japan, Taiwan, … Software (high level-languages, e.g., C/C++): Korea Rapid development Expandability …

Current Status of SC LBA (Australian Long Baseline Array) 8 antennas (Parkes, … 22-64m, 1.4-22GHz) DiFX software correlator (2006; Deller et al. 2007, 2011) VLBA (Very Long Baseline Array) 10 antennas (25m, 330MHz - 86GHz) DiFX MPIfR (the Max Planck Institute for Radio-astronomy) Mark4  DiFX

Current Status of SC (cont.) GMRT (Giant Metrewave Radio Telescope) 30 antennas (45m, 50MHz-1.5GHz), 32MHz ASIC  software correlator (Roy et al. 2010) LOFAR (Low Frequency Array) LBA (Low Band Antennae) 10-90MHz HBA (High Band Antennae) 110 – 250MHz IBM BlueGene/P: software correlation

Correlation Theorem, FX-correlator F-step (FT): ~log2(Nc) operations per sample X-step (CMAC): ~ N operations per sample

FLOPS of the X-step in FX correlator 4 is from 8 is from 4 multiplications and 4 additions: N(N+1)/2 is the number of auto- and cross- correlations with antenna (station) N Dish array (N=250, B = 1 GHz, Nb=1)  16x2502 GFLOPS = 1 PFLOPS Sparse AA (N=50, B=380MHz, Nb=160)  16x502x160x0.38 GFLOPS = 2.43 PFLOPS

top500

Tesla Roadmap

Tesla K10 (Kepler) Single precision: 2x2.288 TFLOPS CUDA cores: 2x1536 Memory Bandwidth: 2x160GB/sec Memory: 2x4GB

CoDR of a Software Correlator for the dish array 250 dishes 250 nodes 100 Gb/s Ethernet CPUs+(GPUs) CPUs+(GPUs) CPUs+(GPUs) Required BW>32Gb/s CPUs+(GPUs) CPUs+(GPUs) >4 TFLOPS CPUs+(GPUs) 2x4x2x1GHz =16Gb/s 2 pols, 4bit sampling, Nyquist, BW

CoDR of a Software Correlator for the sparse AA 50 stations 16 subclusters 100 Gb/s Ethernets station subcluster subcluster station subcluster station station subcluster station subcluster >150 TFLOPS station subcluster 60Gb/s x 16

Conclusions Korea (software), Japan, Taiwan (FPGA) have common interest in correlators for not only the SKA but also, probably, the ACA. Japan (NAOJ) is also developing a software correlator for VLBI. We (KJT) might take part in the CSP (Central Signal Processor) domain of the SKA project together.

CoDR for SKA Phase I, Memo 125 Key Sciences: H I and Pulsars Sparse Aperture Array 70-450 MHz, A/Tsys = 2000m2/K, Lmax=100Km Dish Array 0.45-3 GHz, A/Tsys=1000m2/K, 250 15m dishes single-pixel feeds, Lmax=100Km Construction: 2016-19 Budget: 350M Euros

Design goals Connect antennas and computer nodes with simple network topology Use future technology development of HPC clusters Simplify programming

Cost and Power Estimates of SCs # of nodes Cost per node [kEuros] Cost of IB per port [kEuros] Power per node [kW] Total cost [M Euros] Total power [MW] Dish Array 250 5 1 1.0 1.5 0.25 Sparse AA 800 4.8 0.80 Total 1050 6.3 1.05

Data Rate per Dish Pure data: Encoding overhead 2 (pol) x 4 (bit/sample) x 2 (Nyquist) x 1GHz (BW) =16Gb/s Encoding overhead 20% (8b/10b; PCIe 2.0)  1.5% (128b/130b; PCIe 3.0) UDP (User Data protocol) overhead < 1% (28 bytes for headers / 65,507 bytes data length) Oversample overhead ~ 20% (memo 130)

Communication between Computer Nodes I after FFT 1 Nc Node 1 FFT 2x4x2xB 2x8x2xB Node 2 FFT 2 Nc

Communication between Computer Nodes II after communication 1 Nc/2 Node 1 FFT 1 2 2x4x2xB 2x8x2xB Node 2 FFT 1 2 Nc/2 2

All-to-All Communication between Computer Nodes III after communication Nc/4 2x4x2xB 2x8x2xB 1 N>>1 BW(interconnect) =2 (2x4x2xB) =32B Node 1 FFT 2 1 3 4 Node 2 FFT 2 Node 2 FFT 3 Nc/4 1 Node 2 FFT 2 4 3 4

Data Rate per Station Pure data: Encoding overhead 2 (pol) x 4 (bit/sample) x 2 (Nyquist) x 0.38GHz (BW) x 160 (beams) =972.8 Gb/s Encoding overhead 20% (8b/10b; PCIe 2.0)  1.5% (128b/130b; PCIe 3.0) UDP (User Data protocol) overhead < 1% (28 bytes for headers / 65,507 bytes data length) Oversample overhead ~ 20% (memo 130)

Channelized data in a station Nchan subcluster1 subcluster2 subcluster16 160 beams Nchan/16

Connectivity between stations and nodes in a subcluster 100 Gb/s Ethernets station node station node node station Required BW >60Gb/s station node station node >3 TFLOPS station node ~ 60Gb/s

AI (Arithmetic Intensity) Definition: number of operations (flops) per byte AI = 8flops/16bytes (Ri,Rj) = 0.5 AI = 32 flops/32bytes (Ri, Li, Rj, Lj) = 1.0 for 1x1 tile AI = 2.4 for 3x2 tiles Since AIs are small numbers, correlation calculations are bounded by the memory bandwidth. Performance: AI x memory BW (=102GB/s) Wayth+ (2009), van Nieuwpoort+ (2009)

Performance of Tesla C1060 as a function of AI Performance is, indeed, memory-bounded. Maximum performance is about 1/3 of the peak performance.

SKA Phase I: Preliminary System, Memo 130 Sparse Aperture array 50 stations Bandwidth: 380 MHz (70-450 MHz) Number of beam: 480 (160) Dual polarizations Bits per sample: 4 bits