Presentation is loading. Please wait.

Presentation is loading. Please wait.

SCARIe FABRIC A pilot study of distributed correlation Huib Jan van Langevelde Ruud Oerlemans Nico Kruithof Sergei Pogrebenko and many others…

Similar presentations


Presentation on theme: "SCARIe FABRIC A pilot study of distributed correlation Huib Jan van Langevelde Ruud Oerlemans Nico Kruithof Sergei Pogrebenko and many others…"— Presentation transcript:

1 SCARIe FABRIC A pilot study of distributed correlation Huib Jan van Langevelde Ruud Oerlemans Nico Kruithof Sergei Pogrebenko and many others…

2 huib 02/11/06 2/17GiGaPort meeting SURF Utrecht 2 Nov 2006 What correlators do… Synthesis imaging simulates a very large telescope by measuring Fourier components of sky brightness on each baseline pair Sensitivity is proportional to bandwidth optimal use of available recording bandwidth by sampling 2 bits (4 level) at Nyquist rate Correlator calculates ½N(N-1) baseline outputs after compensating for the geometry of array Integrates output signal to something relatively slow and samples with delay/frequency resolution

3 huib 02/11/06 3/17GiGaPort meeting SURF Utrecht 2 Nov 2006 EVN MkIV data processor at JIVE Implements this in custom silicon 16 stations input from tapes now hard-disks and fibres Input data is 1 Gb/s max 1 or 2 bit sampled up to 16 sub-bands format includes time codes Super computer 1024 chips 256 complex correlations each at 32 MHz clock Around 100 T-operations/sec 2 bit only! Depends a bit how you do it Should next correlator also use special hardware?

4 huib 02/11/06 4/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Next generation… Can be implemented on standard computing? Time critical, keep up with input example: LOFAR on BlueGene Higher precision and new applications Better sensitivity, interference mitigation, spacecraft navigation Can CPU cycles be found on the Grid? From 16 1Gb/s (eVLBI) And growing… To 1000s at 100 Gb/s (SKA) Pilot projects FABRIC & SCARIe Connectivity, workflow Real-time resource allocation LOFAR central processor FABRIC eVLBI SKA inner core (5km)

5 huib 02/11/06 5/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Tflops, Pflops… 2 bit operations floating point Results in enormous computing tasks Very few operations / bit Some could be associated with telescope Rough estimate based on XF correlation SKA not even in here…

6 huib 02/11/06 6/17GiGaPort meeting SURF Utrecht 2 Nov 2006 SCARIe FABRIC EC funded project EXPReS (03/2006) To turn eVLBI into an operational system Plus: Joint Research Activity: FABRIC Future Arrays of Broadband Radio-telescopes on Internet Computing One work-package on 4Gb/s data acquisition and transport (Jodrell Bank, Metsahovi, Onsala, Bonn, ASTRON) One work-package on distributed correlation (JIVE, PSNC Poznan) Dutch NWO funded project SCARIe (10/2006) Software Correlator Architecture Research and Implementation for eVLBI Collaboration with SARA and UvA Use Dutch Grid with configurable high connectivity: StarPlane Software correlation with data originating from JIVE Complementary projects with matching funding International and national expertise from other partners Total of 9 man year at JIVE, plus some matching from staff plus similar amount at partners

7 huib 02/11/06 7/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Aim of the project Research the possibility of distributed correlation Using the Grid for getting the CPU cycles Can it be employed for the next generation VLBI correlation? Exercise the advantages of software correlation Using floating point accuracy and special filtering Explore (push) the boundaries of the Grid paradigm Real time applications, data transfer limitations To lead to a modest size demo With some possible real applications: Monitoring EVN network performance Continuous available eVLBI network with few telescopes Monitoring transient sources Astrometry, possibly of spectral line sources Special correlator modes: spacecraft navigation, pulsar gating Test bed for broadband eVLBI research Something to try on the roadmap for the next generation correlator, even if you do not believe it is the solution…

8 huib 02/11/06 8/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Previous experience on Software correlation Builds on previous experience at JIVE regular and automated network performance tests Using Japanese software correlator from NICT Huygens extreme narrow band correlation Home grown superFX with sub- Hz resolution

9 huib 02/11/06 9/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Work packages Grid resource allocation Grid workflow management Tool to allocate correlator resources and schedule correlation Data flow from telescopes to appropriate correlator resources Expertise from the Poznan group in Virtual Laboratories Will this application fit on Grid? As it is very data intensive And time-critical if not real-time Software correlation correlator algorithm design High precision correlation on standard computing Scalable to cluster computers Portable for grid computers and interfaced to standard middleware Interactive visualization and output definition Collect & merge data in EVN archive Standard format and proprietary rights

10 huib 02/11/06 10/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Basic idea Use the Grid for correlation CPU cycles on compute nodes The Net could be crossbar switch? Correlation will be asynchronous Based on floating point arithmetic Portable code, standard environment

11 huib 02/11/06 11/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Workflow Management Must interact with normal VLBI schedules Divide data, route to compute nodes, setup correlation Dynamic resource allocation, keep up with incoming data! Effort from Poznan, based on their Virtual Lab.

12 huib 02/11/06 12/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Topology Slice in time Every node gets an interval A new correlator for every time slice Employ clusters computers at nodes Minimizes total data transport Bottleneck at compute node Probably good connectivity at Grid nodes anyway Scales perfectly Easily estimated how many nodes are needed Works with heterogeneous nodes But leaves sorting to compute nodes Memory access may limit effectiveness Slice in baseline Assign a (or a range of) products to a certain node E.g. two data streams meet in some place Transport Bottleneck at sources (telescopes) Maybe curable with multicast transport mechanism which forks at network nodes Some advantage when local nodes at telescopes Does not scale very simply Simple schemes for ½N 2 nodes Need to re-sort output But reduces the compute problem Using the network as the cross-bar switch

13 huib 02/11/06 13/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Work packages Grid resource allocation Grid workflow management Tool to allocate correlator resources and schedule correlation Data flow from telescopes to appropriate correlator resources Expertise from the Poznan group in Virtual Laboratories Will this application fit on Grid? As it is very data intensive And time-critical if not real-time Software correlation correlator algorithm design High precision correlation on standard computing Scalable to cluster computers Portable for grid computers and interfaced to standard middleware Interactive visualization and output definition Collect & merge data in EVN archive Standard format and proprietary rights

14 huib 02/11/06 14/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Broadband software correlation Raw data 16 MHz, Mk4 format on linux disk Channel extraction Extracted data Delay corrections Delay corrected data Station 1Station 2Station N Correlation. SFXC Data Product Pre-calculated,Delay tables From Mk5 to linux disk Raw data BW=16 MHz, Mk4 format on Mk5 disk DIM,TRM, CRM DCM,DMM, FR SU Correlator Chip EVN Mk4 equivalents

15 huib 02/11/06 15/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Better SNR than Mk4 hardware

16 huib 02/11/06 16/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Software correlation Working on benchmarking Single core processors so far Different CPUs available Already quite efficient More work on memory performance Must deploy on cluster computers And then on Grid Organize the output to be used for astronomy

17 huib 02/11/06 17/8NRI eSciences 2 Nov 2006 Side step: Data intensive processing Radio-astronomy can be extreme User data sets can be large Few – 100 GB now Larger: LOFAR, eVLBI, APERTIF, SKA All data enter imaging Iterative calibration schemes Few operations per Byte Parallel computing: not obviously suited for messaging systems Task (data oriented) parallelization Processing traditionally done interactively on user platform More and more pipeline approaches Addressed in RadioNet Project ALBUS resulted in Python for AIPS Looking for extension in FP7 Interoperability with ALMA, LOFAR But for user domain

18

19 huib 02/11/06 19/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Goal of the project Develop: methods for high data rate e-VLBI using distributed correlation High data rate eVLBI data acquisition and transport Develop a scalable prototype for broadband data acquisition Prototype acquisition system Establish a transportation protocol for broadband e-VLBI Build into prototype, establish interface normal system Interface e-VLBI public networks with LOFAR and e-MERLIN dedicated networks Correlate wide band Onsala data on eMERLIN Demonstrate LOFAR connectivity Distributed correlation Setup data distribution over Grid Workflow management tool Develop a software correlator Run a modest distributed eVLBI experiment

20 huib 02/11/06 20/17GiGaPort meeting SURF Utrecht 2 Nov 2006 Current eVLBI practice observing schedule in VEX format user correlator parameters earth orientation parameters correlator control including model calculation field system controls antenna and acquisition BBC & samplers Mk4 formatter Mk5 playback Mk5 recorder Mk4 data in Mk5prop form over TCPIP output data

21 huib 02/11/06 21/17GiGaPort meeting SURF Utrecht 2 Nov 2006 FABRIC = The GRID FABRIC components observing schedule in VEX format user correlator parameters GRID resources data correlator control including model calculation field system controls antenna and acquisition DBBC VSI VSIe?? on?? output data earth orientation parameters PC-EVN #2 resource allocation and routing


Download ppt "SCARIe FABRIC A pilot study of distributed correlation Huib Jan van Langevelde Ruud Oerlemans Nico Kruithof Sergei Pogrebenko and many others…"

Similar presentations


Ads by Google