Presentation is loading. Please wait.

Presentation is loading. Please wait.

VOMegaPlot Efficient Plotting of Large VOTable Datasets.

Similar presentations


Presentation on theme: "VOMegaPlot Efficient Plotting of Large VOTable Datasets."— Presentation transcript:

1 VOMegaPlot Efficient Plotting of Large VOTable Datasets

2 VOPlot  VOPlot is a tool for visualizing astronomical data that is available in the VOTable format.VOTable  VOPlot reads the xml file in order to load entire data into memory and then processes it to draw various types of plots.  This approach of loading the entire data into memory cannot be used for VOTable files that are very large.

3 Approach for VOMegaPlot  VOMegaPlot preprocesses the xml file to create intermediate files which are subsequently used for plotting.  Entire data is divided into fixed sized blocks and individual blocks are loaded into memory thus reducing the memory requirement.  The number of intermediate files created is equal to the number of columns present in the xml file.

4 Pre-processing operation Creation of array blocks Col 1Col m ……. 1 2 n.... Col 2 File 2 ……. Original xml file with m columns and n rows Block 1 File 1 Block 2 Block k Block 1 Block 2 Block k File m Block 1 Block 2 Block k Intermediate files on disk

5 Algorithm for drawing a scatter plot 1)Input the columns to be plotted, say A vs. B 2)Load a set of corresponding blocks for both columns, A and B. 3)Take corresponding data elements from both the blocks and plot them. 4)After plotting all the points, discard the blocks. 5)If there exist more blocks of data repeat step 2, else stop.

6 Advantages  The complexity for plotting is to O(2n) where n is the no. of rows. This complexity is independent of the no. of columns in the xml file.  If the user has to plot only a subset of data (as in case of zoom operation) then there exists another set of files which can be used for this purpose.

7 Dealing with subset of data  Data for every column is stored in an indexed fashion.  This helps in accessing the subset of data without having to go through the entire set of data.  As a result, operations like zoom become much faster.

8 Pre-processing operation Creation of tree blocks Col 1Col m ……. 1 2 n.... Col 2 Indexed File for col 1 0-10 10-20...... Indexed File for col 2 2-4 4-6...... ……. Original xml file with m columns and n rows Intermediate files with indexed data 0.1–0.3 0.3–0.6...... Indexed File for col m

9 Pre-processing operation Creation of tree blocks (contd) 0-2020-4040-60 20-3030-40 30-3535-40 … Indexed file for a column

10 Results TychoTycho-2UCAC2 Data size1 million rows and 56 columns 2.5 million rows and 32 columns 48.3 million rows and 9 columns Pre- processing time 18 minutes30 minutes3 hours 26 minutes Plotting time for scatter plot 9 seconds22 seconds5 minutes 46 seconds

11 Features of VOMegaPlot  Scatter Plot with zoom, reverse axis and logged axis  Projection Plot  Density Plot  Histogram

12 Scatter Plot Tycho-1 catalogue ( RA vs. Vmag)

13 Density Plot Tycho-1 catalogue ( RA vs. Vmag)

14 Density Plot Tycho-2 catalogue ( DEC vs RA)

15 Scatter Plot UCAC2 Catalogue (2m_J vs. U2Rmag)

16 Density Plot UCAC2 Catalogue (2m_J vs. U2Rmag)

17 Future Enhancements  Support for reading data stored in binary format  Block level compression while creating intermediate files  Client Server version

18 References  VOTable http://www.ivoa.net/Documents/latest/VOT.html http://www.ivoa.net/Documents/latest/VOT.html  VOPlot http://vo.iucaa.ernet.in/~voi/voplot.htm  VOMegaPlot http://vo.iucaa.ernet.in/~voi/vomegaplot.h tm http://vo.iucaa.ernet.in/~voi/vomegaplot.h tm  IUCAA http://www.iucaa.ernet.in  Persistent Systems Pvt. Ltd. http://www.persistentsys.com

19 Sample VOTable 12.4524 34.2331 25.1321 47.9055 18.0723 33.5802 Back


Download ppt "VOMegaPlot Efficient Plotting of Large VOTable Datasets."

Similar presentations


Ads by Google