Multidimensional classification of burst triggers from LIGO S5 run Soma Mukherjee for the LSC Center for Gravitational Wave Astronomy University of Texas.

Slides:



Advertisements
Similar presentations
A walk through some statistic details of LSC results.
Advertisements

Clustering Basic Concepts and Algorithms
GWDAW-8 (December 17-20, 2003, Milwaukee, Wisconsin, USA) Search for burst gravitational waves with TAMA data Masaki Ando Department of Physics, University.
R. Stone for the LSC Center for Gravitational Wave Astronomy Department of Physics and Astronomy University of Texas at Brownsville. GWDAW, Boston, MA,
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Noise Floor Non-stationarity Monitor Roberto Grosso*, Soma Mukherjee + + University of Texas at Brownsville * University of Nuernburg, Germany Detector.
FLANN Fast Library for Approximate Nearest Neighbors
Evaluating Performance for Data Mining Techniques
Image Segmentation by Clustering using Moments by, Dhiraj Sakumalla.
LIGO-G Z Coherent Coincident Analysis of LIGO Burst Candidates Laura Cadonati Massachusetts Institute of Technology LIGO Scientific Collaboration.
LIGO-G Z Detector characterization for LIGO burst searches Shourov K. Chatterji for the LIGO Scientific Collaboration 10 th Gravitational Wave.
Presented by Tienwei Tsai July, 2005
21 Feb 2002Soumya D. Mohanty, AEI1 DCR Plan of presentation Soumya Mohanty: Overview, aims & work done R. Balasubramanian: Details of Hardware, Database.
Soma Mukherjee GWDAW8, Milwaukee, Dec '03 Interferometric Data Modeling: Issues in realistic data generation. Soma Mukherjee CGWA University of Texas.
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
1 Lecture 10 Clustering. 2 Preview Introduction Partitioning methods Hierarchical methods Model-based methods Density-based methods.
S.Klimenko, G Z, December 21, 2006, GWDAW11 Coherent detection and reconstruction of burst events in S5 data S.Klimenko, University of Florida.
Data Characterization in Gravitational Waves Soma Mukherjee Max Planck Institut fuer Gravitationsphysik Golm, Germany. Talk at University of Texas, Brownsville.
| October LIGO Document ID: LIGO-G Multivariate Veto Analysis for Real-time Gravitational Wave.
LIGO-G Z LIGO Scientific Collaboration 1 Upper Limits on the Rate of Gravitational Wave Bursts from the First LIGO Science Run Edward Daw Louisiana.
Proposal to add three new members to the existing UTB-LIGO MOU Soma Mukherjee on behalf of the UTB LSC members LIGO Tech Doc # G LSC meeting,
LIGO-G Z Peter Shawhan, for the LIGO Scientific Collaboration APS Meeting April 25, 2006 Search for Gravitational Wave Bursts in Data from the.
Analysis of nonstationarity in LIGO S5 data using the NoiseFloorMon output A proposal for a seismic Data Quality flag. R. Stone for the LSC The Center.
Multidimensional classification of burst triggers from the fifth science run of LIGO Soma Mukherjee CGWA, UTB GWDAW11, Potsdam, 12/18/06 LIGO-G
LIGO-G Z A Coherent Network Burst Analysis Patrick Sutton on behalf of Shourov Chatterji, Albert Lazzarini, Antony Searle, Leo Stein, Massimo.
S.Klimenko, December 2003, GWDAW Performance of the WaveBurst algorithm on LIGO S2 playground data S.Klimenko (UF), I.Yakushin (LLO), G.Mitselmakher (UF),
Searching for Gravitational Waves with LIGO Andrés C. Rodríguez Louisiana State University on behalf of the LIGO Scientific Collaboration SACNAS
LIGO-G Z April 2006 APS meeting Igor Yakushin (LLO, Caltech) Search for Gravitational Wave Bursts in LIGO’s S5 run Igor Yakushin (LLO, Caltech)
1 Hierarchical glitch classification analysis. Soma Mukherjee CGWA, UTB LSC/Virgo Det Char, Baton Rouge, March 21, 2007 LIGO-G
Large scale clustering of simulated gravitational wave triggers using S-means and Constrained Validation methods. L.R. Tang, H. Lei, S. Mukherjee, S. D.
Dec 16, 2005GWDAW-10, Brownsville Population Study of Gamma Ray Bursts S. D. Mohanty The University of Texas at Brownsville.
G030XXX-00-Z Excess power trigger generator Patrick Brady and Saikat Ray-Majumder University of Wisconsin-Milwaukee LIGO Scientific Collaboration.
7 th Edoardo Amaldi Conference on Gravitational Waves 8-14 July 2007 Sydney Australia Data from gravitational wave detectors like LIGO will be analysed.
21 Feb 2002Soumya D. Mohanty, AEI1 Detector/Data Characterization Robot Towards Data Mining Soumya D. Mohanty Max Planck Institut für Gravitationsphysik.
Searching for Gravitational Waves from Binary Inspirals with LIGO Duncan Brown University of Wisconsin-Milwaukee for the LIGO Scientific Collaboration.
LIGO-G Z Status of MNFTmon Soma Mukherjee +, Roberto Grosso* + University of Texas at Brownsville * University of Nuernberg, Germany LSC Detector.
1 Status of Search for Compact Binary Coalescences During LIGO’s Fifth Science Run Drew Keppel 1 for the LIGO Scientific Collaboration 1 California Institute.
LIGO-G Z The Q Pipeline search for gravitational-wave bursts with LIGO Shourov K. Chatterji for the LIGO Scientific Collaboration APS Meeting.
S.Klimenko, G Z, December 2006, GWDAW11 Coherent detection and reconstruction of burst events in S5 data S.Klimenko, University of Florida for.
LIGO-G Z Confidence Test for Waveform Consistency of LIGO Burst Candidate Events Laura Cadonati LIGO Laboratory Massachusetts Institute of Technology.
Data Analysis Algorithm for GRB triggered Burst Search Soumya D. Mohanty Center for Gravitational Wave Astronomy University of Texas at Brownsville On.
Multidimensional classification analysis of kleine Welle triggers in LIGO S5 run Soma Mukherjee for the LSC University of Texas at Brownsville GWDAW12,
S.Klimenko, March 2003, LSC Burst Analysis in Wavelet Domain for multiple interferometers LIGO-G Z Sergey Klimenko University of Florida l Analysis.
Cluster Analysis Dr. Bernard Chen Assistant Professor Department of Computer Science University of Central Arkansas.
LIGO-G Z GWDAW9 December 17, Search for Gravitational Wave Bursts in LIGO Science Run 2 Data John G. Zweizig LIGO / Caltech for the LIGO.
LIGO-G All-Sky Burst Search in the First Year of the LSC S5 Run Laura Cadonati, UMass Amherst For the LIGO Scientific Collaboration GWDAW Meeting,
Peter Shawhan The University of Maryland & The LIGO Scientific Collaboration Penn State CGWP Seminar March 27, 2007 LIGO-G Z Reaching for Gravitational.
CZ5211 Topics in Computational Biology Lecture 4: Clustering Analysis for Microarray Data II Prof. Chen Yu Zong Tel:
Validation of realistically simulated non-stationary data. Soma Mukherjee Centre for Gravitational Wave Astronomy University of Texas at Brownsville. GWDAW9,
Soma Mukherjee GWDAW8, Milwaukee, December'03 Interferometric Data Modeling: Issues in realistic data generation. Soma Mukherjee CGWA Dept. of Physics.
Soma Mukherjee LSC, Hanford, Aug.19 '04 Generation of realistic data segments: production and application. Soma Mukherjee Centre for Gravitational Wave.
Cluster Analysis What is Cluster Analysis? Types of Data in Cluster Analysis A Categorization of Major Clustering Methods Partitioning Methods.
MNFT : Robust detection of slow nonstationarity in LIGO Science data Soma Mukherjee Max Planck Institut fuer Gravitationsphysik Germany. LSC Meeting, Livingston,
LIGO-G Z The Q Pipeline search for gravitational-wave bursts with LIGO Shourov K. Chatterji for the LIGO Scientific Collaboration APS Meeting.
LIGO-G05????-00-Z Detector characterization for LIGO burst searches Shourov K. Chatterji For the LIGO Scientific Collaboration 10 th Gravitational Wave.
Search for gravitational waves from binary inspirals in S3 and S4 LIGO data. Thomas Cokelaer on behalf of the LIGO Scientific Collaboration.
SC03 failed results delayed FDS: parameter space searches
A 2 veto for Continuous Wave Searches
Bounding the strength of gravitational radiation from Sco-X1
The Q Pipeline search for gravitational-wave bursts with LIGO
Soma Mukherjee Centre for Gravitational Wave Astronomy
LIGO Scientific Collaboration meeting
Soma Mukherjee for LIGO Science Collaboration
Topic 3: Cluster Analysis
WaveMon and Burst FOMs WaveMon WaveMon FOMs Summary & plans
Targeted Searches using Q Pipeline
Excess power trigger generator
Coherent Coincident Analysis of LIGO Burst Candidates
Topic 5: Cluster Analysis
Presentation transcript:

Multidimensional classification of burst triggers from LIGO S5 run Soma Mukherjee for the LSC Center for Gravitational Wave Astronomy University of Texas at Brownsville, USA. Amaldi Conference, Sydney, July 8-14, 2007 LIGO-G

A quick walk through … This study involves implementation of classification methods (non-parametric/hierarchical and parametric, k-means) to see presence of structures in higher dimensional parameter space. Often features embedded in higher dimensions are not elucidated in simple 1 or 2 dimensional study. References for LIGO classification analysis: S. Mukherjee : Past LSC, F2F and GWDAW talks, Burst and Glitch group telecons, published paper in CQG). kleine Welle (Blackburn, L. et. al LIGO Z) is an algorithm that picks up burst triggers from the gravitational wave, auxiliary and environmental channels in LIGO. It generates several gigabytes of trigger database containing information about the physical properties of the burst triggers. The purpose of this analysis is to mine the trigger database to see if the triggers can be categorized in different groups that share common properties. This will lead to effective dimensionality reduction of the problem since the number expected groups will be a countable small number and each group, to some extent, uniform in character. The physical motivation here is that this could become a powerful veto mechanism.

Pipeline : What is new since March 2007 LSC meeting a. We start with the kleineWelle trigger database from all channels. b. Data conditioning is applied to clean the narrowband features and filter in the appropriate frequency bands. c. Analysis database is constructed with logarithmically transformed  t, frequency and snr and shape data. Shape information of the triggers taken into account from the processed time series. d. A hierarchical classification algorithm and a time series similarity-based classification is applied to the reconstructed database. e. At the end of the pipeline, we have information on existing classes (statistics, members,properties ) from all channels. We directly access raw data uninterruptedly. This involves : a.Fast connection for rapid data transfer. b.Data storage. c.Storage for pipeline output. a. Identification of the class properties. b. Correlation between channels. c. Formation of class basis. d. Final step towards direct classification based on pattern recognition. Veto applications.

Kleine Welle triggers from S5 (>350 channels from H1, H2 And L1) Multidimensional Classification algorithm (non-parametric hierarchical); Similarity based time series classification. Duration, frequency, SNR (a cut may be applied) Shape information Identification of trigger classes (time series, time-frequency analysis) Correlation across channels (  t, f, snr, shape correlation) Source recognition (pattern recognition) Within matlab code Web based access Local storage Post processing …….. Data conditioning

Algorithm : Hierarchical classification Example The algorithm is based on computation of distances between data points in the multi- dimensional space. A variance minimization criterion is used to group the objects into statistically distinct classes. The metric may be chosen in several different ways. The group formation stage is guided by complete-linkage criterion, i.e. largest distance between objects in two groups.

Algorithm : Similarity driven time series classification This algorithm is based on k-means which classifies data by assignment of k centroids chosen a priori and then partitioning data based on association of data points to the nearest centroid. In the final step, the algorithm minimizes an objective function, which in this case is a squared error function. The method is an iterative one where the centroids are re-calculated until stability is reached. Simulated example

Groups=2, p<0.003, r=0.93 H1:LSC-DARM_ERR

H2:LSC-DARM_ERR Group=2, p<0.004, r=0.81

Groups=2, p<0.04, r=0.80 L1:LSC-DARM_ERR

Detector : H0 Channel : BSC6_MAGZ Number of groups =2 p < 10e-6 r=0.92 n=26304 snr cut = 6 n-surviving = 7669 Detector : H0 Channel : BSC6_MAGZ Number of groups =2 p < 10e-6 r=0.92 n=26304 snr cut = 6 n-surviving = 7669

Detector : H1 Channel : COIL_MAGZ Number of groups =2 p < 10e-8 r=0.70 n=31086 snr cut = 8 n-surviving = 2963 Detector : H1 Channel : COIL_MAGZ Number of groups =2 p < 10e-8 r=0.70 n=31086 snr cut = 8 n-surviving = 2963

Detector : H1 Channel : COIL_MAGY Number of groups =2 p < 10e-19 r=0.70 n=64220 snr cut = 20 n-surviving = 5270 Detector : H1 Channel : COIL_MAGY Number of groups =2 p < 10e-19 r=0.70 n=64220 snr cut = 20 n-surviving = 5270

Detector : H1 Channel : MICH_CTRL Number of groups =2 p < 10e-19 r=0.94 n=17068 snr cut = 5 n-surviving = 2364 Detector : H1 Channel : MICH_CTRL Number of groups =2 p < 10e-19 r=0.94 n=17068 snr cut = 5 n-surviving = 2364

Detector : H2 Channel : POB_I Number of groups =2 p < 10e-15 r=0.82 n=17096 snr cut = 5 n-surviving = 1331 Detector : H2 Channel : POB_I Number of groups =2 p < 10e-15 r=0.82 n=17096 snr cut = 5 n-surviving = 1331

Detector : H1 Channel : POB_I Number of groups =2 p < 10e-8 r=0.88 n=17779 snr cut = 5 n-surviving = 1706 Detector : H1 Channel : POB_I Number of groups =2 p < 10e-8 r=0.88 n=17779 snr cut = 5 n-surviving = 1706

Detector : L1 Channel : ITMX_Y Number of groups =2 p < 10e-9 r=0.74 n=16508 snr cut = 5 n-surviving = 3609 Detector : L1 Channel : ITMX_Y Number of groups =2 p < 10e-9 r=0.74 n=16508 snr cut = 5 n-surviving = 3609

L1:LSC-DARM_ERR GPS : s. 6 distinct groups found by Similarity driven classification algorithm. These classes are based on the shape of the triggers obtained by retaining 128 Hz around the central frequency and band passing. SNR cut= 30

L1:PEM-LVEA_MAGZ GPS : s. 4 distinct groups found by Similarity driven classification algorithm. These classes are based on the shape of the triggers obtained by retaining 128 Hz around the central frequency and band passing. SNR cut= 15

L0:PEM-EY_MAGX GPS : s. 6 distinct groups found by Similarity driven classification algorithm. These classes are based on the shape of the triggers obtained by retaining 32 Hz around the central frequency and band passing. SNR cut= 5

Conclusions Hierarchical classification algorithm shows existence of more than one statistically significant classes in the kleine- Welle trigger database. Development of time series classification algorithms and incorporation of trigger shape into the analysis shows more structures present in the multi-dimensional space. Knowledge from both the analyses being used for pattern recognition across all channels. (In progress)

Target & timeline : what’s been met and what next Main hierarchical classification code in Matlab. Hardware/software/bandwidth for uninterrupted data transfer. Data storage. Code modification to access data for shape information. Development of similarity based time series shape classification. End-to-end test. Extend study to coincident triggers. Archiving of results in a password protected web page accessible to the collaboration. [October 2007 LSC]. BNS trigger classification. Trigger identification. Automation. Veto. (November/December 2007) Acknowledgment Similarity based clustering algorithm is developed by H.Lei, L. R. Tang, S. Mukherjee, S. D. Mohanty, Use of LIGO S5 data is gratefully acknowledged.