Automated Macular Pathology Diagnosis in Retinal OCT Images Using Multi-Scale Spatial Pyramid with Local Binary Patterns Yu-Ying Liu, James M. Rehg School.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Distinctive Image Features from Scale-Invariant Keypoints David Lowe.
Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.
A Graph based Geometric Approach to Contour Extraction from Noisy Binary Images Amal Dev Parakkat, Jiju Peethambaran, Philumon Joseph and Ramanathan Muthuganapathy.
Exemplar-Based Segmentation of Pigmented Skin Lesions from Dermoscopy Images Mei Chen Intel Labs Pittsburgh Approach Motivation Skin.
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Zhimin CaoThe Chinese University of Hong Kong Qi YinITCS, Tsinghua University Xiaoou TangShenzhen Institutes of Advanced Technology Chinese Academy of.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.
1 Challenge the future HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences Omar Oreifej Zicheng Liu CVPR 2013.
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Computer vision: models, learning and inference Chapter 13 Image preprocessing and feature extraction.
Face Description with Local Binary Patterns:
MIT CSAIL Vision interfaces Approximate Correspondences in High Dimensions Kristen Grauman* Trevor Darrell MIT CSAIL (*) UT Austin…
Automatic Feature Extraction for Multi-view 3D Face Recognition
Semi-Supervised Hierarchical Models for 3D Human Pose Reconstruction Atul Kanaujia, CBIM, Rutgers Cristian Sminchisescu, TTI-C Dimitris Metaxas,CBIM, Rutgers.
Amir Hosein Omidvarnia Spring 2007 Principles of 3D Face Recognition.
Quadtrees, Octrees and their Applications in Digital Image Processing
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Quadtrees, Octrees and their Applications in Digital Image Processing
Feature Screening Concept: A greedy feature selection method. Rank features and discard those whose ranking criterions are below the threshold. Problem:
5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.
1 Invariant Local Feature for Object Recognition Presented by Wyman 2/05/2006.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
A String Matching Approach for Visual Retrieval and Classification Mei-Chen Yeh* and Kwang-Ting Cheng Learning-Based Multimedia Lab Department of Electrical.
© 2013 IBM Corporation Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments Presented by Shashank Mujumdar IBM Research,
Digital Pathology Diagnostic Accuracy, Viewing Behavior and Image Characterization Linda Shapiro University of Washington Computer Science and Engineering.
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.
Oriented Local Binary Patterns for Offline Writer Identification
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.
Svetlana Lazebnik, Cordelia Schmid, Jean Ponce
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.
5. SUMMARY & CONCLUSIONS We have presented a coarse to fine minimization framework using a coupled dual ellipse model to form a subspace constraint that.
Object Detection with Discriminatively Trained Part Based Models
Quadtrees, Octrees and their Applications in Digital Image Processing.
Automatic Minirhizotron Root Image Analysis Using Two-Dimensional Matched Filtering and Local Entropy Thresholding Presented by Guang Zeng.
Pedestrian Detection and Localization
Copyright © 2010 Siemens Medical Solutions USA, Inc. All rights reserved. Hierarchical Segmentation and Identification of Thoracic Vertebra Using Learning-based.
Face Detection Ying Wu Electrical and Computer Engineering Northwestern University, Evanston, IL
MedIX – Summer 07 Lucia Dettori (room 745)
Methods for classification and image representation
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Visual Computing Computer Vision 2 INFO410 & INFO350 S2 2015
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
Timo Ahonen, Abdenour Hadid, and Matti Pietikainen
Robotics Chapter 6 – Machine Vision Dr. Amit Goradia.
Image features and properties. Image content representation The simplest representation of an image pattern is to list image pixels, one after the other.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
WLD: A Robust Local Image Descriptor Jie Chen, Shiguang Shan, Chu He, Guoying Zhao, Matti Pietikäinen, Xilin Chen, Wen Gao 报告人:蒲薇榄.
Another Example: Circle Detection
- photometric aspects of image formation gray level images
Learning Mid-Level Features For Recognition
Recognizing Deformable Shapes
PRESENTED BY Yang Jiao Timo Ahonen, Matti Pietikainen
Nonparametric Semantic Segmentation
Bashir Dodo Automated Layer Segmentation of Macula SD-OCT Images Using Graph-Cut Method Bashir I. Dodo, Yongmin Li, Khalid.
Paper Presentation: Shape and Matching
CS 698 | Current Topics in Data Science
Object detection as supervised classification
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.
Automated Macular Pathology Diagnosis in Retinal OCT Images Using Multi-Scale Spatial Pyramid with Local Binary Patterns Yu-Ying Liu, James M. Rehg.
CS 1674: Intro to Computer Vision Scene Recognition
Brief Review of Recognition + Context
A Novel Smoke Detection Method Using Support Vector Machine
Recognizing Deformable Shapes
Presentation transcript:

Automated Macular Pathology Diagnosis in Retinal OCT Images Using Multi-Scale Spatial Pyramid with Local Binary Patterns Yu-Ying Liu, James M. Rehg School of Interactive Computing, Georgia Institute of Technology Mei Chen Intel Labs Pittsburgh Hiroshi Ishikawa, Gadi Wollstein, Joel S. Schuman UPMC Eye Center, University of Pittsburgh Medical Center, Department of Bioengineering, University of Pittsburgh

OCT Imaging in Ophthalmology OCT (Optical Coherence Tomography) – Non-contact, non-invasive 3D imaging – Becoming as standard of care since 1991 Working principle: – Emit lights into the eye; measure reflectivity of the tissues within a target cube – Rendering the measurements for visualizing inner-structures 2 x z y x z OCT slice x y z OCT volume

Motivation for Automated Pathology Diagnosis Protect vision, need regular and large-scale screening; require CAD tool to improve efficiency Ophthalmologists have no access to radiologists; CAD tool can help alleviate burden 3 Ophthalmologists Radiologists H In U.S., 30% of 75 yr. olds suffer gradual loss of central vision (AMD) regular screening help detect early pathology

Prior Work in Analyzing Ocular OCT 4 [Garvin MK, et.al, TMI’08] [Tapio, et.al, Opt Express’09] [G. Quellec, TMI’10] [Lee K, et.al, TMI’10] Optic disc segmentationFluid-filled column segmentation Top and bottom layer segmentation Intra-retinal layer segmentation Most Prior work focused on segmentation tasks

Our Goal: Automated Pathology Diagnosis No prior work on computer-aided diagnosis of macular pathology Our goal : given the foveal slice from a 3D macular scan, automatically determine the presence of normal macula (NM) and three pathologies (MH, ME, AMD) – All pathologies can coexist 5 Normal macula (NM)? NO Macular hole (MH)? YES Macular edema (ME)? YES Age-related degeneration (AMD)? NO Macular Scan Auto Diagnosis Auto Diagnosis Foveal SlicePresence

Examples of Normal Macula and Macular Pathology 6 NM MH ME AMD Normal Macula: a smooth depression arount the center, no abornomal tissues embedded Macular Hole: a full or partial (pseudo) hole arount the center Macular Edema: retinal thickening or fluid accumulation (black blobs) Age-related Macular Degeneration: irregular shape of the bottom retinal layer High variations within each pathology!

Challenges in Analyzing Ocular OCT 7 Handcrafting high-level rules is unlikely to generalize well We use low-level features and data-driven approach for robust analysis 1. Multiple pathologies coexist2. proliferated/deformed tissues cover top layer/hole 3. Shadowing effects by blood vessels/opaque media MH+ME ME+AMD

Overview of Our Learning-based Approach 8 Labeled Foveal- Slice Set Input: NMNO MEYES MHNO AMDYES Training Testing NMNO MEYES MHYES AMDNO Patho. Presence Classification Patho. Presence Classification Output: Automated Diagnosis: Foveal Slice Large OCT Scan Set Large OCT Scan Set SVM Classifier Training SVM Classifier Training Output: NM classifier MH classifier ME classifier AMD classifier +- Patho. Feature Extraction Foveal Slice

9 Pre- processing Pre- processing Image Representation Image Representation Descriptor Generation Descriptor Generation Classifier Training Classification Overview of Algorithm Feature Extraction present absent

Preprocessing: Retina Alignment (1/2) 10 Purpose : reduce the appearance variations across scans Pre- processing Pre- processing Image Representation Image Representation Descriptor Generation Descriptor Generation Classifier Training Classification Foveal Slice alignment Align original image aligned image remove curvature and centering Large variations in positions, curvatures

Preprocessing: Retina Alignment (2/2) 11 Alignment process: find the retinal area, then curve-fit and warp the retina to be roughly horizontal Pre- processing Pre- processing Image Representation Image Representation Descriptor Generation Descriptor Generation Classifier Training Classification Foveal Slice alignment

Image Representation 12 1.Spatial Location2.Global Context Good representation for ocular OCT should consider: 3.Multiple Scales Pre- processing Pre- processing Image Representation Image Representation Descriptor Generation Descriptor Generation Classifier Training Classification Foveal Slice ME+AMD Pathology locality Overall appearance for correct interpretation Small and large-scale changes

Image Representation: Multi-Scale Spatial Pyramid (MSSP) 13 Multi-Scale Spatial Pyramid (MSSP) : preserve spatial organization of local features at multiple scales and spatial granularities Level-2 Level-1 Level-0 3-level MSSP [Wu & Rehg, CVPR’08] Pre- processing Pre- processing Image Representation Image Representation Descriptor Generation Descriptor Generation Classifier Training Classification Foveal Slice MSSP Finer spatial resolution Coarser spatial resolution Global descriptor: Concatenate local features in a fixed order 1.Spatial Location2.Global Context3.Multiple Scales

Local Descriptors: LBP pca 14 Encode micro-structures 256 bins32 dim. Pre- processing Pre- processing Image Representation Image Representation Descriptor Generation Descriptor Generation Classifier Training Classification Foveal Slice LBP pca [Wu and Rehg, CVPR’08] Intensity Quantization PCA Local Binary Pattern Histogram Local Binary Pattern Histogram LBP pca Suppress pixel noise Dimension reduction

15 Foveal Slice Pre- processing Pre- processing Image Representation Image Representation Descriptor Generation Descriptor Generation Classifier Training Classification Review of Algorithm Multi-Scale Spatial Pyramid LBP pca Alignment Feature Extraction

16 Classifier Training: Support Vector Machine SVM Classifier SVM Classifier Pre- processing Pre- processing Image Representation Image Representation Descriptor Generation Descriptor Generation Classifier Training Classification Foveal Slice present absent Training: Testing: Decision Threshold t present ? YES/NO Probability Non-linear SVM with RBF kernel, probability output SVM Feature Extraction sensitivity 1 - specificity ROC curve 1 1

Dataset and Experiments OCT dataset – We collected 326 macular OCT scans from 136 subjects – Ground truth: foveal slices and labels from one ophthalmologist Experiment design – 10-fold cross-validation at subject level – Area under ROC curve (AUC) as metric Experiment result – AUC: 0.991, 0.962, 0.894, for NM, ME, MH, AMD Validation: 3 sets of experiments for LBPpca, MSSP 17 StatisticsNMMEMHAMD # scans # subjects sensitivity 1 - specificity ROC curve 1 1 AUC

Validation of LBP pca (1/2) 18 AUCNMMEMHAMDAverage LBP pca (32) LBP u2 (59) LBP (256) For AMD, LBPpca > LBPu2 (AMD: vs ) PCA preserves irregular shapes of AMD better! Performance comparison to other LBP-based methods: LBP (dim:256) Uniform LBP histogram (LBP u2 ) (dim:59): model distribution of patterns with infrequent bitwise changes! [Ojala, TPAMI’01, T. Ahonen, TPAMI’06, A. Oliver, MICCAI’07’] Uniform patterns LBPpca, LBPu2 >> LBP (0.93x vs. 0.81)

Validation of LBP pca (2/2) 19 AUCNMMEMHAMDAverage LBP pca (32) Mean + std (2) Intensity histogram (32) Orientation histogram (32) For MH, AMD, LBPpca >> the others texture cues encoded by LBP are relatively more effective! Performance comparison to other popular local descriptors:

Validation of MSSP (1/2) 20 Multiple scales Multiple spatial granularity Single scale Multiple spatial granularities Single scale Single spatial granularity [S. Lazebnik, CVPR’06] [T. Ahonen, TPAMI’06] [A. Oliver, MICCAI’07] [Wu & Rehg, CVPR’08] Compare MSSP to other spatial representations (SP, SL)

Validation of MSSP (2/2) 21 AUCNMMEMHAMDAverage MSSP SP SL For AMD, MSSP >> SP and SL (0.888 vs. 0.84x) Multi-scale modeling is beneficial! Performance comparison to “Spatial pyramid (SP)” and “Single level (SL)”

Conclusion Addressed a novel problem – Automated macular pathology diagnosis in OCT images Developed an effective learning-based approach – A large labeled OCT dataset of 326 scans – Promising result: 0.991, 0.962, 0.894, for NM, ME, MH, AMD – Multi-scale global feature representation with LBPpca can effectively encodes the geometry and texture of the retina Future work – Exploring shape with texture features for better performance 22

Thank You! 23

Reference Prior work in analyzing ocular OCT images – M.K. Garvin, et. al, “Intraretinal layer segmentation of macular optical coherence tomography images using optimal 3-D graph search”, TMI 2008 – S.M. Tapio Fabritius, et.al, “Automated segmentation of the macula by optical coherence tomography”, Opt Express 2009 – G. Quellec, “Three-dimensional analysis of retinal layer texture: Identification of fluid-filled regions in SD-OCT of the macula”, TMI 2010 Local binary patterns (LBP) – T. Ojala, et. al, “Multiresolution gray-scale and rotation invariant texture classification with local binary patterns”, TPAMI 2002 LBP applications – T. Ahonen, et. al, “Face description with local binary patterns: Application to face recognition”, TPAMI 2006 – A. Oliver, et. al, “False positive reduction in mammographic mass detection using local binary patterns”, MICCAI 2007 – L. Sorensen, et. al, “Texture classification in lung CT using local binary patterns”, MICCAI 2008 Spatial pyramid – S. Lazebnik, et. al, “Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories”, CVPR 2006 Multi-scale spatial pyramid (MSSP), LBP+PCA – J. Wu, J. M. Rehg, “Where am I: Place instance and category recognition using spatial PACT”, CVPR

Backup Slides

Local Descriptor: Alternative: uniform LBP bins 59 bins bin selection & merging Uniform LBP (LBPu2) all patterns (256) uniform (58) non-uniform (198) LBPu2: retain distribution of uniform patterns only, since they are majority in pixel counts (>90%) [Ojala, TPAMI’01] Used often in literature [T. Ahonen, TPAMI’06, A. Oliver, MICCAI’07] Separate to uniform and non-uniform patterns 58 uni. + 1 non-uni. [Ojala, TPAMI’01]

Local Descriptor: Non-Uniform Patterns Can be Important We argue that LBPpca is better than LBPu2 when frequent intensity changes are important (e.g. AMD)! 27 Uniform All non-uniform Visualization : non-uniform patterns reside mostly at edge contours (likely important features!)

Zeiss Cirrus HD-OCT Machine 28