High Performance Computing for Tissue MicroArray Analysis Dr Yinhai Wang David McCleary, Ching-Wei Wang, Jackie James, Dean Fennell, Peter Hamilton.

Slides:



Advertisements
Similar presentations
Image analysis In our test case we assessed ER in TMAs that have been created using tissue samples from a large epidemiological study (UK Anglian Breast.
Advertisements

Multimedia: Digitised Sound Data Section 3. Sound in Multimedia Types: Voice Overs Special Effects Musical Backdrops Sound can make multimedia presentations.
Materialization and Cubing Algorithms. Cube Materialization Each cell of the data cube is a view consisting of an aggregation of interest. The values.
SLA-Oriented Resource Provisioning for Cloud Computing
Professor Geraint F. Lewis Sydney Institute for Astronomy School of Physics The University of Sydney.
Outline Introduction Image Registration High Performance Computing Desired Testing Methodology Reviewed Registration Methods Preliminary Results Future.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
1 The Problem of Power Consumption in Servers L. Minas and B. Ellison Intel-Lab In Dr. Dobb’s Journal, May 2009 Prepared and presented by Yan Cai Fall.
MATE-EC2: A Middleware for Processing Data with Amazon Web Services Tekin Bicer David Chiu* and Gagan Agrawal Department of Compute Science and Engineering.
High Volume Slide Scanning Architecture and Applications
Digital Cameras  With a digital camera you can get instant results and pictures to your friends.  A digital camera is a type of camera that stores.
Klas Skogmar, Lund Institute of Technology Real-time Video Effects Using Programmable Graphics Cards Master of Science Thesis Klas Skogmar
Cluster computing facility for CMS simulation work at NPD-BARC Raman Sehgal.
© 2013 Mellanox Technologies 1 NoSQL DB Benchmarking with high performance Networking solutions WBDB, Xian, July 2013.
Report : Zhen Ming Wu 2008 IEEE 9th Grid Computing Conference.
Bob Thome, Senior Director of Product Management, Oracle SIMPLIFYING YOUR HIGH AVAILABILITY DATABASE.
1 Advanced Storage Technologies for High Performance Computing Sorin, Faibish EMC NAS Senior Technologist IDC HPC User Forum, April 14-16, Norfolk, VA.
Algorithm Design & Analysis – CS632 Group Project Group Members Bijay Nepal James Hansen-Quartey Winter
Chapter 13 File Structures. Understand the file access methods. Describe the characteristics of a sequential file. After reading this chapter, the reader.
University of Maryland Compiler-Assisted Binary Parsing Tugrul Ince PD Week – 27 March 2012.
Introduction Overview Static analysis Memory analysis Kernel integrity checking Implementation and evaluation Limitations and future work Conclusions.
Computationally Efficient Histopathological Image Analysis: Use of GPUs for Classification of Stromal Development Olcay Sertel 1,2, Antonio Ruiz 3, Umit.
Binary Image Compression via Monochromatic Pattern Substitution: A Sequential Speed-Up Luigi Cinque and Sergio De Agostino Computer Science Department.
COMPUTER MODELS FOR SKY IMAGE ANALYSIS OF THE INASAN ZVENIGOROD OBSERVATORY Sergei Pirogov ( Institute of Astronomy, Russian Academy of Sciences) VIIth.
An Approach for Processing Large and Non-uniform Media Objects on MapReduce-Based Clusters Rainer Schmidt and Matthias Rella Speaker: Lin-You Wu.
CHAPTER FOUR COMPUTER SOFTWARE.
3. Multimedia Systems Technology
Module 2: Information Technology Infrastructure Chapter 1: Hardware and Software.
Parallelizing Security Checks on Commodity Hardware E.B. Nightingale, D. Peek, P.M. Chen and J. Flinn U Michigan.
Today: -Assign Computers -Mac and Ps Overview video -Practice with macs and ps.
Uncovering the Multicore Processor Bottlenecks Server Design Summit Shay Gal-On Director of Technology, EEMBC.
Digital Image Processing & Analysis Spring Definitions Image Processing Image Analysis (Image Understanding) Computer Vision Low Level Processes:
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. LogKV: Exploiting Key-Value.
The Limitation of MapReduce: A Probing Case and a Lightweight Solution Zhiqiang Ma Lin Gu Department of Computer Science and Engineering The Hong Kong.
F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal Grid eXplorer (GdX) An Instrument for eXploring the GRID F. Cappello,
OSP310. What is a SharePoint® Farm? A collection of one or more SharePoint Servers and SQL Servers® providing a set of basic SharePoint.
Click to edit Master subtitle style 2/23/10 Time and Space Optimization of Document Content Classifiers Dawei Yin, Henry S. Baird, and Chang An Computer.
VTurbo: Accelerating Virtual Machine I/O Processing Using Designated Turbo-Sliced Core Embedded Lab. Kim Sewoog Cong Xu, Sahan Gamage, Hui Lu, Ramana Kompella,
GEM: A Framework for Developing Shared- Memory Parallel GEnomic Applications on Memory Constrained Architectures Mucahid Kutlu Gagan Agrawal Department.
Digital Imagery Quiz – Review Plank Jr. High – DIWD Class Mr. Brown.
Characteristics of a Digital Image It can be captured _____1_____ ways: –By camera –By an image scanner –By Cell phone Is stored on a Flash Drive, Memory.
Full and Para Virtualization
Sunpyo Hong, Hyesoon Kim
Map image compression for real-time applications UNIVERSITY OF JOENSUU DEPARTMENT OF COMPUTER SCIENCE Image Compression Research group:
Load Rebalancing for Distributed File Systems in Clouds.
1 Hierarchical Parallelization of an H.264/AVC Video Encoder A. Rodriguez, A. Gonzalez, and M.P. Malumbres IEEE PARELEC 2006.
Bigtable: A Distributed Storage System for Structured Data Google Inc. OSDI 2006.
Copyright © 2010 Hitachi Data Systems. All rights reserved. Confidential – NDA Strictly Required Hitachi Storage Solutions Hitachi HDD Directions HDD Actual.
Introduction to Performance Tuning Chia-heng Tu PAS Lab Summer Workshop 2009 June 30,
FILE COMPRESSION Lossy vs Lossless. Why compress a file? To save storage space. To speed up data transmission.
Daniele Lezzi Execution of scientific workflows on federated multi-cloud infrastructures IBERGrid Madrid, 20 September 2013.
Accelerating K-Means Clustering with Parallel Implementations and GPU Computing Janki Bhimani Miriam Leeser Ningfang Mi
Digital Pathology, An Introduction From glass slides to digital files and there after.
IMAGE PROCESSING is the use of computer algorithms to perform image process on digital images   It is used for filtering the image and editing the digital.
Lecture 2: Performance Evaluation
Introduction to Parallel Computing: MPI, OpenMP and Hybrid Programming
Pathology Spatial Analysis February 2017
From Algorithm to System to Cloud Computing
CLUSTER COMPUTING Presented By, Navaneeth.C.Mouly 1AY05IS037
Mapping the Data Warehouse to a Multiprocessor Architecture
Yu Su, Yi Wang, Gagan Agrawal The Ohio State University
Design Unit 26 Design a small or home office network
Parallel I/O System for Massively Parallel Processors
ICCCN14 CPS Cloud Panel Jun Wang
KISS-Tree: Smart Latch-Free In-Memory Indexing on Modern Architectures
CLUSTER COMPUTING.
Declarative Transfer Learning from Deep CNNs at Scale
Subject Name: Operating System Concepts Subject Number:
Presentation transcript:

High Performance Computing for Tissue MicroArray Analysis Dr Yinhai Wang David McCleary, Ching-Wei Wang, Jackie James, Dean Fennell, Peter Hamilton

Introduction

Tissue Microarrays Key technique for high throughput single assay platform for tissue biomarker research and discovery. *Dolled-Filhart and Rimm, Principles and Practice of Oncology, 7 th Edition, Chapter 7, 2004.

232 Tissue Cores The Bottleneck Relies on visual scoring of tissue biomarkers by pathologists. It is time consuming, subjective and prone to error.

103,790×58,586 pixels 17GB Image Analysis of TMA Virtual Slides A TMA virtual slide is an ultra-large digital image, scanned at a high magnification (40X). Computer assisted analysis using TMA virtual slides. Objective and reproducible. Speed?

Objective

Objective Automate TMA analysis Genuine high throughput platform Reduce pathologists workload Speedup biomarker discovery

Materials and Methods

Hewlett-Packard Blade Server. Intel Xeon quad-core x86_64 processors. >9,000 processor-cores available GB memory per node (8 cores). Gigabit Ethernet connection. Fibre connection to hard disks (SAN). High Performance Computing (HPC) Platform

Parallel Processing Module Glass slide Image generation Digital Slide Serving Module Image File Access Module Analytical Module TMA Database Viewing Instructions Results High resolution image HPC Platform Visualisation High Performance Computing

Colour conversion Vendor format independent B GRBG R Region extraction Raw image JPEG decoder Compressed? uncompressed data R GBGB Pixel (n,0) Pixel (0,1) Hamamatsu virtual slide No Yes Region extraction Raw image JPEG decoder Compressed? Aperio virtual slide No Yes uncompressed data BGRGR Pixel (0,0) Pixel (0,1)  Format? JPEG 2000 decoder JPEG 2000 JPEG Region extraction Raw image JPEG decoder Compressed? Carl Zeiss virtual slide No Yes uncompressed data RGBRG Pixel (0,0) Pixel (0,1) B Colour conversion HPC: Image File Access Module

Master Worker 1Worker 2Worker 3Worker 4 Database Request for core coordinates TMA core location (x, y) at TMA virtual slide Assign to available workers Storage Locate Image in Storage and Core Sub-image 4 Retrieve and Load Core Sub-image 5 Analyse and return results 6 7 Analyse and return results 6 Informs Master it is now available Centralised Dynamic Load Balancing HPC: Parallel Processing Module

HPC: Analytic Module Texture feature calculation ◦ Tumour pattern recognition ◦ Tumour region identification

HPC: Analytic Module Automated quantisation of biomarker IHC density on TMA core images, using colour decomposition.

HPC: Digital Slide Serving Module

Results

106,290×65,017 pixels 19.3GB 229 Tissue Cores Speedup=(Fastest Sequential Code)/(Parallel Code)=42.58 Texture Pattern Calculation for TMA Slides

Time for Loading vs. Saving Time for actual Texture Feature Calculation Loading, Storing, Texture

Processing time: 30minutes  77seconds Speedup=22.19 Biomarker Quantification

Multi TMA Slides There are >9000 processor-cores available The processing of 1 TMA virtual slide uses <100 processor-cores. >90 TMA virtual slides can be processed simultaneously (≈1 minute). Genuine high throughput platform for multiplex multi-TMA studies.

Conclusion A novel high performance computing platform for the rapid analysis of TMA virtual slides. The centralised load balancing approach is proven to be robust. It significantly speedups up the analysis of TMAs, removing the bottleneck. Valuable platform for TMA research & biomarker discovery. High performance platform for the algorithm prototyping, development & evaluation.

Acknowledgements

Thanks Dr Yinhai Wang 0044-(0)