HPCLatAm 2013 HPCLatAm 2013 Permutation Index and GPU to Solve efficiently Many Queries AUTORES  Mariela Lopresti  Natalia Miranda  Fabiana Piccoli.

Slides:



Advertisements
Similar presentations
Proximity Searching in High Dimensional Spaces with a Proximity Preserving Order Edgar Chávez Karina Figueroa Gonzalo Navarro UNIVERSIDAD MICHOACANA, MEXICO.
Advertisements

Scalable Multi-Cache Simulation Using GPUs Michael Moeng Sangyeun Cho Rami Melhem University of Pittsburgh.
Exploiting Graphics Processors for High- performance IP Lookup in Software Routers Author: Jin Zhao, Xinya Zhang, Xin Wang, Yangdong Deng, Xiaoming Fu.
An Effective GPU Implementation of Breadth-First Search Lijuan Luo, Martin Wong and Wen-mei Hwu Department of Electrical and Computer Engineering, UIUC.
Sparse LU Factorization for Parallel Circuit Simulation on GPU Ling Ren, Xiaoming Chen, Yu Wang, Chenxi Zhang, Huazhong Yang Department of Electronic Engineering,
ADBIS 2003 Revisiting M-tree Building Principles Tomáš Skopal 1, Jaroslav Pokorný 2, Michal Krátký 1, Václav Snášel 1 1 Department of Computer Science.
Acceleration of the Smith– Waterman algorithm using single and multiple graphics processors Author : Ali Khajeh-Saeed, Stephen Poole, J. Blair Perot. Publisher:
2009/04/07 Yun-Yang Ma.  Overview  What is CUDA ◦ Architecture ◦ Programming Model ◦ Memory Model  H.264 Motion Estimation on CUDA ◦ Method ◦ Experimental.
Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009.
DCABES 2009 China University Of Geosciences 1 The Parallel Models of Coronal Polarization Brightness Calculation Jiang Wenqian.
Weekly Report Start learning GPU Ph.D. Student: Leo Lee date: Sep. 18, 2009.
1 An Empirical Study on Large-Scale Content-Based Image Retrieval Group Meeting Presented by Wyman
A Resource-level Parallel Approach for Global-routing-based Routing Congestion Estimation and a Method to Quantify Estimation Accuracy Wen-Hao Liu, Zhen-Yu.
All-Pairs-Shortest-Paths for Large Graphs on the GPU Gary J Katz 1,2, Joe Kider 1 1 University of Pennsylvania 2 Lockheed Martin IS&GS.
Accelerating Machine Learning Applications on Graphics Processors Narayanan Sundaram and Bryan Catanzaro Presented by Narayanan Sundaram.
Gregex: GPU based High Speed Regular Expression Matching Engine Date:101/1/11 Publisher:2011 Fifth International Conference on Innovative Mobile and Internet.
HPEC_GPU_DECODE-1 ADC 8/6/2015 MIT Lincoln Laboratory GPU Accelerated Decoding of High Performance Error Correcting Codes Andrew D. Copeland, Nicholas.
Debunking the 100X GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU Presented by: Ahmad Lashgar ECE Department, University of Tehran.
GPGPU platforms GP - General Purpose computation using GPU
Accelerating SQL Database Operations on a GPU with CUDA Peter Bakkum & Kevin Skadron The University of Virginia GPGPU-3 Presentation March 14, 2010.
Efficient Lists Intersection by CPU-GPU Cooperative Computing Di Wu, Fan Zhang, Naiyong Ao, Gang Wang, Xiaoguang Liu, Jing Liu Nankai-Baidu Joint Lab,
Motivation “Every three minutes a woman is diagnosed with Breast cancer” (American Cancer Society, “Detailed Guide: Breast Cancer,” 2006) Explore the use.
High-Performance Packet Classification on GPU Author: Shijie Zhou, Shreyas G. Singapura and Viktor K. Prasanna Publisher: HPEC 2014 Presenter: Gang Chi.
An approach for solving the Helmholtz Equation on heterogeneous platforms An approach for solving the Helmholtz Equation on heterogeneous platforms G.
GPU-accelerated Evaluation Platform for High Fidelity Networking Modeling 11 December 2007 Alex Donkers Joost Schutte.
COLLABORATIVE EXECUTION ENVIRONMENT FOR HETEROGENEOUS PARALLEL SYSTEMS Aleksandar Ili´c, Leonel Sousa 2010 IEEE International Symposium on Parallel & Distributed.
CuMAPz: A Tool to Analyze Memory Access Patterns in CUDA
SCHOOL OF ELECTRICAL AND COMPUTER ENGINEERING | GEORGIA INSTITUTE OF TECHNOLOGY Accelerating Simulation of Agent-Based Models on Heterogeneous Architectures.
Performance Tuning on Multicore Systems for Feature Matching within Image Collections Xiaoxin Tang*, Steven Mills, David Eyers, Zhiyi Huang, Kai-Cheung.
1 On Querying Historical Evolving Graph Sequences Chenghui Ren $, Eric Lo *, Ben Kao $, Xinjie Zhu $, Reynold Cheng $ $ The University of Hong Kong $ {chren,
« Pruning Policies for Two-Tiered Inverted Index with Correctness Guarantee » Proceedings of the 30th annual international ACM SIGIR, Amsterdam 2007) A.
YOU LI SUPERVISOR: DR. CHU XIAOWEN CO-SUPERVISOR: PROF. LIU JIMING THURSDAY, MARCH 11, 2010 Speeding up k-Means by GPUs 1.
Evaluating FERMI features for Data Mining Applications Masters Thesis Presentation Sinduja Muralidharan Advised by: Dr. Gagan Agrawal.
Fast Support Vector Machine Training and Classification on Graphics Processors Bryan Catanzaro Narayanan Sundaram Kurt Keutzer Parallel Computing Laboratory,
GPU Architecture and Programming
Autonomic scheduling of tasks from data parallel patterns to CPU/GPU core mixes Published in: High Performance Computing and Simulation (HPCS), 2013 International.
Parallelization and Characterization of Pattern Matching using GPUs Author: Giorgos Vasiliadis 、 Michalis Polychronakis 、 Sotiris Ioannidis Publisher:
Accelerating Error Correction in High-Throughput Short-Read DNA Sequencing Data with CUDA Haixiang Shi Bertil Schmidt Weiguo Liu Wolfgang Müller-Wittig.
IIIT Hyderabad Scalable Clustering using Multiple GPUs K Wasif Mohiuddin P J Narayanan Center for Visual Information Technology International Institute.
Introduction What is GPU? It is a processor optimized for 2D/3D graphics, video, visual computing, and display. It is highly parallel, highly multithreaded.
Jie Chen. 30 Multi-Processors each contains 8 cores at 1.4 GHz 4GB GDDR3 memory offers ~100GB/s memory bandwidth.
Similarity Searching in High Dimensions via Hashing Paper by: Aristides Gionis, Poitr Indyk, Rajeev Motwani.
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
QCAdesigner – CUDA HPPS project
SCHOOL OF ELECTRICAL AND COMPUTER ENGINEERING | GEORGIA INSTITUTE OF TECHNOLOGY HPCDB Satisfying Data-Intensive Queries Using GPU Clusters November.
Authors – Jeahyuk huh, Doug Burger, and Stephen W.Keckler Presenter – Sushma Myneni Exploring the Design Space of Future CMPs.
 Genetic Algorithms  A class of evolutionary algorithms  Efficiently solves optimization tasks  Potential Applications in many fields  Challenges.
Euripides G.M. PetrakisIR'2001 Oulu, Sept Indexing Images with Multiple Regions Euripides G.M. Petrakis Dept. of Electronic.
GPU Accelerated MRI Reconstruction Professor Kevin Skadron Computer Science, School of Engineering and Applied Science University of Virginia, Charlottesville,
Radix Sort and Hash-Join for Vector Computers Ripal Nathuji 6.893: Advanced VLSI Computer Architecture 10/12/00.
Introduction.  Administration  Simple DBMS  CMPT 454 Topics John Edgar2.
ACCELERATING QUERY-BY-HUMMING ON GPU Pascal Ferraro, Pierre Hanna, Laurent Imbert, Thomas Izard ISMIR 2009 Presenter: Chung-Che Wang (Focus on the performance.
Ranking of Database Query Results Nitesh Maan, Arujn Saraswat, Nishant Kapoor.
Debunking the 100X GPU vs. CPU Myth An Evaluation of Throughput Computing on CPU and GPU Present by Chunyi Victor W Lee, Changkyu Kim, Jatin Chhugani,
Sudhanshu Khemka.  Treats each document as a vector with one component corresponding to each term in the dictionary  Weight of a component is calculated.
Geant4 on GPU prototype Nicholas Henderson (Stanford Univ. / ICME)
Onlinedeeneislam.blogspot.com1 Design and Analysis of Algorithms Slide # 1 Download From
GFlow: Towards GPU-based High- Performance Table Matching in OpenFlow Switches Author : Kun Qiu, Zhe Chen, Yang Chen, Jin Zhao, Xin Wang Publisher : Information.
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
3/12/2013Computer Engg, IIT(BHU)1 CUDA-3. GPGPU ● General Purpose computation using GPU in applications other than 3D graphics – GPU accelerates critical.
Introduction to Algorithms: Brute-Force Algorithms.
Accelerating K-Means Clustering with Parallel Implementations and GPU Computing Janki Bhimani Miriam Leeser Ningfang Mi
Exploiting Graphics Processors for High-performance IP Lookup in Software Routers Jin Zhao, Xinya Zhang, Xin Wang, Yangdong Deng, Xiaoming Fu IEEE INFOCOM.
Massive Spatial Query on the Kepler Architecture
Efficient Image Classification on Vertically Decomposed Data
Accelerating MapReduce on a Coupled CPU-GPU Architecture
Presented by: Isaac Martin
Efficient Image Classification on Vertically Decomposed Data
Pyramid Sketch: a Sketch Framework
Presentation transcript:

HPCLatAm 2013 HPCLatAm 2013 Permutation Index and GPU to Solve efficiently Many Queries AUTORES  Mariela Lopresti  Natalia Miranda  Fabiana Piccoli  Nora Reyes UNIVERSIDAD NACIONAL DE SAN LUIS 1

OBJETIVES  Speed up multimedia database queries through search index using High Performance Computing.  Search Index: Permutation.  High Performance Computing: Parallel programming on NVIDIA GPU. 2

INTRODUCTION  Multimedia Data.  How to resolve queries?  Similarty Search.  Metric Space Model: is a paradigm that allows to modelize all the similarity search problems.  Metric Data Base: store objects of a metric space and let resolve similiraty search. 3

INTRODUCTION  A metric space (X, d) is composed of a universe of valid objects X and a distance function d : X × X → R + defined among them.  The distance function determines the similarity (or dissimilarity) between two given objects and satisfies several properties which make it a metric.  Similarity Search: given a dataset of | U |= n objects, a query can be trivially answered by performing n distance evaluations.  There are two main queries of interest:  Range Searching.  The k Nearest Neighbors(k-NN). 4

SEARCH INDEX  The saved information in the index can vary, some indices store a subset of distances between objects, others maintain just a range of distance values.  The goal is to preprocess the dataset such that queries can be answered with as few distance computations as possible.  One of these indices is the Permutation Index. 5

INDEX: PERMUTATION  The algorithm based on permutation is a probabilistic algorithm.  Predict proximity between elements, using its permutations.  If two elements are similar then their permutations are similar.  Preprocessing step: compute the permutation of each element of the database.  All permutations are stored to form the index. 6

GPU - CUDA  GPU was developed with a highly parallel structure, high memory bandwidth.  GPU has high throughput becouse of the compute capability of thousands of threads.  GPU characteristics:  Several streams multiprocessors.  CPU – GPU memory hierarchy.  Threads running in parallel.

PERMUTATION ON GPU  Build a searching index: Permutants  Solving similarity queries on a Data Base. 8

GPU-CUDA PERMUTATION INDEX 9 The Indexing process has two stages: 1- Calculates the distance among every object in database and the permutants. 2- Sets up the signatures of all objects in database, i.e. all object permutations. Each thread compute an object permutation.

SOLVING APPROXIMATE QUERIES 1- Compute the permutation of query object. Each thread compute one permutation. 2- Contrast the permutation of query object with the index, according to footrule distance. 3- Sort the Footrule distances. They are sort with the quicksort implemented in parallel. 10

SOLVING APPROXIMATE QUERIES 4- Depending on the type of query we evaluate the selected object Range search: select items whose distance is less than a reference range KNN search: 4.2.1: compute de edit distance : sort the distances with the quicksort and select the k first items of the sorted list. 11

SOLVING PARALLELY MANY QUERIES 12  It is not enough to speed up the time to answer only one query, but it is necessary to leverage the capabilities of the GPU to parallely answer several queries.  The permutation index is built once and then is used to answer many queries.  GPU receives the queries set and it has to solve all of them.

ANALYSIS OF EXPERIMENTAL RESULTS We did experiments on:  Size of Data Base: 4KB, 29KB y 84KB.  Metric Data Base: English Words.  Distance Function: Edit Distance. CPU characteristics: Intel corei3, 2.13 GHz, 3 GB of memory. 13

ANALYSIS OF EXPERIMENTAL RESULTS  GPU CHARACTERISTICS: Ge Force GPU Global Memory SMSPClock Rate Computing capacity GTX MB GHz1.2 GTX550Ti1024 MB GHz2.1 GTX520MX1024 MB GHz2.1 14

ANALYSIS OF EXPERIMENTAL RESULTS 15 #permutantesGT520MXGTX550TiGTX #permutantesGT520MXGTX550TiGTX Range Search Throughput Knn Throughput

ANALYSIS OF EXPERIMENTAL RESULTS 16  The next figure show the obteined acceleration in range queries and K-NN queries for 80 queries solved in parallel.  Range queries show improvements respect to k-NN queries.  The best case is for largest database and maximun number of permutant.

ANALYSIS OF EXPERIMENTAL RESULTS 17 Speedup of Range search Queries on three different GPUs. Speedup of k-NN Search Queries for different number of parallel queries

ANALYSIS OF EXPERIMENTAL RESULTS 18 Speedup of GPU-Qsort and Thrust on three different GPUs  Our implementation obtains better speed up than the solution using thrust library.  it is important to notice the independence of GPU-Qsort from GPU characteristics, it works fine in all GPU

CONCLUSIONS  Implementation of an Index: Permutantes used to approximate similarity searches in databases of words.  Empirical Evaluation: improvements obteined in different architectures considered. 19

FUTURE WORK  We plan to make an exhaustive experimental evaluation considering others kinds of database, comparing with other solutions that apply GPU in the scenario of metric space similarity searches.  We need also to evaluate retrieval effectiveness of the answer of the Permutation Index, as the number of objects directly compared with the query grows, by using Recall and precission measures.  Exploiting the power of GPUs using optimization techniques to increase performance in solving many parallel query. 20

THANKS FOR YOUR ATTENTION Questions? 21 Mariela Lopresti: Natalia Fabiana Nora