1 Biometric Databases. 2 Overview Problems associated with Biometric databases Some practical solutions Some existing DBMS.

Slides:



Advertisements
Similar presentations
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
Advertisements

© Copyright 2012 STI INNSBRUCK Apache Lucene Ioan Toma based on slides from Aaron Bannert
Big Data Working with Terabytes in SQL Server Andrew Novick
Bio Michel Hanna M.S. in E.E., Cairo University, Egypt B.S. in E.E., Cairo University at Fayoum, Egypt Currently is a Ph.D. Student in Computer Engineering.
1. Aim High with Oracle Real World Performance Andrew Holdsworth Director Real World Performance Group Server Technologies.
Advanced Database Systems September 2013 Dr. Fatemeh Ahmadi-Abkenari 1.
IS 4420 Database Fundamentals Chapter 6: Physical Database Design and Performance Leon Chen.
Database Management: Getting Data Together Chapter 14.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 11 Database Performance Tuning and Query Optimization.
8-1 Outline  Overview of Physical Database Design  File Structures  Query Optimization  Index Selection  Additional Choices in Physical Database Design.
Definition of terms Definition of terms Explain business conditions driving distributed databases Explain business conditions driving distributed databases.
Chapter 8 Physical Database Design. McGraw-Hill/Irwin © 2004 The McGraw-Hill Companies, Inc. All rights reserved. Outline Overview of Physical Database.
Architecting a Large-Scale Data Warehouse with SQL Server 2005 Mark Morton Senior Technical Consultant IT Training Solutions DAT313.
IST Databases and DBMSs Todd S. Bacastow January 2005.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Systems analysis and design, 6th edition Dennis, wixom, and roth
By: Blake Peters.  OODB- Object Oriented Database  An OODB is a database management system in which information is represented in the form of objects.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 11 Database Performance Tuning and Query Optimization.
Database Systems Design, Implementation, and Management Coronel | Morris 11e ©2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Database Performance Tuning and Query Optimization.
IT The Relational DBMS Section 06. Relational Database Theory Physical Database Design.
1 Physical Data Organization and Indexing Lecture 14.
1 © Prentice Hall, 2002 Physical Database Design Dr. Bijoy Bordoloi.
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
Data File Access API : Under the Hood Simon Horwith CTO Etrilogy Ltd.
Physical Database Design & Performance. Optimizing for Query Performance For DBs with high retrieval traffic as compared to maintenance traffic, optimizing.
Database Management. ICT5 Database Administration (DBA) The DBA’s tasks will include the following: 1. The design of the database. After the initial design,
DBSQL 14-1 Copyright © Genetic Computer School 2009 Chapter 14 Microsoft SQL Server.
MySQL. Dept. of Computing Science, University of Aberdeen2 In this lecture you will learn The main subsystems in MySQL architecture The different storage.
Chapter 6 1 © Prentice Hall, 2002 The Physical Design Stage of SDLC (figures 2.4, 2.5 revisited) Project Identification and Selection Project Initiation.
PowerPoint Presentation for Dennis, Wixom, & Tegarden Systems Analysis and Design with UML, 4th Edition Copyright © 2009 John Wiley & Sons, Inc. All rights.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
ICPP 2012 Indexing and Parallel Query Processing Support for Visualizing Climate Datasets Yu Su*, Gagan Agrawal*, Jonathan Woodring † *The Ohio State University.
DBMS Implementation Chapter 6.4 V3.0 Napier University Dr Gordon Russell.
Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.
Storing Organizational Information - Databases
1 Design Issues in XML Databases Ref: Designing XML Databases by Mark Graves.
Lecture2: Database Environment Prepared by L. Nouf Almujally 1 Ref. Chapter2 Lecture2.
Set Containment Joins: The Good, The Bad and The Ugly Karthikeyan Ramasamy Jointly With Jignesh Patel, Jeffrey F. Naughton and Raghav Kaushik.
Database Management COP4540, SCS, FIU Physical Database Design (ch. 16 & ch. 3)
1 Chapter 10 Joins and Subqueries. 2 Joins & Subqueries Joins – Methods to combine data from multiple tables – Optimizer information can be limited based.
© All rights reserved. U.S International Tech Support
GIS Data Models GEOG 370 Christine Erlien, Instructor.
Object Oriented Database By Ashish Kaul References from Professor Lee’s presentations and the Web.
Marwan Al-Namari Hassan Al-Mathami. Indexing What is Indexing? Indexing is a mechanisms. Why we need to use Indexing? We used indexing to speed up access.
SQL/Lesson 7/Slide 1 of 32 Implementing Indexes Objectives In this lesson, you will learn to: * Create a clustered index * Create a nonclustered index.
Database Indexing 1 After this lecture, you should be able to:  Understand why we need database indexing.  Define indexes for your tables in MySQL. 
Physical Database Design Purpose- translate the logical description of data into the technical specifications for storing and retrieving data Goal - create.
Chapter 8 Physical Database Design. Outline Overview of Physical Database Design Inputs of Physical Database Design File Structures Query Optimization.
Chapter 5 Index and Clustering
Session 1 Module 1: Introduction to Data Integrity
Mapping the Data Warehouse to a Multiprocessor Architecture
Basics of JDBC Session 14.
PowerPoint Presentation for Dennis, Wixom, & Tegarden Systems Analysis and Design with UML, 5th Edition Copyright © 2015 John Wiley & Sons, Inc. All rights.
Relational Operator Evaluation. overview Projection Two steps –Remove unwanted attributes –Eliminate any duplicate tuples The expensive part is removing.
CS 440 Database Management Systems Stored procedures & OR mapping 1.
SQL Basics Review Reviewing what we’ve learned so far…….
Oracle Announced New In- Memory Database G1 Emre Eftelioglu, Fen Liu [09/27/13] 1 [1]
LEC-8 SQL. Indexes The CREATE INDEX statement is used to create indexes in tables. Indexes allow the database application to find data fast; without reading.
Data Integrity & Indexes / Session 1/ 1 of 37 Session 1 Module 1: Introduction to Data Integrity Module 2: Introduction to Indexes.
SQL IMPLEMENTATION & ADMINISTRATION Indexing & Views.
Introduction to Partitioning in SQL Server
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
Database Performance Tuning and Query Optimization
CHAPTER 5: PHYSICAL DATABASE DESIGN AND PERFORMANCE
Physical Database Design
Database Management System
Chapter 11 Database Performance Tuning and Query Optimization
Advanced Database Topics
Presentation transcript:

1 Biometric Databases

2 Overview Problems associated with Biometric databases Some practical solutions Some existing DBMS

3 Problems Maintaining a huge Biometric database may cause scalability problems Matching time increases with the increase in database sizes Biometric data has no natural ordering Matching should be fast for a real-time system

4 Need for a DBMS in Biometrics Every large scale Biometrics Solution requires a RDBMS for efficient storage and access of data Examples : AIFS – contains 400 million fingerprints Point-of-sale Biometric identification system (100 million entries)

5 Indexing Why indexing data?  To accelerate Query Execution  Reduce the number of disk access Many solutions to speed up query processing:  Summary Tables (Not good for Ad-Hoc Queries)  parallel Machines (add additional Hardware --> cost)  Indexes (The Key to achieve this objective) Strong demand for efficient processing of complex queries on huge databases.

6 Indexing Issues contd.. Factors used to determine which indexing technique should be built on a Column:  Characteristics of indexed column oCardinality Data oDistribution oValue range  Understanding the Data and the Usage Developing a new Indexing technique for Data warehouse’s Queries  The index should be small and utilize space efficiently.  The Index should be able to operate with other indexes.  The Index should support Ad-Hoc and complex Queries and speed up join operations  The Index should be easy to build implement and maintain.

7 Binning Originates from network information theory It is division of set of code words (or templates) into subsets(“bins”) such that each bin satisfies some properties depending upon the application..is a way to segment the biometric templates, e.g., Male/Female Particular Finger Loop vs. whorl vs. arch may be another biometric

8 Binning --contd.. Increases search performance, may reduce search accuracy(increases false non match ratio) Search for a matching template may fail owing to an incorrect bin placement May have to include the same template in different bins Bin error rate is related to confidence in binning strategy

9 Architecture Details Loose to Tight Integration

10 Using the RDBMS Loose Integration – Use the RDBMS only for storage of templates – Match performed against in-memory structures created from the stored templates – Users use Biometric vendor-specific API or BioAPI Tight Integration – Use the RDBMS for storage of templates as well as for performing the match – Users use SQL queries directly against database tables

11 Loose Integration Biometric data is loaded from a database table into memory Matching done on custom-built memory-based structures – (+) Results in fast matching – (-) The solution is memory-bound Further scalability, achieved by using Server Farms – (-) Vendor-centric solution – (-) Can not be easily extended to support multi- modal systems

12 Tight Integration Template matching is implemented within the RDBMS and performed using SQL Allows Biometric Vendor to exploit full capabilities of RDBMS including – Security – Scalability and availability – Parallelism

13 Tight Integration – Template Storage A Biometric Template can be stored in a table column as – RAW data type – Simple Object data type – XML data type – Full Common Biometric Exchange File Format-compliant (CBEFF) data type

14 Tight Integration – A basic approach Biometric Vendors define SQL operators – IdentifyMatch() Given an input template, returns all the templates which match the input within a certain threshold (defined as primary operator) – Score() Returns the degree of match of the input template with a stored template (defined as ancillary to IdentifyMatch operator) Biometric Vendors define implementations for these operators which are specific to their biometric

15 Tight Integration - Indexing Biometric Vendors define an indexing scheme (indextype) for fast evaluation of the IdentifyMatch() operator Defining an indexing scheme involves – Developing a filter(s) which will quickly eliminate a large number of non-matching templates – An exact match is performed against the resulting (smaller) set of templates

16 A Fingerprint Example Create a table to store employee data along with their fingerprint template CREATE TABLE Employees (name VARCHAR2(128), employee_id INTEGER, dept VARCHAR2(30), fingerprint_template RAW(1024)); Index the column storing fingerprint data, for faster access CREATE INDEX FingerprintIndex ON employees (fingerprint_template) INDEXTYPE IS FingerprintIndexType; Retrieve the names and match scores for all employees whose fingerprint matches the input fingerprint SELECT name, Score(1) FROM Employees WHERE IdentifyMatch(fingerprint_template,, 1) > 0;

17 Fingerprint Indexing Possible indexing approach involves – classifying the fingerprints as (Left Loop, Right Loop, Whorl, and other) types Query involves – classifying the input fingerprint into one of these classes – performing exact matches against fingerprints of that class

18 Basic Indexing approach Build an auxiliary structure (table) that stores extracted portions of the template information along with the unique row identifiers of the base table Build native bitmap or B-tree indexes on the auxiliary structure A query on this table models the filter that returns a set of row identifiers for which the pair-wise match is performed

19 Indexing Challenges It may not always be possible to develop filter(s) to reduce the search space It might be difficult to beat in-memory matching algorithm

20 Supporting Multi Biometric Applications Why multi-modal biometrics? – Accuracy of a single biometric may be less than desired – If one of the traits is altered, user can still be recognized based on other traits

21 Combining Scores in Multi Biometrics CREATE TABLE Employees (id INTEGER, fingerprint_template RAW(1024),face_template RAW(1024)); SELECT Score(1), Score(2) FROM Employees WHERE IdentifyMatch (fingerprint_template,, 1) >0 AND IdentifyMatch(face_template,, 2) > 0; SELECT Score(1), Score(2) FROM Employees WHERE (IdentifyMatch(fingerprint_template,, 1) >0 OR IdentifyMatch(face_template,, 2) > 0) AND Score(1) + Score(2) >1;

22 Loose Vs. Tight Integration Loose Memory-based solution; can be fairly efficient and make use of pointers Memory bound Must custom-build features for large scale handling Does not need to know about additional DBMS features Tight Caching tables/indexes can help; however incurs buffer cache overhead Not memory bound Can exploit the features of RDBMS, such as Partitioning, Parallelism, and Security Requires understanding of DBMS functionality and extensibility

23 Loose vs. Tight Integration (cont.) Index structures can be pure memory-based structures Difficult to combine relational predicates Difficult to support multimodal applications Coming up with index structures can be challenging Can combine with relational predicates Easily extends to handle multi-modal applications