Getting Biologists off ACID Ryan Verdon 3/13/12. Outline Thesis Idea Specific database Effects of losing ACID What is a NoSQL database Types of NoSQL.

Slides:



Advertisements
Similar presentations
Data Management in the Cloud Paul Szerlip. The rise of data Think about this o For the past two decades, the largest generator of data was humans -- now.
Advertisements

Jennifer Widom NoSQL Systems Overview (as of November 2011 )
Relational Database Alternatives NoSQL. Choosing A Data Model Relational database underpin legacy applications and meet business needs However, companies.
NoSQL Databases: MongoDB vs Cassandra
Reporter: Haiping Wang WAMDM Cloud Group
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
NoSQL Database.
CS 405G: Introduction to Database Systems 24 NoSQL Reuse some slides of Jennifer Widom Chen Qian University of Kentucky.
Group 11 Sameera Shah & Fatemah Husain [10/31/13].
Inexpensive Scalable Information Access Many Internet applications need to access data for millions of concurrent users Relational DBMS technology cannot.
Thanks to our Sponsors! To connect to wireless 1. Choose Uguest in the wireless list 2. Open a browser. This will open a Uof U website 3. Choose Login.
Massively Parallel Cloud Data Storage Systems S. Sudarshan IIT Bombay.
1 Yasin N. Silva Arizona State University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Distributed Data Stores and No SQL Databases S. Sudarshan IIT Bombay.
AN INTRODUCTION TO NOSQL DATABASES Karol Rástočný, Eduard Kuric.
:: Conférence :: NoSQL / Scalabilite Etat de l’art Samuel BERTHE10 Mars 2014Epitech Nantes.
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
NoSQL continued CMSC 461 Michael Wilson. MongoDB  MongoDB is another NoSQL solution  Provides a bit more structure than a solution like Accumulo  Data.
Alternatives to relational DBs What and Why…. Relational DBs  SQL - Fixed schema, row oriented & optimized  SQL - Rigid 2Phase transactions, with locking.
WTT Workshop de Tendências Tecnológicas 2014
Modern Databases NoSQL and NewSQL Willem Visser RW334.
Apache Cassandra - Distributed Database Management System Presented by Jayesh Kawli.
Changwon Nati Univ. ISIE 2001 CSCI5708 NoSQL looks to become the database of the Internet By Lawrence Latif Wed Dec Nhu Nguyen and Phai Hoang CSCI.
Methodological Foundations of Biomedical Informatics (BMSC-GA 4449) Himanshu Grover.
When bet365 met Riak and discovered a true, “always on” database.
CS 347Lecture 9B1 CS 347: Parallel and Distributed Data Management Notes 13: BigTable, HBASE, Cassandra Hector Garcia-Molina.
By Vaibhav Nachankar Arvind Dwarakanath.  HBase is an open-source, distributed, column- oriented and sorted-map data storage.  It is a Hadoop Database;
MongoDB Jer-Shuan Lin.
NOSQL Implementation and examples Maciej Matuszewski.
Modeling MongoDB with Relational Model Proposed by Christopher Polanco.
NoSQL Or Peles. What is NoSQL A collection of various technologies meant to work around RDBMS limitations (mostly performance) Not much of a definition...
NoSQL Systems Motivation. NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis.
NOSQL DATABASE Not Only SQL DATABASE
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT IT Monitoring WG Technology for Storage/Analysis 28 November 2011.
Data and Information Systems Laboratory University of Illinois Urbana-Champaign Data Mining Meeting Mar, From SQL to NoSQL Xiao Yu Mar 2012.
CPT-S Advanced Databases 11 Yinghui Wu EME 49.
NoSQL databases A brief introduction NoSQL databases1.
CMPE 226 Database Systems May 3 Class Meeting Department of Computer Engineering San Jose State University Spring 2016 Instructor: Ron Mak
Context Aware RBAC Model For Wearable Devices And NoSQL Databases Amit Bansal Siddharth Pathak Vijendra Rana Vishal Shah Guided By: Dr. Csilla Farkas Associate.
Introduction to NoSQL Databases Chyngyz Omurov Osman Tursun Ceng,Middle East Technical University.
Department of Computer Science, Johns Hopkins University EN Instructor: Randal Burns 24 September 2013 NoSQL Data Models and Systems.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
Look Mom! – NoSQL Charles Nurse | DotNetNuke Corp.
BIG DATA. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database.
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
Dive into NoSQL with Azure Niels Naglé Hylke Peek.
Database Applications (15-415) NoSQL Databases Lecture 26, April 19, 2016 Mohammad Hammoud.
NoSql An alternative option in the DevEvenings ORM Smackdown Tarn Barford
Database Applications (15-415) NoSQL Databases Lecture 24, November 27, 2016 Mohammad Hammoud.
CS 405G: Introduction to Database Systems
NO SQL for SQL DBA Dilip Nayak & Dan Hess.
and Big Data Storage Systems
CS122B: Projects in Databases and Web Applications Winter 2017
Based on: NoSQL Databases Based on:
Introduction In the computing system (web and business applications), there are enormous data that comes out every day from the web. A large section of.
NoSQL Database and Application
Modern Databases NoSQL and NewSQL
NOSQL.
Christian Stark and Odbayar Badamjav
NOSQL databases and Big Data Storage Systems
NoSQL Systems Overview (as of November 2011).
Storage Systems for Managing Voluminous Data
Massively Parallel Cloud Data Storage Systems
1 Demand of your DB is changing Presented By: Ashwani Kumar
NOSQL and CAP Theorem.
NoSQL Databases An Overview
NoSQL Databases Antonino Virgillito.
CMPE 280 Web UI Design and Development March 14 Class Meeting
NoSQL databases An introduction and comparison between Mongodb and Mysql document store.
Presentation transcript:

Getting Biologists off ACID Ryan Verdon 3/13/12

Outline Thesis Idea Specific database Effects of losing ACID What is a NoSQL database Types of NoSQL databases with examples

Thesis Idea Amount of biological data is growing exponentially Hardware cannot keep up with the growth

Database Survey In 2003, a survey was conducted of 111 biological databases. 96 (i.e. 87%) of the 111 considered databases have Hypertext references to other databases, 40 to 44 (i.e. 36% to 40%) are implemented as flat files, 41 (or 42) (i.e. 37%) are implemented using a relational database management system, 7 (i.e. 6%) use an object database management system, 3 (.i.e. 3%) use an object-relational database management system, and all databases collect data from different sources.

Specific Database

Flybase Primary biology database on the insect family Drosophilidae (fruit fly) Important research from fruit flies includes – Genes – Recombination – Signaling networks (important for major diseases) – Stem cells – Growth control

Info on the Flybase database Single server relational database Over 135 tables Uses Chado schema Gives data a “type” Creates a graph that relates the different types

Effects of losing ACID

ACID Atomicity Consistency Isolation Durability

CAP Theorem Consistency Availability Partition tolerance

BASE consistency model Basically Available Soft state Eventually consistent Given enough time where no changes are made, all replicas will see the same data

NoSQL

What is a NoSQL database Any non-relational database – Hierarchical – Graph – Object oriented – Etc.

Why were NoSQL databases created To overcome limitations of relational databases – Predefined layout – Scaling – SQL – Large feature set

Major types of NoSQL databases Key-value stores Column-oriented databases Document based stores

Key-value stores Stored values are indexed for retrieval by keys Can store unstructured or structured data Similar to DHTs

Example: Dynamo Created by Amazon, only used internally Highly available even in the face of continual failures Amazon realized many services only need primary-key access – Best seller lists – Shopping carts – Session management

Column-oriented databases Contain extendable columns of closely related data Can greatly benefit from compression

Example: HBase Apache open-source project Based off of Google’s Bigtable database Ties in closely with Hadoop and map-reduce

Document based stores Data stored as a collection of documents Documents can have any number of fields and any length Documents are accessed via a unique key Capabilities of the query language heavily depend on the implementation

Example: MongoDB Started by 10gen Supports – Replication – Map-reduce – Sharding Used by lots of large companies including – Disney – Craigslist

Questions ?