NoSQL Systems Motivation. NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis.

Slides:



Advertisements
Similar presentations
No SQL is not about SQL No SQL is a Zoo.. Key-Value Stores Wide Column Stores Document Stores Graph Databases.
Advertisements

Data Management in the Cloud Paul Szerlip. The rise of data Think about this o For the past two decades, the largest generator of data was humans -- now.
Map/Reduce in Practice Hadoop, Hbase, MongoDB, Accumulo, and related Map/Reduce- enabled data stores.
计算机学院 数据库系统原理 1 Introduction to Databases 杨宁 1/23.
NoSQL and Review 1. Big Data (some old numbers) Facebook:  130TB/day: user logs  TB/day: 83 million pictures Google: > 25 PB/day processed data.
Jennifer Widom NoSQL Systems Overview (as of November 2011 )
Reporter: Haiping Wang WAMDM Cloud Group
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
NoSQL Database.
CS 405G: Introduction to Database Systems 24 NoSQL Reuse some slides of Jennifer Widom Chen Qian University of Kentucky.
Hadoop Ecosystem Overview
Massively Parallel Cloud Data Storage Systems S. Sudarshan IIT Bombay.
HADOOP ADMIN: Session -2
1 Yasin N. Silva Arizona State University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Distributed Data Stores and No SQL Databases S. Sudarshan IIT Bombay.
Introduction to NOSQL Databases
AN INTRODUCTION TO NOSQL DATABASES Karol Rástočný, Eduard Kuric.
NoSQL for the SQL Server Pro
SQL vs NOSQL Discussion
Jeffrey D. Ullman Stanford University. 2 Chunking Replication Distribution on Racks.
Distributed Data Stores and No SQL Databases S. Sudarshan Perry Hoekstra (Perficient) with slides pinched from various sources such as Perry Hoekstra (Perficient)
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
MapReduce – An overview Medha Atre (May 7, 2008) Dept of Computer Science Rensselaer Polytechnic Institute.
Getting Biologists off ACID Ryan Verdon 3/13/12. Outline Thesis Idea Specific database Effects of losing ACID What is a NoSQL database Types of NoSQL.
WTT Workshop de Tendências Tecnológicas 2014
NOSQL By: Joseph Cooper MIS 409 MIS 409
MapReduce: Hadoop Implementation. Outline MapReduce overview Applications of MapReduce Hadoop overview.
Distributed Indexing of Web Scale Datasets for the Cloud {ikons, eangelou, Computing Systems Laboratory School of Electrical.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
Modern Databases NoSQL and NewSQL Willem Visser RW334.
The Lightning Way XIV Encontro da comunidade SQLPort LX
NoSQL Databases NoSQL Concepts SoftUni Team Technical Trainers Software University
NoSQL. ACID Semantics Atomicity: All or nothing. Consistency: Consistent state of data and transactions. Isolation: Transactions are isolated from each.
Changwon Nati Univ. ISIE 2001 CSCI5708 NoSQL looks to become the database of the Internet By Lawrence Latif Wed Dec Nhu Nguyen and Phai Hoang CSCI.
Cloud Computing Clase 8 - NoSQL Miguel Johnny Matias
Introduction to Databases
Jennifer Widom NoSQL Systems Motivation. Jennifer Widom NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not.
By Vaibhav Nachankar Arvind Dwarakanath.  HBase is an open-source, distributed, column- oriented and sorted-map data storage.  It is a Hadoop Database;
Lecture 8: Databases and Data Infrastructure CS 6071 Big Data Engineering, Architecture, and Security Fall 2015, Dr. Rozier.
NOSQL Implementation and examples Maciej Matuszewski.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
IBM Research ® © 2007 IBM Corporation A Brief Overview of Hadoop Eco-System.
NoSQL Or Peles. What is NoSQL A collection of various technologies meant to work around RDBMS limitations (mostly performance) Not much of a definition...
Nov 2006 Google released the paper on BigTable.
NOSQL DATABASE Not Only SQL DATABASE
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT IT Monitoring WG Technology for Storage/Analysis 28 November 2011.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
NoSQL: Graph Databases. Databases Why NoSQL Databases?
Data and Information Systems Laboratory University of Illinois Urbana-Champaign Data Mining Meeting Mar, From SQL to NoSQL Xiao Yu Mar 2012.
NoSQL databases A brief introduction NoSQL databases1.
Introduction to NoSQL Databases Chyngyz Omurov Osman Tursun Ceng,Middle East Technical University.
Department of Computer Science, Johns Hopkins University EN Instructor: Randal Burns 24 September 2013 NoSQL Data Models and Systems.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
1 Analysis on the performance of graph query languages: Comparative study of Cypher, Gremlin and native access in Neo4j Athiq Ahamed, ITIS, TU-Braunschweig.
Introduction to Mongo DB(NO SQL data Base)
CS 405G: Introduction to Database Systems
NoSQL: Graph Databases
and Big Data Storage Systems
Hadoop.
CS122B: Projects in Databases and Web Applications Winter 2017
Modern Databases NoSQL and NewSQL
NOSQL.
NOSQL databases and Big Data Storage Systems
NoSQL Systems Overview (as of November 2011).
Massively Parallel Cloud Data Storage Systems
NOSQL and CAP Theorem.
NoSQL Systems Motivation.
NoSQL Databases Antonino Virgillito.
Cloud Computing for Data Analysis Pig|Hive|Hbase|Zookeeper
Presentation transcript:

NoSQL Systems Motivation

NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis problem is best solved using a traditional relational DBMS  “NoSQL” = “No SQL” = Not using traditional relational DBMS  “No SQL”  Don’t use SQL language NoSQL Systems: Motivation

NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis problem is best solved using a traditional relational DBMS  “NoSQL” = “No SQL” = Not using traditional relational DBMS  “No SQL”  Don’t use SQL language  “NoSQL” = “Not Only SQL” NoSQL Systems: Motivation

Not every data management/analysis problem is best solved using a traditional DBMS Database Management System (DBMS) provides…. … efficient, reliable, convenient, and safe multi-user storage of and access to massive amounts of persistent data. Database Management System (DBMS) provides…. … efficient, reliable, convenient, and safe multi-user storage of and access to massive amounts of persistent data. NoSQL Systems: Motivation

NoSQL Systems Alternative to traditional relational DBMS + Flexible schema + Quicker/cheaper to set up + Massive scalability + Relaxed consistency  higher performance & availability – No declarative query language  more programming – Relaxed consistency  fewer guarantees NoSQL Systems: Motivation

NoSQL Systems Several incarnations  MapReduce framework  Key-value stores  Document stores  Graph database systems NoSQL Systems: Overview

MapReduce Framework Originally from Google, open source Hadoop  No data model, data stored in files  User provides specific functions map() reduce()  System provides data processing “glue”, fault-tolerance, scalability NoSQL Systems: Overview

Map and Reduce Functions Map: Divide problem into subproblems Reduce: Do work on subproblems, combine results NoSQL Systems: Overview

MapReduce Architecture NoSQL Systems: Overview

MapReduce Example: Web log analysis Each record: UserID, URL, timestamp, additional-info Task: Count number of accesses for each domain (inside URL) NoSQL Systems: Overview

MapReduce Example (modified #1) Each record: UserID, URL, timestamp, additional-info Task: Total “value” of accesses for each domain based on additional-info NoSQL Systems: Overview

MapReduce Framework  No data model, data stored in files  User provides specific functions  System provides data processing “glue”, fault-tolerance, scalability NoSQL Systems: Overview

MapReduce Framework Schemas and declarative queries are missed Hive – schemas, SQL-like query language Pig – more imperative but with relational operators  Both compile to “workflow” of Hadoop (MapReduce) jobs NoSQL Systems: Overview

Key-Value Stores Extremely simple interface  Data model: (key, value) pairs  Operations: Insert(key,value), Fetch(key), Update(key), Delete(key) Implementation: efficiency, scalability, fault-tolerance  Records distributed to nodes based on key  Replication  Single-record transactions, “eventual consistency” NoSQL Systems: Overview

Key-Value Stores Extremely simple interface  Data model: (key, value) pairs  Operations: Insert(key,value), Fetch(key), Update(key), Delete(key)  Some allow (non-uniform) columns within value  Some allow Fetch on range of keys Example systems  Google BigTable, Amazon Dynamo, Cassandra, Voldemort, HBase, … NoSQL Systems: Overview

Document Stores Like Key-Value Stores except value is document  Data model: (key, document) pairs  Document: JSON, XML, other semistructured formats  Basic operations: Insert(key,document), Fetch(key), Update(key), Delete(key)  Also Fetch based on document contents Example systems  CouchDB, MongoDB, SimpleDB, … NoSQL Systems: Overview

Graph Database Systems  Data model: nodes and edges  Nodes may have properties (including ID)  Edges may have labels or roles NoSQL Systems: Overview

Graph Database Systems  Interfaces and query languages vary  Single-step versus “path expressions” versus full recursion  Example systems Neo4j, FlockDB, Pregel, …  RDF “triple stores” can map to graph databases NoSQL Systems: Overview