AN INTRODUCTION TO NOSQL DATABASES Karol Rástočný, Eduard Kuric.

Slides:



Advertisements
Similar presentations
Introduction to MongoDB
Advertisements

Databases Architectures & Hypertable
Data Management in the Cloud Paul Szerlip. The rise of data Think about this o For the past two decades, the largest generator of data was humans -- now.
BigData Tools Seyyed mohammad Razavi. Outline  Introduction  Hbase  Cassandra  Spark  Acumulo  Blur  MongoDB  Hive  Giraph  Pig.
Jennifer Widom NoSQL Systems Overview (as of November 2011 )
NoSQL Databases: MongoDB vs Cassandra
Reporter: Haiping Wang WAMDM Cloud Group
Regions of Interest.  What’s in a ROI?  Use cases  Requirements  Current Storage System  Problems  Alternative Storage.
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
NoSQL Database.
CS 405G: Introduction to Database Systems 24 NoSQL Reuse some slides of Jennifer Widom Chen Qian University of Kentucky.
Massively Parallel Cloud Data Storage Systems S. Sudarshan IIT Bombay.
1 Yasin N. Silva Arizona State University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Distributed Data Stores and No SQL Databases S. Sudarshan IIT Bombay.
NoSQL for the SQL Server Pro
SQL vs NOSQL Discussion
MONGODB NOSQL SERIES Karol Rástočný 1. Prominent Users 2  AppScale, bit.ly, Business Insider, CERN LHC, craigslist, diaspora, Disney Interactive Media.
NoSQL by Michael Britton, Mark McGregor, and Sam Howard
: what’s all the buzz about?
Distributed Data Stores and No SQL Databases S. Sudarshan Perry Hoekstra (Perficient) with slides pinched from various sources such as Perry Hoekstra (Perficient)
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
Getting Biologists off ACID Ryan Verdon 3/13/12. Outline Thesis Idea Specific database Effects of losing ACID What is a NoSQL database Types of NoSQL.
WTT Workshop de Tendências Tecnológicas 2014
Moohanad Hassan Maedeh Pishvaei. Introduction Open Source Apache foundation project Relational DB: SQL Server CouchDB : JSON document-oriented DB (NoSQL)
NoSQL Databases NoSQL Concepts SoftUni Team Technical Trainers Software University
NoSQL Not Only SQL Edel Sherratt. What is NoSQL? Not Only SQL Large volumes of data No schema Partition tolerance – scale by adding more commodity servers.
Changwon Nati Univ. ISIE 2001 CSCI5708 NoSQL looks to become the database of the Internet By Lawrence Latif Wed Dec Nhu Nguyen and Phai Hoang CSCI.
Cloud Computing Clase 8 - NoSQL Miguel Johnny Matias
© Copyright 2013 STI INNSBRUCK
1 Dennis Kafura – CS5204 – Operating Systems Big Table: Distributed Storage System For Structured Data Sergejs Melderis 1.
Discussion MySQL&Cassandra ZhangGang 2012/11/22. Optimize MySQL.
CS 347Lecture 9B1 CS 347: Parallel and Distributed Data Management Notes 13: BigTable, HBASE, Cassandra Hector Garcia-Molina.
Lecture 8: Databases and Data Infrastructure CS 6071 Big Data Engineering, Architecture, and Security Fall 2015, Dr. Rozier.
NOSQL Implementation and examples Maciej Matuszewski.
Copyright © Curt Hill NoSQL Databases No SQL or Not Only SQL.
NoSQL Or Peles. What is NoSQL A collection of various technologies meant to work around RDBMS limitations (mostly performance) Not much of a definition...
NoSQL Systems Motivation. NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis.
NOSQL DATABASE Not Only SQL DATABASE
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT IT Monitoring WG Technology for Storage/Analysis 28 November 2011.
NoSQL: Graph Databases. Databases Why NoSQL Databases?
Data and Information Systems Laboratory University of Illinois Urbana-Champaign Data Mining Meeting Mar, From SQL to NoSQL Xiao Yu Mar 2012.
NoSQL databases A brief introduction NoSQL databases1.
Introduction to NoSQL Databases Chyngyz Omurov Osman Tursun Ceng,Middle East Technical University.
CS422 Principles of Database Systems Introduction to NoSQL Chengyu Sun California State University, Los Angeles.
Department of Computer Science, Johns Hopkins University EN Instructor: Randal Burns 24 September 2013 NoSQL Data Models and Systems.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
Look Mom! – NoSQL Charles Nurse | DotNetNuke Corp.
1 Analysis on the performance of graph query languages: Comparative study of Cypher, Gremlin and native access in Neo4j Athiq Ahamed, ITIS, TU-Braunschweig.
Dive into NoSQL with Azure Niels Naglé Hylke Peek.
1 Gaurav Kohli Xebia Breaking with DBMS and Dating with Relational Hbase.
NoSQL: Graph Databases
CS 405G: Introduction to Database Systems
NoSQL Know Your Enemy Shelly Noll Learning Care Group, Novi, MI
NO SQL for SQL DBA Dilip Nayak & Dan Hess.
NoSQL: Graph Databases
and Big Data Storage Systems
NoSQL Know Your Enemy Shelly Noll SRT Solutions, Ann Arbor, MI
CS122B: Projects in Databases and Web Applications Winter 2017
NoSQL Know Your Enemy Shelly Noll SRT Solutions, Ann Arbor, MI
NOSQL.
Christian Stark and Odbayar Badamjav
NOSQL databases and Big Data Storage Systems
NoSQL Systems Overview (as of November 2011).
Massively Parallel Cloud Data Storage Systems
NoSQL Databases Antonino Virgillito.
NoSQL Not Only SQL University of Kurdistan Faculty of Engineering
NoSQL Sampath Jayarathna Cal Poly Pomona
Introduction to Data Science
NoSQL Sampath Jayarathna Cal Poly Pomona
NoSQL databases An introduction and comparison between Mongodb and Mysql document store.
Presentation transcript:

AN INTRODUCTION TO NOSQL DATABASES Karol Rástočný, Eduard Kuric

Motivation  Not stable data  No fixed tables  Big data  Horizontal scalability  Distributed computing/querying  Concurrent access  Consistency 2

What is NoSQL?  NoSQL = No + SQL  More-accurately:  Not Only SQL  NoRel – No relational 3

NoSQL Databases - Classification  Sorted Ordered Column-Oriented Stores  Key/Value Stores  Document Databases  Graph Databases 4

Column-Oriented Stores  Contrast with row-oriented RDBMS  Data unit  Set of key(column)/value pairs  Sorted by row-key (primary key)  Nulls are not stored  Columns are organized in column-families  Name: FirstName, LastName,  Location: Address, State, GPS 5

Column-Oriented Stores  Bigtable  Google  HBase  Facebook, Yahoo!, Mahalo  Hypertable  Zvents, Baidu, Redif  Cloudata 6

Key/Value Stores  Idea  HashMap – fast O(1) access  Data unit: Key/Value pair  Key – string  Value Basic types: int, string, … Collections of basic types: set, list, … 7

Key/Value Stores  Membase  Zynga, NHN  Redis  Craigslist, Seznam, ALEF  Dynamo  Amzon  Cassandra  Facebok, Twitter, Digg  Voldemort  LinkedIn 8

Document Databases  Data unite:  Document = Object  Stored as a whole (not fragmented)  JSON (BSON) notation  Allows indexes on attributed 9

Document Databases  CouchDB  Apple, BBC, Cern, PeWeProxy  MongoDB  Github, ForSquare, Shutterfly, Sourceforge 10

Graph Databases  Data unite:  Node with relations to incident nodes  Representation  Set of triples – object, predicate, subject  Set of pointers to incident nodes 11

Graph Databases  AllegroGraph  TwitLogic, Pfizer  FlockDB  Twitter  Neo4j  Box.net 12

Use cases 13  Access to attributes, computation over attributes  Sorted ordered column-oriented stores  Temporal store, frequent add/remove operations  Key/Value stores  Operations over whole objects  Document databases  Relations store, deduction  Graph databases

Main Properties  Data modeling  Querying  Scalability  Consistency 14

Data Modeling  No standardized data model  Sorted ordered column-oriented stores  Structure in class-families level  Key/Value stores  Data structures of collections  Document databases  Rudimentary “class” diagrams  Graph databases  Graph schema definitions 15

Querying  MapReduce  Distributed computing and “views” generation  Custom languages  Mostly based on JavaScript  SQL-like languages  Apache Hive, HQL, CQL, SPARQL, …  Language bindings  Apache Thrift, REST API, Java API, custom Drivers, … 16

Scalability  Multi-master replication  Data partitioning (shards)  Fraud tolerance  Limits  Maximal number of rows, columns, column-families, documents, nodes, replicas, shards, …  Maximal size of index, data unit, … 17

Consistency  Strong Consistency  One master for write, multiple slaves for read  Eventual Consistency  Multiple write masters  Updates are propagated in low load phases  Consistency level  Transaction, row, column, document, … 18

Resources  Tiwari, S.: Professional NoSQL  Comparison of NoSQL databases ( mongodb-vs-couchdb-vs-redis)  Web sites of databases 19

Upcoming Presentations  Cassandra – Eduard Kuric  CouchDB – PeWeProxy team  Graph Databases – Michal Holub  MongoDB – Karol Rástočný  Redis – Alef team  and a lot more (board is opened for everyone ) 20