CS122B: Projects in Databases and Web Applications Winter 2017

Slides:



Advertisements
Similar presentations
CS 440 Database Management Systems
Advertisements

Data Management in the Cloud Paul Szerlip. The rise of data Think about this o For the past two decades, the largest generator of data was humans -- now.
A Survey of Distributed Database Management Systems Brady Kyle CSC
Jennifer Widom NoSQL Systems Overview (as of November 2011 )
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
NoSQL Database.
CS 405G: Introduction to Database Systems 24 NoSQL Reuse some slides of Jennifer Widom Chen Qian University of Kentucky.
Massively Parallel Cloud Data Storage Systems S. Sudarshan IIT Bombay.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Getting Biologists off ACID Ryan Verdon 3/13/12. Outline Thesis Idea Specific database Effects of losing ACID What is a NoSQL database Types of NoSQL.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
NoSQL Databases Oracle - Berkeley DB. Content A brief intro to NoSQL About Berkeley Db About our application.
Introduction to Hbase. Agenda  What is Hbase  About RDBMS  Overview of Hbase  Why Hbase instead of RDBMS  Architecture of Hbase  Hbase interface.
Lecture 8: Databases and Data Infrastructure CS 6071 Big Data Engineering, Architecture, and Security Fall 2015, Dr. Rozier.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
NoSQL Or Peles. What is NoSQL A collection of various technologies meant to work around RDBMS limitations (mostly performance) Not much of a definition...
Nov 2006 Google released the paper on BigTable.
NoSQL Systems Motivation. NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis.
Big Data and NoSQL What and Why?. Motivation: Size WWW has spawned a new era of applications that need to store and query very large data sets –Facebook.
Data and Information Systems Laboratory University of Illinois Urbana-Champaign Data Mining Meeting Mar, From SQL to NoSQL Xiao Yu Mar 2012.
NoSQL databases A brief introduction NoSQL databases1.
Introduction to NoSQL Databases Chyngyz Omurov Osman Tursun Ceng,Middle East Technical University.
Department of Computer Science, Johns Hopkins University EN Instructor: Randal Burns 24 September 2013 NoSQL Data Models and Systems.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
CPSC8985 FA 2015 Team C3 DATA MIGRATION FROM RDBMS TO HADOOP By Naga Sruthi Tiyyagura Monika RallabandiRadhakrishna Nalluri.
Why NO-SQL ?  Three interrelated megatrends  Big Data  Big Users  Cloud Computing are driving the adoption of NoSQL technology.
NoSQL: Graph Databases
Neo4j: GRAPH DATABASE 27 March, 2017
CSCI5570 Large Scale Data Processing Systems
CS 405G: Introduction to Database Systems
NO SQL for SQL DBA Dilip Nayak & Dan Hess.
NoSQL: Graph Databases
and Big Data Storage Systems
NoSQL Databases NoSQL Concepts Databases Telerik Software Academy
Cloud Computing and Architecuture
BigData - NoSQL Hadoop - Couchbase
Hadoop.
CSE 775 – Distributed Objects Bekir Turkkan & Habib Kaya
An Open Source Project Commonly Used for Processing Big Data Sets
Data and Applications Security Developments and Directions
CS422 Principles of Database Systems Course Overview
Introduction In the computing system (web and business applications), there are enormous data that comes out every day from the web. A large section of.
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
Modern Databases NoSQL and NewSQL
Christian Stark and Odbayar Badamjav
NOSQL databases and Big Data Storage Systems
A Comparison of SQL and NoSQL Databases
NoSQL Systems Overview (as of November 2011).
Storage Systems for Managing Voluminous Data
Massively Parallel Cloud Data Storage Systems
1 Demand of your DB is changing Presented By: Ashwani Kumar
NoSQL Databases An Overview
CS122B: Projects in Databases and Web Applications Winter 2018
NoSQL Databases Antonino Virgillito.
Overview of big data tools
NoSQL W2013 CSCI 2141.
NoSQL Not Only SQL University of Kurdistan Faculty of Engineering
Database Systems Summary and Overview
April 13th – Semi-structured data
CS122B: Projects in Databases and Web Applications Winter 2019
Introduction to NoSQL Database Systems
CS122B: Projects in Databases and Web Applications Spring 2018
CS122B: Projects in Databases and Web Applications Winter 2018
NoSQL & Document Stores
NoSQL databases An introduction and comparison between Mongodb and Mysql document store.
Working with GEOLocation Data
CS122B: Projects in Databases and Web Applications Spring 2018
CS122B: Projects in Databases and Web Applications Winter 2018
Copyright © JanBask Training. All rights reserved Get Started with Hadoop Hive HiveQL Languages.
Presentation transcript:

CS122B: Projects in Databases and Web Applications Winter 2017 Notes 07: NoSQL and Hadoop Professor Chen Li Department of Computer Science UC Irvine CS122B

NoSQL Very hot these days What does NoSQL mean? Mainly with startups, less with enterprises What does NoSQL mean? “Not Only SQL” or “Not Relational”.

NoSQL: Overview “A NoSQL database provides a mechanism for storage and retrieval of data that is modeled in means other than the tabular relations used in relational databases.” Give up some of the ACID constraints to improve performance Simple interface: Write (=Put): needs to write all replicas Read (=Get): may get only one

NoSQL Key features: Scale horizontally “simple operations” Replicate/distribute data over many servers Simple call level interface (contrast w/ SQL) Weaker concurrency model than ACID Efficient use of distributed indexes and RAM Flexible schema

1. Key-value Stores Think “file system” more than “database” Put/get calls with a key Voldemort, Memcached Only primary index: lookup by key No secondary indexes

2. Document Stores Nested values, extensible records (e.g., XML or JSON). Schema-less MongoDB, CouchDB May have secondary indexes

3. Column stores Data organized as columns, not rows Vertica, HBase, Cassandra

4. Scalable Relational Systems Means RDBMS that offer sharding MySQL Cluster, VoltDB

What is Hadoop? Apache top level project, open-source implementation of frameworks for reliable, scalable, distributed computing and data storage. It is a flexible and highly-available architecture for large scale computation and data processing on a network of commodity hardware.

Google Origins 2003 2004 2006

Word Count over a Given Set of Web Pages see 1 bob 1 throw 1 see 1 spot 1 run 1 bob 1 run 1 see 2 spot 1 throw 1 see bob throw see spot run Can we do word count in parallel?

Word Count over a Given Set of Web Pages

“Big Data” Era