Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,

Slides:



Advertisements
Similar presentations
HadoopDB Inneke Ponet.  Introduction  Technologies for data analysis  HadoopDB  Desired properties  Layers of HadoopDB  HadoopDB Components.
Advertisements

FAST FORWARD WITH MICROSOFT BIG DATA Vinoo Srinivas M Solutions Specialist Windows Azure (Hadoop, HPC, Media)
Relational Database Alternatives NoSQL. Choosing A Data Model Relational database underpin legacy applications and meet business needs However, companies.
11© 2011 Hitachi Data Systems. All rights reserved. HITACHI DATA DISCOVERY FOR MICROSOFT® SHAREPOINT ® SOLUTION SCALING YOUR SHAREPOINT ENVIRONMENT PRESENTER.
Paula Ta-Shma, IBM Haifa Research 1 “Advanced Topics on Storage Systems” - Spring 2013, Tel-Aviv University Big Data and.
Introduction to Database Management  Department of Computer Science Northern Illinois University January 2001.
Data - Information - Knowledge
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
BUSINESS DRIVEN TECHNOLOGY
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Passage Three Introduction to Microsoft SQL Server 2000.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
A Study in NoSQL & Distributed Database Systems John Hawkins.
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
Page  1 SaaS – BUSINESS MODEL Debmalya Khan DEBMALYA KHAN.
Data Mining on the Web via Cloud Computing COMS E6125 Web Enhanced Information Management Presented By Hemanth Murthy.
Ch 4. The Evolution of Analytic Scalability
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Overview of SQL Server Alka Arora.
STEALTH Content Store for SharePoint using Caringo CAStor  Boosting your SharePoint to the MAX! "Optimizing your Business behind the scenes"
MarkLogic Overview Clark D. Richey, Jr. – Technical Director,
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Getting Biologists off ACID Ryan Verdon 3/13/12. Outline Thesis Idea Specific database Effects of losing ACID What is a NoSQL database Types of NoSQL.
Distributed Indexing of Web Scale Datasets for the Cloud {ikons, eangelou, Computing Systems Laboratory School of Electrical.
Introduction to Hadoop and HDFS
Slide 1 Copyright © 2010 MarkLogic ® Corporation. All rights reserved. Introduction to MarkLogic 4.2 Kenneth Chestnut, Vice President of Product Marketing.
NoSQL Databases Oracle - Berkeley DB Rasanjalee DM Smriti J CSC 8711 Instructor: Dr. Raj Sunderraman.
NoSQL Databases Oracle - Berkeley DB. Content A brief intro to NoSQL About Berkeley Db About our application.
I Information Systems Technology Ross Malaga 4 "Part I Understanding Information Systems Technology" Copyright © 2005 Prentice Hall, Inc. 4-1 DATABASE.
Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters Hung-chih Yang(Yahoo!), Ali Dasdan(Yahoo!), Ruey-Lung Hsiao(UCLA), D. Stott Parker(UCLA)
Open Search Office Web Services Database Doc Mgt Sys Pipeline Index Geospatial Analysis Text Search Faceting Caching Query parsing Clustering Synonyms.
When bet365 met Riak and discovered a true, “always on” database.
Intro – Part 2 Introduction to Database Management: Ch 1 & 2.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
Actualog Social PIM Helps Companies to Manage and Share Product Information Using Secure, Scalable Ease of Microsoft Azure MICROSOFT AZURE ISV PROFILE:
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
What we know or see What’s actually there Wikipedia : In information technology, big data is a collection of data sets so large and complex that it.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
Axis AI Solves Challenges of Complex Data Extraction and Document Classification through Advanced Natural Language Processing and Machine Learning MICROSOFT.
Apache Solr Dima Ionut Daniel. Contents What is Apache Solr? Architecture Features Core Solr Concepts Configuration Conclusions Bibliography.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
Department of Computer Science, Johns Hopkins University EN Instructor: Randal Burns 24 September 2013 NoSQL Data Models and Systems.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
MarkLogic The Only Enterprise NoSQL Database Presented by: Aashi Rastogi ( ) Sanket Patel ( )
BIG DATA/ Hadoop Interview Questions.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
1 Cloud-Native Data Warehousing Bob Muglia. 2 Scenarios with affinity for cloud Gartner 2016 Predictions: By 2018, six billion connected things will be.
Neo4j: GRAPH DATABASE 27 March, 2017
Database Management.
CSE 775 – Distributed Objects Bekir Turkkan & Habib Kaya
CS122B: Projects in Databases and Web Applications Winter 2017
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
Open Source distributed document DB for an enterprise
Software Design and Architecture
Capitalize on modern technology
Storage Systems for Managing Voluminous Data
Basic Concepts in Data Management
Johannes Peter MediaMarktSaturn Retail Group
Ch 4. The Evolution of Analytic Scalability
Adra ACCOUNTS: Transaction Matching Software Powered by the Microsoft Azure Cloud That Helps Optimize the Accounting and Finance Processes MICROSOFT AZURE.
Overview of big data tools
Big Data Young Lee BUS 550.
Quasardb Is a Fast, Reliable, and Highly Scalable Application Database, Built on Microsoft Azure and Designed Not to Buckle Under Demand MICROSOFT AZURE.
Business Intelligence
The Database Environment
How Dell, SAP and SUSE Deliver Value Quickly
Presentation transcript:

Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport, CT MarkLogic DB is one of the Enterprise NoSQL database that supports multiple-model database design. It is optimized for structured and unstructured data that allows to store, manage, query and search across JSON, XML, RDF (Triplestore) and can handle data with a schema free and leads to faster time-to-results by providing handling of different types of data. It provides ACID Transactions using MVCC (multi-version concurrency control). One of the important key feature of MarkLogic is its Bitemporal behavior by providing data at every point in time. Due to its shared-nothing architecture it is highly available and easily and massively scalable with no single point of failure making structured data integration easier. It also has incremental backup means to only backup the updated data. Marklogic provides Hadoop integration and Hadoop is designed to store large amount of data in Hadoop Distributed File System (HDFS) and works better with the transactional applications. It uses XML document as its data model, and stores the documents within a transactional repository. It indexes the words and values from each of the loaded documents, as well as the document structure. And, because of its unique Universal Index, Marklogic doesn’t require advance knowledge of the document structure nor complete adherence to a particular schema. Marklogic Server clusters on commodity hardware using a shared-nothing architecture and differentiates itself in the market by supporting massive scale and fantastic performance- customer deployments have scaled to hundreds of terabytes of source data while maintaining sub-second query response time. In addition to XML, Marklogic can store JSON, text, and binary documents. JSON documents are internally transformed to XML for purposes of indexing. Text documents are indexed as if each was an XML text node without a parent. Binary documents are by default unindexed, with the option to index their metadata and extracted contents. Introduction How Marklogic Works? Conclusion NoSQL means non-SQL or non-relational databases which provides mechanism to store and retrieve the data other than relational databases. NoSQL database is in use nowadays because of simplicity of design, easy to scale out and control over availability. Types of NoSQL Databases:  Key-value based  Column oriented  Graph oriented  Document based  Multi-model Multi-model database is an only designed to support multiple data models against a single application. Marklogic DB is one of the NoSQL database that uses multi-model database design. Marklogic is the only Enterprise NoSQL Database. It is optimized for structured and unstructured data that allows you to store, manage, query and search across JSON, XML, RDF (Triplestore), Geospatial data, text, and large binaries. With Marklogic one can handle data in a schema-agnostic fashion or built in application server and leads to faster time-to-results. It provides capabilities like ACID Transaction, high availability and disaster recovery, Security. Marklogic is designed to run on Hadoop and help you to use their technology in better way. It is also easily deployed on cloud to maintain hardware and provide all benefits of elasticity. It also has built in application services and text search capabilities. It allows to discover new facts by act. Below are the segregate characteristics which describes the Mark Logic so precisely: 1)Flexible Data Model 2)Clear Semantics 3)Scalability and Elasticity 4)ACID Transaction 5)High availability and Disaster Recovery 6)Hadoop Integration 7)Bitemporal 8)Certified Security Marklogic is designed to handle the volume, variety, and velocity of Big Data like other NoSQL solutions, and has the enterprise features that made last-generation relational databases so reliable. And Marklogic gives a way to manage the hierarchical content, distributed graph data, and also XML and RDF in the same database. MarkLogic combines database, search, and application services in one component to greatly simplify the delivery platform, and brings the power of Enterprise NoSQL with schema flexibility to easily add new content and data; horizontal scale out to grow as your business and products grow; and proven enterprise features to operate your product. Eight of the top 10 information providers in the world use MarkLogic to power their information delivery platforms – and we can help you deliver better information to your users, too. “ Standardized search technology not only provides a better experience for our customers, but also gives us a superior platform for developing new content applications and products across the company.” Implementation   References