Big Data Young Lee BUS 550.

Slides:



Advertisements
Similar presentations
1 NETE4631 Cloud deployment models and migration Lecture Notes #4.
Advertisements

 Need for a new processing platform (BigData)  Origin of Hadoop  What is Hadoop & what it is not ?  Hadoop architecture  Hadoop components (Common/HDFS/MapReduce)
Undergraduate Poster Presentation Match 31, 2015 Department of CSE, BUET, Dhaka, Bangladesh Wireless Sensor Network Integretion With Cloud Computing H.M.A.
Big Data A big step towards innovation, competition and productivity.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
© 2011 IBM Corporation Smarter Software for a Smarter Planet The Capabilities of IBM Software Borislav Borissov SWG Manager, IBM.
Tool name : Firebug A URL for more information about the tool, or where to buy or download it : Firebug is.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
W HAT IS H ADOOP ? Hadoop is an open-source software framework for storing and processing big data in a distributed fashion on large clusters of commodity.
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
f ACT s  Data intensive applications with Petabytes of data  Web pages billion web pages x 20KB = 400+ terabytes  One computer can read
SEMINAR ON Guided by: Prof. D.V.Chaudhari Seminar by: Namrata Sakhare Roll No: 65 B.E.Comp.
1 Melanie Alexander. Agenda Define Big Data Trends Business Value Challenges What to consider Supplier Negotiation Contract Negotiation Summary 2.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
HADOOP Carson Gallimore, Chris Zingraf, Jonathan Light.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
Big Data Analytics Platforms. Our Team NameApplication Viborov MichaelApache Spark Bordeynik YanivApache Storm Abu Jabal FerasHPCC Oun JosephGoogle BigQuery.
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
{ Tanya Chaturvedi MBA(ISM) Hadoop is a software framework for distributed processing of large datasets across large clusters of computers.
Axis AI Solves Challenges of Complex Data Extraction and Document Classification through Advanced Natural Language Processing and Machine Learning MICROSOFT.
BIG DATA. The information and the ability to store, analyze, and predict based on that information that is delivering a competitive advantage.
BIG DATA. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database.
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
Microsoft Ignite /28/2017 6:07 PM
MIS 3500 Instructor: Bob Travica Trendy Database Topics 2016.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
A Tutorial on Hadoop Cloud Computing : Future Trends.
Big Data-An Analysis. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult.
Data Analytics 1 - THE HISTORY AND CONCEPTS OF DATA ANALYTICS
Connected Infrastructure
Organizations Are Embracing New Opportunities
Data Platform and Analytics Foundational Training
Big Data is a Big Deal!.
Big Data Enterprise Patterns
Connected Living Connected Living What to look for Architecture
Sushant Ahuja, Cassio Cristovao, Sameep Mohta
Hadoop Aakash Kag What Why How 1.
Connected Maintenance Solution
ANOMALY DETECTION FRAMEWORK FOR BIG DATA
CS122B: Projects in Databases and Web Applications Winter 2017
Connected Maintenance Solution
Connected Living Connected Living What to look for Architecture
Connected Infrastructure
© 2016 Global Market Insights, Inc. USA. All Rights Reserved MLaaS Market share research by applications and regions for :
Madrid Software Training Solutions Big Data Hadoop.
© 2016 Global Market Insights, Inc. USA. All Rights Reserved Fuel Cell Market size worth $25.5bn by 2024Low Power Wide Area Network.
Hadoop Market
Built on the Powerful Microsoft Azure Platform, Lievestro Delivers Care Information, Capacity Management Solutions to Hospitals, Medical Field MICROSOFT.
Big Data For Indian SMEs
Big Data - in Performance Engineering
Global Enterprise Search
DeFacto Planning on the Powerful Microsoft Azure Platform Puts the Power of Intelligent and Timely Planning at Any Business Manager’s Fingertips Partner.
David Gillman Collaborative Metrix
Hadoop Technopoints.
Big Data.
XtremeData on the Microsoft Azure Cloud Platform:
TIM TAYLOR AND JOSH NEEDHAM
Zoie Barrett and Brian Lam
Big Data Analysis in Digital Marketing
Big DATA.
Artificial Intelligence Market Report : Trends, Forecast and Competitive Analysis 1.
Customer 360.
Copyright © JanBask Training. All rights reserved Get Started with Hadoop Hive HiveQL Languages.
Microsoft Azure Services Platform
SQL Server 2019 Bringing Apache Spark to SQL Server
Big Data.
Presentation transcript:

Big Data Young Lee BUS 550

Big data

Big Data Explosion of information Iot Analytics Not just SQL (Structured query language)but unstructured data Transformation from a entity based data to transactional databases

Industry https://www.youtube.com/watch?v=eVSfJhssXUA Billion dollar industry Corporate investments

who IBM Google Sears Amazon Social media applications: facebook

Why Insights Predictions Customer value Efficiency Costs savings Product development Makes AI possible Analytics

How Cloud computing Cognitive computing Artificial intelligence Software implementations

Who needs big data Insurance companies Airlines Retail Hospitals Traffic Manufacturers

Concept of big data

DatA Data is like a dam Gartner security High volume, high velocity and high variety (unstructured) Veracity (trustworthy) security

IBM take on Big data

Tools of big data Map reduce Hadoop Big Table Kaggle Tool design by google to functions large amount of data Hadoop Run Map reduce on large cluster Big Table Google developed distributed storage Kaggle

Hadoop The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures. https://hadoop.apache.org/

Kaggle Kaggle is an online community of data scientists and machine learners, owned by Google, Inc. Kaggle allows users to find and publish data sets, explore and build models in a web-based data-science environment, work with other data scientists and machine learning engineers, and enter competitions to solve data science challenges.

issues Security Personal information Constant monitoring Safety Storage Errors (https://www.ft.com/content/21a6e7d8-b479-11e3-a09a-00144feabdc0) Scary (Forbes)

Questions Who made map reduce?