Big DATA.

Slides:



Advertisements
Similar presentations
Nokia Technology Institute Natural Partner for Innovation.
Advertisements

Setting Big Data Capabilities Free How to Make Business on Big Data? Stig Torngaard, Partner Platon.
Big Data Management and Analytics Introduction Spring 2015 Dr. Latifur Khan 1.
Database – Part 3 Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Mr. Sakthi Angappamudali.
Chapter 9 DATA WAREHOUSING Transparencies © Pearson Education Limited 1995, 2005.
Chapter 3 Database Management
Database – Part 2b Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Sakthi Angappamudali at Standard Insurance; BI.
DATA WAREHOUSING.
Business Intelligence Andrew Davis Andria Zippler Jana Krinsky Tiffany Ferris.
Business Intelligence System September 2013 BI.
Chapter 3 Foundations of Business Intelligence: Databases and Information Management.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
Business Intelligence
Copyright © 2014 Pearson Education, Inc. 1 It's what you learn after you know it all that counts. John Wooden Key Terms and Review (Chapter 6) Enhancing.
Large Scale Data Analytics
Operational Data Tools Chapter Eight. Copyright © Houghton Mifflin Company. All rights reserved.8–28–2 Chapter Eight Learning Objectives To learn database.
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
What is Business Intelligence? Business intelligence (BI) –Range of applications, practices, and technologies for the extraction, translation, integration,
Big Data. What is Big Data? Big Data Analytics: 11 Case Histories and Success Stories
@ ?!.
Information Explosion. Reality: New Machine-Generated Data Non-relational and relational data outside of the EDW † Source: Analytics Platforms – Beyond.
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
BUSINESS DRIVEN TECHNOLOGY
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Big Data Analytics Large-Scale Data Management Big Data Analytics Data Science and Analytics How to manage very large amounts of data and extract value.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
1 Melanie Alexander. Agenda Define Big Data Trends Business Value Challenges What to consider Supplier Negotiation Contract Negotiation Summary 2.
Advanced Database Concepts
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
Big Data Analytics Platforms. Our Team NameApplication Viborov MichaelApache Spark Bordeynik YanivApache Storm Abu Jabal FerasHPCC Oun JosephGoogle BigQuery.
BUSINESS INTELLIGENCE & ADVANCED ANALYTICS DISCOVER | PLAN | EXECUTE JANUARY 14, 2016.
ISQS 3358, Business Intelligence Anatomy of Business Intelligence Zhangxi Lin Texas Tech University 1.
Big Data Analytics with Excel Peter Myers Bitwise Solutions.
Smart Grid Big Data: Automating Analysis of Distribution Systems Steve Pascoe Manager Business Development E&O - NISC.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy DBI210.
BUSINESS INTELLIGENCE. The new technology for understanding the past & predicting the future … BI is broad category of technologies that allows for gathering,
BIG DATA. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database.
Foundations of Business Intelligence: Databases and Information Management Chapter 6 VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Big Data-An Analysis. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult.
Data Analytics (CS40003) Introduction to Data Lecture #1
Business Intelligence Overview
Large Scale Data Analytics
Big Data & Test Automation
Jaclyn Hansberry MIS2502: Data Analytics The Things You Can Do With Data The Information Architecture of an Organization Jaclyn.
Popular Database Management Systems
CNIT131 Internet Basics & Beginning HTML
Data Analytics 1 - THE HISTORY AND CONCEPTS OF DATA ANALYTICS
Data Mining – Intro.
SNS COLLEGE OF TECHNOLOGY
ANOMALY DETECTION FRAMEWORK FOR BIG DATA
MIS2502: Data Analytics Advanced Analytics - Introduction
BIG DATA IN ENGINEERING APPLICATIONS
Chapter 14 Big Data Analytics and NoSQL
ACS1803 Lecture Outline 2   DATA MANAGEMENT CONCEPTS Text, Ch. 3
Azure's Performance, Scalability, SQL Servers Automate Real Time Data Transfer at Low Cost MINI-CASE STUDY “Azure offers high performance, scalable, and.
Data Warehousing and Data Mining
Big Data Young Lee BUS 550.
INNOvation in TRAINING BUSINESS ANALYSTS HAO HElEN Zhang UniVERSITY of ARIZONA
Data Warehouse.
Big Data Analysis in Digital Marketing
Data Warehousing Concepts
Data Analysis and R : Technology & Opportunity
Analytics, BI & Data Integration
UNIT 6 RECENT TRENDS.
Big Data.
Presentation transcript:

Big DATA

What is Big Data? huge data is generated by different sources such as business people, marketing, education, engineering, medicine, social media, on-line transactions, call centers, sensors, web logs and telecommunication. Challenge :collection, storage, retrieval and analysis of enormous data. Business intelligence (BI) vendors are looking at predictive analytics to understand customer behaviour

Business Analytics Exploring data to find new patterns and relationships (data mining) Explaining why a certain result occurred (statistical analysis, quantitative analysis) Forecasting future results (predictive analytics).

Business intelligence (BI) Business intelligence (BI) is an umbrella term that includes the applications, infrastructure and tools, and best practices that enable access to and analysis of information to improve and optimize decisions and performance says Gartner.

Big Data Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze the nformation As the data is increasing voluminously in size from petabytes (1 petabyte=1024 terabytes), exabytes (1 exabyte=1024 petabytes),zettabytes (1 zettabyte=1024 exabytes), yottabytes (1 exabyte=1024 yottabytes).

Key Characteristics of Big data

different ways in which the data can be processed Operational Database: (OLTP) Online Transaction Processing (DBMSs). Data Warehousing: (OLAP) Online Analytical Processing. Stream Computing: Due to uninterrupted networks data is arriving in continuous stream 24/7 a day. This data need to be captured and processed using RealTime Analytics Processing (RTAP).

Big Data Technology 1 Hadoop: An open source (free) software framework for processing huge datasets on certain kinds of problems on a distributed system. Its development was inspired by Google’s MapReduce and Google File System. It was originally developed at Yahoo! and is now managed as a project of the Apache Software Foundation.

Big Data Technology HBase: An open source (free), distributed, nonrelational database modeled on oogle’s Big Table. It was originally developed by Powerset and is now managed as a project of the Apache Software foundation as part of the Hadoop.

Big Data Technology MapReduce: A software framework introduced by Google for processing huge datasets on certain kinds of problems on a distributed system. Also implemented in Hadoop. Cassandra: An open source (free) database management system designed to handle huge amounts of data on a distributed system. This system was originally developed at Facebook and is now managed as a project of the Apache Software foundation.

Big Data Technology 5. Extract, transform, and load: (ETL)Software tools used to extract data from outside sources, transform them to fit operational needs, and load them into a database or data warehouse. 6 Cloud computing: Cloud computing as a computing paradigm in which highly scalable computing resources, often configured as a distributed system, are provided as a service through a network.

Big Data Technology 7 Data warehouse: Specialized database optimized for reporting, often used for storing large amounts of structured data. Data is uploaded using ETL extract, transform, and load) tools from operational data stores, and reports are often generated using business intelligence tools. Data mart: Subset of a data warehouse, used to provide data to users usually through business intelligence tools. Google File System: Proprietary distributed file system developed by Google; part of the inspiration for Hadoop

Big Data analytics - classification descriptive analytics, predictive analytics, and prescriptive analytics.

Big Data analytics - classification descriptive analytics It exploits historical data to describe what occurred. For instance, a regression may be used to find simple trends in the datasets, visualization presents data in a meaningful fashion, and data modeling is used to collect, store and cut the data in an efficient way

Big Data analytics - classification Predictive Analytics: It focuses on predicting future probabilities and trends. For example, predictive modeling uses statistical techniques such as linear and logistic regression to understand trends and predict future outcomes, and data mining extracts patterns to provide insight and forecasts.

Big Data analytics - classification Prescriptive Analytics: It addresses decision making and efficiency. For example, simulation is used to analyze complex systems to gain insight into system behavior and identify issues and optimization techniques are used to find optimal solutions under given constraints

Benefits of big data in predictive analytics A wide variety of data from different sources can be captured, stored, and processed. More real-time analytics may be conducted from desktops or mobile devices for ad hock decision making. Large volumes of non-transactional data can be included in analytics. The time to solution is significantly shortened. Future possibility of automation of data modeling.