+ Big Data IST210 Class Lecture. + Big Data Summary by EMC Corporation (http://www.emc.com) More videos that.

Slides:



Advertisements
Similar presentations
R and HDInsight in Microsoft Azure
Advertisements

Big Data What is Big Data? Recently much good science, whether physical, biological, or social, has been forced to confront - and has often benefited from.
Big Data and Predictive Analytics in Health Care Presented by: Mehadi Sayed President and CEO, Clinisys EMR Inc.
Big Data Workflows N AME : A SHOK P ADMARAJU C OURSE : T OPICS ON S OFTWARE E NGINEERING I NSTRUCTOR : D R. S ERGIU D ASCALU.
Dunja Mladenić Marko Grobelnik Jožef Stefan Institute, Slovenia.
Chapter 14 The Second Component: The Database.
25 Need-to-Know Facts. Fact 1 Every 2 days we create as much information as we did from the beginning of time until 2003 [Source]Source © 2014 Bernard.
GROUP 1 : DATO’ NABIL ABD KADIR SAYNUL ISLAM MOHAMMAD GHAZALI MOHD DAUD.
Understanding Big Data Introduction. Information has always been a crucial resource for decision making. The lack of information in a subject can lead.
Big Data. What is Big Data? Analog starage vs digital. The FOUR V’s of Big Data. Who’s Generating Big Data The importance of Big Data. Optimalization.
Big Data A big step towards innovation, competition and productivity.
Chapter 2: Business Intelligence Capabilities
Basic Marketing Research Customer Insights and Managerial Action
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
Cyber Basics and Big Data. 2 Semantic Extraction Sentiment Analysis Entity Extraction Link Analysis Temporal Analysis Geospatial Analysis Time Event Matrices.
© 2011 IBM Corporation Smarter Software for a Smarter Planet The Capabilities of IBM Software Borislav Borissov SWG Manager, IBM.
© 2013 IBM Corporation Version 1.0 The New Eye Insight through Big Data and Analytics: A Case Study on Citizen Sentiment Analysis Sandipan Sarkar, Executive.
Big Data. What is Big Data? Big Data Analytics: 11 Case Histories and Success Stories
© 2012 IBM Corporation IBM Security Systems 1 © 2013 IBM Corporation 1 Ecommerce Antoine Harfouche.
Big Data: A definition Big data is the realization of greater intelligence by storing, processing, and analyzing data that was previously ignored due to.
Dr. Michael D. Featherstone Summer 2013 Introduction to e-Commerce Web Analytics.
© 2010 IBM Corporation Business Analytics software Business Analytics Editable Text Editable Text Editable Text.
Cyberspace Law Committee Meeting, August 3, 2012 Big Data Lois Mermelstein The Law Office of Lois D. Mermelstein
Cloud Computing & Big Data Group 9 Femme L H Sabaru | Aditya Gisheila N P | Aninda Harapan | Harry | Andrew Khosugih.
Big Data – Big Opportunity Mohammad Khansari ITRC President Jan 2015 ITRC, Tehran, Iran.
© 2012 IBM Corporation Converting Big Data into Big Knowledge.
Big Data: Electronic Gold And why Oreus should invest in Big Data Thomas Snuverink.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
What we know or see What’s actually there Wikipedia : In information technology, big data is a collection of data sets so large and complex that it.
IoT Meets Big Data Standardization Considerations
BUSINESS INTELLIGENCE & ADVANCED ANALYTICS DISCOVER | PLAN | EXECUTE JANUARY 14, 2016.
Big Data Analytics with Excel Peter Myers Bitwise Solutions.
Smart Grid Big Data: Automating Analysis of Distribution Systems Steve Pascoe Manager Business Development E&O - NISC.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
B IG D ATA : S TORAGE, A NALYSIS AND I MPACT Justinas Bisikirskas 1.
BIG DATA. The information and the ability to store, analyze, and predict based on that information that is delivering a competitive advantage.
Big Data Javad Azimi May First of All… Sorry about the language  Feel free to ask any question Please share similar experiences.
BIG DATA. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database.
Unlock your Big Data with Analytics and BI on Office365 Brian Culver ● SharePoint Fest Seattle● BI102 ● August 18-20, 2015.
Big Data-An Analysis. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult.
Data Analytics (CS40003) Introduction to Data Lecture #1
CNIT131 Internet Basics & Beginning HTML
Data Analytics 1 - THE HISTORY AND CONCEPTS OF DATA ANALYTICS
WHY VIDEO SURVELLIANCE
Data Platform Modernization
Big Data is a Big Deal!.
Understanding Big Data
Discovering Computers 2010: Living in a Digital World Chapter 14
April 25, 2012 The Three R’s Are Old School – Now It Is All About Volume, Velocity & Variety Peter Guest Alberta Public Sector Client Technical Advisor.
Big Data.
BIG Data 25 Need-to-Know Facts.
BIG DATA IN ENGINEERING APPLICATIONS
Global Corporate Fast Facts
Mohammad J. Mansourzadeh
Department of Information Systems
April 25, 2012 The Three R’s Are Old School – Now It Is All About Volume, Velocity & Variety Peter Guest Alberta Public Sector Client Technical Advisor.
Data Platform Modernization
Big Data.
Big Data Young Lee BUS 550.
IT Megatrends that shape the Digital Future…
Zoie Barrett and Brian Lam
WHY VIDEO SURVELLIANCE
Big Data: Four Vs Salhuldin Alqarghuli.
AGENDA Buzz word. AGENDA Buzz word What is BIG DATA ? Big Data refers to massive, often unstructured data that is beyond the processing capabilities.
Big DATA.
V. Uddameri Texas Tech University
UNIT 6 RECENT TRENDS.
Big Data.
Presentation transcript:

+ Big Data IST210 Class Lecture

+ Big Data Summary by EMC Corporation ( More videos that pertain to data are found here:

+ What is Big Data? In information technology, big data is a collection of data sets so large and complex that it becomes difficult to process using traditional relational database management systems. Relational Data-Base Management System

+ Challenges: capturing storing searching sharing analyzing visualization

+ Data types with size issues: scientific models/simulations (biology, astrophysics) genetic studies traffic internet searching business information (order management to stock data)

+ What’s making so much data? ubiquitous computing (an area of study for those interested) more people carrying data-generating devices (mobile phones with facebook, gps, cameras, etc.)

+ Just how big are we talking? In 2012 we hit the capability of creating and storing 2.5 quintillion bytes of data PER DAY (2.5 x 10^18) (2.5 billion gigabytes) 90% of the world's data created in last two years Human genome, at the time it was originally mapped, took 10 years to process. It can now be done in a week (as of 2012). Walmart handles 1 million+ transactions per hour and needs to store these for analysis to determine what products sell where, etc.

+ Where is the problem? When trying to get useful information out of the huge volume of data (drinking from a fire-hose), the use of traditional RDBMS queries isn't sufficient. Why? IF you could store all of this data for one example (all tweets in a week, for instance), to search it with traditional tools to find out if a particular topic was trending would take so long that the result would be meaningless by the time it was computed. Big Data solutions, then, consider how to store this data in novel ways in order to make it more accessible, and also to come up with methods of performing analysis on it.

+ Where is the problem? This quite commonly now includes massively parallel software on anywhere from hundreds to thousands of servers, which could be virtual machines themselves on growing server farms. The overall idea of "big data" includes not only storage and analysis, but considering just how to shape the data, what to store, how store, how to search, share and visualize it. There is so much demand, right now, for understanding how to handle the massive amounts of data and make it useful that the industry is now more than $100 billion in size and growing at about 10% per year, about twice as fast as other software technology.

+ Changing how we store data: Big Data analytics, in order to be performed in a practically useful manner, are requiring a redevelopment of data storage. Instead of older SAN storage farms or data warehouses, data is moving into directly connected (Direct-Attached Storage: DAS) of things like solid state disks or large SATA disks attached to parallel processing nodes. This brings the huge amounts of data closer to large processing capabilities in order to perform more timely analytics.

+ Activity/Discussion: ch/technology_and_innovation/big_data_the_n ext_frontier_for_innovation ch/technology_and_innovation/big_data_the_n ext_frontier_for_innovation What do you take away from this reading?

+ Structured Storage A Column (not the same as a column in a relational database) A Super Column A Column Family

+ Getting Information Out of Structured Storage - Map Reduce Map – distribute the task among multiple computers Reduce – take the results from each computer and combine them

+ IBM considers Big Data: Big data spans four dimensions: Volume, Velocity, Variety, and Veracity. Volume: Enterprises are awash with ever- growing data of all types, easily amassing terabytes—even petabytes—of information. Turn 12 terabytes of Tweets created each day into improved product sentiment analysis Convert 350 billion annual meter readings to better predict power consumption

+ IBM considers Big Data: Big data spans four dimensions: Volume, Velocity, Variety, and Veracity. Velocity: Sometimes 2 minutes is too late. For time- sensitive processes such as catching fraud, big data must be used as it streams into your enterprise in order to maximize its value. Scrutinize 5 million trade events created each day to identify potential fraud Analyze 500 million daily call detail records in real- time to predict customer churn faster

+ IBM considers Big Data: Big data spans four dimensions: Volume, Velocity, Variety, and Veracity. Variety: Big data is any type of data - structured and unstructured data such as text, sensor data, audio, video, click streams, log files and more. New insights are found when analyzing these data types together. Monitor 100’s of live video feeds from surveillance cameras to target points of interest Exploit the 80% data growth in images, video and documents to improve customer satisfaction

+ IBM considers Big Data: Big data spans four dimensions: Volume, Velocity, Variety, and Veracity. Veracity: 1 in 3 business leaders don’t trust the information they use to make decisions. How can you act upon information if you don’t trust it? Establishing trust in big data presents a huge challenge as the variety and number of sources grows.

+ Discussion What do you think? Opinions of all this?