Data Mining, Data Science, Big Data

Slides:



Advertisements
Similar presentations
Nokia Technology Institute Natural Partner for Innovation.
Advertisements

R and HDInsight in Microsoft Azure
Business Analytics for the 21 st Century TRENDS AND HOT TOPICS.
Setting Big Data Capabilities Free How to Make Business on Big Data? Stig Torngaard, Partner Platon.
 Need for a new processing platform (BigData)  Origin of Hadoop  What is Hadoop & what it is not ?  Hadoop architecture  Hadoop components (Common/HDFS/MapReduce)
Big Data Workflows N AME : A SHOK P ADMARAJU C OURSE : T OPICS ON S OFTWARE E NGINEERING I NSTRUCTOR : D R. S ERGIU D ASCALU.
Master of Arts in Data Science Geoffrey Fox for Data Science Program March
Introduction to Data Science Kamal Al Nasr, Matthew Hayes and Jean-Claude Pedjeu Computer Science and Mathematical Sciences College of Engineering Tennessee.
Big Data A big step towards innovation, competition and productivity.
Large Scale Data Analytics
BIG DATA NICOLAS MUNOZ. Topics What is Big Data? Benefits & Drawbacks How does it work? Companies doing Big Data Market for Big Data Applications of Big.
Tyson Condie.
Processing and Analyzing Large log from Search Engine Meng Dou 13/9/2012.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
Bleeding edge technology to transform Data into Knowledge HADOOP In pioneer days they used oxen for heavy pulling, and when one ox couldn’t budge a log,
Charles Tappert Seidenberg School of CSIS, Pace University
Distributed Computing Rik Sarkar. Distributed Computing Old style: Use a computer for computation.
© 2012 IBM Corporation IBM Security Systems 1 © 2013 IBM Corporation 1 Ecommerce Antoine Harfouche.
Bleeding edge technology to transform Data into Knowledge HADOOP In pioneer days they used oxen for heavy pulling, and when one ox couldn’t budge a log,
Large Scientific Databases. Large scientific datasets are those which are systematically collected and organized and which stretch the technical capabilites.
Data Mining with Big data
1 Melanie Alexander. Agenda Define Big Data Trends Business Value Challenges What to consider Supplier Negotiation Contract Negotiation Summary 2.
Big Data – Big Opportunity Mohammad Khansari ITRC President Jan 2015 ITRC, Tehran, Iran.
SUPPLY CHAIN OF BIG DATA. WHAT IS BIG DATA?  A lot of data  Too much data for traditional methods  The 3Vs  Volume  Velocity  Variety.
What we know or see What’s actually there Wikipedia : In information technology, big data is a collection of data sets so large and complex that it.
OMIS 694, Big Data Analytics
Data Mining with Big Data IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014 Xiangyu Cai ( )
Computing & Information Sciences Kansas State University An Overview of Big Data Analytics: Challenges & Selected Applications Guest Seminar Drake University.
1 Seattle University Master’s of Science in Business Analytics Key skills, learning outcomes, and a sample of jobs to apply for, or aim to qualify for,
Beyond Hadoop The leading open source system for processing big data continues to evolve, but new approaches with added features are on the rise. Ibrahim.
What is Data Science and Who is Data Scientist
Data Science Interview Questions 1.What do you mean by word Data Science? Data Science is the extraction of knowledge from large.
Big Data ---a statistician’s perspective Ming Ji, PhD College of Nursing USF.
Big Data Analytics Hadoop is here to Stay!. What is Big Data? Large databases which are hard to dealComplex and Unstructured dataNeed for Parallel ProcessingHigh.
Book web site:
Bleeding edge technology to transform Data into Knowledge HADOOP In pioneer days they used oxen for heavy pulling, and when one ox couldn’t budge a log,
Data Analytics (CS40003) Introduction to Data Lecture #1
Large Scale Data Analytics
Introduction to Big Data -- and what it means to database professionals Haidong “Alex” Ji
Data Analytics 1 - THE HISTORY AND CONCEPTS OF DATA ANALYTICS
Big Data is a Big Deal!.
SAS users meeting in Halifax
Empower your Data Analyst
Big Data A Quick Review on Analytical Tools
INTRODUCTION TO BIGDATA & HADOOP
BIG Data 25 Need-to-Know Facts.
Matrisoft matrisoft.gridaxis.in Gridaxis Softwares
Business analytics Lessons from an undergraduate introductory course
Tomáš Jurníček, Jakub Jůza, Lenka Kmeťová
Big-Data Fundamentals
Big Data Dr. Mazin Al-Hakeem (Nov 2016), “Big Data: Reality and Challenges”, LFU – Erbil.
Department of Information Systems
Data Visualisation with Tableau ExcelR Solutions.
Mining Time-Series Databases
Data Science and its role in Big Data analytics
What is Pattern Recognition?
6 October 2016 Irmingard Eder Data Scientist, Munich Re
Defining Data-intensive computing
OMIS 665, Big Data Analytics
 Deep Analytical Talent  Data Savvy Professionals  Technology and Data Enablers.
Big Data Young Lee BUS 550.
Big Data 5 exabytes (1018 bytes) of data were created by human until Today this amount of information is created in two days. In 2012, digital world.
Bleeding edge technology to transform Data into Knowledge
INNOvation in TRAINING BUSINESS ANALYSTS HAO HElEN Zhang UniVERSITY of ARIZONA
CS246: Search-Engine Scale
AGENDA Buzz word. AGENDA Buzz word What is BIG DATA ? Big Data refers to massive, often unstructured data that is beyond the processing capabilities.
Big DATA.
Data Analysis and R : Technology & Opportunity
Business Intelligence
Ungraded quiz Unit 2.
Presentation transcript:

Data Mining, Data Science, Big Data

Data Science Data Science aims to extract insights from large data Less emphasis on algorithms More emphasis on ‘outreach’ Term Data Science is about 10 years old, very popular nowadays Many people reinvent themselves as Data Scientists data miners, statisticians, BI people, analysts, database developers

Data Mining & Data Science Data Mining fff Statistics Computational methods Dealing with large data Visualisation Involving domain knowledge Interpretable and interpreted results

Big Data Because you can… Administrative/financial reasons cheap storage Administrative/financial reasons Internet and social computing Internet of Things, ubiquitous computing 1980 1990 2000 2010 $0.01 $1 $100 $10,000 $1,000,000 cost per Gigabyte in dollars

Cheap Storage 350 million photos uploaded to Facebook per day almost 20 additional racks per day required 1956, IBM 350, 5 Mb 90 Tb

Big Data Many facets, often people focus on only one Very, very large data CERN, Google, Facebook, Twitter, … Analytics Internet-generated Social data Heterogeneous, unstructured data Large-scale technologies MapReduce, Hadoop

Size-complexity trade-off Technological restrictions produce a trade-off Many Big Data projects algorithmically not so complex Embarrassingly parallel complexity size CERN