Presentation is loading. Please wait.

Presentation is loading. Please wait.

Social Computing and Big Data Analytics 社群運算與大數據分析

Similar presentations


Presentation on theme: "Social Computing and Big Data Analytics 社群運算與大數據分析"— Presentation transcript:

1 Social Computing and Big Data Analytics 社群運算與大數據分析
Tamkang University Social Computing and Big Data Analytics 社群運算與大數據分析 Tamkang University Course Orientation for Social Computing and Big Data Analytics (社群運算與大數據分析課程介紹) 1042SCBDA01 MIS MBA (M2226) (8628) Wed, 8,9, (15:10-17:00) (Q201) Min-Yuh Day 戴敏育 Assistant Professor 專任助理教授 Dept. of Information Management, Tamkang University 淡江大學 資訊管理學系 tku.edu.tw/myday/

2 Social Computing and Big Data Analytics (社群運算 與 大數據分析)

3 淡江大學104學年度第2學期 課程教學計畫表 Spring 2016 (2016.02 - 2016.06)
課程名稱:社群運算與大數據分析 (Social Computing and Big Data Analytics) 授課教師:戴敏育 (Min-Yuh Day) 開課系級:資管所碩士班(TLMXM1A) 開課資料:選修 單學期 2 學分 (2 Credits, Elective) 上課時間:週三 8,9 (Wed 15:10-17:00) 上課教室: Q201 (傳播館)

4 課程簡介 本課程介紹社群運算與大數據分析的基本概念及研究議題。 課程內容包括 資料科學與大數據分析:探索、分析、視覺化與呈現資料
大數據基礎:MapReduce典範、Hadoop與Spark生態系統 大數據處理平台SMACK: Spark, Mesos, Akka, Cassandra and Kafka Python Pandas財務大數據分析 文字探勘分析技術與自然語言處理 社群媒體行銷分析 深度學習 (Deep Learning) 社群媒體情感分析 Google TensorFlow 深度學習 (Deep Learning with Google TensorFlow) 社會網絡分析、量測、工具

5 Course Introduction This course introduces the fundamental concepts and research issues of social computing and big data analytics. Topics include Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data Fundamental Big Data: MapReduce Paradigm, Hadoop and Spark Ecosystem Big Data Processing Platforms with SMACK: Spark, Mesos, Akka, Cassandra and Kafka Big Data Analytics with Numpy in Python Finance Big Data Analytics with Pandas in Python Text Mining Techniques and Natural Language Processing Social Media Marketing Analytics Deep Learning with Theano and Keras in Python Deep Learning with Google TensorFlow Sentiment Analysis on Social Media with Deep Learning Social Network Analysis, Measurements, and Tools

6 課程目標 (Objective) 瞭解及應用社群運算與大數據分析基本概念與研究議題。 (Understand and apply the fundamental concepts and research issues of Social Computing and Big Data Analytics.) 進行社群運算與大數據分析相關之資訊管理研究。 (Conduct information systems research in the context of Social Computing and Big Data Analytics.)

7 課程大綱 (Syllabus) 週次 (Week) 日期 (Date) 內容 (Subject/Topics) /02/17 Course Orientation for Social Computing and Big Data Analytics (社群運算與大數據分析課程介紹) /02/24 Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data (資料科學與大數據分析: 探索、分析、視覺化與呈現資料) /03/02 Fundamental Big Data: MapReduce Paradigm, Hadoop and Spark Ecosystem (大數據基礎:MapReduce典範、 Hadoop與Spark生態系統)

8 課程大綱 (Syllabus) 週次 (Week) 日期 (Date) 內容 (Subject/Topics) /03/09 Big Data Processing Platforms with SMACK: Spark, Mesos, Akka, Cassandra and Kafka (大數據處理平台SMACK: Spark, Mesos, Akka, Cassandra, Kafka) /03/16 Big Data Analytics with Numpy in Python (Python Numpy 大數據分析) /03/23 Finance Big Data Analytics with Pandas in Python (Python Pandas 財務大數據分析) /03/30 Text Mining Techniques and Natural Language Processing (文字探勘分析技術與自然語言處理) /04/06 Off-campus study (教學行政觀摩日)

9 課程大綱 (Syllabus) 週次 (Week) 日期 (Date) 內容 (Subject/Topics) /04/13 Social Media Marketing Analytics (社群媒體行銷分析) /04/20 期中報告 (Midterm Project Report) /04/27 Deep Learning with Theano and Keras in Python (Python Theano 和 Keras 深度學習) /05/04 Deep Learning with Google TensorFlow (Google TensorFlow 深度學習) /05/11 Sentiment Analysis on Social Media with Deep Learning (深度學習社群媒體情感分析)

10 課程大綱 (Syllabus) 週次 (Week) 日期 (Date) 內容 (Subject/Topics) /05/18 Social Network Analysis (社會網絡分析) /05/25 Measurements of Social Network (社會網絡量測) /06/01 Tools of Social Network Analysis (社會網絡分析工具) /06/08 Final Project Presentation I (期末報告 I) /06/15 Final Project Presentation II (期末報告 II)

11 2016/02/24 Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data (資料科學與大數據分析: 探索、分析、 視覺化與呈現資料)

12 2016/03/02 Fundamental Big Data: MapReduce Paradigm, Hadoop and Spark Ecosystem (大數據基礎: MapReduce典範、 Hadoop與Spark生態系統)

13 2016/03/09 Big Data Processing Platforms with SMACK: Spark, Mesos, Akka, Cassandra and Kafka (大數據處理平台SMACK: Spark, Mesos, Akka, Cassandra, Kafka)

14 2016/03/23 Finance Big Data Analytics with Pandas in Python (Python Pandas 財務大數據分析)

15 2016/04/27 Deep Learning with Theano and Keras in Python (Python Theano 和 Keras 深度學習)

16 2016/05/04 Deep Learning with Google TensorFlow (Google TensorFlow 深度學習)

17 教學方法與評量方法 教學方法 講述、討論、賞析、模擬、問題解決 評量方法 實作、報告、上課表現

18 教材課本 教材課本 講義 (Slides) 社群運算與大數據分析相關個案與論文 (Cases and Papers related to Social Computing and Big Data Analytics)

19 參考書籍 EMC Education Services, Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data, Wiley, 2015 Mohammed Guller, Big Data Analytics with Spark: A Practitioner's Guide to Using Spark for Large Scale Data Analysis, Apress, 2015 Nick Pentreath, Machine Learning with Spark - Tackle Big Data with Powerful Spark Machine Learning Algorithms, Packt Publishing, 2015 Wes McKinney, Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython, O'Reilly Media, 2012 Michael Heydt , Mastering Pandas for Finance, Packt Publishing, 2015 Michael Heydt, Learning Pandas - Python Data Discovery and Analysis Made Easy, Packt Publishing, 2015 Yves Hilpisch, Derivatives Analytics with Python: Data Analysis, Models, Simulation, Calibration and Hedging, Wiley, 2015 Yves Hilpisch, Python for Finance: Analyze Big Financial Data, O'Reilly Media, 2014 James Ma Weiming, Mastering Python for Finance, Packt Publishing, 2015 Fabio Nelli, Python Data Analytics: Data Analysis and Science using PANDAs, matplotlib and the Python Programming Language, Apress, 2015

20 作業與學期成績計算方式 作業篇數 學期成績計算方式 3篇 期中評量:30 % 期末評量:30 %
期末評量:30 % 其他(課堂參與及報告討論表現): 40 %

21 Team Term Project Term Project Topics 3-4 人為一組 Big Data Analytics
Social Computing Big Data mining Web and Text mining Business Intelligence 3-4 人為一組 分組名單於 2016/02/24 (三) 課程下課時繳交 由班代統一收集協調分組名單

22 Social Computing and Big Data Analytics (社群運算 與 大數據分析)

23 EMC Education Services, Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data, Wiley, 2015 Source:

24 Mohammed Guller, Big Data Analytics with Spark: A Practitioner's Guide to Using Spark for Large Scale Data Analysis, Apress, 2015 Source:

25 Nick Pentreath, Machine Learning with Spark – Tackle Big Data with Powerful Spark Machine Learning Algorithms, Packt Publishing, 2015 Source:

26 Yves Hilpisch, Python for Finance: Analyze Big Financial Data, O'Reilly, 2014
Source:

27 Michael Heydt , Mastering Pandas for Finance, Packt Publishing, 2015
Source:

28 Business Insights with Social Analytics

29 Analyzing the Social Web: Social Network Analysis

30 Jennifer Golbeck (2013), Analyzing the Social Web, Morgan Kaufmann
Source:

31 Mining the Social Web: Analyzing Data from Facebook, Twitter, LinkedIn, and Other Social Media Sites
Source:

32 Web Mining Success Stories
Amazon.com, Ask.com, Scholastic.com, … Website Optimization Ecosystem Source: Turban et al. (2011), Decision Support and Business Intelligence Systems

33 Big Data Analytics and Data Mining

34 Source: http://www.amazon.com/gp/product/1466568704
Stephan Kudyba (2014), Big Data, Mining, and Analytics: Components of Strategic Decision Making, Auerbach Publications Source:

35 Architecture of Big Data Analytics
Big Data Analytics Applications Big Data Sources Big Data Transformation Big Data Platforms & Tools * Internal * External * Multiple formats * Multiple locations * Multiple applications Middleware Hadoop MapReduce Pig Hive Jaql Zookeeper Hbase Cassandra Oozie Avro Mahout Others Queries Transformed Data Big Data Analytics Raw Data Extract Transform Load Reports Data Warehouse OLAP Traditional Format CSV, Tables Data Mining Source: Stephan Kudyba (2014), Big Data, Mining, and Analytics: Components of Strategic Decision Making, Auerbach Publications

36 Architecture of Big Data Analytics
Big Data Analytics Applications Big Data Sources Big Data Transformation Big Data Platforms & Tools * Internal * External * Multiple formats * Multiple locations * Multiple applications Data Mining Big Data Analytics Applications Middleware Hadoop MapReduce Pig Hive Jaql Zookeeper Hbase Cassandra Oozie Avro Mahout Others Queries Transformed Data Big Data Analytics Raw Data Extract Transform Load Reports Data Warehouse OLAP Traditional Format CSV, Tables Data Mining Source: Stephan Kudyba (2014), Big Data, Mining, and Analytics: Components of Strategic Decision Making, Auerbach Publications

37 Social Big Data Mining (Hiroshi Ishikawa, 2015)
Source:

38 Architecture for Social Big Data Mining (Hiroshi Ishikawa, 2015)
Enabling Technologies Analysts Integrated analysis Model Construction Explanation by Model Integrated analysis model Conceptual Layer Construction and confirmation of individual hypothesis Description and execution of application-specific task Natural Language Processing Information Extraction Anomaly Detection Discovery of relationships among heterogeneous data Large-scale visualization Data Mining Application specific task Multivariate analysis Logical Layer Parallel distrusted processing Social Data Software Hardware Physical Layer Source: Hiroshi Ishikawa (2015), Social Big Data Mining, CRC Press

39 Business Intelligence (BI) Infrastructure
Source: Kenneth C. Laudon & Jane P. Laudon (2014), Management Information Systems: Managing the Digital Firm, Thirteenth Edition, Pearson.

40 Data Warehouse Data Mining and Business Intelligence
Increasing potential to support business decisions End User Decision Making Data Presentation Business Analyst Visualization Techniques Data Mining Data Analyst Information Discovery Data Exploration Statistical Summary, Querying, and Reporting Data Preprocessing/Integration, Data Warehouses DBA Data Sources Paper, Files, Web documents, Scientific experiments, Database Systems Source: Jiawei Han and Micheline Kamber (2006), Data Mining: Concepts and Techniques, Second Edition, Elsevier

41 The Evolution of BI Capabilities
Source: Turban et al. (2011), Decision Support and Business Intelligence Systems

42 Source: http://www. amazon

43 Deep Learning Intelligence from Big Data
Source:

44 Source: http://www. amazon

45 Source: http://www. amazon

46 Source: https://www. thalesgroup

47 Big Data with Hadoop Architecture
Source:

48 Big Data with Hadoop Architecture Logical Architecture Processing: MapReduce
Source:

49 Big Data with Hadoop Architecture Logical Architecture Storage: HDFS
Source:

50 Big Data with Hadoop Architecture Process Flow
Source:

51 Big Data with Hadoop Architecture Hadoop Cluster
Source:

52 Traditional ETL Architecture
Source:

53 Offload ETL with Hadoop (Big Data Architecture)
Source:

54 Source: http://www.newera-technologies.com/big-data-solution.html
Big Data Solution Source:

55 HDP A Complete Enterprise Hadoop Data Platform
Source:

56 Spark and Hadoop Source:

57 Spark Ecosystem Source:

58 Python for Big Data Analytics
(The column on the left is the 2015 ranking; the column on the right is the 2014 ranking for comparison 2015 2014 Source:

59 Source: http://www. kdnuggets

60 Business Intelligence Trends
Agile Information Management (IM) Cloud Business Intelligence (BI) Mobile Business Intelligence (BI) Analytics Big Data Source:

61 Business Intelligence Trends: Computing and Service
Cloud Computing and Service Mobile Computing and Service Social Computing and Service

62 Business Intelligence and Analytics
Business Intelligence 2.0 (BI 2.0) Web Intelligence Web Analytics Web 2.0 Social Networking and Microblogging sites Data Trends Big Data Platform Technology Trends Cloud computing platform Source: Lim, E. P., Chen, H., & Chen, G. (2013). Business Intelligence and Analytics: Research Directions.  ACM Transactions on Management Information Systems (TMIS), 3(4), 17

63 Business Intelligence and Analytics: Research Directions
1. Big Data Analytics Data analytics using Hadoop / MapReduce framework 2. Text Analytics From Information Extraction to Question Answering From Sentiment Analysis to Opinion Mining 3. Network Analysis Link mining Community Detection Social Recommendation Source: Lim, E. P., Chen, H., & Chen, G. (2013). Business Intelligence and Analytics: Research Directions.  ACM Transactions on Management Information Systems (TMIS), 3(4), 17

64 Source: Davenport, T. H. , & Patil, D. J. (2012). Data Scientist
Source: Davenport, T. H., & Patil, D. J. (2012). Data Scientist. Harvard business review

65 SAS第五屆大數據資料科學家競賽 文字分析與數位行銷大賽

66 SAS第五屆大數據資料科學家競賽 文字分析與數位行銷大賽

67 Summary This course introduces the fundamental concepts and research issues of social computing and big data analytics. Topics include Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data Fundamental Big Data: MapReduce Paradigm, Hadoop and Spark Ecosystem Big Data Processing Platforms with SMACK: Spark, Mesos, Akka, Cassandra and Kafka Big Data Analytics with Numpy in Python Finance Big Data Analytics with Pandas in Python Text Mining Techniques and Natural Language Processing Social Media Marketing Analytics Deep Learning with Theano and Keras in Python Deep Learning with Google TensorFlow Sentiment Analysis on Social Media with Deep Learning Social Network Analysis, Measurements, and Tools

68 Contact Information 戴敏育 博士 (Min-Yuh Day, Ph.D.) 專任助理教授 淡江大學 資訊管理學系 電話: #2846 傳真: 研究室:B929 地址: 新北市淡水區英專路151號 : 網址:


Download ppt "Social Computing and Big Data Analytics 社群運算與大數據分析"

Similar presentations


Ads by Google