1 Causal Analytics with Social Media Content Lipika Dey Innovation Labs, Delhi.

Slides:



Advertisements
Similar presentations
Taxonomy & Ontology Impact on Search Infrastructure John R. McGrath Sr. Director, Fast Search & Transfer.
Advertisements

Copyright © 2012, SAS Institute Inc. All rights reserved. INTRODUCTION TO DATA AND TEXT MINING ANDREW PEASE, 8 MARCH 2013.
Components of a Product Vision/Strategy
1.Accuracy of Agree/Disagree relation classification. 2.Accuracy of user opinion prediction. 1.Task extraction performance on Bing web search log with.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Extract from various presentations: Bing Liu, Aditya Joshi, Aster Data … Sentiment Analysis January 2012.
SAS solutions SAS ottawa platform user society nov 20th 2014.
IBM SPSS MODELER Ercan Kaynakca Data Analyst Analysis Express.
CloudSocial Mobility Big data Social connections, mobility, cloud delivery and pervasive information are converging in a powerful way. This convergence.
Advance Analytics Capabilities
Enabling the Social Web Krishna P. Gummadi Networked Systems Group Max Planck Institute for Software Systems.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Data Mining Jessica Jackson Kimberli Klein Kevin Wood.
Data Mining By Archana Ketkar.
Supporting Decision Making Chapter 10 McGraw-Hill/IrwinCopyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved.
02 -1 Lecture 02 Agent Technology Topics –Introduction –Agent Reasoning –Agent Learning –Ontology Engineering –User Modeling –Mobile Agents –Multi-Agent.
Data Mining – Intro.
Advanced Analytics The next wave of Business Intelligence Shankar Radhakrishnan Senior Solutions Architect HCL Technologies.
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
WHT/ HPCC Systems Flavio Villanustre VP, Products and Infrastructure HPCC Systems Risk Solutions.
Forecasting with Twitter data Presented by : Thusitha Chandrapala MARTA ARIAS, ARGIMIRO ARRATIA, and RAMON XURIGUERA.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
Governance, Risk, and Compliance Bill Greene Senior Industry Director.
LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
Copyright R. Weber INFO 629 Concepts in Artificial Intelligence Fall 2004 Professor: Dr. Rosina Weber.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Data Mining Chun-Hung Chou
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
CIS 9002 Kannan Mohan Department of CIS Zicklin School of Business, Baruch College.
Chapter 1 Introduction to Data Mining
Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved. Decision Support Systems Chapter 10.
1 Controversial Issues  Data mining (or simple analysis) on people may come with a profile that would raise controversial issues of  Discrimination 
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Introduction – Addressing Business Challenges Microsoft® Business Intelligence Solutions.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Copyright © 2012, SAS Institute Inc. All rights reserved. ANALYTICS IN BIG DATA ERA ANALYTICS TECHNOLOGY AND ARCHITECTURE TO MANAGE VELOCITY AND VARIETY,
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Computing Ontology Part II. So far, We have seen the history of the ACM computing classification system – What have you observed? – What topics from CS2013.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Business Intelligence - 2 BUS 782. Topics Data warehousing Data Mining.
Pertemuan 16 Materi : Buku Wajib & Sumber Materi :
Information Design Trends Unit Five: Delivery Channels Lecture 2: Portals and Personalization Part 2.
Academic Year 2014 Spring Academic Year 2014 Spring.
Realtime Financial Monitoring and Analysis System May 2010 Lietu Search Engine.
Data Science Interview Questions 1.What do you mean by word Data Science? Data Science is the extraction of knowledge from large.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
MICROSOFT ENTERPRISE SOLUTIONS FOR LEADERS.  Business intelligence and Enterprise Resource Planning With powerful business intelligence and ERP solutions,
Early Warning and Real Time Anomaly Detection System © 2015 Blueocean Market Intelligence1.
Introduction BIM Data Mining.
Data Mining – Intro.
Decision Support Systems
Data Mining Generally, (Sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it.
Siemens Enables Digitalization: Data Analytics & Artificial Intelligence Dr. Mike Roshchin, CT RDA BAM.
Big-Data Fundamentals
Introduction C.Eng 714 Spring 2010.
Future Technologies FTC 2016 Future Technologies Conference December 2016 San Francisco, United States.
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Kbv Research | +1 (646) | Global Text Analytics Market Knowledge Based Value (KBV) Research Full report -
Data Mining Modified from
Data Warehousing and Data Mining
Supporting Decision Making
Course Summary ChengXiang “Cheng” Zhai Department of Computer Science
AGMLAB Information Technologies
Data Warehousing Data Mining Privacy
Presentation transcript:

1 Causal Analytics with Social Media Content Lipika Dey Innovation Labs, Delhi

2 2 Agenda Introduction to Innovation Labs Social media content for business analytics Behavioral analysis Future Directions 2

3 3 TCS R&D – Innovation Labs DELHI MUMBAI PUNE HYDERABAD CHENNAI BANGALORE  ~ 600 people; 60 PhDs. KOLKATA Data Analytics - Text Mining, Enterprise Information Fusion, Big Data Management Software Architecture Graphics & Virtual Reality Multimedia Applications & Computer Vision Natural Language Processing Data Security (PKI, ECDSA) Life Sciences (Bioinformatics) Analytics & Data Mining Large Scale Systems Software Engineering Tools Speech Technology Performance Engineering Embedded Systems (VLSI) Green IT (Power Management) Wireless (WiMAX, 4G, RFID) Signal Processing 3

4 Social media based analytics Content Analysis Sentiment Analysis Causal Analytics Social Network Analysis

5 Social-media Intelligence Social Media Issues and Opportunities Events Context to interpret Business Data Impact on business

6 The Retail Story Shampoo sales are going down Market basket Analysis – Shampoo sells with milk and bread Survey conducted – are you satisfied with quality / price / availability of milk and bread More positive than negative Milk and bread sell as quick-refill – sales showed decline ? ? ?

7 Seek answer in social-media

8 Text Mining Causal Analysis Customer Pain-points and Delights Events Trends Frequent patterns Consumer-generated Text Action Key drivers for satisfaction / dissatisfaction / problems Segment-wise reports Business Analysis Prioritization of issues Business Case Recommendation / Prediction Business Knowledge Business units Business Goals Business Knowledge Business units Business Goals Domain knowledge Knowledge Acquisition Feedback / Impact Goal-driven text analytics Knowledge-driven Analytics Mapping issues to goals Measuring Performance Indicators Expert

9 Text Analytics Process Mining Text elements - terms extracted from Consumer –generated Unstructured Text Mapping Text components to Business Processes GatherMine MapInterpret Improve Value from Text Analytics Analysis KPIs to measure process performance Expect Improvement in performance Involvement of business expert Cleaning and pre-processing Natural Language Processing Statistical Text Processing Cleaning and pre-processing Natural Language Processing Statistical Text Processing Domain Ontology Semi-supervised Fuzzy Clustering Content Classification Domain Ontology Semi-supervised Fuzzy Clustering Content Classification Knowledge-base

10 Reports for the Retail Store Discovering New issues Relevance / representativeness Anomaly detection Novelty of issues Divergence of issues across stores/regions/products/categories Learn from correlations Discovering New issues Relevance / representativeness Anomaly detection Novelty of issues Divergence of issues across stores/regions/products/categories Learn from correlations

11 Content Discovery and Analytics  Semi-supervised fuzzy Clustering –Seeded clustering –Learn domain terms  Topic Discovery (Latent Dirichlet Allocation) –Topic novelty –Topic spread –Topic affinity –Topic relevance  Event detection –Entity and action-oriented (Conditional Random Fields) –Event linking – story building

12 Topic discovery – iPhone related tweets Features TV show Gifts Comparison

13 Influence-driven analysis

14 Learning to identify relevant content Event detection Entity Detection Address resolution Raise alarm Throw alerts Event detection Entity Detection Address resolution Raise alarm Throw alerts WSDM

15 Learning cause-effect relationships Reinforcement learning framework to learn critical events

16 Topic spread – Topic evolution – Topic affinity Event history – India Olympics IJCAI 2011 – From News From Twitter -To be presented at WI

17 Behavioral Analysis on Social Networks Problem Clustering and characterizing users based on their activity patterns Volume, Regularity, Consistency Use Provides insights about different categories of users (i). Trade Promoters (ii). News Agencies (iii). Analysts (iv). Regular users (v). Spammers Predict actions and information flow Methodology A wavelet-based clustering mechanism that groups users according to their temporal activity profiles (To be presented at ICPR 2012, Japan) Irregular, Inconsistent Extreme Irregular, Inconsistent Extreme Regular, Consistent, Low volume Regular, Consistent, Medium volume Periodic, Consistent, Low volume Less regular, consistent, Low volume Regular, consistent, High volume Irregular, inconsistent, Low volume

18 Interested in  Content-based analytics –Supply-chain models revisited –Demand forecasting  Identity resolution across social-media –Social CRM  Fraud detection –Spammers –Fake identities  Information diffusion –Across regions –Interestingness –Effect  Intent mining

19