CISB594 – Business Intelligence Data Mining. CISB594 – Business Intelligence Reference Materials used in this presentation are extracted mainly from the.

Slides:



Advertisements
Similar presentations
Data Warehousing and Data Mining J. G. Zheng May 20 th 2008 MIS Chapter 3.
Advertisements

Web Mining.
Chapter 1 Business Driven Technology
5.1 © 2007 by Prentice Hall 5 Chapter Foundations of Business Intelligence: Databases and Information Management.
Back to Table of Contents
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Copyright 2007 John Wiley & Sons, Inc. Chapter 41 Data and Knowledge Management.
DATA, TEXT, AND WEB MINING
Decision Support Chapter 10. Overview Databases are really information technology Decision Support is a business application that actually uses databases.
Data Mining Knowledge Discovery in Databases Data 31.
Data Mining By Archana Ketkar.
CISB594 – Business Intelligence
Data Mining – Intro.
1 Data and Knowledge Management. 2 Data Management: A Critical Success Factor The difficulties and the process Data sources and collection Data quality.
Business Intelligence Business intelligence (BI) refers to all of the applications and technologies used to, provide access to, and information to efforts.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Business Intelligence
Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305.
Enterprise systems infrastructure and architecture DT211 4
Chapter 4 Data, Text, and Web Mining
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
1.Understand the decision-making process of consumer purchasing online. 2.Describe how companies are building one-to-one relationships with customers.
Data Mining By Jason Baltazar, Phil Cademas, Jillian Latham, Rachel Peeler & Kamila Singh.
Chapter 5: Data Mining for Business Intelligence
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
1 Data Mining DT211 4 Refer to Connolly and Begg 4ed.
Enabling Organization-Decision Making
Chapter 4 Data, Text, and Web Mining
CISB594 – Business Intelligence Data Mining Part I.
Chapter 7 DATA, TEXT, AND WEB MINING Pages , 311, Sections 7.3, 7.5, 7.6.
3 Objects (Views Synonyms Sequences) 4 PL/SQL blocks 5 Procedures Triggers 6 Enhanced SQL programming 7 SQL &.NET applications 8 OEM DB structure 9 DB.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
1 1 Slide Introduction to Data Mining and Business Intelligence.
@ ?!.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Business Plug-In B18 Business Intelligence.
4. Secondary Data.
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Chapter 1 Business Driven Technology MANGT 366 Information Technology for Business Chapter 1: Management Information Systems: Business Driven MIS.
BUSINESS DRIVEN TECHNOLOGY
Data Mining By Dave Maung.
CISB594 – Business Intelligence Data Mining. CISB594 – Business Intelligence Reference Materials used in this presentation are extracted mainly from the.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
CRM - Data mining Perspective. Predicting Who will Buy Here are five primary issues that organizations need to address to satisfy demanding consumers:
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
CISB113 Fundamentals of Information Systems Data Management.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
CHAPTER 4 Data Warehousing, Access, Analysis, Mining, and Visualization 2 1.
Chapter 14 Data Mining Transparencies. 2 Chapter Objectives u The concepts associated with data mining. u The main features of data mining operations,
CISB594 – Business Intelligence Data Mining. CISB594 – Business Intelligence Reference Materials used in this presentation are extracted mainly from the.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Academic Year 2014 Spring Academic Year 2014 Spring.
Chapter 2 Data, Text, and Web Mining. Data Mining Concepts and Applications  Data mining (DM) A process that uses statistical, mathematical, artificial.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
1 Ahmed K. Ezzat, Data, Text, and Web Mining for BI Data Mining and Big Data.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
01-Business intelligence
PRIMARY DATA vs SECONDARY DATA RESEARCH Lesson 23 June 2016
MIS2502: Data Analytics Advanced Analytics - Introduction
MIS5101: Data Analytics Advanced Analytics - Introduction
TEXTAND WEB MINING.
TEXT and WEB MINING.
Presentation transcript:

CISB594 – Business Intelligence Data Mining

CISB594 – Business Intelligence Reference Materials used in this presentation are extracted mainly from the following texts, unless stated otherwise.

CISB594 – Business Intelligence Objectives At the end of this lecture, you should be able to: Describe data mining, its characteristics and objectives in business Identify and explain the common algorithms used in data mining Discuss the use of data mining in different types of business Discuss the importance of data mining in understanding customers’ behaviours Discuss text and web mining CISB594 – Business Intelligence

What is Data Mining A process that uses statistical, mathematical, artificial intelligence and machine learning techniques to extract and identify useful information and subsequent knowledge from large database Uses sophisticated data manipulation technology Identifies useful information Deals with large databases Data Mining

CISB594 – Business Intelligence Data Mining Concepts and Applications Where is Data Mining in Business Intelligence?

CISB594 – Business Intelligence Users today will want to perform statistical and mathematical analysis such as hypothesis testing, prediction and customer scoring models A major step in managerial decision making is forecasting or estimating the results of different alternative courses of actions Such investigation cannot be done with basic OLAP and will require special tools – advanced business analytics – data mining Why do we need Data Mining

CISB594 – Business Intelligence Data are often buried deep within very large databases, which sometimes contain data from several years The data mining environment is usually client/server architecture or Web-based architecture Sophisticated tools are used to clean and synchronize data in order to get the best result Miners are the end users who are empowered with sophisticated tools to ask ad-hoc questions – they need not be technically equipped Miners may find an unexpected result during data mining activities and this will require creative thinking on the users’ decision making Major Characteristics of Data Mining

CISB594 – Business Intelligence Data Mining algorithms Fall into four broad categories: 1.Classification – Also known as supervised induction – Most common of all data mining activities – Used to analyse the historical data stored in the database and to automatically generate a model that can predict future behaviour – Application example : target marketing, quality assurance

CISB594 – Business Intelligence Data Mining algorithms Fall into four broad categories: 2. Clustering – Partitioning a database into segments in which the members of a segment share similar qualitiues – Before the results of clustering techniques are used, it might be necessary for an expert to interpret, modify the information – Clustering technique includes optimization, the goal is to create groups so that members within each group have maximum similarity and the members across groups have minimum similarity – Application example : Market segmentation

CISB594 – Business Intelligence Data Mining algorithms Fall into four broad categories: 3. Association – Establishes relationship about items that occur together in a given record – Determining associations among items that sell together – Often called market basket analysis as the primary applications is the analysis of sales transactions – Application example : Market basket analysis

CISB594 – Business Intelligence Data Mining algorithms Fall into four broad categories: 4. Sequence discovery – The identification of association over time – Some sequence discovery techniques keep track of elapsed time between associated events and the frequency of occurrences – Application example : Market basket analysis over time, customer life cycle analysis

CISB594 – Business Intelligence Types of data mining Two types – Hypothesis-driven data mining Begins with a proposition by the user, who then seeks to validate the truthfulness of the proposition – Discovery-driven data mining Finds patterns, associations, and relationships among the data in order to uncover facts that were previously unknown or not even contemplated by an organization

CISB594 – Business Intelligence Use in business Where data mining is beneficial (the intent in most of these examples is to identify a business opportunity and create a sustainable competitive advantage) BusinessUse MarketingPredicting which customers will respond to Internet banners, or buy a particular product and segmenting customer demographics BankingForecasting levels of bad loans, fraud in credit card usage, credit card spending pattern, new loans Retailing and sales Predicting sales, determining correct inventory levels and distribution schedules Manufacturing and production Predicting when to expect machinery failures, for what resaons, optimizing manufacturing capacity

CISB594 – Business Intelligence Use in business Where data mining is beneficial (the intent in most of these examples is to identify a business opportunity and create a sustainable competitive advantage) BusinessUse Government and defense Forecasting threats to national security, predicting resources consumptions HealthMapping demographics data to critical illnesses, identifying patterns and best approach for treatments AirlinesPredicting sales, determining popular routes, capture lost business, add routes and destinations BroadcastingPredicting what programs are best shown during prime time and where to slot in advertisements

CISB594 – Business Intelligence Data Mining in retail The process of data mining in retail has three different aspects: 1.Web analytics – Gather web statistics that track customer’s online behaviour ; hit, pages, sales, volume, and so on. This helps in adjusting a web site to meet customer needs. 2.Customer analytics – web sites interaction, transaction data from offline purchases, and demographic data. This is critical in CRM and revenue management because a better understanding allows an organizationto cluster customers into groupings 3.Optimization – Patterns can be detected and used to optimize customer interactions. For example in recommending relevant styles and complementary purchases/products to suit customer behaviour

CISB594 – Business Intelligence Text Mining Application of data mining to nonstructured or less structured text files. It entails the generation of meaningful numerical indices from the unstructured text and then processing these indices using various data mining algorithms Data MiningText Mining Takes advantage of the infrastructure of stored data to extract additional useful information. E.g. Applying data mining to customer database, we may discover that everyone who buys product A will also buy products B and C six months later Operates with text documents - less structured information.. E.g. Visualising relationships between documents such as policies, memos, s, minutes of meeting etc. Organizations recognized this as one of the major sources for competitive advantage.

CISB594 – Business Intelligence Text Mining Example incident reports to increase the quality of service Airline industry uses text mining software to focus on key problem areas through pattern identification by accessing incident reports to increase the quality of service. – The most frequently occurring terms are identified in the incident report – Cluster/group the terms e.g the term spillage and associate with other key terms such as coffee, tea, soup, drink – Can identify incidents that might lead to trouble and help management curb the issue

CISB594 – Business Intelligence Text Mining How to mine text 1.Eliminate commonly used words (e.g. the, and, other). These are known as stop-words. 2.Replace words with their stems or roots (e.g. eliminate plurals and various conjugations). The terms phoned, phoning, and phones would be mapped to phone. 3.Consider synonyms and phrases. Synonyms need to be combined, e.g students and pupil need to be grouped together.

CISB594 – Business Intelligence Text Mining How to mine text 4. Calculate the weights of the remaining terms, looking at the frequency with which the words appear 2 common measures are used for this, term frequency factor (the actual number of times the word appears in a document) and inverse document frequency (the number of times the word appears in all document in a set) – If tf factor is large, weight increase, If idf factor is large, weight decrease – Reason: idf indicates that the terms would be a common words to the industry.

CISB594 – Business Intelligence Web Mining The discovery through the analysis of interesting and useful information from the web, about the web and usually using a web based tool.

CISB594 – Business Intelligence Types of Web Mining 1.Web content mining 1.Web content mining - extraction of useful information from Webpages. May be used to enhance search results produced by search engines 2.Web structure mining 2.Web structure mining – generating information from the links included in WebPages. Can be used to structure the display of the page. Can also identify the members of specific communities and and their roles 3.Web usage mining 3.Web usage mining – generated through web page visits, transactions and web server logs – useful for CRM, understanding user behaviour

Web Mining

CISB594 – Business Intelligence Now ask if … You are now be able to: Describe data mining, its characteristics and objectives in business Identify and explain the common algorithms used in data mining Discuss the use of data mining in different types of business Discuss the importance of data mining in understanding customers’ behaviours Discuss text and web mining CISB594 – Business Intelligence