Multivariate Data Analysis Chapter 9 - Cluster Analysis

Slides:



Advertisements
Similar presentations
Discrimination amongst k populations. We want to determine if an observation vector comes from one of the k populations For this purpose we need to partition.
Advertisements

Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
N. Kumar, Asst. Professor of Marketing Database Marketing Cluster Analysis.
Chapter 12: Cluster analysis and segmentation of customers
Metrics, Algorithms & Follow-ups Profile Similarity Measures Cluster combination procedures Hierarchical vs. Non-hierarchical Clustering Statistical follow-up.
AEB 37 / AE 802 Marketing Research Methods Week 7
Cluster Analysis.
Statistics for Marketing & Consumer Research Copyright © Mario Mazzocchi 1 Cluster Analysis (from Chapter 12)
Cluster analysis. Partition Methods Divide data into disjoint clusters Hierarchical Methods Build a hierarchy of the observations and deduce the clusters.
Multivariate Data Analysis Chapter 11 - Structural Equation Modeling.
Multivariate Data Analysis Chapter 10 - Multidimensional Scaling
Multivariate Data Analysis Chapter 4 – Multiple Regression.
19-1 Chapter Nineteen MULTIVARIATE ANALYSIS: An Overview.
Multivariate Data Analysis Chapter 7 - Conjoint Analysis
What Is Multivariate Analysis of Variance (MANOVA)?
CLUSTERING (Segmentation)
Clustering Ram Akella Lecture 6 February 23, & 280I University of California Berkeley Silicon Valley Center/SC.
Dr. Michael R. Hyman Cluster Analysis. 2 Introduction Also called classification analysis and numerical taxonomy Goal: assign objects to groups so that.
Clustering analysis workshop Clustering analysis workshop CITM, Lab 3 18, Oct 2014 Facilitator: Hosam Al-Samarraie, PhD.
Segmentation Analysis
Cluster Analysis Chapter 12.
Multiple Discriminant Analysis and Logistic Regression.
Multivariate Statistical Data Analysis with Its Applications
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multivariate Data Analysis Chapter 8 - Canonical Correlation Analysis.
CLUSTER ANALYSIS.
Data Cleansing for Predictive Models: The Next Level Roosevelt C. Mosley, Jr., FCAS, MAAA CAS Ratemaking & Product Management Seminar Philadelphia, PA.
Chapter 12 – Discriminant Analysis © Galit Shmueli and Peter Bruce 2010 Data Mining for Business Intelligence Shmueli, Patel & Bruce.
© 2007 Prentice Hall20-1 Chapter Twenty Cluster Analysis.
Cluster analysis 포항공과대학교 산업공학과 확률통계연구실 이 재 현. POSTECH IE PASTACLUSTER ANALYSIS Definition Cluster analysis is a technigue used for combining observations.
Chapter XX Cluster Analysis. Chapter Outline Chapter Outline 1) Overview 2) Basic Concept 3) Statistics Associated with Cluster Analysis 4) Conducting.
Multivariate Data Analysis Chapter 5 – Discrimination Analysis and Logistic Regression.
1 Cluster Analysis Objectives ADDRESS HETEROGENEITY Combine observations into groups or clusters such that groups formed are homogeneous (similar) within.
Cluster Analysis Cluster Analysis Cluster analysis is a class of techniques used to classify objects or cases into relatively homogeneous groups.
Multiple Discriminant Analysis
1 Hair, Babin, Money & Samouel, Essentials of Business Research, Wiley, Learning Objectives: 1.Explain the difference between dependence and interdependence.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides 1.
Multivariate Data Analysis Chapter 1 - Introduction.
CLUSTERING AND SEGMENTATION MIS2502 Data Analytics Adapted from Tan, Steinbach, and Kumar (2004). Introduction to Data Mining.
Data Science and Big Data Analytics Chap 4: Advanced Analytical Theory and Methods: Clustering Charles Tappert Seidenberg School of CSIS, Pace University.
Basic Managerial Accounting Concepts Cost Behavior: Fixed Costs Fixed vs. Variable Relevant range: normal production levels Behavior of fixed costs as.
Multivariate Data Analysis Chapter 2 – Examining Your Data
Copyright © 2010 Pearson Education, Inc., publishing as Prentice-Hall. 9-1 Chapter 9 Cluster Analysis.
Applied Multivariate Statistics Cluster Analysis Fall 2015 Week 9.
Multivariate Data Analysis Chapter 3 – Factor Analysis.
Factor & Cluster Analyses. Factor Analysis Goals Data Process Results.
Copyright © 2010 Pearson Education, Inc Chapter Twenty Cluster Analysis.
Clustering / Scaling. Cluster Analysis Objective: – Partitions observations into meaningful groups with individuals in a group being more “similar” to.
1 Cluster Analysis Prepared by : Prof Neha Yadav.
Multivariate statistical methods Cluster analysis.
CLUSTER ANALYSIS. What is Cluster analysis? Cluster analysis is a techniques for grouping objects, cases, entities on the basis of multiple variables.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Chapter_20 Cluster Analysis Naresh K. Malhotra
CLUSTER ANALYSIS. Cluster Analysis  Cluster analysis is a major technique for classifying a ‘mountain’ of information into manageable meaningful piles.
Unsupervised Learning
Multivariate statistical methods
Chapter 12 – Discriminant Analysis
Clustering based on book chapter Cluster Analysis in Multivariate Analysis by Hair, Anderson, Tatham, and Black.
Lecturing 12 Cluster Analysis
Multiple Discriminant Analysis and Logistic Regression
Clustering and Multidimensional Scaling
Data Mining 資料探勘 分群分析 (Cluster Analysis) Min-Yuh Day 戴敏育
Data Mining – Chapter 4 Cluster Analysis Part 2
Chapter_20 Cluster Analysis
Cluster Analysis.
MANOVA Control of experimentwise error rate (problem of multiple tests). Detection of multivariate vs. univariate differences among groups (multivariate.
Cluster analysis Presented by Dr.Chayada Bhadrakom
Unsupervised Learning
Presentation transcript:

Multivariate Data Analysis Chapter 9 - Cluster Analysis Section 3: Independence Techniques

Chapter 9 What Is Cluster Analysis (Q analysis)? Define groups of homogeneous objects (i.e., individuals, firms, products, or behaviors) Maximize the homogeneity of objects within the clusters while also maximize the heterogeneity between clusters Segmentation and target marketing Compare with Factor Analysis How Does Cluster Analysis Work? Measuring Similarity (Euclidean distance) Forming Clusters (hierarchical procedure vs. agglomerative method) Determining the Number of Clusters in the Final Solution (entropy group)

Cluster Analysis Decision Process Stage One: Objectives of Cluster Analysis Taxonomy description Data simplification Relationship identification Selection of Clustering Variables Characterize the objects being clustered Relate specifically to the objectives of the cluster analysis

Cluster Analysis Decision Process (Cont.) Stage 2: Research Design in Cluster Analysis Detecting Outliers Similarity Measures (Interobject similarity) Correlational Measures Distance Measures Comparison to Correlational Measures Types of Distance Measures (Euclidean distance) Impact of Unstandardized Data Values (Mahalonobis Distance, D2) Association Measures Standardizing the Data Standardizing By Variables (normalized distance function) Standardizing By Observation (within-case vs. row-centering standarlization)

Cluster Analysis Decision Process (Cont.) Stage 3: Assumptions in Cluster Analysis Representativeness of the Sample Impact of Multicollinearity

Cluster Analysis Decision Process (Cont.) Stage 4: Deriving Clusters and Assessing Overall Fit Clustering Algorithms Hierarchical Cluster Procedures Single Linkage Complete Linkage Average Linkage Ward's Method Centroid Method Nonhierarchical Clustering Procedures Sequential Threshold Parallel Threshold Optimization Selecting Seed Points Should Hierarchical or Nonhierarchical Methods Be Used? Pros and Cons of Hierarchical Methods Emergence of Nonhierarchical Methods A Combination of Both Methods How Many Clusters Should Be Formed? Should the Cluster Analysis Be Respecified

Cluster Analysis Decision Process (Cont.) Stage 5: Interpretation of the Clusters Stage 6: Validation and Profiling of the Clusters Validating the Cluster Solution Criterion or predictive validity Profiling the Cluster Solution Summary of the Decision Process

An Illustrative Example Stage 1: Objectives of the Cluster Analysis Segment objects (customers) into groups with similar perceptions of HATCO HATCO can then formulate strategies with different appeals for the separate groups. Stage 2: Research Design of the Cluster Analysis Identify any outliers Similarity measure (multicollinearity: D2) Stage 3: Assumptions in Cluster Analysis

An Illustrative Example (Cont.) Stage 4: Deriving Clusters and Assessing Overall Fit Step 1: Hierarchical Cluster Analysis Step 2: Nonhierarchical Cluster Analysis Stage 5: Interpretation of the Clusters Two-cluster solution Four-cluster solution Stage 6: Validation and Profiling of the Clusters Managerial view