Download presentation

Presentation is loading. Please wait.

Published byGael Oldaker Modified over 2 years ago

1
1 Conjoint and Discrete Choice Experiments (DCEs) Lecture Content Conjoint and Discrete Choice Experiments (DCE)? –Stated preference (SP) and Revealed preference (RP) data –Theories of Individual Choice Behaviour –Utility maximization -Random Utility Framework Experimental Design –Selection of Attributes: physical, beneficial, image –Focus group, best-worst scaling to make a shortlist –Fractional factorial design: main effects + interactions –Unacceptable combinations? Conjoint Analysis –Different Types of Conjoint Analysis Discrete Choice Models –Making Random Utility Theory operational –Discrete Choice Model and Its Assumptions Data Collection and Model Estimation Interpretation of Results

2
2 Conjoint and DCEs Any DECOMPOSITIONAL (as opposed to COMPOSITIONAL method) method that estimates the structure of consumers preferences (such as preference parameter, relative importance, willingness-to- pay etc.), given his or her overall evaluations of a set of alternatives that are pre-specified in terms of levels of different attributes. (see Green and Srinivasan 1990: Journal of Marketing, 54 (4), 3-19) Individual level model Stated Preference (SP) Data ( as opposed to Revealed Preference, RP) Compositional Method – see example in next slide Applications –Consumer goods (59%), industrial goods (18%), financial (9%), other services (9%) [ 1990 data]

3
3 Compositional Method: Multi-Attribute Attitude Model (attitude towards grocery store) Importance: 1 = least, 2,3,4, 5 = most Beliefs 1 = Strongly Disagree 5 = Strongly Agree 72 = 5 x x 5+ 4 x 4 +5 x x 3

4
4 Discrete Choice Experiments Suppose that you are considering buying juice for yourself or your family. Below we describe 32 different juice product options. Each juice option is described by the features you evaluated on the first page of this survey. We want you to evaluate each option and decide if you would consider Buying or Not Buying each option ( one in each row). I would Juice Type Brand Price of 250 ml serving Percentage of real Juice No. of 250 ml servings per pack Calcium Packaging Materials BuyNot Buy Apple Minute Maid $0.3010% 3 – 250 ml Pack Not AddedGlass AppleMcCain$0.5070% 9 – 250 ml Pack Not AddedTetra-Pack AppleTropicana$ % 3 – 250 ml Pack AddedGlass AppleDel Monte$0.3040% 9 – 250 ml Pack AddedTetra-Pack Example: Decompositional Method – sample survey Other Dependent Measures possible?

5
5 Available flights for travel from Boston to Seattle: Scenario 4 Information about flightsOption AOption BOption COption D Round-Trip Air Fare$650$350$450$550 Number of Stops2130 Total Travel Time5hrs7hrs4hrs6hrs Type of AirplaneB717B737 B717 In-Flight food & Beveragehotmeal beverage AirlineNorthwestSouthwestNorthwestSouthwest 1. Which flight do you prefer most ( one) 2. Which flight do you prefer least ( one) 3. Which one of the two remaining flights do you prefer most ( one) 4. Check the box to the right if you actually would choose not to fly if these were your only travel options Example: Decompositional Method

6
6 Features of preference data sources Why design and analysis of SP surveys? Organisations need to estimate demand for new products with new attributes or features. No RP data on which to rely. Explanatory variables have little variability in the marketplace. Often Explanatory variables are highly collinear New variables now explain choices. Categories mature - new product features introduced &/or designs replace old ones. RP data are time consuming and expensive to collect. Features of preference data sources RP Data: depict world as it is now (current market equilibrium), have only existing alternatives as observables, have high reliability and face validity, yield one observation per respondent at each observation point. SP Data: describe hypothetical or virtual decision contexts (flexibility), can include existing and/or proposed and/or generic choice alternatives, reliable if subjects understand, are committed to & can complete tasks, (usually) yield multiple observations per respondent per observation.

7
7 Theories of Individual Choice Behavior Examples of Choices?? Elements of Choice Process Decision maker ~ Individual, Family, Firm etc. Alternatives ~ Choice set ( Consideration set, evoked set) –Choice set need to exhibit 3 characteristics »Mutually Exclusive »Exhaustive – all possible alternatives are included »Nos. of alternatives finite Example: Household Energy Use »Electricity, Gas, Oil, Wood »None option Attributes of alternatives ~ e.g. Transit Choice: cost, time Decision rule ~ a choice from two or more alternatives requires a decision rule. It describes the internal mechanism used by the decision maker to arrive at a unique choice. A few rules are as follows: –Non-compensatory: Lexicographic rule, Elimination by Aspects –Compensatory: Multi-Attribute Decision Rule

8
8 Theories of Individual Choice Behavior Rational Behavior Consistent and transitive preferences Transit choice: car > Car Pool, Car Pool > Bus, then Car > Bus Economic Consumption Theory An individual choosing a consumption bundle Q={ q1,.q2…, qn}, where q1,q2…qn are the quantities of each of the commodities and services. In economic consumption theory these quantities are generally assumed to be continuous variables. The consumer is faced with a budget constraint, i.e. income. In the classical approach to consumer theory there is no explicit treatment of attributes in addition of the quantities. The consumer is assumed to have preferences over alternative consumption bundles, for instance if Q i Q j. It is assumed that that the consumer has the ability to compare all possible alternatives. Under these assumptions there exist an ordinal utility function and the consumer chooses bundle Q i if U(Q i ) U(Q j ). Utility function First Order conditions/ Derivatives Demand function

9
9 Theories of Individual Choice Behavior Discrete Choice Theory Transit Choice: Car Pool, Car, Bus, Train As we are dealing with a set of discrete choices, it is impossible to use the techniques used in Economic Consumption Theory (e.g. use of derivatives to derive demand function). Thus discrete choice problem requires a different analytical approach. We will retain the concept of Rational Consumer and concept of Utility maximization. Difference between Choice Theory and Economic Consumption Theory. –Consumption theory works with demand function derived from utility function but Choice theory work directly with the utility functions. Random Utility Maximization –The approach of Random Utility is more in line with economic consumption theory and is the theoretical behavioral approach for modeling choices. –The individual is always assumed to select the alternative with the highest utility. However, the utilities are not known to the analyst with certainty and are therefore treated by the analyst as random variables. –We are saying Utility is random. There are a number of sources of randomness such as unobserved attributes unobserved taste variations measurement errors and imperfect information –When Consumer will choose one alternative ?

10
10 Discrete Choice Analysis: related Issues Variable Choice sets- in discrete choice experiments, we give respondents same choice sets (or consideration set). No of alternatives fixed. Example: If consumer do not have car, then his/her alternatives are car pool, bus, and train. Behavioral Theory- What if decision rule is Non- compensatory such as Lexicographic or Elimination by Aspects. This will lead to different choice model. Missing Information - What happens when we dont give consumer full information i.e. choices under missing information. May influence brand equity, mean estimates, or variability. Family Decision Making ( or interaction among households) Current models are based on the theory of individual decision making. Many decisions are based on the interactions of household members such as home purchase

11
11 Experimental Design: Selection of Attributes Wide variety of product attribute descriptions – mainly 3 types ( see Lefkoff- Hagius and Mason (1993), 20 (1), , Journal of Consumer Research ): (a) Characteristics / Physical properties / Tangible Attributes that describes the physical properties Physical attributes are objectively measured and meaningful to engineers and managers (b) Beneficial / Functional / Intangible / Instrumental What product will do to user Economic theory suggests that consumers select products for the utility or benefit that they provide. Consumers want products not for physical products themselves but for the benefit derived from using the products (c) Image / Intangible/ abstract Beyond these utilitarian or functional benefits there is a stream of sociological work which consider symbolic aspects of product preferences How product represent user to others or self-image Possibility of correlations among 3 types of attributes Physical characteristics are often causally linked to beneficial attributes. – presence of air bag, antilock brakes determines safety of a car. Correlation between beneficial and image attributes –Fast acceleration, tight cornering projects sporty image

12
12 Experimental Design: Selection of Attributes Designs Science of planning exactly what observations to take and how to take them to permit the best possible inferences to be made from data vis-à-vis ones hypothesis. Designed experiments manipulate attributes and levels to permit rigorous tests of certain hypothesis of interest Experiment manipulates one or more attributes Attribute list –Also called factors, explanatory variables, independent variables, features etc –Make list of attributes – from previous surveys, consumers, experts –Make a short list of attributes – focus group, best- worst scaling –Check for conceptual correlation between the attributes Identify attribute levels –experiment with one attribute each level is called TREATMENT –Experiment with more than one attribute Each unique combination of factor levels is called TREATMENT COMBINATION –Marketers call this treatment combination PROFILES

13
13 Experimental Design: Factorial Designs Designs – Full Factorial Each level of each attribute is combined with every level of all other attributes eg, 3 attributes, each with 2 levels like soup attributes - meat (beef/chicken), noodles (yes/no) & veggies (yes/no). Each combination of these attribute levels describes a unique soup (eg, chicken noodle with vegetables). All soups that can be created from this set of attributes & levels are given by a factorial combination of their levels. ie, there are 2 x 2 x 2, or eight total soups, as below SoupsMeatNoodlesVegetab les 1chickenyes 2chickenyesno 3chickennoyes 4chickenno 5beefyes 6beefyesno 7beefnoyes 8beefno

14
14 ABCA*BA*CB*CA*B*C ABCA*BA*CB*CA*B*C A1 B01 C001 A*B0001 A*C00001 B*C A*B*C Design Matrix: 1 = yes, -1 = No Correlation Matrix Full Factorial –allows estimation of all main effects and interactions –All effects are truly independent

15
15 LevelBrandPriceCrust Type # of Toppings Delivery Time (mins.) Free Delivery Garlic Bread/Bread Sticks Chicken wings Salad 0Pizza Hut$10Regular110No 1Domino's$12Thin2-320Yes 2Pizza $14Thick Gino's$16Pan pizza6 or more40 Example: Delivered Pizza Full Factorial Combinations ? Combinations can be BLOCKED

16
16 SoupsMeatNoodlesVeggies 1(-1) chicken (-1) no 2(-1) chicken(1) yes Fraction 1 3(1) beef(-1) no(1) yes 4(1) beef(1) yes(-1) no 1(-1) chicken(-1) no(1) yes 2(-1) chicken(1) yes(-1) noFraction 2 3(1) beef(-1) no 4(1) beef(1) yes Fractional Factorial Designs

17
17 Fractional Factorial Designs: Main Effects Selective samples from full factorial –Fractions loss statistical efficiency of the design –Must assume that higher order interactions are not significant Current ½ fraction is main effects only design Degrees of Freedom = Nos. of estimated coeff. + 1 ABCA*BA*CB*CA*B*C A1 B01 C001 A*B001 A*C0001 B*C00001 A*B*Cxxxxx.x1 Correlation Matrix

18
18 Fractional Factorial Designs: Interactions Interaction - preferences for levels of one attribute depend on the levels of a second. Economic theory generally silent about interactions, but if utility function is strictly additive, all attributes must be preferentially independent. Travel Time UTILITYUTILITY Low Fare High Fare

19
19 Fractional Factorial Designs Main effects designs can be practical –Two different & often conflicting objectives in choice research: –Understanding process requires maximum information (ie, full factorials) or high resolution designs. –Resolution of a design = highest order of effects that can be independently estimated. –Main effects designs have lowest resolution - science needs highest resolution designs possible. –Understanding may lead to better prediction; but better prediction will not necessarily lead to better understanding. Research findings –Main effects capture about 70-90% of explained variance –Two way interactions capture about 5-15% of explained variance –Higher order interactions account for the rest Practical designs: Main effects only Main effects plus selected interactions

20
20 Correlated treatment combinations/ Unacceptable combinations Re: Moore and Holbrook (1990), Journal of Consumer Research, 16 (4), Many product classes contain objects with correlated attributes. –Car: acceleration, top speed and engine size are all positively correlated and each negatively correlated to gas mileage –Orthogonal designs can produce some unbelievable attribute combination such as low priced luxury car, very powerful, low wattage air conditioner etc. Possible Approaches –Permute attribute levels to generate another fractional design. Feasible when number of attributes and their levels are small –One can remove the unbelievable combination so works as a missing level Possible Approaches –One can construct composite factor for a number of highly correlated attributes. This can be handled by combining levels:short trip ($2.75, $3.75); long trip ($4.00, $5.50) - 4 levels. But will be difficult to infer about contribution of particular attribute in consumers preference –One can construct a design with non-zero correlation but with more realism. Generate designs under constrained condition. –Procedure based on difference designs (Louviere and Woodworth 1988, Goldberg, Green and Wind 1984) Research findings –Impact of unacceptable combinations is minimal –Monitor during pretest and place questionable profiles at the end

21
21 Design Efficiency What is Efficiency and How to Maximise it? General Linear Model: Y = Xβ + ε The significance of the t & F statistics reflect the variance of β and the size of the error. As the variance of β decreases the statistical significance increases. There are two ways to reduce the variance of β: (1) data (2) design matrix. Efficiency is the ability to estimate β, given the design matrix (X) you have specified i.e. it reflects how well your experimental design can answer the question you are interested in. As the efficiency of the model increases, the variance of β decreases (and vice versa). Efficiency can be calculated because the variance of β is proportional to the variance of X

22
22 Design Efficiency A-efficiency ~ trace is sum of the diagonal element of variance-cov matrix i.e. sum of the variances –average variance D –efficiency ~ geometric mean of the variance. Determinant is product eigenvalues. N = number of runs i.e. rows of X matrix P = number of columns of X matrix i.e. number of attributes Burgess, Leonie and Deborah Street (2003), Optimal designs for 2k choice experiments, Communications in Statistics -Theory and Methods, 32, Burgess, Leonie and Deborah Street (2005), Optimal designs for choice experiments with asymmetric attributes, Journal of Statistical Planning and Inference, 134, Street, Deborah, Leonie Burgess and Jordan J. Louviere (2005), Quick and Easy Choice Sets: Constructing Optimal and Nearly Optimal Stated Choice Experiments, International Journal of Research in Marketing, forthcoming.

23
23 Conjoint Analysis: Different Types Preference structure measurement Compositional approach Self-explicated method Decompositional approach Traditional conjoint + Discrete Choice method Compositional and Decompositional Hybrid method Adaptive method Hybrid Conjoint: Self-explicated + subset of profiles ( see Green (1984), Journal of Marketing Research, 21 (May), ) –Attribute level desirability values for the levels of each attribute separately –Attribute importance data –Conjoint responses to a limited set (usually 3 to 9) of full profiles, drawn from a larger master design –The main idea of hybrid model is to develop individual utility functions in which some aspects of the resulting utilities are estimated at individual level and some at total sample (possibly sub-sample) level.

24
24 Designs for Simple SP Problems Attributes of Boston – LA flights Levels of features Return fare$300, $400, $500, $600 Departure time8am, 9am, noon, 2pm Total travel time5, 7 hours Non-stop servicenon-stop, 1 stop Music/audio entertainmentyes, no TV-video clips, newsyes, no Movie(s)yes, no Hot mealyes, no AirlineSouthwest, Northwest

25
25 Designs for Simple SP Problems Effects & df can be decomposed as follows: –Main effects (13 df) –Two-way interactions (36 df) –Other interactions (2, = ?? df) –Typically do not ask respondents to evaluate 2048 tickets - need more parsimonious statistical models or blocking + aggregation. Design Possibilities Design possibilities include (in increasing complexity): –Only main effects (OMEP) –Main effects orthogonal to unobserved 2-way interactions –Main effects + some two-way interaction effects –Main effects + all two-way interaction effects –Designs for orthogonal higher order effects –Blocking + aggregation – dividing large designs into versions/blocks

26
26 Designs for Simple SP Problems: Main Effect Design Design codes for a regular orthogonal main effects FlightFareDepartTimeStopsAudioVideoMealsAirline

27
27 Designs for Simple SP Problems: Main Effect Design Flight # Translating design codes into profiles using find & replace FareDepartTimeStopsAudioVideoMealsAirline 1$3008 am4hrs0no Southwest 2$3009 am5hrs1noyes Southwest 3$300Noon6 hrs1yesnoyesNorthwest 4$3002 pm7 hrs0yes noNorthwest 5$4008 am5 hrs0yesnoyesNorthwest 6$4009 am4 hrs1yes noNorthwest 7$400Noon7 hrs1no Southwest 8$4002 pm6 hrs0noyes Southwest 9$5008 am6 hrs1noyesnoNorthwest 10$5009 am7 hrs0no yesNorthwest 11$500Noon4 hrs0yes Southwest 12$5002 pm5 hrs1yesno Southwest 13$6008 am7 hrs1yes Southwest 14$6009 am6 hrs0yesno Southwest 15$600Noon5 hrs0noyesnoNorthwest 16$6002 pm4 hrs1no yesNorthwest

28
28 Multiple Choice Experiments Two types of alternatives –Unlabelled or Generic –Labelled or alternative specific TreatmentAlternative 1Alternative 2 combination ComfortLow Travel Time10 hours1 hour TreatmentCarPlane combination ComfortLow Travel Time10 hours1 hour

29
29 Discrete Choice Analysis: Models Conceptual Framework : RANDOM UTILITY Theory (RUT) –RUT holds that consumer preferences are latent and unobservable (Manski 1977, Thurstone 1927, McFadden 1974) and latent utility can be expressed as additive function of systematic and random component Latent utility (U i ) of option i evaluated by individual Systematic part Random part Attribute information

30
30 Discrete Choice Analysis: Making Random Utility Theory Operational Specification of systematic part: The first issue is what variables to include? –Attributes ~ in case transit choice: travel cost, time, comfort, convenience, safety –What is the reasonable functional form? –Linear in parameters Specification of the random part –We dont know ?. However if we know the distribution function that describes it, then we can at least tell how likely the inequality will met. The assumption of distribution function gives different choice models. The assumption of Extreme value distribution ( also known as double exponential, Weibul distribution) gives us the LOGIT CHOICE MODEL. –The assumption of normal distribution will give us PROBIT CHOICE MODEL. But PROBIT dont have closed closed form solution and becomes computationally very complicated when we have three or more alternatives.

31
31 Discrete Choice Model and its Assumptions Mathematics of Choice Models Say a consumer will take action when the utility is positive Utility, U i > 0 ( U i = V i + e i ) V i + e i > 0 e i > - V i ( Say V i = ' x = b 1 x 1 +b 2 x 2 +…… ) We are interested about mean choice We need to assume a distribution of e i.e. f(e i ) Mean Probability Logit Model

32
32 All latent dependent variable models confound mean and error variance. –e.g., Choice Models (MNL, Mixed Logit) for MNL, Implication: Estimation software outputs ( / ) not Imposing distribution on makes empirical sense if is constant. Discrete Choice Models

33
33 Binary Choice: P(yes|yes, no) = exp(V yes )/[exp(V yes ) + exp(V no )], V no can be set = 0 to satisfy the identification restriction. Thus, P(yes|yes, no) = exp(V yes )/[exp(V yes ) + 1], The odds of responding yes relative to no are Taking natural logarithms of both sides, we see that: V yes specified as linear-in-parameters: V yes = k k X k + m m Z m, k - vector of K attribute effects (X k ); m - vector of M individual measure effects (Z m ) interacteed with intercept or elements of X k. Key property – can effects of X k be estimated independently? X k contains main effects + (possibly) interactions - so, models that can be estimated depend on design of X k columns. Binary Choice Model

34
34 Multinomial Choice Model Linearise MNL model to motivate the discussion: or,. The odds of choosing a over r (a reference option) is: ; and the log odds are:

35
35 Multinomial Choice Model So, log odds of a/r estimates a utility difference of options a & r. The utility of one option must be set to a constant (typically 0) because only J-1 options are identified. Thus, r can be set = 0; hence the log odds of a/r estimates the utility of a up to a positive linear transformation. Generally r is not constant, but its attributes vary over choice sets. If rs utility effects are generic we would have the following:, ; and. Thus, MNL is a difference-in-attributes model. If rs attributes constant (eg, no choice), can vary abs levels

36
36 Discrete Choice Models : Covariance Heterogeneity Model (CHM),Mixed Logit model: McFadden and Train 2000 and Latent Segment: Kamakura and Russell 1989 CHM Mixed Logit There are other models or extensions of basic models MNL(λ=1) Latent Segment

37
37 Discrete Choice Model and its Assumptions Multinomial Logit: Model Assumptions IID ~ The residuals (errors) of the alternatives are independent and identically distributed. What IID means? Unobserved factors are uncorrelated over alternatives, as well as having the same variance for all the alternatives This is not always TRUE –A person who dislikes travel by bus because of presence of other riders might have a similar reaction to rail travel. If so, then unobserved factors affecting bus and rail are correlated IIA ~ Independence of Irrelevant Alternatives: For a specific individual the ratio of the choice probabilities of any two alternatives is entirely unaffected by systematic utilities of any other alternatives. Examples ???

38
38 Data Collection and Estimation Calculation of Sample size and sampling Design Convert design into Survey Presentation format – mail, interview, online, computer- assisted Generate sample design using SPSS Generate sample design using SAS

39
39 Sample size for Discrete Choice models Sample Size p = choice share of a brand, q = 1-p, r = nos. of replication, a = allowable error margin (percentage of choice share). Example: say we have 3 brands and we are interested to measure the choice shares of these brands. If we assume the population is homogeneous in terms of their preferences, choice share p for a particular brand will be either close to 0 or close to 1. If we assume the population is heterogeneous in terms of their preferences, choice share p for a particular brand will be close to 1/3. Here 3 is number of brands. This is the case of largest variance in the population. Lets assume the heterogeneous population i.e. p = 0.33, q =0.67. We want to be 95% confident on our result i.e. z =1.96. Our allowable margin of error is 10% of p. Our design has 8 replication i.e. r =8. We need to ask 98 respondents and each will make 8 choices.

40
40 Example binary logit regression results ( n=100 individual) EffectEstimateStd ErrT-ValueP(T) Fare Dep 8am Dep 9am Dep Noon Time (hrs) No. Stops Audio Video Meals Airline (SW = -1) Intercept Statistics: -2[L(0)-L( )] = , df=10, p( 2 )<0.000, 2 = 0.168

41
41 SP Models: Interpretation of Results Interpretation of Results Individual Coefficient Market Share (or Choice share) Relative importance of attribute –For each attribute find difference between maximum utility – minimum utility –Add all the differences –Determine relative importance Willingness to Pay (WTP) – Mean centered Price Price elasticity Brand share Price level Price Coeff.

42
42 Ordering a Large (14) Pizza Sample Example: Pizza See sample data Discrete choice sample data.sav FeaturesDescription (levels) of Features Levels0123 BrandPizza HutDominos Eagle Boys Pizza Haven Price$12$14$16$18 Number of toppings 13 Free SaladNoYes Free DessertNoYes

43
43 Estimation using SPSS File – Open - Data »Discrete choice sample data.sav Analyze – Regression – Binary Logistic »Dependent: Choice (consumers answer, 1 = yes, 0 = No) »Covariates: Enter all your attributes »Click: categorical and enter all covariates »Contrast: Choose deviation »Preference: First »Click Change »Click Continue and then click OK

44
44 Interpretation of Sample Output Are the estimates intuitive? Relative importance of the attributes Brand share Willingness-to-pay Own price elasticity

Similar presentations

© 2017 SlidePlayer.com Inc.

All rights reserved.

Ads by Google