Livelihoods analysis using SPSS. Why do we analyze livelihoods?  Food security analysis aims at informing geographical and socio-economic targeting 

Slides:



Advertisements
Similar presentations
SADC Course in Statistics Common complications when analysing survey data Module I3 Sessions 14 to 16.
Advertisements

Applied Econometrics Second edition
©2004, 2006, 2008 UIW Department of Instructional Technology Meat and Potatoes SPSS Presented by Terence Peak.
Cluster Analysis Measuring latent groups. Cluster Analysis - Discussion Definition Vocabulary Simple Procedure SPSS example ICPSR and hands on.
Bivariate Analysis Cross-tabulation and chi-square.
4. FREQUENCY DISTRIBUTION
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
17a.Accessing Data: Manipulating Variables in SPSS ®
Introduction to SPSS Allen Risley Academic Technology Services, CSUSM
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
AEB 37 / AE 802 Marketing Research Methods Week 7
Cluster Analysis.
Welcome to Dave’s Data Demonstration This presentation is designed for users of SPSS with some familiarity with the program and a willingness to experiment.
AEB 37 / AE 802 Marketing Research Methods Week 5
QM Spring 2002 Business Statistics SPSS: A Summary & Review.
A Simple Guide to Using SPSS© for Windows
Chi-square Test of Independence
15a.Accessing Data: Frequencies in SPSS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
MR2300: MARKETING RESEARCH PAUL TILLEY Unit 10: Basic Data Analysis.
Problem 1: Relationship between Two Variables-1 (1)
Introduction to SPSS (For SPSS Version 16.0)
Clustering analysis workshop Clustering analysis workshop CITM, Lab 3 18, Oct 2014 Facilitator: Hosam Al-Samarraie, PhD.
Evaluating Performance for Data Mining Techniques
LINDSEY BREWER CSSCR (CENTER FOR SOCIAL SCIENCE COMPUTATION AND RESEARCH) UNIVERSITY OF WASHINGTON September 17, 2009 Introduction to SPSS (Version 16)
SW388R6 Data Analysis and Computers I Slide 1 Central Tendency and Variability Sample Homework Problem Solving the Problem with SPSS Logic for Central.
Srinivasulu Rajendran Centre for the Study of Regional Development (CSRD) Jawaharlal Nehru University (JNU) New Delhi India
Using SPSS for Windows Part II Jie Chen Ph.D. Phone: /6/20151.
1 Cronbach’s Alpha It is very common in psychological research to collect multiple measures of the same construct. For example, in a questionnaire designed.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Splitting Files PowerPoint Prepared by Alfred.
© 2007 Prentice Hall20-1 Chapter Twenty Cluster Analysis.
Livelihoods activities Food Security Indicators Training Bangkok, January 2009.
© 2011 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Stewart Venit ~ Elizabeth Drake Developing a Program.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
What is SPSS  SPSS is a program software used for statistical analysis.  Statistical Package for Social Sciences.
Recap of data analysis and procedures Food Security Indicators Training Bangkok January 2009.
1 An Introduction to SPSS for Windows Jie Chen Ph.D. 6/4/20161.
Cluster Analysis Cluster Analysis Cluster analysis is a class of techniques used to classify objects or cases into relatively homogeneous groups.
A Simple Guide to Using SPSS ( Statistical Package for the Social Sciences) for Windows.
PROCESSING, ANALYSIS & INTERPRETATION OF DATA
Perform Descriptive Statistics Section 6. Descriptive Statistics Descriptive statistics describe the status of variables. How you describe the status.
Chi-square Test of Independence
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Prepared by: Mahmoud Rafeek Al-Farra
Smoking Data The investigation was based on examining the effectiveness of smoking cessation programs among heavy smokers who are also recovering alcoholics.
Practice Problem: Lambda (1)
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
Sensitivity Analysis A systematic way of asking “what-if” scenario questions in order to understand what outcomes could possibly occur that would effect.
NSF DUE ; Wen M. Andrews J. Sargeant Reynolds Community College Richmond, Virginia.
SW388R6 Data Analysis and Computers I Slide 1 Comparing Central Tendency and Variability across Groups Impact of Missing Data on Group Comparisons Sample.
16a. Accessing Data: Means in SPSS ®. 16a. Accessing Data: Means in SSPS ® 1 Prerequisites Recommended modules to complete before viewing this module.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Selecting Cases PowerPoint Prepared by Alfred.
Using SPSS Next. An Introduction SPSS (the Statistical Package for the Social Sciences)
1 PEER Session 02/04/15. 2  Multiple good data management software options exist – quantitative (e.g., SPSS), qualitative (e.g, atlas.ti), mixed (e.g.,
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
1 Cluster Analysis Prepared by : Prof Neha Yadav.
Analyzing Data. Learning Objectives You will learn to: – Import from excel – Add, move, recode, label, and compute variables – Perform descriptive analyses.
PROCESSING DATA.
Introduction to SPSS July 28, :00-4:00 pm 112A Stright Hall
Introduction to SPSS.
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
LINDSEY BREWER CSSCR (CENTER FOR SOCIAL SCIENCE COMPUTATION AND RESEARCH) UNIVERSITY OF WASHINGTON September 17, 2009 Introduction to SPSS (Version 16)
Data Entry and Managment
Hypothesis Testing Part 2: Categorical variables
Cluster Analysis.
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
Cluster analysis Presented by Dr.Chayada Bhadrakom
By A.Arul Xavier Department of mathematics
 .
Performing the Runs Test Using SPSS
Presentation transcript:

Livelihoods analysis using SPSS

Why do we analyze livelihoods?  Food security analysis aims at informing geographical and socio-economic targeting  Livelihood analysis allows us to answer one of the key basic questions of food security analysis: “who are the food insecure?”  This analysis also allows us to create a socio-economic profile of the vulnerable households

How do we analyze livelihoods  The standard livelihood (income) module in a CFSVA allows for a few different types of analysis  We can analyze the main income activity followed by the second and third by simply running cross-tabulations with the main activity and other variables  We can also use multiple response analysis to analyze all of the reported income activities (regardless of order) and run cross- tabulations  We can analyze the number of income activities to see if there are significant differences between diversified households and single income households  And we can identify clusters of livelihood activities which offers a more powerful form of analysis

Types of cluster analysis available in SPSS  SPSS offers three methods for cluster analysis  Hierarchial clustering  Two-step clustering  K-means clustering

Types of analysis available in SPSS  Hierarchical clustering  Uses algorithms that are agglomerative (bottom-up) or divisive (top-down)  If agglomerative, each case is a cluster and then an algorithm is performed to either separate successive cases into clusters  Divisive algorithms first put all cases in a single cluster and then sequentially attempt to divide them

Types of analysis available in SPSS  Two-step clustering  As the name implies, clustering is done in two steps  First the cases are pre-clustered into many small sub-clusters  Then the sub-clusters are joined into the a specified number of clusters (SPSS can also find the number of clusters automatically)

Types of analysis available in SPSS  K-means clustering  Cases are placed into a partition and then iteratively relocated into another cluster  Iterations are repeated until the desired number of clusters are reached

Issue with SPSS cluster analysis  Two of the available procedures (hierarchical and k- means) require the user to know a priori the number of clusters desired  Only the two-step cluster option allows for automatic determination, however, from the WFP perspective it does not produce a useful result (too few clusters)  Therefore either another statistical software package needs to be used or a guess needs to be made on the number of clusters to include (and then run several iterations until a logical clustering is achieved)

Performing cluster analysis  As mentioned, there are several options available to perform cluster analysis  The analyst should chose the method that they are most familiar with  To give an example of one method to create the clusters, we will use the k-means method in SPSS

Prepare the dataset  It is imperative that the income activity module data is clean and without errors  The sum of all activities contributions must be 100  The same activity should not be repeated for a household  If an activity exists, the relative contribution must not be missing  Before the clustering can be performed, the contribution of each livelihood activity must be calculated for all households  To do so, syntax such as the following must be executed for all variables:  compute act01 = 0.  if (activity1 =1) act01 = act01+Activity1_Value.  if (activity2 =1) act01 = act01+Activity2_Value.  if (activity3 =1) act01 = act01+Activity3_Value.  The objective of this computation is to find out for every household, what is the relative contribution of each activity to their overall livelihood  After executing the syntax above for every activity, verify that the total for each household is exactly 100

Perform the first iteration of the cluster analysis  In this example, we will use the SPSS k-means method to perform cluster analysis using the contribution of each income activity as our variables of interest  In SPSS select:  Analyze > Classify > K-means cluster  Select all of the newly created income activity variables  The number of clusters is chosen at your discretion keeping in mind the number of activities listed in the survey and the knowledge that you will create a few iterations  Click the ‘save’ button and chose ‘cluster membership’  Click OK or Paste

Interpret the results  SPSS will produce a few outputs (based on the options you gave)  The iteration history will show you the number of iterations the change in the center of each cluster  The final clusters center table is the table we look at closely  Here, each variable is listed as a row and it’s average contribution to each cluster is noted in the columns  Paste this table into Excel

Interpret the results  Use conditional formatting to highlight cells with a value > 10 and examine the way the clusters have attempted to group the activities

Repeat the analysis  Repeat the cluster analysis this time increasing (or decreasing) the number of clusters by 1  Examine the final clusters table again  Continue to repeat this exercise until you have successfully created clusters that are logical  Livelihood clusters should be able to be described in a relatively simple fashion. Usually, there is one predominant income activity defining a group and some supplemental income from other activities  There is no ‘golden rule’ on the right number of clusters and some subjective but informed but decisions must be made

Describe the clusters  Once the clusters have been finalized, further examine the contribution of the activities to each cluster  Write a brief description of the composition of the cluster; for example:  A cluster which has a center of 78 from income from trading, selling and other commercial activity could be simply described as a ‘trader’  A cluster which has a center of 50 from cash crops and 30 from food crops could be summarized as ‘cash and food crops’  Appropriately label the final cluster variable in your dataset with the livelihood descriptions

Explore the clusters  Next, explore the livelihood clusters you’ve created  Look at the frequency of the clusters in the dataset  Some clusters may be combined if reasonable information allows you to do so  For example, people who are ‘remittance receivers’ and ‘pensioners’ may have very similar qualities and could possibly be combined

Analyze the clusters using cross-tabulations  The livelihood clusters can be used to examine ‘who are the food insecure’ and ‘where are they’  Cross-tabulate the livelihood clusters with Food Consumption Groups (you can also compare means of the FCS between clusters)  Cross-tabulate the clusters with all geographic strata  Wealth and livelihood are usually highly related and should be examined  Other indicators of interest: gender of household head, education of household head, etc.