Presentation is loading. Please wait.

Presentation is loading. Please wait.

Estimating Internally Forced Displacement (IFD) Flows in Colombia using Mark and Recapture methods, 1996 – 2004 Soledad Granada Universidad del Rosario.

Similar presentations


Presentation on theme: "Estimating Internally Forced Displacement (IFD) Flows in Colombia using Mark and Recapture methods, 1996 – 2004 Soledad Granada Universidad del Rosario."— Presentation transcript:

1 Estimating Internally Forced Displacement (IFD) Flows in Colombia using Mark and Recapture methods, 1996 – 2004 Soledad Granada Universidad del Rosario and CERAC Mauricio Sadinle García-Ruíz Universidad Nacional and CERAC Jorge A. Restrepo Pontificia Universidad Javeriana and CERAC September 2nd, 2007

2 Content 1.Difficulties with displacement flows in Colombia 2.Objectives 3.IFD according to the available information 4.Sources 5.Methods 6.Results 7.Further planned work

3 Caveats/Acknowledgement Work still in progress –We are waiting for a major third source of data to become available and to integrate it in order to improve on the estimation quality –We are testing further improvements to extract more information from the data Project developed thanks to the support of Foreign Affairs-CANADA (DFAIT)

4 Difficulties with displacement flows in Colombia Multiple complementary sources Measurement: –Government –Catholic Church’s Conference of Bishops –NGO Codhes –ICRC Surveys –Ibáñez (2006) –Household survey (13 cities) –Catholic Church’s Conference of Bishops Estimations –NGO Codhes

5 Difficulties with displacement flows in Colombia In general, these sources offer similar trends, different levels, but they have differential coverage, methods and purposes As a results, there is no information for humanitarian attention and planning Inefficiencies in humanitarian attention Victims attention is also made mostly on a ad- hoc basis, both by government and other assistance agencies Most attention is emergency-based and not long-term oriented

6 Objectives To estimate the level of IFD using several complementary sources, starting with two To estimate dynamics of IFD To provide a measure of the reliability of the estimates To provide a measure of coverage/undercount of sources To provide a “humanitarian” IFD risk map

7 IFD available information

8

9

10

11

12 Sources

13 Secretariado Nacional de Pastoral Social –RUT- The RUT database has 242.565 records, between1996 y 2004; after cleaning it up we had 236.795 usable records, only 2.37% had incomplete fields. Several variables, including, victim information, family relatives, and regarding the displacement event. Voluntary, not compulsory, survey at humanitarian attention and Church attention posts. Low coverage, self-selected, good coverage in isolated areas Potential problematic biases

14 Acción Social –SUR- Acción Social database holds 2.278.978 records covering the 1995 2006 period. Several variables including family, individual, and event of displacement-related information. Low “quality”. After cleaning it up, only 1.616.743 records were being able to be used. Compulsory registry to obtain humanitarian emergency aid from the government and to be able to apply for further assistance Self-selected Large coverage, mostly on urban areas and large cities

15 Methods We use RUT and AS for the 1996-2004 period to estimate a level of displacement at day-municipality pairs, using Capture-recapture (C-R) census correction models. (Pollock, 2002, JASA). (Also known as mark- and recapture, by Wikipedia) C-R permits an estimation of population levels based on incomplete samples of it by different overlapping sources It assumes : –Two samples (at record level) –Equal probability of inclusion for every individual in every sample –Independency between samples –It is possible to identify the intersection –Population is closed

16 Methods Under those assumptions, estimation is unbiased (consistent) and efficient. Applications in natural sciences realms are countless In social sciences, Ball et al have been a pioneer of using these techniques to estimate the level of HR violation in civil war contexts (Guatemala, Perú, Timor Leste) (see Ball et al, 2002).

17 Methods

18 Estimators used Lincoln-Petersen: Chapman: Records at SUR Records at RUT Recaptured

19 Estimators Chapman (1951) improves on Lincoln- Petersen Chapman is unbiased if (Wittes, 1972): Chapman (1951) show its estimator has negative bias if:

20 The most problematic of all assumptions (in terms of the quality of the estimate) is the equi-probability of capture across individuals Sekar and Deming (1949) propose a stratified estimation procedure in order to account for the effect of differential probability of capture. The procedure will yield thus an estimate: It assumes differential probability of capture across strata, same probability within strata, independence between samples and close population. This way the (upward) bias created falls dramatically The quality of stratification determines the quality of the estimate

21 Methods We use then both Lincoln Petersen and Chapman (preferred) stratified estimators When m=0, This is, we use the simple sum of the two observed samples.

22 There are several applications that take the strata as defined exogenously (communities, regions, etc.) We want to obtain k statistically different groups of records that will have a statistically similar probability of being included in the group: We test the null hypothesis: Against Where How to define the strata?

23 We test using: That distributes

24 Procedure for stratification We test for stratification by municipality (admin unit) of arrival, and day of arrival We further jointly test for strata by day and municipality of arrival, creating strata for each day and municipality pairs. We further test within each municipality in consecutive days, in order to discard over-stratification within municipality, that will lead to an upward bias. (We do not do this by municipality as these are more “natural” strata). The rejection level was set at 5% for all tests

25 Matching criteria We define the matching criteria according to date of expulsion and arrival, place of expulsion and arrival and gender. Other more strict applications offered no substantial estimated difference. We create an indicator for each record and compare them across sources using a computer algorithm We do not have access to more specific characteristics (name, id) of the individual that will help us to improve the matching

26 Confidence: Bootstrapping methods In order to obtain confidence intervals, and given that we do not know the theoretical distribution of the estimator, we use bootstrapping methods in order to approach the distribution of the estimator. We follow Buckland y Garthwaite (1991). We perform 10000 replications of sub-samples with replacement in order to obtain CI and an estimate of the variance of the estimator. We perform a correction of the bias of the estimator according to the results of the bootstrap obtaining:

27 Results The following are the point estimates bias-corrected and uncorrected Lincoln-Petersen and Chapman estimates. In graphs and maps we present the Chapman estimate

28

29

30

31

32 Departmental Results

33 Recorded by Secretariado Nacional de Pastoral Social – RUT -

34 Recorded by Acción Social SUR

35 Recaptured

36 Estimated Arrival IFD 1996 – 2004

37 Recorded Exit IFD by the two systems (Projected)

38 Probability of being displaced by municipality (exit)

39 Estimated arrival IFD population per 100.000 inhabitants

40 Comparison with other displacement figures o CODHES: o Estimation (Jan 1985 - June 2006) : 3.832.527* o RUT: o Estimation (1985 – 1994): 720.000 o Records (Jan 1995 - June 2004): 237.614 o SUR: o Records (Jan 1995 - June 2005) : 1.877.328 o CEDE: o Estimation (Jan 1995 - June 2005) : 2.576.610 o CERAC: o Estimation (1996 - 2004) : 2.440.207 * Incluye cifra Conferencia Episcopal periodo 1985 – 1994.

41 Further Planned Work We just received data until mid-2007 and will consequently update the estimation We are devising a method to improve the estimation in order to incorporate the large number of un-matchable observations using a window-based recursive procedure with those observations. We are researching the properties of the estimator using Monte Carlo methods.


Download ppt "Estimating Internally Forced Displacement (IFD) Flows in Colombia using Mark and Recapture methods, 1996 – 2004 Soledad Granada Universidad del Rosario."

Similar presentations


Ads by Google