Download presentation

Presentation is loading. Please wait.

Published byGilbert Kelley Modified over 6 years ago

1
Incomplete data: Indirect estimation of migration flows Modelling approaches

2
Aim: Synthetic data base by effective combination of data from different sources

3
Requirements Data representation: a mathematical model of the ‘complete’ or desired migration data Data types: the different ways of measuring migration –Data on events [relocations] (‘movement data’) Migrations –Data on changes in status [place of residence] Migrants

4
Requirements Typology of missing or incomplete data Related to data types: what is missing? Typology of available data Related to data types: what is available? –Primary data –Auxiliary data (e.g. historical migration matrix) Measure of reliability of available data. Method to infer missing data from available statistical data and ‘soft’ information on migration

5
Existing approaches Net migration: residual method Gross migration flows: spatial interaction models –Gravity model –Entropy maximisation –Information-theoretic approaches –Iterative proportional fitting (bi- and multiproportional adjustment [RAS]) Age profile: model migration schedules

6
The approach Migration is a manifestation of behavioural processes and random processes (choice and chance) Describe the processes and get plausible/accurate parameter estimates based on the (incomplete) data and additional information Apply the model to predict migration flows

7
Data types Micro-data –Migration data (event data) Occurrence of migration in observation period Time at migration –Migrant data (status data; transition data) Current status Status at two or more points in time (panel) –Equal interval –Unequal interval (e.g. place of birth and place of current residence) Grouped data

8
Data types Micro-data Grouped data (aggregate data; tabulations) –Migrations (events) –Migrants (transitions) Observation in continuous time (e.g. population register) Observation in discrete time

9
Types of incompleteness Non-response Net migration vs gross flows Migrants vs migrations (events) Single migration recorded instead of sequence of migrations (e.g. last migration) Partially missing data –e.g. Origin by age or covariates –Some information missing for some persons

10
Solutions to incomplete data Collect missing information Use ancillary data and/or information on comparable population Live with it and minimise distortions caused by missing data Infer missing data from all the information you can get (combine sources)

11
Probability models of migration Migration is a realisation of a Poisson process

13
Log-rate model: rate = events/exposure Gravity model

16
RAS, Biproportional adjustment, etc.

17
Likelihood equations may be written as : Marginal totals are sufficient statistics

18
A different way of writing the spatial interaction model: Link Poisson - Multinomial

19
The gravity model is a log-linear model The entropy model is a log-linear model The RAS model is as log-linear (log-rate) model

20
Parameter estimation Maximise (log) likelihood function: probability that the model predicts the data Expectation: predict E[N rs ] = rs given the model and initial parameter estimates. Maximisation: maximise the ‘complete- data’ log-likelihood.

21
Z ki : Individual k is member of group i

22
When k and 2 are known, then

23
Conclusion A unified approach to the prediction of migration from different types of data and different data sources Approach based on probability theory and theory of statistical inference (not ad hoc) The EM algorithm is studied extensively. Much experience gathered. ‘Soft’data (e.g. expert opinions) can be added

Similar presentations

© 2021 SlidePlayer.com Inc.

All rights reserved.

To make this website work, we log user data and share it with processors. To use this website, you must agree to our Privacy Policy, including cookie policy.

Ads by Google