Measuring Global Migration Flows using Online Data
- URL: http://arxiv.org/abs/2504.11691v1
- Date: Wed, 16 Apr 2025 01:19:26 GMT
- Title: Measuring Global Migration Flows using Online Data
- Authors: Guanghua Chi, Guy J. Abel, Drew Johnston, Eugenia Giraudy, Mike Bailey,
- Abstract summary: Using privacy protected records from three billion Facebook users, we estimate country-to-country migration flows at monthly granularity for 181 countries.<n>We estimate that 39.1 million people migrated internationally in 2022 (0.63% of the population of the countries in our sample)<n>To support research and policy interventions, we will release these estimates publicly through the Humanitarian Data Exchange.
- Score: 0.38836072943850625
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Existing estimates of human migration are limited in their scope, reliability, and timeliness, prompting the United Nations and the Global Compact on Migration to call for improved data collection. Using privacy protected records from three billion Facebook users, we estimate country-to-country migration flows at monthly granularity for 181 countries, accounting for selection into Facebook usage. Our estimates closely match high-quality measures of migration where available but can be produced nearly worldwide and with less delay than alternative methods. We estimate that 39.1 million people migrated internationally in 2022 (0.63% of the population of the countries in our sample). Migration flows significantly changed during the COVID-19 pandemic, decreasing by 64% before rebounding in 2022 to a pace 24% above the pre-crisis rate. We also find that migration from Ukraine increased tenfold in the wake of the Russian invasion. To support research and policy interventions, we will release these estimates publicly through the Humanitarian Data Exchange.
Related papers
- The NetMob25 Dataset: A High-resolution Multi-layered View of Individual Mobility in Greater Paris Region [64.30214722988666]
This paper describes the survey design, collection protocol, processing methodology, and characteristics of the released dataset.<n>The dataset includes three components: (i) an Individuals database describing demographic, socioeconomic, and household characteristics; (ii) a Trips database with over 80,000 annotated displacements including timestamps, transport modes, and trip purposes; and (iii) a Raw GPS Traces database comprising about 500 million high-frequency points.
arXiv Detail & Related papers (2025-06-06T09:22:21Z) - Inferring fine-grained migration patterns across the United States [1.6594124470436404]
We develop a scalable iterative-proportional-fitting based method which reconciles high-resolution but biased proprietary data with low-resolution but more reliable Census data.<n>We apply this method to produce MIGRATE, a dataset of annual migration matrices from 2010 - 2019 which captures flows between 47.4 billion pairs of Census Block Groups.<n>These estimates are highly correlated with external ground-truth datasets, and improve accuracy and reduce bias relative to raw proprietary data.
arXiv Detail & Related papers (2025-03-26T21:07:44Z) - Uchaguzi-2022: A Dataset of Citizen Reports on the 2022 Kenyan Election [49.35115948941981]
We present Uchaguzi-2022, a dataset of 14k categorized and geotagged citizen reports related to the 2022 Kenyan General Election.<n>We use this dataset to investigate whether language models can assist in scalably categorizing and geotagging reports, thus highlighting its potential application in the AI for Social Good space.
arXiv Detail & Related papers (2024-12-17T17:08:35Z) - Combining Twitter and Mobile Phone Data to Observe Border-Rush: The Turkish-European Border Opening [2.5693085674985117]
Following Turkey's 2020 decision to revoke border controls, many individuals journeyed towards the Greek, Bulgarian, and Turkish borders.
However, the lack of verifiable statistics on irregular migration and discrepancies between media reports and actual migration patterns require further exploration.
This study is to bridge this knowledge gap by harnessing novel data sources, specifically mobile phone and Twitter data.
arXiv Detail & Related papers (2024-05-21T09:51:15Z) - The diaspora model for human migration [0.07852714805965527]
Existing models primarily rely on population size and travel distance to explain flow fluctuations.
We propose the diaspora model of migration, incorporating intensity (the number of people moving to a country) and assortativity (the destination within the country)
Our model considers only the existing diaspora sizes in the destination country, influencing the probability of migrants selecting a specific residence.
arXiv Detail & Related papers (2023-09-06T15:17:53Z) - Estimating Latent Population Flows from Aggregated Data via Inversing
Multi-Marginal Optimal Transport [57.16851632525864]
We study the problem of estimating latent population flows from aggregated count data.
This problem arises when individual trajectories are not available due to privacy issues or measurement fidelity.
We propose to estimate the transition flows from aggregated data by learning the cost functions of the MOT framework.
arXiv Detail & Related papers (2022-12-30T03:03:23Z) - Releasing survey microdata with exact cluster locations and additional
privacy safeguards [77.34726150561087]
We propose an alternative microdata dissemination strategy that leverages the utility of the original microdata with additional privacy safeguards.
Our strategy reduces the respondents' re-identification risk for any number of disclosed attributes by 60-80% even under re-identification attempts.
arXiv Detail & Related papers (2022-05-24T19:37:11Z) - Jalisco's multiclass land cover analysis and classification using a
novel lightweight convnet with real-world multispectral and relief data [51.715517570634994]
We present our novel lightweight (only 89k parameters) Convolution Neural Network (ConvNet) to make LC classification and analysis.
In this work, we combine three real-world open data sources to obtain 13 channels.
Our embedded analysis anticipates the limited performance in some classes and gives us the opportunity to group the most similar.
arXiv Detail & Related papers (2022-01-26T14:58:51Z) - Leveraging Mobile Phone Data for Migration Flows [5.0161988361764775]
Statistics on migration flows are often derived from census data, which suffer from intrinsic limitations.
Alternative data sources, such as surveys and field observations, also suffer from reliability, costs, and scale limitations.
The ubiquity of mobile phones enables an accurate and efficient collection of up-to-date data related to migration.
arXiv Detail & Related papers (2021-05-31T13:41:47Z) - Heads in the Clouds: Measuring the Implications of Universities
Migrating to Public Clouds [39.16294179753668]
We study the migration of universities to public clouds among universities in the U.S. and Europe.
We find that cloud adoption differs between countries, with one cluster showing a limited move to clouds.
We then analyze and interpret our results, finding that the implications reach beyond individuals' privacy towards questions of academic independence and integrity.
arXiv Detail & Related papers (2021-04-19T17:25:10Z) - Forecasting asylum-related migration flows with machine learning and
data at scale [0.0]
We show that adaptive machine learning algorithms can effectively forecast asylum-related migration flows.
We exploit three tiers of data - geolocated events and internet searches in countries of origin, detections of irregular crossings at the EU border, and asylum recognition rates in countries of destination.
arXiv Detail & Related papers (2020-11-09T11:31:17Z) - Brain Drain and Brain Gain in Russia: Analyzing International Migration
of Researchers by Discipline using Scopus Bibliometric Data 1996-2020 [77.34726150561087]
We analyze all researchers who have published with a Russian affiliation address in Scopus-indexed sources in 1996-2020.
While Russia was a donor country in the late 1990s and early 2000s, it has experienced a relatively balanced circulation of researchers in more recent years.
Overall, researchers emigrating from Russia outnumbered and outperformed researchers immigrating to Russia.
arXiv Detail & Related papers (2020-08-07T12:47:38Z) - Cross-lingual Transfer Learning for COVID-19 Outbreak Alignment [90.12602012910465]
We train on Italy's early COVID-19 outbreak through Twitter and transfer to several other countries.
Our experiments show strong results with up to 0.85 Spearman correlation in cross-country predictions.
arXiv Detail & Related papers (2020-06-05T02:04:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.