A Scoping Review of Earth Observation and Machine Learning for Causal Inference: Implications for the Geography of Poverty
- URL: http://arxiv.org/abs/2406.02584v3
- Date: Tue, 24 Sep 2024 20:50:21 GMT
- Title: A Scoping Review of Earth Observation and Machine Learning for Causal Inference: Implications for the Geography of Poverty
- Authors: Kazuki Sakamoto, Connor T. Jerzak, Adel Daoud,
- Abstract summary: Early research in computer vision used predictive models to estimate living conditions.
Recent work has progressed beyond using EO data to predict such outcomes -- now also using it to conduct causal inference.
- Score: 3.4137115855910762
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Earth observation (EO) data such as satellite imagery can have far-reaching impacts on our understanding of the geography of poverty, especially when coupled with machine learning (ML) and computer vision. Early research in computer vision used predictive models to estimate living conditions, especially in contexts where data availability on poverty was scarce. Recent work has progressed beyond using EO data to predict such outcomes -- now also using it to conduct causal inference. However, how such EO-ML models are used for causality remains incompletely mapped. To address this gap, we conduct a scoping review where we first document the growth of interest in using satellite images and other sources of EO data in causal analysis. We then trace the methodological relationship between spatial statistics and ML methods before discussing five ways in which EO data has been used in scientific workflows -- (1) outcome imputation for downstream causal analysis, (2) EO image deconfounding, (3) EO-based treatment effect heterogeneity, (4) EO-based transportability analysis, and (5) image-informed causal discovery. We consolidate these observations by providing a detailed workflow for how researchers can incorporate EO data in causal analysis going forward -- from data requirements to choice of computer vision model and evaluation metrics. While our discussion focuses on health and living conditions outcomes, our workflow applies to other measures of sustainable development where EO data are informative.
Related papers
- Analyzing Poverty through Intra-Annual Time-Series: A Wavelet Transform Approach [2.3213238782019316]
Using Landsat imagery and nighttime light data, we evaluate EO-ML methods that use intra-annual EO data.
Our results indicate that integrating specific NDVI-derived features with multi-spectral data provides valuable insights for poverty analysis.
arXiv Detail & Related papers (2024-11-05T06:59:05Z) - Data-Centric AI in the Age of Large Language Models [51.20451986068925]
This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs)
We make the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs.
We identify four specific scenarios centered around data, covering data-centric benchmarks and data curation, data attribution, knowledge transfer, and inference contextualization.
arXiv Detail & Related papers (2024-06-20T16:34:07Z) - Smoke and Mirrors in Causal Downstream Tasks [59.90654397037007]
This paper looks at the causal inference task of treatment effect estimation, where the outcome of interest is recorded in high-dimensional observations.
We compare 6 480 models fine-tuned from state-of-the-art visual backbones, and find that the sampling and modeling choices significantly affect the accuracy of the causal estimate.
Our results suggest that future benchmarks should carefully consider real downstream scientific questions, especially causal ones.
arXiv Detail & Related papers (2024-05-27T13:26:34Z) - VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models [57.43276586087863]
Large Vision-Language Models (LVLMs) suffer from hallucination issues, wherein the models generate plausible-sounding but factually incorrect outputs.
Existing benchmarks are often limited in scope, focusing mainly on object hallucinations.
We introduce a multi-dimensional benchmark covering objects, attributes, and relations, with challenging images selected based on associative biases.
arXiv Detail & Related papers (2024-04-22T04:49:22Z) - Impact Assessment of Missing Data in Model Predictions for Earth Observation Applications [4.388282062290401]
We assess the impact of missing temporal and static EO sources in trained models across four datasets with classification and regression tasks.
We find that some methods are naturally more robust to missing data.
The optical view is the most critical view when it is missing individually.
arXiv Detail & Related papers (2024-03-21T11:03:56Z) - Bias and Fairness in Large Language Models: A Survey [73.87651986156006]
We present a comprehensive survey of bias evaluation and mitigation techniques for large language models (LLMs)
We first consolidate, formalize, and expand notions of social bias and fairness in natural language processing.
We then unify the literature by proposing three intuitive, two for bias evaluation, and one for mitigation.
arXiv Detail & Related papers (2023-09-02T00:32:55Z) - Artificial intelligence to advance Earth observation: : A review of models, recent trends, and pathways forward [60.43248801101935]
This article gives a bird's eye view of the essential scientific tools and approaches informing and supporting the transition from raw EO data to usable EO-based information.
We cover the impact of (i) Computer vision; (ii) Machine learning; (iii) Advanced processing and computing; (iv) Knowledge-based AI; (v) Explainable AI and causal inference; (vi) Physics-aware models; (vii) User-centric approaches; and (viii) the much-needed discussion of ethical and societal issues related to the massive use of ML technologies in EO.
arXiv Detail & Related papers (2023-05-15T07:47:24Z) - Studying Up Machine Learning Data: Why Talk About Bias When We Mean
Power? [0.0]
We argue that reducing societal problems to "bias" misses the context-based nature of data.
We highlight the corporate forces and market imperatives involved in the labor of data workers that subsequently shape ML datasets.
arXiv Detail & Related papers (2021-09-16T17:38:26Z) - Earth Observation and the New African Rural Datascapes: Defining an
Agenda for Critical Research [0.0]
Increasing availability of Earth Observation data could transform the use and governance of African rural landscapes.
Recent years have seen a rapid increase in the development of EO data applications targeted at stakeholders in African agricultural systems.
There is still relatively little critical scholarship questioning how EO data are accessed, presented, disseminated and used in different socio-political contexts.
arXiv Detail & Related papers (2021-08-23T06:05:16Z) - OR-Net: Pointwise Relational Inference for Data Completion under Partial
Observation [51.083573770706636]
This work uses relational inference to fill in the incomplete data.
We propose Omni-Relational Network (OR-Net) to model the pointwise relativity in two aspects.
arXiv Detail & Related papers (2021-05-02T06:05:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.