Related papers: A Scoping Review of Earth Observation and Machine Learning for Causal Inference: Implications for the Geography of Poverty

A Scoping Review of Earth Observation and Machine Learning for Causal Inference: Implications for the Geography of Poverty

URL: http://arxiv.org/abs/2406.02584v3
Date: Tue, 24 Sep 2024 20:50:21 GMT
Title: A Scoping Review of Earth Observation and Machine Learning for Causal Inference: Implications for the Geography of Poverty
Authors: Kazuki Sakamoto, Connor T. Jerzak, Adel Daoud,
Abstract summary: Early research in computer vision used predictive models to estimate living conditions. Recent work has progressed beyond using EO data to predict such outcomes -- now also using it to conduct causal inference.
Score: 3.4137115855910762
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Earth observation (EO) data such as satellite imagery can have far-reaching impacts on our understanding of the geography of poverty, especially when coupled with machine learning (ML) and computer vision. Early research in computer vision used predictive models to estimate living conditions, especially in contexts where data availability on poverty was scarce. Recent work has progressed beyond using EO data to predict such outcomes -- now also using it to conduct causal inference. However, how such EO-ML models are used for causality remains incompletely mapped. To address this gap, we conduct a scoping review where we first document the growth of interest in using satellite images and other sources of EO data in causal analysis. We then trace the methodological relationship between spatial statistics and ML methods before discussing five ways in which EO data has been used in scientific workflows -- (1) outcome imputation for downstream causal analysis, (2) EO image deconfounding, (3) EO-based treatment effect heterogeneity, (4) EO-based transportability analysis, and (5) image-informed causal discovery. We consolidate these observations by providing a detailed workflow for how researchers can incorporate EO data in causal analysis going forward -- from data requirements to choice of computer vision model and evaluation metrics. While our discussion focuses on health and living conditions outcomes, our workflow applies to other measures of sustainable development where EO data are informative.

Related papers

A Survey of AIOps in the Era of Large Language Models [60.59720351854515]
We analyzed 183 research papers published between January 2020 and December 2024 to answer four key research questions (RQs)<n>We discuss the state-of-the-art advancements and trends, identify gaps in existing research, and propose promising directions for future exploration.
arXiv Detail & Related papers (2025-06-23T02:40:16Z)
Can Large Language Models Help Experimental Design for Causal Discovery? [94.66802142727883]
Large Language Model Guided Intervention Targeting (LeGIT) is a robust framework that effectively incorporates LLMs to augment existing numerical approaches for the intervention targeting in causal discovery. LeGIT demonstrates significant improvements and robustness over existing methods and even surpasses humans.
arXiv Detail & Related papers (2025-03-03T03:43:05Z)
Regression in EO: Are VLMs Up to the Challenge? [18.343600857006763]
Vision Language Models (VLMs) have achieved remarkable success in perception and reasoning tasks. This paper systematically examines the challenges and opportunities of adapting VLMs for EO regression tasks.
arXiv Detail & Related papers (2025-02-19T20:27:54Z)
REO-VLM: Transforming VLM to Meet Regression Challenges in Earth Observation [58.91579272882073]
This paper introduces a novel benchmark dataset, called textbfREO-Instruct to unify regression and generation tasks specifically for the Earth Observation domain. We develop textbfREO-VLM, a groundbreaking model that seamlessly integrates regression capabilities with traditional generative functions.
arXiv Detail & Related papers (2024-12-21T11:17:15Z)
Analyzing Poverty through Intra-Annual Time-Series: A Wavelet Transform Approach [2.3213238782019316]
Using Landsat imagery and nighttime light data, we evaluate EO-ML methods that use intra-annual EO data. Our results indicate that integrating specific NDVI-derived features with multi-spectral data provides valuable insights for poverty analysis.
arXiv Detail & Related papers (2024-11-05T06:59:05Z)
Data-Centric AI in the Age of Large Language Models [51.20451986068925]
This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs) We make the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs. We identify four specific scenarios centered around data, covering data-centric benchmarks and data curation, data attribution, knowledge transfer, and inference contextualization.
arXiv Detail & Related papers (2024-06-20T16:34:07Z)
Smoke and Mirrors in Causal Downstream Tasks [59.90654397037007]
This paper looks at the causal inference task of treatment effect estimation, where the outcome of interest is recorded in high-dimensional observations. We compare 6 480 models fine-tuned from state-of-the-art visual backbones, and find that the sampling and modeling choices significantly affect the accuracy of the causal estimate. Our results suggest that future benchmarks should carefully consider real downstream scientific questions, especially causal ones.
arXiv Detail & Related papers (2024-05-27T13:26:34Z)
VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models [57.43276586087863]
Large Vision-Language Models (LVLMs) suffer from hallucination issues, wherein the models generate plausible-sounding but factually incorrect outputs. Existing benchmarks are often limited in scope, focusing mainly on object hallucinations. We introduce a multi-dimensional benchmark covering objects, attributes, and relations, with challenging images selected based on associative biases.
arXiv Detail & Related papers (2024-04-22T04:49:22Z)
Impact Assessment of Missing Data in Model Predictions for Earth Observation Applications [4.388282062290401]
We assess the impact of missing temporal and static EO sources in trained models across four datasets with classification and regression tasks. We find that some methods are naturally more robust to missing data. The optical view is the most critical view when it is missing individually.
arXiv Detail & Related papers (2024-03-21T11:03:56Z)
Bias and Fairness in Large Language Models: A Survey [73.87651986156006]
We present a comprehensive survey of bias evaluation and mitigation techniques for large language models (LLMs) We first consolidate, formalize, and expand notions of social bias and fairness in natural language processing. We then unify the literature by proposing three intuitive, two for bias evaluation, and one for mitigation.
arXiv Detail & Related papers (2023-09-02T00:32:55Z)
Joint Learning of Label and Environment Causal Independence for Graph Out-of-Distribution Generalization [60.4169201192582]
We propose to incorporate label and environment causal independence (LECI) to fully make use of label and environment information. LECI significantly outperforms prior methods on both synthetic and real-world datasets.
arXiv Detail & Related papers (2023-06-01T19:33:30Z)
Artificial intelligence to advance Earth observation: : A review of models, recent trends, and pathways forward [60.43248801101935]
This article gives a bird's eye view of the essential scientific tools and approaches informing and supporting the transition from raw EO data to usable EO-based information. We cover the impact of (i) Computer vision; (ii) Machine learning; (iii) Advanced processing and computing; (iv) Knowledge-based AI; (v) Explainable AI and causal inference; (vi) Physics-aware models; (vii) User-centric approaches; and (viii) the much-needed discussion of ethical and societal issues related to the massive use of ML technologies in EO.
arXiv Detail & Related papers (2023-05-15T07:47:24Z)
Studying Up Machine Learning Data: Why Talk About Bias When We Mean Power? [0.0]
We argue that reducing societal problems to "bias" misses the context-based nature of data. We highlight the corporate forces and market imperatives involved in the labor of data workers that subsequently shape ML datasets.
arXiv Detail & Related papers (2021-09-16T17:38:26Z)
Earth Observation and the New African Rural Datascapes: Defining an Agenda for Critical Research [0.0]
Increasing availability of Earth Observation data could transform the use and governance of African rural landscapes. Recent years have seen a rapid increase in the development of EO data applications targeted at stakeholders in African agricultural systems. There is still relatively little critical scholarship questioning how EO data are accessed, presented, disseminated and used in different socio-political contexts.
arXiv Detail & Related papers (2021-08-23T06:05:16Z)
OR-Net: Pointwise Relational Inference for Data Completion under Partial Observation [51.083573770706636]
This work uses relational inference to fill in the incomplete data. We propose Omni-Relational Network (OR-Net) to model the pointwise relativity in two aspects.
arXiv Detail & Related papers (2021-05-02T06:05:54Z)

This list is automatically generated from the titles and abstracts of the papers in this site.