Geospatial Disparities: A Case Study on Real Estate Prices in Paris
- URL: http://arxiv.org/abs/2401.16197v1
- Date: Mon, 29 Jan 2024 14:53:14 GMT
- Title: Geospatial Disparities: A Case Study on Real Estate Prices in Paris
- Authors: Agathe Fernandes Machado, Fran\c{c}ois Hu, Philipp Ratz, Ewen Gallic,
Arthur Charpentier
- Abstract summary: We propose a toolkit for identifying and mitigating biases arising from geospatial data.
We incorporate an ordinal regression case with spatial attributes, deviating from the binary classification focus.
Illustrating our methodology, we showcase practical applications and scrutinize the implications of choosing geographical aggregation levels for fairness and calibration measures.
- Score: 0.3495246564946556
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Driven by an increasing prevalence of trackers, ever more IoT sensors, and
the declining cost of computing power, geospatial information has come to play
a pivotal role in contemporary predictive models. While enhancing prognostic
performance, geospatial data also has the potential to perpetuate many
historical socio-economic patterns, raising concerns about a resurgence of
biases and exclusionary practices, with their disproportionate impacts on
society. Addressing this, our paper emphasizes the crucial need to identify and
rectify such biases and calibration errors in predictive models, particularly
as algorithms become more intricate and less interpretable. The increasing
granularity of geospatial information further introduces ethical concerns, as
choosing different geographical scales may exacerbate disparities akin to
redlining and exclusionary zoning. To address these issues, we propose a
toolkit for identifying and mitigating biases arising from geospatial data.
Extending classical fairness definitions, we incorporate an ordinal regression
case with spatial attributes, deviating from the binary classification focus.
This extension allows us to gauge disparities stemming from data aggregation
levels and advocates for a less interfering correction approach. Illustrating
our methodology using a Parisian real estate dataset, we showcase practical
applications and scrutinize the implications of choosing geographical
aggregation levels for fairness and calibration measures.
Related papers
- Thinking Racial Bias in Fair Forgery Detection: Models, Datasets and Evaluations [63.52709761339949]
We first contribute a dedicated dataset called the Fair Forgery Detection (FairFD) dataset, where we prove the racial bias of public state-of-the-art (SOTA) methods.
We design novel metrics including Approach Averaged Metric and Utility Regularized Metric, which can avoid deceptive results.
We also present an effective and robust post-processing technique, Bias Pruning with Fair Activations (BPFA), which improves fairness without requiring retraining or weight updates.
arXiv Detail & Related papers (2024-07-19T14:53:18Z) - Fair Spatial Indexing: A paradigm for Group Spatial Fairness [6.640563753223598]
We propose techniques to mitigate location bias in machine learning.
We focus on spatial group fairness and we propose a spatial indexing algorithm that accounts for fairness.
arXiv Detail & Related papers (2023-02-05T05:15:11Z) - TensorAnalyzer: Identification of Urban Patterns in Big Cities using
Non-Negative Tensor Factorization [8.881421521529198]
This paper presents a new approach to detecting the most relevant urban patterns from multiple data sources based on tensor decomposition.
We developed a generic framework namedAnalyzer, where the effectiveness and usefulness of the proposed methodology are tested.
arXiv Detail & Related papers (2022-10-06T01:04:02Z) - Position-aware Structure Learning for Graph Topology-imbalance by
Relieving Under-reaching and Over-squashing [67.83086131278904]
Topology-imbalance is a graph-specific imbalance problem caused by the uneven topology positions of labeled nodes.
We propose a novel position-aware graph structure learning framework named PASTEL.
Our key insight is to enhance the connectivity of nodes within the same class for more supervision information.
arXiv Detail & Related papers (2022-08-17T14:04:21Z) - Deep residential representations: Using unsupervised learning to unlock
elevation data for geo-demographic prediction [0.0]
LiDAR technology can be used to provide detailed three-dimensional elevation maps of urban and rural landscapes.
To date, airborne LiDAR imaging has been predominantly confined to the environmental and archaeological domains.
We consider the suitability of this data not just on its own but also as a source of data in combination with demographic features, thus providing a realistic use case for the embeddings.
arXiv Detail & Related papers (2021-12-02T17:10:52Z) - Fairness without the sensitive attribute via Causal Variational
Autoencoder [17.675997789073907]
Due to privacy purposes and var-ious regulations such as RGPD in EU, many personal sensitive attributes are frequently not collected.
By leveraging recent developments for approximate inference, we propose an approach to fill this gap.
Based on a causal graph, we rely on a new variational auto-encoding based framework named SRCVAE to infer a sensitive information proxy.
arXiv Detail & Related papers (2021-09-10T17:12:52Z) - Geo-Spatiotemporal Features and Shape-Based Prior Knowledge for
Fine-grained Imbalanced Data Classification [63.916371837696396]
Fine-grained classification aims at distinguishing between items with similar global perception and patterns, but that differ by minute details.
Our primary challenges come from both small inter-class variations and large intra-class variations.
We propose to combine several innovations to improve fine-grained classification within the use-case of wildlife.
arXiv Detail & Related papers (2021-03-21T02:01:38Z) - Fundamental Limits and Tradeoffs in Invariant Representation Learning [99.2368462915979]
Many machine learning applications involve learning representations that achieve two competing goals.
Minimax game-theoretic formulation represents a fundamental tradeoff between accuracy and invariance.
We provide an information-theoretic analysis of this general and important problem under both classification and regression settings.
arXiv Detail & Related papers (2020-12-19T15:24:04Z) - Domain Adaptative Causality Encoder [52.779274858332656]
We leverage the characteristics of dependency trees and adversarial learning to address the tasks of adaptive causality identification and localisation.
We present a new causality dataset, namely MedCaus, which integrates all types of causality in the text.
arXiv Detail & Related papers (2020-11-27T04:14:55Z) - Supercharging Imbalanced Data Learning With Energy-based Contrastive
Representation Transfer [72.5190560787569]
In computer vision, learning from long tailed datasets is a recurring theme, especially for natural image datasets.
Our proposal posits a meta-distributional scenario, where the data generating mechanism is invariant across the label-conditional feature distributions.
This allows us to leverage a causal data inflation procedure to enlarge the representation of minority classes.
arXiv Detail & Related papers (2020-11-25T00:13:11Z) - Learning Geo-Contextual Embeddings for Commuting Flow Prediction [20.600183945696863]
Predicting commuting flows based on infrastructure and land-use information is critical for urban planning and public policy development.
Conventional models, such as gravity model, are mainly derived from physics principles and limited by their predictive power in real-world scenarios.
We propose Geo-contextual Multitask Embedding Learner (GMEL), a model that captures the spatial correlations from geographic contextual information for commuting flow prediction.
arXiv Detail & Related papers (2020-05-04T17:45:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.