Imbalance-aware Presence-only Loss Function for Species Distribution
Modeling
- URL: http://arxiv.org/abs/2403.07472v1
- Date: Tue, 12 Mar 2024 10:08:36 GMT
- Title: Imbalance-aware Presence-only Loss Function for Species Distribution
Modeling
- Authors: Robin Zbinden, Nina van Tiel, Marc Ru{\ss}wurm, Devis Tuia
- Abstract summary: This study assesses the effectiveness of training deep learning models using a balanced presence-only loss function on large citizen science-based datasets.
We demonstrate that this imbalance-aware loss function outperforms traditional loss functions across various datasets and tasks.
- Score: 3.4306175858244794
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In the face of significant biodiversity decline, species distribution models
(SDMs) are essential for understanding the impact of climate change on species
habitats by connecting environmental conditions to species occurrences.
Traditionally limited by a scarcity of species observations, these models have
significantly improved in performance through the integration of larger
datasets provided by citizen science initiatives. However, they still suffer
from the strong class imbalance between species within these datasets, often
resulting in the penalization of rare species--those most critical for
conservation efforts. To tackle this issue, this study assesses the
effectiveness of training deep learning models using a balanced presence-only
loss function on large citizen science-based datasets. We demonstrate that this
imbalance-aware loss function outperforms traditional loss functions across
various datasets and tasks, particularly in accurately modeling rare species
with limited observations.
Related papers
- Evolutionary Dispersal of Ecological Species via Multi-Agent Deep Reinforcement Learning [0.0]
This study uses multi-agent reinforcement learning (MARL) with deep Q-networks (DQN) to simulate single species and predator-prey interactions.
Our simulations reveal evolutionary dispersal strategies, providing insights into species dispersal mechanisms and validating traditional mathematical models.
arXiv Detail & Related papers (2024-10-24T10:21:23Z) - Low-rank finetuning for LLMs: A fairness perspective [54.13240282850982]
Low-rank approximation techniques have become the de facto standard for fine-tuning Large Language Models.
This paper investigates the effectiveness of these methods in capturing the shift of fine-tuning datasets from the initial pre-trained data distribution.
We show that low-rank fine-tuning inadvertently preserves undesirable biases and toxic behaviors.
arXiv Detail & Related papers (2024-05-28T20:43:53Z) - Predicting Species Occurrence Patterns from Partial Observations [21.009271008147785]
We introduce the problem of predicting species occurrence patterns given (a) satellite imagery, and (b) known information on the occurrence of other species.
To evaluate algorithms on this task, we introduce SatButterfly, a dataset of satellite images, environmental data and observational data for butterflies.
We propose a general model, R-Tran, for predicting species occurrence patterns that enables the use of partial observational data wherever found.
arXiv Detail & Related papers (2024-03-26T18:29:39Z) - On the selection and effectiveness of pseudo-absences for species
distribution modeling with deep learning [3.8974747170521287]
Species distribution modeling is a versatile tool for understanding the relationship between environmental conditions and species occurrences.
To overcome this limitation, a common approach is to employ pseudo-absences, which are specific geographic locations designated as negative samples.
In this paper, we demonstrate that these challenges can be effectively tackled by integrating pseudo-absences in the training of multi-species neural networks.
arXiv Detail & Related papers (2024-01-03T16:06:30Z) - SatBird: Bird Species Distribution Modeling with Remote Sensing and
Citizen Science Data [68.2366021016172]
We present SatBird, a satellite dataset of locations in the USA with labels derived from presence-absence observation data from the citizen science database eBird.
We also provide a dataset in Kenya representing low-data regimes.
We benchmark a set of baselines on our dataset, including SOTA models for remote sensing tasks.
arXiv Detail & Related papers (2023-11-02T02:00:27Z) - On the Connection between Pre-training Data Diversity and Fine-tuning
Robustness [66.30369048726145]
We find that the primary factor influencing downstream effective robustness is data quantity.
We demonstrate our findings on pre-training distributions drawn from various natural and synthetic data sources.
arXiv Detail & Related papers (2023-07-24T05:36:19Z) - Spatial Implicit Neural Representations for Global-Scale Species Mapping [72.92028508757281]
Given a set of locations where a species has been observed, the goal is to build a model to predict whether the species is present or absent at any location.
Traditional methods struggle to take advantage of emerging large-scale crowdsourced datasets.
We use Spatial Implicit Neural Representations (SINRs) to jointly estimate the geographical range of 47k species simultaneously.
arXiv Detail & Related papers (2023-06-05T03:36:01Z) - Class-Balancing Diffusion Models [57.38599989220613]
Class-Balancing Diffusion Models (CBDM) are trained with a distribution adjustment regularizer as a solution.
Our method benchmarked the generation results on CIFAR100/CIFAR100LT dataset and shows outstanding performance on the downstream recognition task.
arXiv Detail & Related papers (2023-04-30T20:00:14Z) - Ensembles of Vision Transformers as a New Paradigm for Automated
Classification in Ecology [0.0]
We show that ensembles of Data-efficient image Transformers (DeiTs) significantly outperform the previous state of the art (SOTA)
On all the data sets we test, we achieve a new SOTA, with a reduction of the error with respect to the previous SOTA ranging from 18.48% to 87.50%.
arXiv Detail & Related papers (2022-03-03T14:16:22Z) - StatEcoNet: Statistical Ecology Neural Networks for Species Distribution
Modeling [8.534315844706367]
This paper focuses on a core task in computational sustainability and statistical ecology: species distribution modeling (SDM)
In SDM, the occurrence pattern of a species on a landscape is predicted by environmental features based on observations at a set of locations.
To address the unique challenges of SDM, this paper proposes a framework called StatEcoNet.
arXiv Detail & Related papers (2021-02-17T02:19:00Z) - A General Framework for Survival Analysis and Multi-State Modelling [70.31153478610229]
We use neural ordinary differential equations as a flexible and general method for estimating multi-state survival models.
We show that our model exhibits state-of-the-art performance on popular survival data sets and demonstrate its efficacy in a multi-state setting.
arXiv Detail & Related papers (2020-06-08T19:24:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.