SatBird: Bird Species Distribution Modeling with Remote Sensing and
Citizen Science Data
- URL: http://arxiv.org/abs/2311.00936v1
- Date: Thu, 2 Nov 2023 02:00:27 GMT
- Title: SatBird: Bird Species Distribution Modeling with Remote Sensing and
Citizen Science Data
- Authors: M\'elisande Teng, Amna Elmustafa, Benjamin Akera, Yoshua Bengio, Hager
Radi Abdelwahed, Hugo Larochelle, David Rolnick
- Abstract summary: We present SatBird, a satellite dataset of locations in the USA with labels derived from presence-absence observation data from the citizen science database eBird.
We also provide a dataset in Kenya representing low-data regimes.
We benchmark a set of baselines on our dataset, including SOTA models for remote sensing tasks.
- Score: 68.2366021016172
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Biodiversity is declining at an unprecedented rate, impacting ecosystem
services necessary to ensure food, water, and human health and well-being.
Understanding the distribution of species and their habitats is crucial for
conservation policy planning. However, traditional methods in ecology for
species distribution models (SDMs) generally focus either on narrow sets of
species or narrow geographical areas and there remain significant knowledge
gaps about the distribution of species. A major reason for this is the limited
availability of data traditionally used, due to the prohibitive amount of
effort and expertise required for traditional field monitoring. The wide
availability of remote sensing data and the growing adoption of citizen science
tools to collect species observations data at low cost offer an opportunity for
improving biodiversity monitoring and enabling the modelling of complex
ecosystems. We introduce a novel task for mapping bird species to their
habitats by predicting species encounter rates from satellite images, and
present SatBird, a satellite dataset of locations in the USA with labels
derived from presence-absence observation data from the citizen science
database eBird, considering summer (breeding) and winter seasons. We also
provide a dataset in Kenya representing low-data regimes. We additionally
provide environmental data and species range maps for each location. We
benchmark a set of baselines on our dataset, including SOTA models for remote
sensing tasks. SatBird opens up possibilities for scalably modelling properties
of ecosystems worldwide.
Related papers
- Combining Observational Data and Language for Species Range Estimation [63.65684199946094]
We propose a novel approach combining millions of citizen science species observations with textual descriptions from Wikipedia.
Our framework maps locations, species, and text descriptions into a common space, enabling zero-shot range estimation from textual descriptions.
Our approach also acts as a strong prior when combined with observational data, resulting in more accurate range estimation with less data.
arXiv Detail & Related papers (2024-10-14T17:22:55Z) - GeoPlant: Spatial Plant Species Prediction Dataset [4.817737198128259]
Species Distribution Models (SDMs) predict species across space from spatially explicit features.
We have designed and developed a new European-scale dataset for SDMs at high spatial resolution (10-50 m)
The dataset comprises 5M heterogeneous Presence-Only records and 90k exhaustive Presence-Absence survey records.
arXiv Detail & Related papers (2024-08-25T20:09:46Z) - Predicting Species Occurrence Patterns from Partial Observations [21.009271008147785]
We introduce the problem of predicting species occurrence patterns given (a) satellite imagery, and (b) known information on the occurrence of other species.
To evaluate algorithms on this task, we introduce SatButterfly, a dataset of satellite images, environmental data and observational data for butterflies.
We propose a general model, R-Tran, for predicting species occurrence patterns that enables the use of partial observational data wherever found.
arXiv Detail & Related papers (2024-03-26T18:29:39Z) - Spatial Implicit Neural Representations for Global-Scale Species Mapping [72.92028508757281]
Given a set of locations where a species has been observed, the goal is to build a model to predict whether the species is present or absent at any location.
Traditional methods struggle to take advantage of emerging large-scale crowdsourced datasets.
We use Spatial Implicit Neural Representations (SINRs) to jointly estimate the geographical range of 47k species simultaneously.
arXiv Detail & Related papers (2023-06-05T03:36:01Z) - Bird Distribution Modelling using Remote Sensing and Citizen Science
data [31.375576105932442]
Climate change is a major driver of biodiversity loss.
There are significant knowledge gaps about the distribution of species.
We propose an approach leveraging computer vision to improve species distribution modelling.
arXiv Detail & Related papers (2023-05-01T20:27:11Z) - Semantic Segmentation of Vegetation in Remote Sensing Imagery Using Deep
Learning [77.34726150561087]
We propose an approach for creating a multi-modal and large-temporal dataset comprised of publicly available Remote Sensing data.
We use Convolutional Neural Networks (CNN) models that are capable of separating different classes of vegetation.
arXiv Detail & Related papers (2022-09-28T18:51:59Z) - Seeing biodiversity: perspectives in machine learning for wildlife
conservation [49.15793025634011]
We argue that machine learning can meet this analytic challenge to enhance our understanding, monitoring capacity, and conservation of wildlife species.
In essence, by combining new machine learning approaches with ecological domain knowledge, animal ecologists can capitalize on the abundance of data generated by modern sensor technologies.
arXiv Detail & Related papers (2021-10-25T13:40:36Z) - Florida Wildlife Camera Trap Dataset [48.99466876948454]
We introduce a challenging wildlife camera trap classification dataset collected from two different locations in Southwestern Florida.
The dataset consists of 104,495 images featuring visually similar species, varying illumination conditions, skewed class distribution, and including samples of endangered species.
arXiv Detail & Related papers (2021-06-23T18:53:15Z) - The GeoLifeCLEF 2020 Dataset [13.274586385114622]
We present the GeoLifeCLEF 2020 dataset, which consists of 1.9 million species observations paired with high-resolution remote sensing imagery, land cover data, and altitude.
We also discuss the GeoLifeCLEF 2020 competition, which aims to use this dataset to advance the state-of-the-art in location-based species recommendation.
arXiv Detail & Related papers (2020-04-08T18:30:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.