An Ecologically-Informed Deep Learning Framework for Interpretable and Validatable Habitat Mapping
- URL: http://arxiv.org/abs/2511.17627v1
- Date: Tue, 18 Nov 2025 23:38:29 GMT
- Title: An Ecologically-Informed Deep Learning Framework for Interpretable and Validatable Habitat Mapping
- Authors: Iván Felipe Benavides-Martínez, Cristiam Victoriano Portilla-Cabrera, Katherine E. Mills, Claire Enterline, José Garcés-Vargas, Andrew J. Allyn, Auroop R Ganguly,
- Abstract summary: ECOSAIC is an Artificial Intelligence framework for automatic classification of benthic habitats.<n> ECOSAIC compresses n-dimensional feature space by optimizing specialization and orthogonality between domain-informed features.<n>We applied the model to the Colombian Pacific Ocean and the results revealed 16 benthic habitats, expanding from mangroves to deep rocky areas up to 1000 m depth.
- Score: 1.4672361353012924
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Benthic habitat is challenging due to the environmental complexity of the seafloor, technological limitations, and elevated operational costs, especially in under-explored regions. This generates knowledge gaps for the sustainable management of hydrobiological resources and their nexus with society. We developed ECOSAIC (Ecological Compression via Orthogonal Specialized Autoencoders for Interpretable Classification), an Artificial Intelligence framework for automatic classification of benthic habitats through interpretable latent representations using a customizable autoencoder. ECOSAIC compresses n-dimensional feature space by optimizing specialization and orthogonality between domain-informed features. We employed two domain-informed categories: biogeochemical and hydrogeomorphological, that together integrate biological, physicochemical, hydrological and geomorphological, features, whose constraints on habitats have been recognized in ecology for a century. We applied the model to the Colombian Pacific Ocean and the results revealed 16 benthic habitats, expanding from mangroves to deep rocky areas up to 1000 m depth. The candidate habitats exhibited a strong correspondence between their environmental constraints, represented in latent space, and their expected species composition. This correspondence reflected meaningful ecological associations rather than purely statistical correlations, where the habitat's environmental offerings align semantically with the species' requirements. This approach could improve the management and conservation of benthic habitats, facilitating the development of functional maps that support marine planning, biodiversity conservation and fish stock assessment. We also hope it provides new insights into how ecological principles can inform AI frameworks, particularly given the substantial data limitations that characterize ecological research.
Related papers
- MATANet: A Multi-context Attention and Taxonomy-Aware Network for Fine-Grained Underwater Recognition of Marine Species [6.870403086472032]
MATANet is a novel model designed for fine-grained marine species classification.<n>It mimics expert strategies by using taxonomy and environmental context to interpret ambiguous features of underwater animals.<n>Experiments on the FathomNet2025, FAIR1M, and LifeCLEF2015-Fish datasets demonstrate state-of-the-art performance.
arXiv Detail & Related papers (2026-01-07T09:21:45Z) - Towards AI-Guided Open-World Ecological Taxonomic Classification [25.577016053193862]
TaxoNet is an embedding-based encoder with a dual-marginization loss that strengthens learning signals from rare underrepresented taxa.<n>Our findings show that general-purpose multimodal foundation models remain constrained in plant-domain applications.
arXiv Detail & Related papers (2025-12-22T03:20:05Z) - Topography, climate, land cover, and biodiversity: Explaining endemic richness and management implications on a Mediterranean island [0.0]
Island endemism is shaped by complex interactions among environmental, ecological, and evolutionary factors.<n>We investigated the drivers of endemic plant richness across Crete, a Mediterranean biodiversity hotspot.<n>We found total species richness, elevation range, and climatic variability were the strongest predictors of endemic richness.
arXiv Detail & Related papers (2025-11-05T07:09:18Z) - UWBench: A Comprehensive Vision-Language Benchmark for Underwater Understanding [54.16709436340606]
Large vision-language models (VLMs) have achieved remarkable success in natural scene understanding.<n>Underwater imagery presents unique challenges including severe light attenuation, color distortion, and suspended particle scattering.<n>We introduce UWBench, a benchmark specifically designed for underwater vision-language understanding.
arXiv Detail & Related papers (2025-10-21T03:32:15Z) - Continental-scale habitat distribution modelling with multimodal earth observation foundation models [0.0]
Habitats integrate the abiotic conditions, vegetation composition and structure that support biodiversity and sustain nature's contributions to people.<n>Current habitat maps often fall short in thematic or spatial resolution.<n>We evaluated how high-resolution remote sensing (RS) data and Artificial Intelligence (AI) tools can improve habitat mapping.
arXiv Detail & Related papers (2025-07-13T18:11:26Z) - BioAnalyst: A Foundation Model for Biodiversity [0.565395466029518]
We introduce BioAnalyst, the first Foundation Model tailored for biodiversity analysis and conservation planning.<n>BioAnalyst employs a transformer-based architecture, pretrained on extensive multi-modal datasets.<n>We evaluate the model's performance on two downstream use cases, demonstrating its generalisability compared to existing methods.
arXiv Detail & Related papers (2025-07-11T23:56:08Z) - Multi-scale species richness estimation with deep learning [0.0]
We combine sampling theory and deep learning to predict local species richness within arbitrarily large sampling areas.<n>We show how our deep SAR model can provide fundamental insights on the multi-scale effects of key biodiversity processes.
arXiv Detail & Related papers (2025-07-08T19:42:33Z) - Image-Based Relocalization and Alignment for Long-Term Monitoring of Dynamic Underwater Environments [57.59857784298534]
We propose an integrated pipeline that combines Visual Place Recognition (VPR), feature matching, and image segmentation on video-derived images.<n>This method enables robust identification of revisited areas, estimation of rigid transformations, and downstream analysis of ecosystem changes.
arXiv Detail & Related papers (2025-03-06T05:13:19Z) - Mining for Species, Locations, Habitats, and Ecosystems from Scientific Papers in Invasion Biology: A Large-Scale Exploratory Study with Large Language Models [6.364723262453785]
This paper harnesses the capabilities of large language models (LLMs) to mine key ecological entities from invasion biology literature.<n>Specifically, we focus on extracting species names, their locations, associated habitats, and ecosystems, information that is critical for understanding species spread.<n>This study lays the groundwork for more advanced, automated knowledge extraction tools that can aid researchers and practitioners in understanding and managing biological invasions.
arXiv Detail & Related papers (2025-01-30T11:55:44Z) - Combining Observational Data and Language for Species Range Estimation [63.65684199946094]
We propose a novel approach combining millions of citizen science species observations with textual descriptions from Wikipedia.<n>Our framework maps locations, species, and text descriptions into a common space, enabling zero-shot range estimation from textual descriptions.<n>Our approach also acts as a strong prior when combined with observational data, resulting in more accurate range estimation with less data.
arXiv Detail & Related papers (2024-10-14T17:22:55Z) - FREE: The Foundational Semantic Recognition for Modeling Environmental Ecosystems [56.0640340392818]
We introduce a framework, FREE, that enables the use of varying features and available information to train a universal model.<n>The core idea is to map available environmental data into a text space and then convert the traditional predictive modeling task in environmental science to a semantic recognition problem.<n>Our evaluation on two societally important real-world applications, stream water temperature prediction and crop yield prediction, demonstrates the superiority of FREE over multiple baselines.
arXiv Detail & Related papers (2023-11-17T00:53:09Z) - SatBird: Bird Species Distribution Modeling with Remote Sensing and
Citizen Science Data [68.2366021016172]
We present SatBird, a satellite dataset of locations in the USA with labels derived from presence-absence observation data from the citizen science database eBird.
We also provide a dataset in Kenya representing low-data regimes.
We benchmark a set of baselines on our dataset, including SOTA models for remote sensing tasks.
arXiv Detail & Related papers (2023-11-02T02:00:27Z) - Seeing biodiversity: perspectives in machine learning for wildlife
conservation [49.15793025634011]
We argue that machine learning can meet this analytic challenge to enhance our understanding, monitoring capacity, and conservation of wildlife species.
In essence, by combining new machine learning approaches with ecological domain knowledge, animal ecologists can capitalize on the abundance of data generated by modern sensor technologies.
arXiv Detail & Related papers (2021-10-25T13:40:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.