AQUAIR: A High-Resolution Indoor Environmental Quality Dataset for Smart Aquaculture Monitoring
- URL: http://arxiv.org/abs/2509.24069v1
- Date: Sun, 28 Sep 2025 21:07:10 GMT
- Title: AQUAIR: A High-Resolution Indoor Environmental Quality Dataset for Smart Aquaculture Monitoring
- Authors: Youssef Sabiri, Walid Houmaidi, Ouail El Maadi, Yousra Chtouki,
- Abstract summary: AQUAIR is an open-access public dataset that logs six Indoor Environmental Quality variables.<n>A single Awair HOME monitor sampled every five minutes from 14 October 2024 to 9 January 2025.<n>We describe the sensor placement, ISO-compliant mounting height, calibration checks against reference instruments, and an open-source processing pipeline that normalizes timestamps.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Smart aquaculture systems depend on rich environmental data streams to protect fish welfare, optimize feeding, and reduce energy use. Yet public datasets that describe the air surrounding indoor tanks remain scarce, limiting the development of forecasting and anomaly-detection tools that couple head-space conditions with water-quality dynamics. We therefore introduce AQUAIR, an open-access public dataset that logs six Indoor Environmental Quality (IEQ) variables--air temperature, relative humidity, carbon dioxide, total volatile organic compounds, PM2.5 and PM10--inside a fish aquaculture facility in Amghass, Azrou, Morocco. A single Awair HOME monitor sampled every five minutes from 14 October 2024 to 9 January 2025, producing more than 23,000 time-stamped observations that are fully quality-controlled and publicly archived on Figshare. We describe the sensor placement, ISO-compliant mounting height, calibration checks against reference instruments, and an open-source processing pipeline that normalizes timestamps, interpolates short gaps, and exports analysis-ready tables. Exploratory statistics show stable conditions (median CO2 = 758 ppm; PM2.5 = 12 micrograms/m3) with pronounced feeding-time peaks, offering rich structure for short-horizon forecasting, event detection, and sensor drift studies. AQUAIR thus fills a critical gap in smart aquaculture informatics and provides a reproducible benchmark for data-centric machine learning curricula and environmental sensing research focused on head-space dynamics in recirculating aquaculture systems.
Related papers
- AquaFusionNet: Lightweight VisionSensor Fusion Framework for Real-Time Pathogen Detection and Water Quality Anomaly Prediction on Edge Devices [0.0]
This study introduces AquaFusionNet, a lightweight cross-modal framework that unifies both information sources inside a single edge deployable model.<n>The framework is trained on AquaMicro12K, a new dataset comprising 12,846 annotated 1000 micrographs for drinking water contexts.<n>The system processed 1.84 million frames and consistently detected contamination events with 94.8% mAP@0.5 and 96.3% anomaly prediction accuracy, while operating at 4.8 W on a Jetson Nano.
arXiv Detail & Related papers (2025-12-07T14:03:26Z) - 50 Years of Water Body Monitoring: The Case of Qaraaoun Reservoir, Lebanon [0.0]
The sustainable management of the Qaraaoun Reservoir, the largest surface water body in Lebanon, depends on reliable monitoring of its storage volume.<n>This study introduces a sensor-free approach that integrates open-source satellite imagery, advanced water-extent segmentation, and machine learning.
arXiv Detail & Related papers (2025-10-28T13:23:32Z) - Machine Learning for Proactive Groundwater Management: Early Warning and Resource Allocation [1.372066170415575]
We develop a machine learning pipeline that predicts groundwater level categories using climate data, hydro-meteorological records, and physiographic attributes.<n>Our approach integrates geospatial preprocessing, domain-driven feature engineering, and automated model selection to overcome monitoring limitations.
arXiv Detail & Related papers (2025-06-18T00:41:04Z) - Air Quality Prediction with A Meteorology-Guided Modality-Decoupled Spatio-Temporal Network [47.699409089023696]
Air quality prediction plays a crucial role in public health and environmental protection.<n>Existing works underestimate the critical role atmospheric conditions in air quality prediction.<n> MDSTNet is an encoder framework explicitly that captures atmosphere-pollution dependencies for prediction.<n>ChinaAirNet is the first dataset combining air quality records with multi-pressure-level meteorological observations.
arXiv Detail & Related papers (2025-04-14T09:18:11Z) - SOLAQUA: SINTEF Ocean Large Aquaculture Robotics Dataset [1.2289361708127877]
This paper presents a dataset gathered with an underwater robot in a sea-based aquaculture setting.<n>Data was gathered from sensors such as the Waterlinked A50 DVL, the Nortek Nucleus 1000 DVL, Sonardyne Micro Ranger 2 USBL, Sonoptix Mulitbeam Sonar, mono and stereo cameras, and vehicle sensor data such as power usage, IMU, pressure, temperature, and more.<n>It is expected that both the research community and the aquaculture industry will benefit greatly from the utilization of the proposed SOLAQUA dataset.
arXiv Detail & Related papers (2025-04-02T14:58:16Z) - DiTEC-WDN: A Large-Scale Dataset of Hydraulic Scenarios across Multiple Water Distribution Networks [41.94295877935867]
This dataset comprises 36,000 unique scenarios simulated over either short-term (24 hours) or long-term (1 year) periods.<n>DiTEC-WDN can support a variety of machine-learning tasks, including graph-level, node-level, and link-level regression, as well as time-series forecasting.<n>This contribution, released under a public license, encourages open scientific research in the critical water sector, eliminates the risk of exposing sensitive data, and fulfills the need for a large-scale water distribution network benchmark for study comparisons and scenario analysis.
arXiv Detail & Related papers (2025-03-21T14:14:03Z) - Identifying Trustworthiness Challenges in Deep Learning Models for Continental-Scale Water Quality Prediction [69.38041171537573]
Water quality is foundational to environmental sustainability, ecosystem resilience, and public health.<n>Deep learning offers transformative potential for large-scale water quality prediction and scientific insights generation.<n>Their widespread adoption in high-stakes operational decision-making, such as pollution mitigation and equitable resource allocation, is prevented by unresolved trustworthiness challenges.
arXiv Detail & Related papers (2025-03-13T01:50:50Z) - SEN12-WATER: A New Dataset for Hydrological Applications and its Benchmarking [40.996860106131244]
Climate and increasing droughts pose significant challenges to water resource management around the world.
We present a new dataset, SEN12-WATER, along with a benchmark using a end-to-end Deep Learning framework for proactive drought-related analysis.
arXiv Detail & Related papers (2024-09-25T16:50:59Z) - DeepExtremeCubes: Integrating Earth system spatio-temporal data for impact assessment of climate extremes [5.736700805381591]
Machine learning techniques show promise but require well-structured, high-quality, and curated analysis-ready datasets.
Here, we introduce the DeepExtremes database, tailored to map around heatwave and drought extreme impact.
It comprises over 40,000 spatially sampled small data cubes (i.e. minicubes) globally, with a spatial coverage of 2.5 by 2.5 km.
arXiv Detail & Related papers (2024-06-26T08:53:26Z) - Monitoring water contaminants in coastal areas through ML algorithms
leveraging atmospherically corrected Sentinel-2 data [3.155658695525581]
This study pioneers a novel approach to monitor the Turbidity contaminant, integrating CatBoost Machine Learning (ML) with high-resolution data from Sentinel-2 Level-2A.
Traditional methods are labor-intensive while CatBoost offers an efficient solution, excelling in predictive accuracy.
Leveraging atmospherically corrected Sentinel-2 data through the Google Earth Engine (GEE), our study contributes to scalable and precise Turbidity monitoring.
arXiv Detail & Related papers (2024-01-08T10:20:34Z) - Using Multi-Temporal Sentinel-1 and Sentinel-2 data for water bodies
mapping [40.996860106131244]
Climate change is intensifying extreme weather events, causing both water scarcity and severe rainfall unpredictability.
This paper aims to provide valuable insights for comprehensive water resource monitoring under diverse meteorological conditions.
arXiv Detail & Related papers (2024-01-05T18:11:08Z) - Multimodal Dataset from Harsh Sub-Terranean Environment with Aerosol
Particles for Frontier Exploration [55.41644538483948]
This paper introduces a multimodal dataset from the harsh and unstructured underground environment with aerosol particles.
It contains synchronized raw data measurements from all onboard sensors in Robot Operating System (ROS) format.
The focus of this paper is not only to capture both temporal and spatial data diversities but also to present the impact of harsh conditions on captured data.
arXiv Detail & Related papers (2023-04-27T20:21:18Z) - Federated Learning in the Sky: Aerial-Ground Air Quality Sensing
Framework with UAV Swarms [53.38353133198842]
Air quality significantly affects human health, it is increasingly important to accurately and timely predict the Air Quality Index (AQI)
This paper proposes a new federated learning-based aerial-ground air quality sensing framework for fine-grained 3D air quality monitoring and forecasting.
For ground sensing systems, we propose a Graph Convolutional neural network-based Long Short-Term Memory (GC-LSTM) model to achieve accurate, real-time and future AQI inference.
arXiv Detail & Related papers (2020-07-23T13:32:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.