OceanBench: The Sea Surface Height Edition
- URL: http://arxiv.org/abs/2309.15599v1
- Date: Wed, 27 Sep 2023 12:00:40 GMT
- Title: OceanBench: The Sea Surface Height Edition
- Authors: J. Emmanuel Johnson, Quentin Febvre, Anastasia Gorbunova, Sammy
Metref, Maxime Ballarotta, Julien Le Sommer, Ronan Fablet
- Abstract summary: Ocean satellite data presents challenges for information extraction due to their sparsity and irregular sampling, signal complexity, and noise.
Machine learning (ML) techniques have demonstrated their capabilities in dealing with large-scale, complex signals.
OceanBench is a unifying framework that provides standardized processing steps that comply with domain-expert standards.
- Score: 5.307677318971956
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The ocean profoundly influences human activities and plays a critical role in
climate regulation. Our understanding has improved over the last decades with
the advent of satellite remote sensing data, allowing us to capture essential
quantities over the globe, e.g., sea surface height (SSH). However, ocean
satellite data presents challenges for information extraction due to their
sparsity and irregular sampling, signal complexity, and noise. Machine learning
(ML) techniques have demonstrated their capabilities in dealing with
large-scale, complex signals. Therefore we see an opportunity for ML models to
harness the information contained in ocean satellite data. However, data
representation and relevant evaluation metrics can be the defining factors when
determining the success of applied ML. The processing steps from the raw
observation data to a ML-ready state and from model outputs to interpretable
quantities require domain expertise, which can be a significant barrier to
entry for ML researchers. OceanBench is a unifying framework that provides
standardized processing steps that comply with domain-expert standards. It
provides plug-and-play data and pre-configured pipelines for ML researchers to
benchmark their models and a transparent configurable framework for researchers
to customize and extend the pipeline for their tasks. In this work, we
demonstrate the OceanBench framework through a first edition dedicated to SSH
interpolation challenges. We provide datasets and ML-ready benchmarking
pipelines for the long-standing problem of interpolating observations from
simulated ocean satellite data, multi-modal and multi-sensor fusion issues, and
transfer-learning to real ocean satellite observations. The OceanBench
framework is available at github.com/jejjohnson/oceanbench and the dataset
registry is available at github.com/quentinf00/oceanbench-data-registry.
Related papers
- OKG-LLM: Aligning Ocean Knowledge Graph with Observation Data via LLMs for Global Sea Surface Temperature Prediction [70.48962924608033]
This work presents the first systematic effort to construct an Ocean Knowledge Graph (OKG) specifically designed to represent diverse ocean knowledge for SST prediction.<n>We develop a graph embedding network to learn the comprehensive semantic and structural knowledge within the OKG, capturing both the unique characteristics of individual sea regions and the complex correlations between them. Finally, we align the learned knowledge with fine-grained numerical SST data and leverage a pre-trained LLM to model SST patterns for accurate prediction.
arXiv Detail & Related papers (2025-07-31T02:06:03Z) - AIS Data-Driven Maritime Monitoring Based on Transformer: A Comprehensive Review [8.76510054804934]
This paper reviews the research on Transformer-based AIS data-driven maritime monitoring.<n>The focus is on Transformer-based trajectory prediction methods, behavior detection, and prediction techniques.<n>This paper collects and organizes publicly available AIS datasets from the reviewed papers, performing data filtering, cleaning, and statistical analysis.
arXiv Detail & Related papers (2025-05-12T09:17:43Z) - Efficient Self-Supervised Learning for Earth Observation via Dynamic Dataset Curation [67.23953699167274]
Self-supervised learning (SSL) has enabled the development of vision foundation models for Earth Observation (EO)
In EO, this challenge is amplified by the redundancy and heavy-tailed distributions common in satellite imagery.
We propose a dynamic dataset pruning strategy designed to improve SSL pre-training by maximizing dataset diversity and balance.
arXiv Detail & Related papers (2025-04-09T15:13:26Z) - Simulation-informed deep learning for enhanced SWOT observations of fine-scale ocean dynamics [4.575524892161048]
Ocean processes at fine scales are crucial yet difficult to observe accurately due to limitations in satellite and in-situ measurements.
Current methods struggle with noisy data or require extensive supervised training, limiting their effectiveness on real-world observations.
We introduce SIMPGEN, an unsupervised adversarial learning framework combining real SWOT observations with simulated reference data.
arXiv Detail & Related papers (2025-03-27T09:29:33Z) - A Scalable System for Visual Analysis of Ocean Data [3.2195554805277435]
pyParaOcean is a scalable and interactive visualization system designed specifically for ocean data analysis.
pyParaOcean offers specialized modules for common oceanographic analysis tasks, including eddy identification and salinity movement tracking.
We present a case study on the Bay of Bengal to demonstrate the utility of the system and scaling studies to evaluate the efficiency of the system.
arXiv Detail & Related papers (2025-01-09T07:11:51Z) - MID: A Comprehensive Shore-Based Dataset for Multi-Scale Dense Ship Occlusion and Interaction Scenarios [10.748210940033484]
The Maritime Ship Navigation Behavior dataset (MID) is designed to address challenges in ship detection within complex maritime environments.
MID contains 5,673 images with 135,884 finely annotated target instances, supporting both supervised and semi-supervised learning.
MID's images are sourced from high-definition video clips of real-world navigation across 43 water areas, with varied weather and lighting conditions.
arXiv Detail & Related papers (2024-12-08T09:34:23Z) - Underwater Camouflaged Object Tracking Meets Vision-Language SAM2 [60.47622353256502]
We propose the first large-scale multi-modal underwater camouflaged object tracking dataset, namely UW-COT220.
Based on the proposed dataset, this work first evaluates current advanced visual object tracking methods, including SAM- and SAM2-based trackers, in challenging underwater environments.
Our findings highlight the improvements of SAM2 over SAM, demonstrating its enhanced ability to handle the complexities of underwater camouflaged objects.
arXiv Detail & Related papers (2024-09-25T13:10:03Z) - Introducing VaDA: Novel Image Segmentation Model for Maritime Object Segmentation Using New Dataset [3.468621550644668]
The maritime shipping industry is undergoing rapid evolution driven by advancements in computer vision artificial intelligence (AI)
object recognition in maritime environments faces challenges such as light reflection, interference, intense lighting, and various weather conditions.
Existing AI recognition models and datasets have limited suitability for composing autonomous navigation systems.
arXiv Detail & Related papers (2024-07-12T05:48:53Z) - Scale-Translation Equivariant Network for Oceanic Internal Solitary Wave Localization [7.444865250744234]
Internal solitary waves (ISWs) are gravity waves that are often observed in the interior ocean rather than the surface.
Cloud cover in optical remote sensing images variably obscures ground information, leading to blurred or missing surface observations.
This paper aims at altimeter-based machine learning solutions to automatically locate ISWs.
arXiv Detail & Related papers (2024-06-18T21:09:56Z) - SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection [79.23689506129733]
We establish a new benchmark dataset and an open-source method for large-scale SAR object detection.
Our dataset, SARDet-100K, is a result of intense surveying, collecting, and standardizing 10 existing SAR detection datasets.
To the best of our knowledge, SARDet-100K is the first COCO-level large-scale multi-class SAR object detection dataset ever created.
arXiv Detail & Related papers (2024-03-11T09:20:40Z) - LargeST: A Benchmark Dataset for Large-Scale Traffic Forecasting [65.71129509623587]
Road traffic forecasting plays a critical role in smart city initiatives and has experienced significant advancements thanks to the power of deep learning.
However, the promising results achieved on current public datasets may not be applicable to practical scenarios.
We introduce the LargeST benchmark dataset, which includes a total of 8,600 sensors in California with a 5-year time coverage.
arXiv Detail & Related papers (2023-06-14T05:48:36Z) - Multimodal Dataset from Harsh Sub-Terranean Environment with Aerosol
Particles for Frontier Exploration [55.41644538483948]
This paper introduces a multimodal dataset from the harsh and unstructured underground environment with aerosol particles.
It contains synchronized raw data measurements from all onboard sensors in Robot Operating System (ROS) format.
The focus of this paper is not only to capture both temporal and spatial data diversities but also to present the impact of harsh conditions on captured data.
arXiv Detail & Related papers (2023-04-27T20:21:18Z) - TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual
Environments [84.6017003787244]
This work proposes a synthetic data generation pipeline to address the difficulties and domain-gaps present in simulated datasets.
We show that using annotations and visual cues from existing datasets, we can facilitate automated multi-modal data generation.
arXiv Detail & Related papers (2022-08-16T20:46:08Z) - Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision
Datasets from 3D Scans [103.92680099373567]
This paper introduces a pipeline to parametrically sample and render multi-task vision datasets from comprehensive 3D scans from the real world.
Changing the sampling parameters allows one to "steer" the generated datasets to emphasize specific information.
Common architectures trained on a generated starter dataset reached state-of-the-art performance on multiple common vision tasks and benchmarks.
arXiv Detail & Related papers (2021-10-11T04:21:46Z) - Joint calibration and mapping of satellite altimetry data using
trainable variational models [4.506463422216017]
We show how a data-driven variational data assimilation framework could be used to jointly learn a calibration operator and an interpolator from non-calibrated data.
The proposed framework significantly outperforms the operational state-of-the-art mapping pipeline.
arXiv Detail & Related papers (2021-10-07T12:48:27Z) - Bridging observation, theory and numerical simulation of the ocean using
Machine Learning [0.08155575318208629]
The study of the ocean poses a combination of unique challenges that ML can help address.
The observational data available is largely spatially sparse, limited to the surface, and with few time series spanning more than a handful of decades.
This review covers the current scientific insight offered by applying ML and points to where there is imminent potential.
arXiv Detail & Related papers (2021-04-26T12:11:51Z) - RGB-D Salient Object Detection: A Survey [195.83586883670358]
We provide a comprehensive survey of RGB-D based SOD models from various perspectives.
We also review SOD models and popular benchmark datasets from this domain.
We discuss several challenges and open directions of RGB-D based SOD for future research.
arXiv Detail & Related papers (2020-08-01T10:01:32Z) - Filtering Internal Tides From Wide-Swath Altimeter Data Using
Convolutional Neural Networks [9.541153192112194]
We propose the use of convolutional neural networks (ConvNets) to estimate fields free of internal tide signals.
We also investigate the relevance of considering additional data from other sea surface variables such as sea surface temperature (SST)
arXiv Detail & Related papers (2020-05-03T14:02:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.