Related papers: Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results

Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results

URL: http://arxiv.org/abs/2504.02558v1
Date: Thu, 03 Apr 2025 13:14:16 GMT
Title: Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results
Authors: Andrei Dumitriu, Florin Tatui, Florin Miron, Radu Tudor Ionescu, Radu Timofte,
Abstract summary: Rip currents are the leading cause of fatal accidents and injuries on many beaches worldwide.<n>We introduce a comprehensive dataset containing $2,466$ images with newly created polygonal annotations for instance segmentation.<n>We present a novel dataset comprising $17$ drone videos (comprising about $24K$ frames) captured at $30 FPS$, annotated with both polygons for instance segmentation and bounding boxes for object detection.
Score: 60.656120527353096
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Rip currents are the leading cause of fatal accidents and injuries on many beaches worldwide, emphasizing the importance of automatically detecting these hazardous surface water currents. In this paper, we address a novel task: rip current instance segmentation. We introduce a comprehensive dataset containing $2,466$ images with newly created polygonal annotations for instance segmentation, used for training and validation. Additionally, we present a novel dataset comprising $17$ drone videos (comprising about $24K$ frames) captured at $30 FPS$, annotated with both polygons for instance segmentation and bounding boxes for object detection, employed for testing purposes. We train various versions of YOLOv8 for instance segmentation on static images and assess their performance on the test dataset (videos). The best results were achieved by the YOLOv8-nano model (runnable on a portable device), with an mAP50 of $88.94%$ on the validation dataset and $81.21%$ macro average on the test dataset. The results provide a baseline for future research in rip current segmentation. Our work contributes to the existing literature by introducing a detailed, annotated dataset, and training a deep learning model for instance segmentation of rip currents. The code, training details and the annotated dataset are made publicly available at https://github.com/Irikos/rip_currents.

Related papers

RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and Safety [57.243502132481176]
RipVIS is a large-scale video instance segmentation benchmark designed for rip current segmentation.<n>Our dataset encompasses diverse visual contexts, such as wave-breaking patterns, sediment flows, and water color variations.<n>Results are reported in terms of multiple metrics, with a particular focus on the $F$ score to prioritize recall and reduce false negatives.
arXiv Detail & Related papers (2025-04-01T18:57:15Z)
Robot Instance Segmentation with Few Annotations for Grasping [10.005879464111915]
We propose a novel framework that combines Semi-Supervised Learning (SSL) with Learning Through Interaction (LTI)<n>Our approach exploits partially annotated data through self-supervision and incorporates temporal context using pseudo-sequences generated from unlabeled still images.<n>We validate our method on two common benchmarks, ARMBench mix-object-tote and OCID, where it achieves state-of-the-art performance.
arXiv Detail & Related papers (2024-07-01T13:58:32Z)
Localization Is All You Evaluate: Data Leakage in Online Mapping Datasets and How to Fix It [2.1665407462280446]
State-of-the-art methods are trained predominantly using nuScenes and Argoverse 2 datasets. Over $80$% of nuScenes and $40$% of Argoverse 2 validation and test samples are less than $5$ m from a training sample. We propose geographically disjoint data splits to reveal the true performance in unseen environments.
arXiv Detail & Related papers (2023-12-11T14:43:23Z)
A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture [15.298790238028356]
Instance detection (InsDet) is a long-lasting problem in robotics and computer vision. Current InsDet are too small in scale by today's standards. We introduce a new InsDet dataset and protocol.
arXiv Detail & Related papers (2023-10-30T03:58:41Z)
Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples [61.66967790884943]
Referring video object segmentation (RVOS) relies on sufficient data for a given scene. In more realistic scenarios, only minimal annotations are available for a new scene. We propose a model with a newly designed cross-modal affinity (CMA) module based on a Transformer architecture. CMA module builds multimodal affinity with a few samples, thus quickly learning new semantic information, and enabling the model to adapt to different scenarios.
arXiv Detail & Related papers (2023-09-05T08:34:23Z)
A Benchmark of Long-tailed Instance Segmentation with Noisy Labels [14.977028531774945]
In this paper, we consider the instance segmentation task on a long-tailed dataset, which contains label noise. We propose a new dataset, which is a large vocabulary long-tailed dataset containing label noise for instance segmentation. The results indicate that the noise in the training dataset will hamper the model in learning rare categories and decrease the overall performance.
arXiv Detail & Related papers (2022-11-24T06:34:29Z)
BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations [89.42397034542189]
We synthesize a large labeled dataset via a generative adversarial network (GAN) We take image samples from the class-conditional generative model BigGAN trained on ImageNet, and manually annotate 5 images per class, for all 1k classes. We create a new ImageNet benchmark by labeling an additional set of 8k real images and evaluate segmentation performance in a variety of settings.
arXiv Detail & Related papers (2022-01-12T20:28:34Z)
INSTA-YOLO: Real-Time Instance Segmentation [2.726684740197893]
We propose Insta-YOLO, a novel one-stage end-to-end deep learning model for real-time instance segmentation. The proposed model is inspired by the YOLO one-shot object detector, with the box regression loss is replaced with regression in the localization head. We evaluate our model on three datasets, namely, Carnva, Cityscapes and Airbus.
arXiv Detail & Related papers (2021-02-12T21:17:29Z)
The Devil is in Classification: A Simple Framework for Long-tail Object Detection and Instance Segmentation [93.17367076148348]
We investigate performance drop of the state-of-the-art two-stage instance segmentation model Mask R-CNN on the recent long-tail LVIS dataset. We unveil that a major cause is the inaccurate classification of object proposals. We propose a simple calibration framework to more effectively alleviate classification head bias with a bi-level class balanced sampling approach.
arXiv Detail & Related papers (2020-07-23T12:49:07Z)
UniT: Unified Knowledge Transfer for Any-shot Object Detection and Segmentation [52.487469544343305]
Methods for object detection and segmentation rely on large scale instance-level annotations for training. We propose an intuitive and unified semi-supervised model that is applicable to a range of supervision.
arXiv Detail & Related papers (2020-06-12T22:45:47Z)

This list is automatically generated from the titles and abstracts of the papers in this site.