Utilizing Mask R-CNN for Waterline Detection in Canoe Sprint Video
Analysis
- URL: http://arxiv.org/abs/2004.09573v1
- Date: Mon, 20 Apr 2020 19:00:45 GMT
- Title: Utilizing Mask R-CNN for Waterline Detection in Canoe Sprint Video
Analysis
- Authors: Marie-Sophie von Braun and Patrick Frenzel and Christian K\"ading and
Mirco Fuchs
- Abstract summary: We propose an approach for the automated waterline detection.
We developed a multi-stage approach to estimate a waterline from the outline of the segments.
We conducted a study among several experts to estimate the ground truth waterlines.
- Score: 5.735035463793008
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Determining a waterline in images recorded in canoe sprint training is an
important component for the kinematic parameter analysis to assess an athlete's
performance. Here, we propose an approach for the automated waterline
detection. First, we utilized a pre-trained Mask R-CNN by means of transfer
learning for canoe segmentation. Second, we developed a multi-stage approach to
estimate a waterline from the outline of the segments. It consists of two
linear regression stages and the systematic selection of canoe parts. We then
introduced a parameterization of the waterline as a basis for further
evaluations. Next, we conducted a study among several experts to estimate the
ground truth waterlines. This not only included an average waterline drawn from
the individual experts annotations but, more importantly, a measure for the
uncertainty between individual results. Finally, we assessed our method with
respect to the question whether the predicted waterlines are in accordance with
the experts annotations. Our method demonstrated a high performance and
provides opportunities for new applications in the field of automated video
analysis in canoe sprint.
Related papers
- Task-customized Masked AutoEncoder via Mixture of Cluster-conditional
Experts [104.9871176044644]
Masked Autoencoder(MAE) is a prevailing self-supervised learning method that achieves promising results in model pre-training.
We propose a novel MAE-based pre-training paradigm, Mixture of Cluster-conditional Experts (MoCE)
MoCE trains each expert only with semantically relevant images by using cluster-conditional gates.
arXiv Detail & Related papers (2024-02-08T03:46:32Z) - Spanning Training Progress: Temporal Dual-Depth Scoring (TDDS) for Enhanced Dataset Pruning [50.809769498312434]
We propose a novel dataset pruning method termed as Temporal Dual-Depth Scoring (TDDS)
Our method achieves 54.51% accuracy with only 10% training data, surpassing random selection by 7.83% and other comparison methods by at least 12.69%.
arXiv Detail & Related papers (2023-11-22T03:45:30Z) - Cross-domain Sound Recognition for Efficient Underwater Data Analysis [4.373836150479923]
This paper presents a novel deep learning approach for analyzing massive underwater acoustic data by leveraging a model trained on a broad spectrum of non-underwater (aerial) sounds.
We use PCA and UMAP visualization to cluster the data in a two dimensional space and listen to points within these clusters to understand their defining characteristics.
In the second part, we train a neural network model using both the selected underwater data and the non-underwater dataset.
arXiv Detail & Related papers (2023-09-07T02:26:32Z) - DeepAqua: Self-Supervised Semantic Segmentation of Wetland Surface Water
Extent with SAR Images using Knowledge Distillation [44.99833362998488]
We present DeepAqua, a self-supervised deep learning model that eliminates the need for manual annotations during the training phase.
We exploit cases where optical- and radar-based water masks coincide, enabling the detection of both open and vegetated water surfaces.
Experimental results show that DeepAqua outperforms other unsupervised methods by improving accuracy by 7%, Intersection Over Union by 27%, and F1 score by 14%.
arXiv Detail & Related papers (2023-05-02T18:06:21Z) - On the Viability of Monocular Depth Pre-training for Semantic Segmentation [48.29060171161375]
We study whether pre-training on geometric tasks is viable for downstream transfer to semantic tasks.
We find that monocular depth is a viable form of pre-training for semantic segmentation, validated by improvements over common baselines.
arXiv Detail & Related papers (2022-03-26T04:27:28Z) - Benchmarking Safe Deep Reinforcement Learning in Aquatic Navigation [78.17108227614928]
We propose a benchmark environment for Safe Reinforcement Learning focusing on aquatic navigation.
We consider a value-based and policy-gradient Deep Reinforcement Learning (DRL)
We also propose a verification strategy that checks the behavior of the trained models over a set of desired properties.
arXiv Detail & Related papers (2021-12-16T16:53:56Z) - X-Distill: Improving Self-Supervised Monocular Depth via Cross-Task
Distillation [69.9604394044652]
We propose a novel method to improve the self-supervised training of monocular depth via cross-task knowledge distillation.
During training, we utilize a pretrained semantic segmentation teacher network and transfer its semantic knowledge to the depth network.
We extensively evaluate the efficacy of our proposed approach on the KITTI benchmark and compare it with the latest state of the art.
arXiv Detail & Related papers (2021-10-24T19:47:14Z) - HED-UNet: Combined Segmentation and Edge Detection for Monitoring the
Antarctic Coastline [14.235722825330493]
We devise a new model to unite two approaches to coastline detection in a deep learning model.
Training is made efficient by employing deep supervision on side predictions at multiple resolutions.
An implementation of this approach is available at urlhttps://github.com/kr/HED-UNet.
arXiv Detail & Related papers (2021-03-02T16:35:05Z) - Application of Deep Learning-based Interpolation Methods to Nearshore
Bathymetry [0.82354995224692]
We present several deep learning-based techniques to estimate nearshore bathymetry with sparse, multi-scale measurements.
We propose a Deep Neural Network (DNN) to compute posterior estimates of the nearshore bathymetry, as well as a conditional Generative Adversarial Network (cGAN) that samples from the posterior distribution.
arXiv Detail & Related papers (2020-11-19T08:22:00Z) - Deep Semi-supervised Knowledge Distillation for Overlapping Cervical
Cell Instance Segmentation [54.49894381464853]
We propose to leverage both labeled and unlabeled data for instance segmentation with improved accuracy by knowledge distillation.
We propose a novel Mask-guided Mean Teacher framework with Perturbation-sensitive Sample Mining.
Experiments show that the proposed method improves the performance significantly compared with the supervised method learned from labeled data only.
arXiv Detail & Related papers (2020-07-21T13:27:09Z) - Semantic Segmentation of Underwater Imagery: Dataset and Benchmark [13.456412091502527]
We present the first large-scale dataset for semantic analysis of Underwater IMagery (SUIM)
It contains over 1500 images with pixel annotations for eight object categories: fish (vertebrates), reefs (invertebrates), aquatic plants, wrecks/ruins, human divers, robots, and sea-floor.
We also present a benchmark evaluation of state-of-the-art semantic segmentation approaches based on standard performance metrics.
arXiv Detail & Related papers (2020-04-02T19:53:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.