Marine Video Kit: A New Marine Video Dataset for Content-based Analysis
and Retrieval
- URL: http://arxiv.org/abs/2209.11518v1
- Date: Fri, 23 Sep 2022 10:57:50 GMT
- Title: Marine Video Kit: A New Marine Video Dataset for Content-based Analysis
and Retrieval
- Authors: Quang-Trung Truong and Tuan-Anh Vu and Tan-Sang Ha and Lokoc Jakub and
Yue Him Wong Tim and Ajay Joneja and Sai-Kit Yeung
- Abstract summary: In this paper, we focus on single-shot videos taken from moving cameras in underwater environments.
The first shard of a new Marine Video Kit is presented to serve for video retrieval and other computer vision challenges.
- Score: 10.526705651297146
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Effective analysis of unusual domain specific video collections represents an
important practical problem, where state-of-the-art general purpose models
still face limitations. Hence, it is desirable to design benchmark datasets
that challenge novel powerful models for specific domains with additional
constraints. It is important to remember that domain specific data may be
noisier (e.g., endoscopic or underwater videos) and often require more
experienced users for effective search. In this paper, we focus on single-shot
videos taken from moving cameras in underwater environments which constitute a
nontrivial challenge for research purposes. The first shard of a new Marine
Video Kit dataset is presented to serve for video retrieval and other computer
vision challenges. In addition to basic meta-data statistics, we present
several insights and reference graphs based on low-level features as well as
semantic annotations of selected keyframes. The analysis contains also
experiments showing limitations of respected general purpose models for
retrieval.
Related papers
- VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs [64.60035916955837]
VANE-Bench is a benchmark designed to assess the proficiency of Video-LMMs in detecting anomalies and inconsistencies in videos.
Our dataset comprises an array of videos synthetically generated using existing state-of-the-art text-to-video generation models.
We evaluate nine existing Video-LMMs, both open and closed sources, on this benchmarking task and find that most of the models encounter difficulties in effectively identifying the subtle anomalies.
arXiv Detail & Related papers (2024-06-14T17:59:01Z) - Scaling Up Video Summarization Pretraining with Large Language Models [73.74662411006426]
We introduce an automated and scalable pipeline for generating a large-scale video summarization dataset.
We analyze the limitations of existing approaches and propose a new video summarization model that effectively addresses them.
Our work also presents a new benchmark dataset that contains 1200 long videos each with high-quality summaries annotated by professionals.
arXiv Detail & Related papers (2024-04-04T11:59:06Z) - Appearance-Based Refinement for Object-Centric Motion Segmentation [85.2426540999329]
We introduce an appearance-based refinement method that leverages temporal consistency in video streams to correct inaccurate flow-based proposals.
Our approach involves a sequence-level selection mechanism that identifies accurate flow-predicted masks as exemplars.
Its performance is evaluated on multiple video segmentation benchmarks, including DAVIS, YouTube, SegTrackv2, and FBMS-59.
arXiv Detail & Related papers (2023-12-18T18:59:51Z) - FODVid: Flow-guided Object Discovery in Videos [12.792602427704395]
We focus on building a generalizable solution that avoids overfitting to the individual intricacies.
To solve Video Object (VOS) in an unsupervised setting, we propose a new pipeline (FODVid) based on the idea of guiding segmentation outputs.
arXiv Detail & Related papers (2023-07-10T07:55:42Z) - MarineVRS: Marine Video Retrieval System with Explainability via
Semantic Understanding [11.878077736295863]
MarineVRS is a novel and flexible video retrieval system designed explicitly for the marine domain.
MarineVRS integrates state-of-the-art methods for visual and linguistic object representation to enable efficient and accurate search and analysis of vast volumes of underwater video data.
MarineVRS is a powerful tool for marine researchers and scientists to efficiently and accurately process vast amounts of data and gain deeper insights into the behavior and movements of marine species.
arXiv Detail & Related papers (2023-06-07T16:46:44Z) - A Survey on Deep Learning Technique for Video Segmentation [147.0767454918527]
Video segmentation plays a critical role in a broad range of practical applications.
Deep learning based approaches have been dedicated to video segmentation and delivered compelling performance.
arXiv Detail & Related papers (2021-07-02T15:51:07Z) - Highlight Timestamp Detection Model for Comedy Videos via Multimodal
Sentiment Analysis [1.6181085766811525]
We propose a multimodal structure to obtain state-of-the-art performance in this field.
We select several benchmarks for multimodal video understanding and apply the most suitable model to find the best performance.
arXiv Detail & Related papers (2021-05-28T08:39:19Z) - Coherent Loss: A Generic Framework for Stable Video Segmentation [103.78087255807482]
We investigate how a jittering artifact degrades the visual quality of video segmentation results.
We propose a Coherent Loss with a generic framework to enhance the performance of a neural network against jittering artifacts.
arXiv Detail & Related papers (2020-10-25T10:48:28Z) - Fine-Grained Instance-Level Sketch-Based Video Retrieval [159.12935292432743]
We propose a novel cross-modal retrieval problem of fine-grained instance-level sketch-based video retrieval (FG-SBVR)
Compared with sketch-based still image retrieval, and coarse-grained category-level video retrieval, this is more challenging as both visual appearance and motion need to be simultaneously matched at a fine-grained level.
We show that this model significantly outperforms a number of existing state-of-the-art models designed for video analysis.
arXiv Detail & Related papers (2020-02-21T18:28:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.