Video Segmentation Learning Using Cascade Residual Convolutional Neural
Network
- URL: http://arxiv.org/abs/2212.10570v1
- Date: Tue, 20 Dec 2022 16:56:54 GMT
- Title: Video Segmentation Learning Using Cascade Residual Convolutional Neural
Network
- Authors: Daniel F. S. Santos, Rafael G. Pires, Danilo Colombo, Jo\~ao P. Papa
- Abstract summary: We propose a novel deep learning video segmentation approach that incorporates residual information into the foreground detection learning process.
Experiments conducted on the Change Detection 2014 and on the private dataset PetrobrasROUTES from Petrobras support the effectiveness of the proposed approach.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Video segmentation consists of a frame-by-frame selection process of
meaningful areas related to foreground moving objects. Some applications
include traffic monitoring, human tracking, action recognition, efficient video
surveillance, and anomaly detection. In these applications, it is not rare to
face challenges such as abrupt changes in weather conditions, illumination
issues, shadows, subtle dynamic background motions, and also camouflage
effects. In this work, we address such shortcomings by proposing a novel deep
learning video segmentation approach that incorporates residual information
into the foreground detection learning process. The main goal is to provide a
method capable of generating an accurate foreground detection given a grayscale
video. Experiments conducted on the Change Detection 2014 and on the private
dataset PetrobrasROUTES from Petrobras support the effectiveness of the
proposed approach concerning some state-of-the-art video segmentation
techniques, with overall F-measures of $\mathbf{0.9535}$ and $\mathbf{0.9636}$
in the Change Detection 2014 and PetrobrasROUTES datasets, respectively. Such a
result places the proposed technique amongst the top 3 state-of-the-art video
segmentation methods, besides comprising approximately seven times less
parameters than its top one counterpart.
Related papers
- Scene Change Detection Using Multiscale Cascade Residual Convolutional
Neural Networks [0.0]
Scene change detection is an image processing problem related to partitioning pixels of a digital image into foreground and background regions.
In this work, we propose a novel Multiscale Residual Processing Module, with a Convolutional Neural Network that integrates a Residual Processing Module.
Experiments conducted on two different datasets support the overall effectiveness of the proposed approach, achieving an average overall effectiveness of $boldsymbol0.9622$ and $boldsymbol0.9664$ over Change Detection 2014 and PetrobrasROUTES datasets respectively.
arXiv Detail & Related papers (2022-12-20T16:48:51Z) - Part-level Action Parsing via a Pose-guided Coarse-to-Fine Framework [108.70949305791201]
Part-level Action Parsing (PAP) aims to not only predict the video-level action but also recognize the frame-level fine-grained actions or interactions of body parts for each person in the video.
In particular, our framework first predicts the video-level class of the input video, then localizes the body parts and predicts the part-level action.
Our framework achieves state-of-the-art performance and outperforms existing methods over a 31.10% ROC score.
arXiv Detail & Related papers (2022-03-09T01:30:57Z) - Recent Trends in 2D Object Detection and Applications in Video Event
Recognition [0.76146285961466]
We discuss the pioneering works in object detection, followed by the recent breakthroughs that employ deep learning.
We highlight recent datasets for 2D object detection both in images and videos, and present a comparative performance summary of various state-of-the-art object detection techniques.
arXiv Detail & Related papers (2022-02-07T14:15:11Z) - Video Salient Object Detection via Contrastive Features and Attention
Modules [106.33219760012048]
We propose a network with attention modules to learn contrastive features for video salient object detection.
A co-attention formulation is utilized to combine the low-level and high-level features.
We show that the proposed method requires less computation, and performs favorably against the state-of-the-art approaches.
arXiv Detail & Related papers (2021-11-03T17:40:32Z) - EAN: Event Adaptive Network for Enhanced Action Recognition [66.81780707955852]
We propose a unified action recognition framework to investigate the dynamic nature of video content.
First, when extracting local cues, we generate the spatial-temporal kernels of dynamic-scale to adaptively fit the diverse events.
Second, to accurately aggregate these cues into a global video representation, we propose to mine the interactions only among a few selected foreground objects by a Transformer.
arXiv Detail & Related papers (2021-07-22T15:57:18Z) - CDN-MEDAL: Two-stage Density and Difference Approximation Framework for
Motion Analysis [3.337126420148156]
We propose a novel, two-stage method of change detection with two convolutional neural networks.
Our two-stage framework contains approximately 3.5K parameters in total but still maintains rapid convergence to intricate motion patterns.
arXiv Detail & Related papers (2021-06-07T16:39:42Z) - Few-Shot Learning for Video Object Detection in a Transfer-Learning
Scheme [70.45901040613015]
We study the new problem of few-shot learning for video object detection.
We employ a transfer-learning framework to effectively train the video object detector on a large number of base-class objects and a few video clips of novel-class objects.
arXiv Detail & Related papers (2021-03-26T20:37:55Z) - Learning to Segment Rigid Motions from Two Frames [72.14906744113125]
We propose a modular network, motivated by a geometric analysis of what independent object motions can be recovered from an egomotion field.
It takes two consecutive frames as input and predicts segmentation masks for the background and multiple rigidly moving objects, which are then parameterized by 3D rigid transformations.
Our method achieves state-of-the-art performance for rigid motion segmentation on KITTI and Sintel.
arXiv Detail & Related papers (2021-01-11T04:20:30Z) - Robust Unsupervised Video Anomaly Detection by Multi-Path Frame
Prediction [61.17654438176999]
We propose a novel and robust unsupervised video anomaly detection method by frame prediction with proper design.
Our proposed method obtains the frame-level AUROC score of 88.3% on the CUHK Avenue dataset.
arXiv Detail & Related papers (2020-11-05T11:34:12Z) - Detecting Forged Facial Videos using convolutional neural network [0.0]
We propose to use smaller (fewer parameters to learn) convolutional neural networks (CNN) for a data-driven approach to forged video detection.
To validate our approach, we investigate the FaceForensics public dataset detailing both frame-based and video-based results.
arXiv Detail & Related papers (2020-05-17T19:04:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.