Recurrence-in-Recurrence Networks for Video Deblurring
- URL: http://arxiv.org/abs/2203.06418v1
- Date: Sat, 12 Mar 2022 11:58:13 GMT
- Title: Recurrence-in-Recurrence Networks for Video Deblurring
- Authors: Joonkyu Park, Seungjun Nah, Kyoung Mu Lee
- Abstract summary: State-of-the-art video deblurring methods often adopt recurrent neural networks to model the temporal dependency between the frames.
In this paper, we propose recurrence-in-recurrence network architecture to cope with the limitations of short-ranged memory.
- Score: 58.49075799159015
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: State-of-the-art video deblurring methods often adopt recurrent neural
networks to model the temporal dependency between the frames. While the hidden
states play key role in delivering information to the next frame, abrupt motion
blur tend to weaken the relevance in the neighbor frames. In this paper, we
propose recurrence-in-recurrence network architecture to cope with the
limitations of short-ranged memory. We employ additional recurrent units inside
the RNN cell. First, we employ inner-recurrence module (IRM) to manage the
long-ranged dependency in a sequence. IRM learns to keep track of the cell
memory and provides complementary information to find the deblurred frames.
Second, we adopt an attention-based temporal blending strategy to extract the
necessary part of the information in the local neighborhood. The adpative
temporal blending (ATB) can either attenuate or amplify the features by the
spatial attention. Our extensive experimental results and analysis validate the
effectiveness of IRM and ATB on various RNN architectures.
Related papers
- ISMRNN: An Implicitly Segmented RNN Method with Mamba for Long-Term Time Series Forecasting [6.125620036017928]
Long time series forecasting aims to utilize historical information to forecast future states over extended horizons.
Traditional RNN-based series forecasting methods struggle to effectively address long-term dependencies and gradient issues in long time series problems.
Recently, SegRNN has emerged as a leading RNN-based model tailored for long-term series forecasting.
arXiv Detail & Related papers (2024-07-15T14:50:15Z) - Delayed Memory Unit: Modelling Temporal Dependency Through Delay Gate [16.4160685571157]
Recurrent Neural Networks (RNNs) are widely recognized for their proficiency in modeling temporal dependencies.
This paper proposes a novel Delayed Memory Unit (DMU) for gated RNNs.
The DMU incorporates a delay line structure along with delay gates into vanilla RNN, thereby enhancing temporal interaction and facilitating temporal credit assignment.
arXiv Detail & Related papers (2023-10-23T14:29:48Z) - Message Propagation Through Time: An Algorithm for Sequence Dependency
Retention in Time Series Modeling [14.49997340857179]
This paper proposes the Message Propagation Through Time (MPTT) algorithm for time series modeling.
MPTT incorporates long temporal dependencies while preserving faster training times relative to the stateful solutions.
Experimental results demonstrate that MPTT outperforms seven strategies on four climate datasets.
arXiv Detail & Related papers (2023-09-28T22:38:18Z) - Sliding Window Recurrent Network for Efficient Video Super-Resolution [0.0]
Video super-resolution (VSR) is the task of restoring high-resolution frames from a sequence of low-resolution inputs.
We propose a textitSliding Window based Recurrent Network (SWRN) which can be real-time inference while still achieving superior performance.
Our experiment on REDS dataset shows that the proposed method can be well adapted to mobile devices and produce visually pleasant results.
arXiv Detail & Related papers (2022-08-24T15:23:44Z) - Learning Sequence Representations by Non-local Recurrent Neural Memory [61.65105481899744]
We propose a Non-local Recurrent Neural Memory (NRNM) for supervised sequence representation learning.
Our model is able to capture long-range dependencies and latent high-level features can be distilled by our model.
Our model compares favorably against other state-of-the-art methods specifically designed for each of these sequence applications.
arXiv Detail & Related papers (2022-07-20T07:26:15Z) - Group-based Bi-Directional Recurrent Wavelet Neural Networks for Video
Super-Resolution [4.9136996406481135]
Video super-resolution (VSR) aims to estimate a high-resolution (HR) frame from a low-resolution (LR) frames.
Key challenge for VSR lies in the effective exploitation of spatial correlation in an intra-frame and temporal dependency between consecutive frames.
arXiv Detail & Related papers (2021-06-14T06:36:13Z) - Reconstructive Sequence-Graph Network for Video Summarization [107.0328985865372]
Exploiting the inner-shot and inter-shot dependencies is essential for key-shot based video summarization.
We propose a Reconstructive Sequence-Graph Network (RSGN) to encode the frames and shots as sequence and graph hierarchically.
A reconstructor is developed to reward the summary generator, so that the generator can be optimized in an unsupervised manner.
arXiv Detail & Related papers (2021-05-10T01:47:55Z) - Temporal Memory Relation Network for Workflow Recognition from Surgical
Video [53.20825496640025]
We propose a novel end-to-end temporal memory relation network (TMNet) for relating long-range and multi-scale temporal patterns.
We have extensively validated our approach on two benchmark surgical video datasets.
arXiv Detail & Related papers (2021-03-30T13:20:26Z) - A Prospective Study on Sequence-Driven Temporal Sampling and Ego-Motion
Compensation for Action Recognition in the EPIC-Kitchens Dataset [68.8204255655161]
Action recognition is one of the top-challenging research fields in computer vision.
ego-motion recorded sequences have become of important relevance.
The proposed method aims to cope with it by estimating this ego-motion or camera motion.
arXiv Detail & Related papers (2020-08-26T14:44:45Z) - Co-Saliency Spatio-Temporal Interaction Network for Person
Re-Identification in Videos [85.6430597108455]
We propose a novel Co-Saliency Spatio-Temporal Interaction Network (CSTNet) for person re-identification in videos.
It captures the common salient foreground regions among video frames and explores the spatial-temporal long-range context interdependency from such regions.
Multiple spatialtemporal interaction modules within CSTNet are proposed, which exploit the spatial and temporal long-range context interdependencies on such features and spatial-temporal information correlation.
arXiv Detail & Related papers (2020-04-10T10:23:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.