A Review on Deep Learning Techniques for Video Prediction
- URL: http://arxiv.org/abs/2004.05214v2
- Date: Wed, 15 Apr 2020 00:24:31 GMT
- Title: A Review on Deep Learning Techniques for Video Prediction
- Authors: Sergiu Oprea, Pablo Martinez-Gonzalez, Alberto Garcia-Garcia, John
Alejandro Castro-Vargas, Sergio Orts-Escolano, Jose Garcia-Rodriguez and
Antonis Argyros
- Abstract summary: The ability to predict, anticipate and reason about future outcomes is a key component of intelligent decision-making systems.
Deep learning-based video prediction emerged as a promising research direction.
- Score: 3.203688549673373
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The ability to predict, anticipate and reason about future outcomes is a key
component of intelligent decision-making systems. In light of the success of
deep learning in computer vision, deep-learning-based video prediction emerged
as a promising research direction. Defined as a self-supervised learning task,
video prediction represents a suitable framework for representation learning,
as it demonstrated potential capabilities for extracting meaningful
representations of the underlying patterns in natural videos. Motivated by the
increasing interest in this task, we provide a review on the deep learning
methods for prediction in video sequences. We firstly define the video
prediction fundamentals, as well as mandatory background concepts and the most
used datasets. Next, we carefully analyze existing video prediction models
organized according to a proposed taxonomy, highlighting their contributions
and their significance in the field. The summary of the datasets and methods is
accompanied with experimental results that facilitate the assessment of the
state of the art on a quantitative basis. The paper is summarized by drawing
some general conclusions, identifying open research challenges and by pointing
out future research directions.
Related papers
- Predicting Long-horizon Futures by Conditioning on Geometry and Time [49.86180975196375]
We explore the task of generating future sensor observations conditioned on the past.
We leverage the large-scale pretraining of image diffusion models which can handle multi-modality.
We create a benchmark for video prediction on a diverse set of videos spanning indoor and outdoor scenes.
arXiv Detail & Related papers (2024-04-17T16:56:31Z) - Deep networks for system identification: a Survey [56.34005280792013]
System identification learns mathematical descriptions of dynamic systems from input-output data.
Main aim of the identified model is to predict new data from previous observations.
We discuss architectures commonly adopted in the literature, like feedforward, convolutional, and recurrent networks.
arXiv Detail & Related papers (2023-01-30T12:38:31Z) - Convolutional Motif Kernel Networks [1.104960878651584]
We show that our model is able to robustly learn on small datasets and reaches state-of-the-art performance on relevant healthcare prediction tasks.
Our proposed method can be utilized on DNA and protein sequences.
arXiv Detail & Related papers (2021-11-03T15:06:09Z) - Deep Learning Schema-based Event Extraction: Literature Review and
Current Trends [60.29289298349322]
Event extraction technology based on deep learning has become a research hotspot.
This paper fills the gap by reviewing the state-of-the-art approaches, focusing on deep learning-based models.
arXiv Detail & Related papers (2021-07-05T16:32:45Z) - Self-Supervision by Prediction for Object Discovery in Videos [62.87145010885044]
In this paper, we use the prediction task as self-supervision and build a novel object-centric model for image sequence representation.
Our framework can be trained without the help of any manual annotation or pretrained network.
Initial experiments confirm that the proposed pipeline is a promising step towards object-centric video prediction.
arXiv Detail & Related papers (2021-03-09T19:14:33Z) - Video Summarization Using Deep Neural Networks: A Survey [72.98424352264904]
Video summarization technologies aim to create a concise and complete synopsis by selecting the most informative parts of the video content.
This work focuses on the recent advances in the area and provides a comprehensive survey of the existing deep-learning-based methods for generic video summarization.
arXiv Detail & Related papers (2021-01-15T11:41:29Z) - Video Prediction via Example Guidance [156.08546987158616]
In video prediction tasks, one major challenge is to capture the multi-modal nature of future contents and dynamics.
In this work, we propose a simple yet effective framework that can efficiently predict plausible future states.
arXiv Detail & Related papers (2020-07-03T14:57:24Z) - Deep Learning for Vision-based Prediction: A Survey [6.840474688871695]
Vision-based prediction algorithms have a wide range of applications including autonomous driving, surveillance, human-robot interaction, weather prediction.
This paper provides an overview of the field in the past five years with a particular focus on deep learning approaches.
arXiv Detail & Related papers (2020-06-30T20:26:46Z) - Academic Performance Estimation with Attention-based Graph Convolutional
Networks [17.985752744098267]
Given a student's past data, the task of student's performance prediction is to predict a student's grades in future courses.
Traditional methods for student's performance prediction usually neglect the underlying relationships between multiple courses.
We propose a novel attention-based graph convolutional networks model for student's performance prediction.
arXiv Detail & Related papers (2019-12-26T23:11:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.