Predicting the Future from First Person (Egocentric) Vision: A Survey
- URL: http://arxiv.org/abs/2107.13411v1
- Date: Wed, 28 Jul 2021 14:58:13 GMT
- Title: Predicting the Future from First Person (Egocentric) Vision: A Survey
- Authors: Ivan Rodin, Antonino Furnari, Dimitrios Mavroedis, Giovanni Maria
Farinella
- Abstract summary: This survey summarises the evolution of studies in the context of future prediction from egocentric vision.
It makes an overview of applications, devices, existing problems, commonly used datasets, models and input modalities.
Our analysis highlights that methods for future prediction from egocentric vision can have a significant impact in a range of applications.
- Score: 18.07516837332113
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Egocentric videos can bring a lot of information about how humans perceive
the world and interact with the environment, which can be beneficial for the
analysis of human behaviour. The research in egocentric video analysis is
developing rapidly thanks to the increasing availability of wearable devices
and the opportunities offered by new large-scale egocentric datasets. As
computer vision techniques continue to develop at an increasing pace, the tasks
related to the prediction of future are starting to evolve from the need of
understanding the present. Predicting future human activities, trajectories and
interactions with objects is crucial in applications such as human-robot
interaction, assistive wearable technologies for both industrial and daily
living scenarios, entertainment and virtual or augmented reality. This survey
summarises the evolution of studies in the context of future prediction from
egocentric vision making an overview of applications, devices, existing
problems, commonly used datasets, models and input modalities. Our analysis
highlights that methods for future prediction from egocentric vision can have a
significant impact in a range of applications and that further research efforts
should be devoted to the standardisation of tasks and the proposal of datasets
considering real-world scenarios such as the ones with an industrial vocation.
Related papers
- Human Action Anticipation: A Survey [86.415721659234]
The literature on behavior prediction spans various tasks, including action anticipation, activity forecasting, intent prediction, goal prediction, and so on.
Our survey aims to tie together this fragmented literature, covering recent technical innovations as well as the development of new large-scale datasets for model training and evaluation.
arXiv Detail & Related papers (2024-10-17T21:37:40Z) - Data Augmentation in Human-Centric Vision [54.97327269866757]
This survey presents a comprehensive analysis of data augmentation techniques in human-centric vision tasks.
It delves into a wide range of research areas including person ReID, human parsing, human pose estimation, and pedestrian detection.
Our work categorizes data augmentation methods into two main types: data generation and data perturbation.
arXiv Detail & Related papers (2024-03-13T16:05:18Z) - An Outlook into the Future of Egocentric Vision [35.98763217828443]
This survey focuses on software models for egocentric vision, independent of any specific hardware.
The paper concludes with recommendations for areas of immediate explorations so as to unlock our path to the future always-on, personalised and life-enhancing egocentric vision.
arXiv Detail & Related papers (2023-08-14T13:10:48Z) - World Models and Predictive Coding for Cognitive and Developmental
Robotics: Frontiers and Challenges [51.92834011423463]
We focus on the two concepts of world models and predictive coding.
In neuroscience, predictive coding proposes that the brain continuously predicts its inputs and adapts to model its own dynamics and control behavior in its environment.
arXiv Detail & Related papers (2023-01-14T06:38:14Z) - Learn to Predict How Humans Manipulate Large-sized Objects from
Interactive Motions [82.90906153293585]
We propose a graph neural network, HO-GCN, to fuse motion data and dynamic descriptors for the prediction task.
We show the proposed network that consumes dynamic descriptors can achieve state-of-the-art prediction results and help the network better generalize to unseen objects.
arXiv Detail & Related papers (2022-06-25T09:55:39Z) - GIMO: Gaze-Informed Human Motion Prediction in Context [75.52839760700833]
We propose a large-scale human motion dataset that delivers high-quality body pose sequences, scene scans, and ego-centric views with eye gaze.
Our data collection is not tied to specific scenes, which further boosts the motion dynamics observed from our subjects.
To realize the full potential of gaze, we propose a novel network architecture that enables bidirectional communication between the gaze and motion branches.
arXiv Detail & Related papers (2022-04-20T13:17:39Z) - A-ACT: Action Anticipation through Cycle Transformations [89.83027919085289]
We take a step back to analyze how the human capability to anticipate the future can be transferred to machine learning algorithms.
A recent study on human psychology explains that, in anticipating an occurrence, the human brain counts on both systems.
In this work, we study the impact of each system for the task of action anticipation and introduce a paradigm to integrate them in a learning framework.
arXiv Detail & Related papers (2022-04-02T21:50:45Z) - 3D Human Motion Prediction: A Survey [23.605334184939164]
3D human motion prediction, predicting future poses from a given sequence, is an issue of great significance and challenge in computer vision and machine intelligence.
A comprehensive survey on 3D human motion prediction is conducted for the purpose of retrospecting and analyzing relevant works from existing released literature.
arXiv Detail & Related papers (2022-03-03T09:46:43Z) - Deep Learning for Human Mobility: a Survey on Data and Models [5.9623431392389]
The study of human mobility is crucial due to its impact on several aspects of our society.
The proliferation of digital mobility data, such as phone records, GPS traces, and social media posts, triggered the application of deep learning to human mobility.
Our survey is a guide to the leading deep learning solutions to next-location prediction, crowd flow prediction, and trajectory generation.
arXiv Detail & Related papers (2020-12-04T19:59:39Z) - From Recognition to Prediction: Analysis of Human Action and Trajectory
Prediction in Video [4.163207534602723]
Deciphering human behaviors to predict their future paths/trajectories is important.
Human trajectory prediction still remains a challenging task.
System must be able to detect and analyze human activities as well as scene semantics.
arXiv Detail & Related papers (2020-11-20T22:23:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.