SeekNet: Improved Human Instance Segmentation via Reinforcement Learning
Based Optimized Robot Relocation
- URL: http://arxiv.org/abs/2011.08682v1
- Date: Tue, 17 Nov 2020 15:03:30 GMT
- Title: SeekNet: Improved Human Instance Segmentation via Reinforcement Learning
Based Optimized Robot Relocation
- Authors: Venkatraman Narayanan and Bala Murali Manoghar and Rama Prashanth RV
and Aniket Bera
- Abstract summary: Amodal recognition is the ability of the system to detect occluded objects.
We propose SeekNet, an improved optimization method for amodal recognition through embodied visual recognition.
We also implement SeekNet for social robots, where there are multiple interactions with crowded humans.
- Score: 17.4240390944016
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Amodal recognition is the ability of the system to detect occluded objects.
Most state-of-the-art Visual Recognition systems lack the ability to perform
amodal recognition. Few studies have achieved amodal recognition through
passive prediction or embodied recognition approaches. However, these
approaches suffer from challenges in real-world applications, such as dynamic
objects. We propose SeekNet, an improved optimization method for amodal
recognition through embodied visual recognition. Additionally, we implement
SeekNet for social robots, where there are multiple interactions with crowded
humans. Hence, we focus on occluded human detection & tracking and showcase the
superiority of our algorithm over other baselines. We also experiment with
SeekNet to improve the confidence of COVID-19 symptoms pre-screening algorithms
using our efficient embodied recognition system.
Related papers
- Know Thy Neighbors: A Graph Based Approach for Effective Sensor-Based
Human Activity Recognition in Smart Homes [0.0]
We propose a novel graph-guided neural network approach for Human Activity Recognition (HAR) in smart homes.
We accomplish this by learning a more expressive graph structure representing the sensor network in a smart home.
Our approach maps discrete input sensor measurements to a feature space through the application of attention mechanisms.
arXiv Detail & Related papers (2023-11-16T02:43:13Z) - Human Activity Recognition Using Cascaded Dual Attention CNN and
Bi-Directional GRU Framework [3.3721926640077795]
Vision-based human activity recognition has emerged as one of the essential research areas in video analytics domain.
This paper presents a computationally efficient yet generic spatial-temporal cascaded framework that exploits the deep discriminative spatial and temporal features for human activity recognition.
The proposed framework attains an improvement in execution time up to 167 times in terms of frames per second as compared to most of the contemporary action recognition methods.
arXiv Detail & Related papers (2022-08-09T20:34:42Z) - Multimodal Emotion Recognition using Transfer Learning from Speaker
Recognition and BERT-based models [53.31917090073727]
We propose a neural network-based emotion recognition framework that uses a late fusion of transfer-learned and fine-tuned models from speech and text modalities.
We evaluate the effectiveness of our proposed multimodal approach on the interactive emotional dyadic motion capture dataset.
arXiv Detail & Related papers (2022-02-16T00:23:42Z) - Incremental Learning Techniques for Online Human Activity Recognition [0.0]
We propose a human activity recognition (HAR) approach for the online prediction of physical movements.
We develop a HAR system containing monitoring software and a mobile application that collects accelerometer and gyroscope data.
Six incremental learning algorithms are employed and evaluated in this work and compared with several batch learning algorithms commonly used for developing offline HAR systems.
arXiv Detail & Related papers (2021-09-20T11:33:09Z) - Identity-Aware Attribute Recognition via Real-Time Distributed Inference
in Mobile Edge Clouds [53.07042574352251]
We design novel models for pedestrian attribute recognition with re-ID in an MEC-enabled camera monitoring system.
We propose a novel inference framework with a set of distributed modules, by jointly considering the attribute recognition and person re-ID.
We then devise a learning-based algorithm for the distributions of the modules of the proposed distributed inference framework.
arXiv Detail & Related papers (2020-08-12T12:03:27Z) - AR-Net: Adaptive Frame Resolution for Efficient Action Recognition [70.62587948892633]
Action recognition is an open and challenging problem in computer vision.
We propose a novel approach, called AR-Net, that selects on-the-fly the optimal resolution for each frame conditioned on the input for efficient action recognition.
arXiv Detail & Related papers (2020-07-31T01:36:04Z) - AutoOD: Automated Outlier Detection via Curiosity-guided Search and
Self-imitation Learning [72.99415402575886]
Outlier detection is an important data mining task with numerous practical applications.
We propose AutoOD, an automated outlier detection framework, which aims to search for an optimal neural network model.
Experimental results on various real-world benchmark datasets demonstrate that the deep model identified by AutoOD achieves the best performance.
arXiv Detail & Related papers (2020-06-19T18:57:51Z) - Entropy Decision Fusion for Smartphone Sensor based Human Activity
Recognition [0.0]
We present an approach for fusing convolutional neural network, recurrent convolutional network, and support vector machine by computing.
Experiments are conducted on two benchmark datasets, UCI-HAR and WISDM.
arXiv Detail & Related papers (2020-05-30T21:09:38Z) - Automatic Gesture Recognition in Robot-assisted Surgery with
Reinforcement Learning and Tree Search [63.07088785532908]
We propose a framework based on reinforcement learning and tree search for joint surgical gesture segmentation and classification.
Our framework consistently outperforms the existing methods on the suturing task of JIGSAWS dataset in terms of accuracy, edit score and F1 score.
arXiv Detail & Related papers (2020-02-20T13:12:38Z) - Continuous Emotion Recognition via Deep Convolutional Autoencoder and
Support Vector Regressor [70.2226417364135]
It is crucial that the machine should be able to recognize the emotional state of the user with high accuracy.
Deep neural networks have been used with great success in recognizing emotions.
We present a new model for continuous emotion recognition based on facial expression recognition.
arXiv Detail & Related papers (2020-01-31T17:47:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.