Learning to Segment Human Body Parts with Synthetically Trained Deep
Convolutional Networks
- URL: http://arxiv.org/abs/2102.01460v1
- Date: Tue, 2 Feb 2021 12:26:50 GMT
- Title: Learning to Segment Human Body Parts with Synthetically Trained Deep
Convolutional Networks
- Authors: Alessandro Saviolo, Matteo Bonotto, Daniele Evangelista, Marco
Imperoli, Emanuele Menegatti and Alberto Pretto
- Abstract summary: This paper presents a new framework for human body part segmentation based on Deep Convolutional Neural Networks trained using only synthetic data.
The proposed approach achieves cutting-edge results without the need of training the models with real annotated data of human body parts.
- Score: 58.0240970093372
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper presents a new framework for human body part segmentation based on
Deep Convolutional Neural Networks trained using only synthetic data. The
proposed approach achieves cutting-edge results without the need of training
the models with real annotated data of human body parts. Our contributions
include a data generation pipeline, that exploits a game engine for the
creation of the synthetic data used for training the network, and a novel
pre-processing module, that combines edge response map and adaptive histogram
equalization to guide the network to learn the shape of the human body parts
ensuring robustness to changes in the illumination conditions. For selecting
the best candidate architecture, we performed exhaustive tests on
manually-annotated images of real human body limbs. We further present an
ablation study to validate our pre-processing module. The results show that our
method outperforms several state-of-the-art semantic segmentation networks by a
large margin. We release an implementation of the proposed approach along with
the acquired datasets with this paper.
Related papers
- Ensembling convolutional neural networks for human skin segmentation [2.8391355909797644]
We propose to train a convolutional network using the datasets focused on different features to create an ensemble whose individual outcomes are effectively combined.
We expect that this study will help in developing new ensemble-based techniques that will improve the performance of semantic segmentation systems.
arXiv Detail & Related papers (2024-07-27T17:55:28Z) - Diffusion-based Data Augmentation for Nuclei Image Segmentation [68.28350341833526]
We introduce the first diffusion-based augmentation method for nuclei segmentation.
The idea is to synthesize a large number of labeled images to facilitate training the segmentation model.
The experimental results show that by augmenting 10% labeled real dataset with synthetic samples, one can achieve comparable segmentation results.
arXiv Detail & Related papers (2023-10-22T06:16:16Z) - Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based
Action Recognition [88.34182299496074]
Action labels are only available on a source dataset, but unavailable on a target dataset in the training stage.
We utilize a self-supervision scheme to reduce the domain shift between two skeleton-based action datasets.
By segmenting and permuting temporal segments or human body parts, we design two self-supervised learning classification tasks.
arXiv Detail & Related papers (2022-07-17T07:05:39Z) - End-to-end Neuron Instance Segmentation based on Weakly Supervised
Efficient UNet and Morphological Post-processing [0.0]
We present an end-to-end weakly-supervised framework to automatically detect and segment NeuN stained neuronal cells on histological images.
We integrate the state-of-the-art network, EfficientNet, into our U-Net-like architecture.
arXiv Detail & Related papers (2022-02-17T14:35:45Z) - Self-supervised Audiovisual Representation Learning for Remote Sensing Data [96.23611272637943]
We propose a self-supervised approach for pre-training deep neural networks in remote sensing.
By exploiting the correspondence between geo-tagged audio recordings and remote sensing, this is done in a completely label-free manner.
We show that our approach outperforms existing pre-training strategies for remote sensing imagery.
arXiv Detail & Related papers (2021-08-02T07:50:50Z) - Denoise and Contrast for Category Agnostic Shape Completion [48.66519783934386]
We present a deep learning model that exploits the power of self-supervision to perform 3D point cloud completion.
A denoising pretext task provides the network with the needed local cues, decoupled from the high-level semantics.
contrastive learning maximizes the agreement between variants of the same shape with different missing portions.
arXiv Detail & Related papers (2021-03-30T20:33:24Z) - Vision-Based Layout Detection from Scientific Literature using Recurrent
Convolutional Neural Networks [12.221478896815292]
We present an approach for adapting convolutional neural networks for object recognition and classification to scientific literature layout detection (SLLD)
SLLD is a shared subtask of several information extraction problems.
Our results show good improvement with fine-tuning of a pre-trained base network.
arXiv Detail & Related papers (2020-10-18T23:50:28Z) - Semantic Segmentation With Multi Scale Spatial Attention For Self
Driving Cars [2.7317088388886384]
We present a novel neural network using multi scale feature fusion at various scales for accurate and efficient semantic image segmentation.
We used ResNet based feature extractor, dilated convolutional layers in downsampling part, atrous convolutional layers in the upsampling part and used concat operation to merge them.
A new attention module is proposed to encode more contextual information and enhance the receptive field of the network.
arXiv Detail & Related papers (2020-06-30T20:19:09Z) - Dataset Condensation with Gradient Matching [36.14340188365505]
We propose a training set synthesis technique for data-efficient learning, called dataset Condensation, that learns to condense large dataset into a small set of informative synthetic samples for training deep neural networks from scratch.
We rigorously evaluate its performance in several computer vision benchmarks and demonstrate that it significantly outperforms the state-of-the-art methods.
arXiv Detail & Related papers (2020-06-10T16:30:52Z) - Monocular Human Pose and Shape Reconstruction using Part Differentiable
Rendering [53.16864661460889]
Recent works succeed in regression-based methods which estimate parametric models directly through a deep neural network supervised by 3D ground truth.
In this paper, we introduce body segmentation as critical supervision.
To improve the reconstruction with part segmentation, we propose a part-level differentiable part that enables part-based models to be supervised by part segmentation.
arXiv Detail & Related papers (2020-03-24T14:25:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.