EasyPortrait -- Face Parsing and Portrait Segmentation Dataset
- URL: http://arxiv.org/abs/2304.13509v3
- Date: Thu, 7 Mar 2024 15:34:00 GMT
- Title: EasyPortrait -- Face Parsing and Portrait Segmentation Dataset
- Authors: Karina Kvanchiani, Elizaveta Petrova, Karen Efremyan, Alexander
Sautin, Alexander Kapitanov
- Abstract summary: Video conferencing apps have become functional by accomplishing such computer vision-based features as real-time background removal and face beautification.
We create a new dataset, EasyPortrait, for these tasks simultaneously.
It contains 40,000 primarily indoor photos repeating video meeting scenarios with 13,705 unique users and fine-grained segmentation masks separated into 9 classes.
- Score: 79.16635054977068
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Recently, video conferencing apps have become functional by accomplishing
such computer vision-based features as real-time background removal and face
beautification. Limited variability in existing portrait segmentation and face
parsing datasets, including head poses, ethnicity, scenes, and occlusions
specific to video conferencing, motivated us to create a new dataset,
EasyPortrait, for these tasks simultaneously. It contains 40,000 primarily
indoor photos repeating video meeting scenarios with 13,705 unique users and
fine-grained segmentation masks separated into 9 classes. Inappropriate
annotation masks from other datasets caused a revision of annotator guidelines,
resulting in EasyPortrait's ability to process cases, such as teeth whitening
and skin smoothing. The pipeline for data mining and high-quality mask
annotation via crowdsourcing is also proposed in this paper. In the ablation
study experiments, we proved the importance of data quantity and diversity in
head poses in our dataset for the effective learning of the model. The
cross-dataset evaluation experiments confirmed the best domain generalization
ability among portrait segmentation datasets. Moreover, we demonstrate the
simplicity of training segmentation models on EasyPortrait without extra
training tricks. The proposed dataset and trained models are publicly
available.
Related papers
- Outline-Guided Object Inpainting with Diffusion Models [11.391452115311798]
Instance segmentation datasets play a crucial role in training accurate and robust computer vision models.
We show how this issue can be mitigated by starting with small annotated instance segmentation datasets and augmenting them to obtain a sizeable annotated dataset.
We generate new images using a diffusion-based inpainting model to fill out the masked area with a desired object class by guiding the diffusion through the object outline.
arXiv Detail & Related papers (2024-02-26T09:21:17Z) - Lidar Annotation Is All You Need [0.0]
This paper aims to improve the efficiency of image segmentation using a convolutional neural network in a multi-sensor setup.
The key innovation of our approach is the masked loss, addressing sparse ground-truth masks from point clouds.
Experimental validation of the approach on benchmark datasets shows comparable performance to a high-quality image segmentation model.
arXiv Detail & Related papers (2023-11-08T15:55:18Z) - A Generalist Framework for Panoptic Segmentation of Images and Videos [61.61453194912186]
We formulate panoptic segmentation as a discrete data generation problem, without relying on inductive bias of the task.
A diffusion model is proposed to model panoptic masks, with a simple architecture and generic loss function.
Our method is capable of modeling video (in a streaming setting) and thereby learns to track object instances automatically.
arXiv Detail & Related papers (2022-10-12T16:18:25Z) - Self-attention on Multi-Shifted Windows for Scene Segmentation [14.47974086177051]
We explore the effective use of self-attention within multi-scale image windows to learn descriptive visual features.
We propose three different strategies to aggregate these feature maps to decode the feature representation for dense prediction.
Our models achieve very promising performance on four public scene segmentation datasets.
arXiv Detail & Related papers (2022-07-10T07:36:36Z) - Learning Co-segmentation by Segment Swapping for Retrieval and Discovery [67.6609943904996]
The goal of this work is to efficiently identify visually similar patterns from a pair of images.
We generate synthetic training pairs by selecting object segments in an image and copy-pasting them into another image.
We show our approach provides clear improvements for artwork details retrieval on the Brueghel dataset.
arXiv Detail & Related papers (2021-10-29T16:51:16Z) - Personalized Image Semantic Segmentation [58.980245748434]
We generate more accurate segmentation results on unlabeled personalized images by investigating the data's personalized traits.
We propose a baseline method that incorporates the inter-image context when segmenting certain images.
The code and the PIS dataset will be made publicly available.
arXiv Detail & Related papers (2021-07-24T04:03:11Z) - Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences
for Urban Scene Segmentation [57.68890534164427]
In this work, we ask if we may leverage semi-supervised learning in unlabeled video sequences and extra images to improve the performance on urban scene segmentation.
We simply predict pseudo-labels for the unlabeled data and train subsequent models with both human-annotated and pseudo-labeled data.
Our Naive-Student model, trained with such simple yet effective iterative semi-supervised learning, attains state-of-the-art results at all three Cityscapes benchmarks.
arXiv Detail & Related papers (2020-05-20T18:00:05Z) - Omni-supervised Facial Expression Recognition via Distilled Data [120.11782405714234]
We propose omni-supervised learning to exploit reliable samples in a large amount of unlabeled data for network training.
We experimentally verify that the new dataset can significantly improve the ability of the learned FER model.
To tackle this, we propose to apply a dataset distillation strategy to compress the created dataset into several informative class-wise images.
arXiv Detail & Related papers (2020-05-18T09:36:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.