Learning to Detect Important People in Unlabelled Images for
Semi-supervised Important People Detection
- URL: http://arxiv.org/abs/2004.07568v1
- Date: Thu, 16 Apr 2020 10:09:37 GMT
- Title: Learning to Detect Important People in Unlabelled Images for
Semi-supervised Important People Detection
- Authors: Fa-Ting Hong, Wei-Hong Li, Wei-Shi Zheng
- Abstract summary: We propose learning important people detection on partially annotated images.
Our approach iteratively learns to assign pseudo-labels to individuals in un-annotated images.
We have collected two large-scale datasets for evaluation.
- Score: 85.91577271918783
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Important people detection is to automatically detect the individuals who
play the most important roles in a social event image, which requires the
designed model to understand a high-level pattern. However, existing methods
rely heavily on supervised learning using large quantities of annotated image
samples, which are more costly to collect for important people detection than
for individual entity recognition (eg, object recognition). To overcome this
problem, we propose learning important people detection on partially annotated
images. Our approach iteratively learns to assign pseudo-labels to individuals
in un-annotated images and learns to update the important people detection
model based on data with both labels and pseudo-labels. To alleviate the
pseudo-labelling imbalance problem, we introduce a ranking strategy for
pseudo-label estimation, and also introduce two weighting strategies: one for
weighting the confidence that individuals are important people to strengthen
the learning on important people and the other for neglecting noisy unlabelled
images (ie, images without any important people). We have collected two
large-scale datasets for evaluation. The extensive experimental results clearly
confirm the efficacy of our method attained by leveraging unlabelled images for
improving the performance of important people detection.
Related papers
- Extracting Human Attention through Crowdsourced Patch Labeling [18.947126675569667]
In image classification, a significant problem arises from bias in the datasets.
One approach to mitigate such biases is to direct the model's attention toward the target object's location.
We propose a novel patch-labeling method that integrates AI assistance with crowdsourcing to capture human attention from images.
arXiv Detail & Related papers (2024-03-22T07:57:27Z) - Unsupervised learning based object detection using Contrastive Learning [6.912349403119665]
We introduce a groundbreaking method for training single-stage object detectors through unsupervised/self-supervised learning.
Our state-of-the-art approach has the potential to revolutionize the labeling process, substantially reducing the time and cost associated with manual annotation.
We pioneer the concept of intra-image contrastive learning alongside inter-image counterparts, enabling the acquisition of crucial location information.
arXiv Detail & Related papers (2024-02-21T01:44:15Z) - Self-similarity Driven Scale-invariant Learning for Weakly Supervised
Person Search [66.95134080902717]
We propose a novel one-step framework, named Self-similarity driven Scale-invariant Learning (SSL)
We introduce a Multi-scale Exemplar Branch to guide the network in concentrating on the foreground and learning scale-invariant features.
Experiments on PRW and CUHK-SYSU databases demonstrate the effectiveness of our method.
arXiv Detail & Related papers (2023-02-25T04:48:11Z) - Semi-supervised Ranking for Object Image Blur Assessment [37.778436378659656]
We establish a large-scale realistic face image blur assessment dataset with reliable labels.
We propose a method to obtain the blur scores only with the pairwise rank labels as supervision.
To further improve the performance, we propose a self-supervised method based on quadruplet ranking consistency.
arXiv Detail & Related papers (2022-07-13T09:49:22Z) - STAR: Noisy Semi-Supervised Transfer Learning for Visual Classification [0.8662293148437356]
Semi-supervised learning (SSL) has proven to be effective at leveraging large-scale unlabeled data.
Recent SSL methods rely on unlabeled image data at a scale of billions to work well.
We propose noisy semi-supervised transfer learning, which integrates transfer learning and self-training with noisy student.
arXiv Detail & Related papers (2021-08-18T19:35:05Z) - Automatic Main Character Recognition for Photographic Studies [78.88882860340797]
Main characters in images are the most important humans that catch the viewer's attention upon first look.
Identifying the main character in images plays an important role in traditional photographic studies and media analysis.
We propose a method for identifying the main characters using machine learning based human pose estimation.
arXiv Detail & Related papers (2021-06-16T18:14:45Z) - Towards Good Practices for Efficiently Annotating Large-Scale Image
Classification Datasets [90.61266099147053]
We investigate efficient annotation strategies for collecting multi-class classification labels for a large collection of images.
We propose modifications and best practices aimed at minimizing human labeling effort.
Simulated experiments on a 125k image subset of the ImageNet100 show that it can be annotated to 80% top-1 accuracy with 0.35 annotations per image on average.
arXiv Detail & Related papers (2021-04-26T16:29:32Z) - Few-Cost Salient Object Detection with Adversarial-Paced Learning [95.0220555274653]
This paper proposes to learn the effective salient object detection model based on the manual annotation on a few training images only.
We name this task as the few-cost salient object detection and propose an adversarial-paced learning (APL)-based framework to facilitate the few-cost learning scenario.
arXiv Detail & Related papers (2021-04-05T14:15:49Z) - Attention-Aware Noisy Label Learning for Image Classification [97.26664962498887]
Deep convolutional neural networks (CNNs) learned on large-scale labeled samples have achieved remarkable progress in computer vision.
The cheapest way to obtain a large body of labeled visual data is to crawl from websites with user-supplied labels, such as Flickr.
This paper proposes the attention-aware noisy label learning approach to improve the discriminative capability of the network trained on datasets with potential label noise.
arXiv Detail & Related papers (2020-09-30T15:45:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.