Large-Scale Object Detection in the Wild from Imbalanced Multi-Labels
- URL: http://arxiv.org/abs/2005.08455v1
- Date: Mon, 18 May 2020 04:36:36 GMT
- Title: Large-Scale Object Detection in the Wild from Imbalanced Multi-Labels
- Authors: Junran Peng, Xingyuan Bu, Ming Sun, Zhaoxiang Zhang, Tieniu Tan,
Junjie Yan
- Abstract summary: In this work, we quantitatively analyze label problems that objects may explicitly or implicitly have multiple labels.
We propose a soft-sampling methods with hybrid training scheduler to deal with the label imbalance.
Our method yields a dramatic improvement of 3.34 points, leading to the best single model with 60.90 mAP on the public object detection test set of Open Images.
- Score: 128.77822070156057
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Training with more data has always been the most stable and effective way of
improving performance in deep learning era. As the largest object detection
dataset so far, Open Images brings great opportunities and challenges for
object detection in general and sophisticated scenarios. However, owing to its
semi-automatic collecting and labeling pipeline to deal with the huge data
scale, Open Images dataset suffers from label-related problems that objects may
explicitly or implicitly have multiple labels and the label distribution is
extremely imbalanced. In this work, we quantitatively analyze these label
problems and provide a simple but effective solution. We design a concurrent
softmax to handle the multi-label problems in object detection and propose a
soft-sampling methods with hybrid training scheduler to deal with the label
imbalance. Overall, our method yields a dramatic improvement of 3.34 points,
leading to the best single model with 60.90 mAP on the public object detection
test set of Open Images. And our ensembling result achieves 67.17 mAP, which is
4.29 points higher than the best result of Open Images public test 2018.
Related papers
- Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection [26.486535389258965]
We experimentally find three gaps between general and oriented object detection in semi-supervised learning.
We propose a Multi-clue Consistency Learning (MCL) framework to bridge these gaps.
Our proposed MCL can achieve state-of-the-art performance in the semi-supervised oriented object detection task.
arXiv Detail & Related papers (2024-07-08T13:14:25Z) - SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection [59.868772767818975]
We propose a simple yet effective Semi-supervised Oriented Object Detection method termed SOOD++.
Specifically, we observe that objects from aerial images are usually arbitrary orientations, small scales, and aggregation.
Extensive experiments conducted on various multi-oriented object datasets under various labeled settings demonstrate the effectiveness of our method.
arXiv Detail & Related papers (2024-07-01T07:03:51Z) - Weakly Semi-supervised Tool Detection in Minimally Invasive Surgery
Videos [11.61305113932032]
Surgical tool detection is essential for analyzing and evaluating minimally invasive surgery videos.
Large image datasets with instance-level labels are often limited because of the burden of annotation.
In this work, we propose to strike a balance between the extremely costly annotation burden and detection performance.
arXiv Detail & Related papers (2024-01-05T13:05:02Z) - ALWOD: Active Learning for Weakly-Supervised Object Detection [14.063031246614488]
We propose ALWOD, a new framework that fuses active learning with weakly and semi-supervised object detection paradigms.
We demonstrate that ALWOD significantly narrows the gap between the ODs trained on few partially labeled but strategically selected image instances and those that rely on the fully-labeled data.
arXiv Detail & Related papers (2023-09-14T17:59:05Z) - The Devil is in the Points: Weakly Semi-Supervised Instance Segmentation
via Point-Guided Mask Representation [61.027468209465354]
We introduce a novel learning scheme named weakly semi-supervised instance segmentation (WSSIS) with point labels.
We propose a method for WSSIS that can effectively leverage the budget-friendly point labels as a powerful weak supervision source.
We conduct extensive experiments on COCO and BDD100K datasets, and the proposed method achieves promising results comparable to those of the fully-supervised model.
arXiv Detail & Related papers (2023-03-27T10:11:22Z) - Collaborative Propagation on Multiple Instance Graphs for 3D Instance
Segmentation with Single-point Supervision [63.429704654271475]
We propose a novel weakly supervised method RWSeg that only requires labeling one object with one point.
With these sparse weak labels, we introduce a unified framework with two branches to propagate semantic and instance information.
Specifically, we propose a Cross-graph Competing Random Walks (CRW) algorithm that encourages competition among different instance graphs.
arXiv Detail & Related papers (2022-08-10T02:14:39Z) - Large-Scale Unsupervised Person Re-Identification with Contrastive
Learning [17.04597303816259]
Most existing unsupervised and domain adaptation ReID methods utilize only the public datasets in their experiments.
Inspired by the recent progress of large-scale self-supervised image classification using contrastive learning, we propose to learn ReID representation from large-scale unlabeled surveillance video alone.
arXiv Detail & Related papers (2021-05-17T14:55:08Z) - Towards Good Practices for Efficiently Annotating Large-Scale Image
Classification Datasets [90.61266099147053]
We investigate efficient annotation strategies for collecting multi-class classification labels for a large collection of images.
We propose modifications and best practices aimed at minimizing human labeling effort.
Simulated experiments on a 125k image subset of the ImageNet100 show that it can be annotated to 80% top-1 accuracy with 0.35 annotations per image on average.
arXiv Detail & Related papers (2021-04-26T16:29:32Z) - Data-Uncertainty Guided Multi-Phase Learning for Semi-Supervised Object
Detection [66.10057490293981]
We propose a data-uncertainty guided multi-phase learning method for semi-supervised object detection.
Our method behaves extraordinarily compared to baseline approaches and outperforms them by a large margin.
arXiv Detail & Related papers (2021-03-29T09:27:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.