Group R-CNN for Weakly Semi-supervised Object Detection with Points
- URL: http://arxiv.org/abs/2205.05920v1
- Date: Thu, 12 May 2022 07:17:54 GMT
- Title: Group R-CNN for Weakly Semi-supervised Object Detection with Points
- Authors: Shilong Zhang, Zhuoran Yu, Liyang Liu, Xinjiang Wang, Aojun Zhou and
Kai Chen
- Abstract summary: We propose an effective point-to-box regressor: Group R-CNN.
Group R-CNN first uses instance-level proposal grouping to generate a group of proposals for each point annotation.
We show that Group R-CNN significantly outperforms the prior method Point DETR by 3.9 mAP with 5% well-labeled images.
- Score: 18.720915213798623
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We study the problem of weakly semi-supervised object detection with points
(WSSOD-P), where the training data is combined by a small set of fully
annotated images with bounding boxes and a large set of weakly-labeled images
with only a single point annotated for each instance. The core of this task is
to train a point-to-box regressor on well-labeled images that can be used to
predict credible bounding boxes for each point annotation. We challenge the
prior belief that existing CNN-based detectors are not compatible with this
task. Based on the classic R-CNN architecture, we propose an effective
point-to-box regressor: Group R-CNN. Group R-CNN first uses instance-level
proposal grouping to generate a group of proposals for each point annotation
and thus can obtain a high recall rate. To better distinguish different
instances and improve precision, we propose instance-level proposal assignment
to replace the vanilla assignment strategy adopted in the original R-CNN
methods. As naive instance-level assignment brings converging difficulty, we
propose instance-aware representation learning which consists of instance-aware
feature enhancement and instance-aware parameter generation to overcome this
issue. Comprehensive experiments on the MS-COCO benchmark demonstrate the
effectiveness of our method. Specifically, Group R-CNN significantly
outperforms the prior method Point DETR by 3.9 mAP with 5% well-labeled images,
which is the most challenging scenario. The source code can be found at
https://github.com/jshilong/GroupRCNN
Related papers
- Complete Instances Mining for Weakly Supervised Instance Segmentation [6.177842623752537]
We propose a novel approach for weakly supervised instance segmentation (WSIS) using only image-level labels.
We use MaskIoU heads to predict the integrity scores of proposals and a Complete Instances Mining (CIM) strategy to explicitly model the redundant segmentation problem.
Our approach allows the network to become aware of multiple instances and complete instances, and we further improve its robustness through the incorporation of an Anti-noise strategy.
arXiv Detail & Related papers (2024-02-12T13:16:47Z) - VQ-GNN: A Universal Framework to Scale up Graph Neural Networks using
Vector Quantization [70.8567058758375]
VQ-GNN is a universal framework to scale up any convolution-based GNNs using Vector Quantization (VQ) without compromising the performance.
Our framework avoids the "neighbor explosion" problem of GNNs using quantized representations combined with a low-rank version of the graph convolution matrix.
arXiv Detail & Related papers (2021-10-27T11:48:50Z) - Learning Hierarchical Graph Neural Networks for Image Clustering [81.5841862489509]
We propose a hierarchical graph neural network (GNN) model that learns how to cluster a set of images into an unknown number of identities.
Our hierarchical GNN uses a novel approach to merge connected components predicted at each level of the hierarchy to form a new graph at the next level.
arXiv Detail & Related papers (2021-07-03T01:28:42Z) - Pointly-Supervised Instance Segmentation [81.34136519194602]
We propose point-based instance-level annotation, a new form of weak supervision for instance segmentation.
It combines the standard bounding box annotation with labeled points that are uniformly sampled inside each bounding box.
In our experiments, Mask R-CNN models trained on COCO, PASCAL VOC, Cityscapes, and LVIS with only 10 annotated points per object achieve 94%--98% of their fully-supervised performance.
arXiv Detail & Related papers (2021-04-13T17:59:40Z) - ATRM: Attention-based Task-level Relation Module for GNN-based Few-shot
Learning [14.464964336101028]
We propose a new relation measure method, namely the attention-based task-level relation module (ATRM)
The proposed module captures the relation representations between nodes by considering the sample-to-task instead of sample-to-sample embedding features.
Experimental results demonstrate that the proposed module is effective for GNN-based few-shot learning.
arXiv Detail & Related papers (2021-01-25T00:53:04Z) - Joint Object Contour Points and Semantics for Instance Segmentation [1.2117737635879038]
We propose Mask Point R-CNN aiming at promoting the neural network's attention to the object boundary.
Specifically, we innovatively extend the original human keypoint detection task to the contour point detection of any object.
As a consequence, the model will be more sensitive to the edges of the object and can capture more geometric features.
arXiv Detail & Related papers (2020-08-02T11:11:28Z) - Sequential Graph Convolutional Network for Active Learning [53.99104862192055]
We propose a novel pool-based Active Learning framework constructed on a sequential Graph Convolution Network (GCN)
With a small number of randomly sampled images as seed labelled examples, we learn the parameters of the graph to distinguish labelled vs unlabelled nodes.
We exploit these characteristics of GCN to select the unlabelled examples which are sufficiently different from labelled ones.
arXiv Detail & Related papers (2020-06-18T00:55:10Z) - High-Order Information Matters: Learning Relation and Topology for
Occluded Person Re-Identification [84.43394420267794]
We propose a novel framework by learning high-order relation and topology information for discriminative features and robust alignment.
Our framework significantly outperforms state-of-the-art by6.5%mAP scores on Occluded-Duke dataset.
arXiv Detail & Related papers (2020-03-18T12:18:35Z) - 1st Place Solutions for OpenImage2019 -- Object Detection and Instance
Segmentation [116.25081559037872]
This article introduces the solutions of the two champion teams, MMfruit' for the detection track and MMfruitSeg' for the segmentation track, in OpenImage Challenge 2019.
It is commonly known that for an object detector, the shared feature at the end of the backbone is not appropriate for both classification and regression.
We propose the Decoupling Head (DH) to disentangle the object classification and regression via the self-learned optimal feature extraction.
arXiv Detail & Related papers (2020-03-17T06:45:07Z) - Weakly Supervised Instance Segmentation by Deep Community Learning [39.18749732409763]
We present a weakly supervised instance segmentation algorithm based on deep community learning with multiple tasks.
We address this problem by designing a unified deep neural network architecture.
The proposed algorithm achieves state-of-the-art performance in the weakly supervised setting.
arXiv Detail & Related papers (2020-01-30T08:35:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.