Joint COCO and Mapillary Workshop at ICCV 2019: COCO Instance
Segmentation Challenge Track
- URL: http://arxiv.org/abs/2010.02475v1
- Date: Tue, 6 Oct 2020 04:49:37 GMT
- Title: Joint COCO and Mapillary Workshop at ICCV 2019: COCO Instance
Segmentation Challenge Track
- Authors: Zeming Li, Yuchen Ma, Yukang Chen, Xiangyu Zhang, Jian Sun
- Abstract summary: MegDetV2 works in a two-pass fashion, first to detect instances then to obtain segmentation.
On the COCO-2019 detection/instance-segmentation test-dev dataset, our system achieves 61.0/53.1 mAP, which surpassed our 2018 winning results by 5.0/4.2 respectively.
- Score: 87.90450014797287
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this report, we present our object detection/instance segmentation system,
MegDetV2, which works in a two-pass fashion, first to detect instances then to
obtain segmentation. Our baseline detector is mainly built on a new designed
RPN, called RPN++. On the COCO-2019 detection/instance-segmentation test-dev
dataset, our system achieves 61.0/53.1 mAP, which surpassed our 2018 winning
results by 5.0/4.2 respectively. We achieve the best results in COCO Challenge
2019 and 2020.
Related papers
- Pseudo-label Alignment for Semi-supervised Instance Segmentation [67.9616087910363]
Pseudo-labeling is significant for semi-supervised instance segmentation.
In existing pipelines, pseudo-labels that contain valuable information may be filtered out due to mismatches in class and mask quality.
We propose a novel framework, called pseudo-label aligning instance segmentation (PAIS), in this paper.
arXiv Detail & Related papers (2023-08-10T05:56:53Z) - Ensembling Instance and Semantic Segmentation for Panoptic Segmentation [0.0]
Methods first performs instance segmentation and semantic segmentation separately, then combines the two to generate panoptic segmentation results.
We add several expert models of Mask R-CNN in instance segmentation to tackle the data imbalance problem in the training data.
In semantic segmentation, we trained several models with various backbones and use an ensemble strategy which further boosts the segmentation results.
arXiv Detail & Related papers (2023-04-20T14:02:01Z) - 1st Place Solution of The Robust Vision Challenge (RVC) 2022 Semantic
Segmentation Track [67.56316745239629]
This report describes the winning solution to the semantic segmentation task of the Robust Vision Challenge on ECCV 2022.
Our method adopts the FAN-B-Hybrid model as the encoder and uses Segformer as the segmentation framework.
The proposed method could serve as a strong baseline for the multi-domain segmentation task and benefit future works.
arXiv Detail & Related papers (2022-10-23T20:52:22Z) - Disentangle Your Dense Object Detector [82.22771433419727]
Deep learning-based dense object detectors have achieved great success in the past few years and have been applied to numerous multimedia applications such as video understanding.
However, the current training pipeline for dense detectors is compromised to lots of conjunctions that may not hold.
We propose Disentangled Dense Object Detector (DDOD), in which simple and effective disentanglement mechanisms are designed and integrated into the current state-of-the-art detectors.
arXiv Detail & Related papers (2021-07-07T00:52:16Z) - Two stages for visual object tracking [13.851408246039515]
Siamese-based trackers have achived promising performance on visual object tracking tasks.
In this paper, we propose a novel tracker with two-stages: detection and segmentation.
Our approach achieves state-of-the-art results, with the EAO of 52.6$%$ on VOT2016, 51.3$%$ on VOT2018, and 39.0$%$ on VOT 2019 datasets.
arXiv Detail & Related papers (2021-04-28T09:11:33Z) - 1st Place Solutions for OpenImage2019 -- Object Detection and Instance
Segmentation [116.25081559037872]
This article introduces the solutions of the two champion teams, MMfruit' for the detection track and MMfruitSeg' for the segmentation track, in OpenImage Challenge 2019.
It is commonly known that for an object detector, the shared feature at the end of the backbone is not appropriate for both classification and regression.
We propose the Decoupling Head (DH) to disentangle the object classification and regression via the self-learned optimal feature extraction.
arXiv Detail & Related papers (2020-03-17T06:45:07Z) - Learning Delicate Local Representations for Multi-Person Pose Estimation [77.53144055780423]
We propose a novel method called Residual Steps Network (RSN)
RSN aggregates features with the same spatial size (Intra-level features) efficiently to obtain delicate local representations.
Our approach won the 1st place of COCO Keypoint Challenge 2019.
arXiv Detail & Related papers (2020-03-09T10:40:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.