Related papers: COCONut: Modernizing COCO Segmentation

COCONut: Modernizing COCO Segmentation

URL: http://arxiv.org/abs/2404.08639v1
Date: Fri, 12 Apr 2024 17:59:40 GMT
Title: COCONut: Modernizing COCO Segmentation
Authors: Xueqing Deng, Qihang Yu, Peng Wang, Xiaohui Shen, Liang-Chieh Chen,
Abstract summary: COCO benchmark has propelled the development of modern detection and segmentation systems. COCONut harmonizes segmentation annotations across semantic, instance, and panoptic segmentation. To our knowledge, COCONut stands as the inaugural large-scale universal segmentation dataset, verified by human raters.
Score: 25.706167486289974
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In recent decades, the vision community has witnessed remarkable progress in visual recognition, partially owing to advancements in dataset benchmarks. Notably, the established COCO benchmark has propelled the development of modern detection and segmentation systems. However, the COCO segmentation benchmark has seen comparatively slow improvement over the last decade. Originally equipped with coarse polygon annotations for thing instances, it gradually incorporated coarse superpixel annotations for stuff regions, which were subsequently heuristically amalgamated to yield panoptic segmentation annotations. These annotations, executed by different groups of raters, have resulted not only in coarse segmentation masks but also in inconsistencies between segmentation types. In this study, we undertake a comprehensive reevaluation of the COCO segmentation annotations. By enhancing the annotation quality and expanding the dataset to encompass 383K images with more than 5.18M panoptic masks, we introduce COCONut, the COCO Next Universal segmenTation dataset. COCONut harmonizes segmentation annotations across semantic, instance, and panoptic segmentation with meticulously crafted high-quality masks, and establishes a robust benchmark for all segmentation tasks. To our knowledge, COCONut stands as the inaugural large-scale universal segmentation dataset, verified by human raters. We anticipate that the release of COCONut will significantly contribute to the community's ability to assess the progress of novel neural networks.

Related papers

COCO-Occ: A Benchmark for Occluded Panoptic Segmentation and Image Understanding [8.261771972240778]
This paper proposes a new large-scale dataset, COCO-Occ, which is derived from the COCO dataset by manually labelling the COCO images into three perceived occlusion levels.
arXiv Detail & Related papers (2024-09-19T13:26:28Z)
Image Segmentation in Foundation Model Era: A Survey [99.19456390358211]
Current research in image segmentation lacks a detailed analysis of distinct characteristics, challenges, and solutions associated with these advancements. This survey seeks to fill this gap by providing a thorough review of cutting-edge research centered around FM-driven image segmentation. An exhaustive overview of over 300 segmentation approaches is provided to encapsulate the breadth of current research efforts.
arXiv Detail & Related papers (2024-08-23T10:07:59Z)
Concealed Object Segmentation with Hierarchical Coherence Modeling [9.185195569812667]
We propose a Hierarchical Coherence Modeling (HCM) segmenter for concealed object segmentation (COS) HCM promotes feature coherence by leveraging the intra-stage coherence and cross-stage coherence modules. We also introduce the reversible re-calibration decoder to detect previously undetected parts in low-confidence regions.
arXiv Detail & Related papers (2024-01-22T09:02:52Z)
A Lightweight Clustering Framework for Unsupervised Semantic Segmentation [28.907274978550493]
Unsupervised semantic segmentation aims to categorize each pixel in an image into a corresponding class without the use of annotated data. We propose a lightweight clustering framework for unsupervised semantic segmentation. Our framework achieves state-of-the-art results on PASCAL VOC and MS COCO datasets.
arXiv Detail & Related papers (2023-11-30T15:33:42Z)
Exploring Open-Vocabulary Semantic Segmentation without Human Labels [76.15862573035565]
We present ZeroSeg, a novel method that leverages the existing pretrained vision-language model (VL) to train semantic segmentation models. ZeroSeg overcomes this by distilling the visual concepts learned by VL models into a set of segment tokens, each summarizing a localized region of the target image. Our approach achieves state-of-the-art performance when compared to other zero-shot segmentation methods under the same training data.
arXiv Detail & Related papers (2023-06-01T08:47:06Z)
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation [80.48979302400868]
We focus on open vocabulary instance segmentation to expand a segmentation model to classify and segment instance-level novel categories. Previous approaches have relied on massive caption datasets and complex pipelines to establish one-to-one mappings between image regions and captions in nouns. We devise a joint textbfCaption Grounding and Generation (CGG) framework, which incorporates a novel grounding loss that only focuses on matching object to improve learning efficiency.
arXiv Detail & Related papers (2023-01-02T18:52:12Z)
CoMFormer: Continual Learning in Semantic and Panoptic Segmentation [45.66711231393775]
We present the first continual learning model capable of operating on both semantic and panoptic segmentation. Our method carefully exploits the properties of transformer architectures to learn new classes over time. Our CoMFormer outperforms all the existing baselines by forgetting less old classes but also learning more effectively new classes.
arXiv Detail & Related papers (2022-11-25T10:15:06Z)
A Survey on Label-efficient Deep Segmentation: Bridging the Gap between Weak Supervision and Dense Prediction [115.9169213834476]
This paper offers a comprehensive review on label-efficient segmentation methods. We first develop a taxonomy to organize these methods according to the supervision provided by different types of weak labels. Next, we summarize the existing label-efficient segmentation methods from a unified perspective.
arXiv Detail & Related papers (2022-07-04T06:21:01Z)
Inconsistency-aware Uncertainty Estimation for Semi-supervised Medical Image Segmentation [92.9634065964963]
We present a new semi-supervised segmentation model, namely, conservative-radical network (CoraNet) based on our uncertainty estimation and separate self-training strategy. Compared with the current state of the art, our CoraNet has demonstrated superior performance.
arXiv Detail & Related papers (2021-10-17T08:49:33Z)
Bootstrapping Semantic Segmentation with Regional Contrast [27.494579304204226]
ReCo is a contrastive learning framework designed at a regional level to assist learning in semantic segmentation. We achieve 50% mIoU in the CityScapes dataset, whilst requiring only 20 labelled images, improving by 10% relative to the previous state-of-the-art.
arXiv Detail & Related papers (2021-04-09T16:26:29Z)
A Few Guidelines for Incremental Few-Shot Segmentation [57.34237650765928]
Given a pretrained segmentation model and few images containing novel classes, our goal is to learn to segment novel classes while retaining the ability to segment previously seen ones. We show how the main problems of end-to-end training in this scenario are. i) the drift of the batch-normalization statistics toward novel classes that we can fix with batch renormalization and. ii) the forgetting of old classes, that we can fix with regularization strategies.
arXiv Detail & Related papers (2020-11-30T20:45:56Z)
Class-wise Dynamic Graph Convolution for Semantic Segmentation [63.08061813253613]
We propose a class-wise dynamic graph convolution (CDGC) module to adaptively propagate information. We also introduce the Class-wise Dynamic Graph Convolution Network(CDGCNet), which consists of two main parts including the CDGC module and a basic segmentation network. We conduct extensive experiments on three popular semantic segmentation benchmarks including Cityscapes, PASCAL VOC 2012 and COCO Stuff.
arXiv Detail & Related papers (2020-07-19T15:26:50Z)

This list is automatically generated from the titles and abstracts of the papers in this site.