Related papers: Object Detection with a Unified Label Space from Multiple Datasets

Object Detection with a Unified Label Space from Multiple Datasets

URL: http://arxiv.org/abs/2008.06614v1
Date: Sat, 15 Aug 2020 00:51:27 GMT
Title: Object Detection with a Unified Label Space from Multiple Datasets
Authors: Xiangyun Zhao, Samuel Schulter, Gaurav Sharma, Yi-Hsuan Tsai, Manmohan Chandraker, Ying Wu
Abstract summary: Given multiple datasets with different label spaces, the goal of this work is to train a single object detector predicting over the union of all the label spaces. Consider an object category like faces that is annotated in one dataset, but is not annotated in another dataset. Some categories, like face here, would thus be considered foreground in one dataset, but background in another. We propose loss functions that carefully integrate partial but correct annotations with complementary but noisy pseudo labels.
Score: 94.33205773893151
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Given multiple datasets with different label spaces, the goal of this work is to train a single object detector predicting over the union of all the label spaces. The practical benefits of such an object detector are obvious and significant application-relevant categories can be picked and merged form arbitrary existing datasets. However, naive merging of datasets is not possible in this case, due to inconsistent object annotations. Consider an object category like faces that is annotated in one dataset, but is not annotated in another dataset, although the object itself appears in the latter images. Some categories, like face here, would thus be considered foreground in one dataset, but background in another. To address this challenge, we design a framework which works with such partial annotations, and we exploit a pseudo labeling approach that we adapt for our specific case. We propose loss functions that carefully integrate partial but correct annotations with complementary but noisy pseudo labels. Evaluation in the proposed novel setting requires full annotation on the test set. We collect the required annotations and define a new challenging experimental setup for this task based one existing public datasets. We show improved performances compared to competitive baselines and appropriate adaptations of existing work.

Related papers

Bridging Annotation Gaps: Transferring Labels to Align Object Detection Datasets [26.566426911250296]
Label-Aligned Transfer Proposal (LAT) systematically projects annotations from diverse source datasets into a target label space.<n>LAT achieves consistent improvements in target-domain detection performance, achieving gains of up to +4.8AP over semi-supervised baselines.
arXiv Detail & Related papers (2025-06-05T08:16:15Z)
FMG-Det: Foundation Model Guided Robust Object Detection [7.489718044485341]
Training on noisy annotations significantly degrades detector performance.<n>We propose -Det, a simple, efficient methodology for training models with noisy annotations.
arXiv Detail & Related papers (2025-05-29T17:55:41Z)
An Attribute-Enriched Dataset and Auto-Annotated Pipeline for Open Detection [7.531866919805308]
We introduce the Objects365-Attr dataset, an extension of the existing Objects365 dataset, distinguished by its attribute annotations. This dataset reduces inconsistencies in object detection by integrating a broad spectrum of attributes, including color, material, state, texture and tone. It contains an extensive collection of 5.6M object-level attribute descriptions, meticulously annotated across 1.4M bounding boxes.
arXiv Detail & Related papers (2024-09-10T07:53:32Z)
Anno-incomplete Multi-dataset Detection [67.69438032767613]
We propose a novel problem as "-incomplete Multi-dataset Detection" We develop an end-to-end multi-task learning architecture which can accurately detect all the object categories with multiple partially annotated datasets.
arXiv Detail & Related papers (2024-08-29T03:58:21Z)
Object-Centric Multiple Object Tracking [124.30650395969126]
This paper proposes a video object-centric model for multiple-object tracking pipelines. It consists of an index-merge module that adapts the object-centric slots into detection outputs and an object memory module. Benefited from object-centric learning, we only require sparse detection labels for object localization and feature binding.
arXiv Detail & Related papers (2023-09-01T03:34:12Z)
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation [33.25304533086283]
Open-vocabulary semantic segmentation is a challenging task that requires segmenting novel object categories at inference time. Recent studies have explored vision-language pre-training to handle this task, but suffer from unrealistic assumptions in practical scenarios. This work proposes a novel attribute decomposition-aggregation framework, AttrSeg, inspired by human cognition in understanding new concepts.
arXiv Detail & Related papers (2023-08-31T19:34:09Z)
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding [137.3719377780593]
A new design (named Detection Hub) is dataset-aware and category-aligned. It mitigates the dataset inconsistency and provides coherent guidance for the detector to learn across multiple datasets. The categories across datasets are semantically aligned into a unified space by replacing one-hot category representations with word embedding.
arXiv Detail & Related papers (2022-06-07T17:59:44Z)
Simple multi-dataset detection [83.9604523643406]
We present a simple method for training a unified detector on multiple large-scale datasets. We show how to automatically integrate dataset-specific outputs into a common semantic taxonomy. Our approach does not require manual taxonomy reconciliation.
arXiv Detail & Related papers (2021-02-25T18:55:58Z)
Self-supervised Robust Object Detectors from Partially Labelled Datasets [3.1669406516464007]
merging datasets allows us to train one integrated object detector, instead of training several ones. We propose a training framework to overcome missing-label challenge of the merged datasets. We evaluate our proposed framework for training Yolo on a simulated merged dataset with missing rate $approx!48%$ using VOC2012 and VOC2007.
arXiv Detail & Related papers (2020-05-23T15:18:20Z)
Weakly-Supervised Salient Object Detection via Scribble Annotations [54.40518383782725]
We propose a weakly-supervised salient object detection model to learn saliency from scribble labels. We present a new metric, termed saliency structure measure, to measure the structure alignment of the predicted saliency maps. Our method not only outperforms existing weakly-supervised/unsupervised methods, but also is on par with several fully-supervised state-of-the-art models.
arXiv Detail & Related papers (2020-03-17T12:59:50Z)

This list is automatically generated from the titles and abstracts of the papers in this site.

This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.