Feedback-driven object detection and iterative model improvement
- URL: http://arxiv.org/abs/2411.19835v3
- Date: Thu, 27 Mar 2025 08:34:04 GMT
- Title: Feedback-driven object detection and iterative model improvement
- Authors: Sönke Tenckhoff, Mario Koddenbrock, Erik Rodner,
- Abstract summary: We present the development and evaluation of a platform designed to interactively improve object detection models.<n>The platform allows uploading and annotating images as well as fine-tuning object detection models.<n>We show evidence for a significant time reduction of up to 53% for semi-automatic compared to manual annotation.
- Score: 2.3700911865675187
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automated object detection has become increasingly valuable across diverse applications, yet efficient, high-quality annotation remains a persistent challenge. In this paper, we present the development and evaluation of a platform designed to interactively improve object detection models. The platform allows uploading and annotating images as well as fine-tuning object detection models. Users can then manually review and refine annotations, further creating improved snapshots that are used for automatic object detection on subsequent image uploads - a process we refer to as semi-automatic annotation resulting in a significant gain in annotation efficiency. Whereas iterative refinement of model results to speed up annotation has become common practice, we are the first to quantitatively evaluate its benefits with respect to time, effort, and interaction savings. Our experimental results show clear evidence for a significant time reduction of up to 53% for semi-automatic compared to manual annotation. Importantly, these efficiency gains did not compromise annotation quality, while matching or occasionally even exceeding the accuracy of manual annotations. These findings demonstrate the potential of our lightweight annotation platform for creating high-quality object detection datasets and provide best practices to guide future development of annotation platforms. The platform is open-source, with the frontend and backend repositories available on GitHub. To support the understanding of our labeling process, we have created an explanatory video demonstrating the methodology using microscopy images of E. coli bacteria as an example.
Related papers
- Automatic Image Annotation for Mapped Features Detection [6.300346102366891]
Road features are a key enabler for autonomous driving and localization.
Modern deep learning-based perception systems need a significant amount of annotated data.
In this paper, we consider the fusion of three automatic annotation methods in images.
arXiv Detail & Related papers (2024-12-11T09:06:52Z) - AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving [68.73885845181242]
We propose an Automatic Data Engine (AIDE) that automatically identifies issues, efficiently curates data, improves the model through auto-labeling, and verifies the model through generation of diverse scenarios.
We further establish a benchmark for open-world detection on AV datasets to comprehensively evaluate various learning paradigms, demonstrating our method's superior performance at a reduced cost.
arXiv Detail & Related papers (2024-03-26T04:27:56Z) - Boosting Gesture Recognition with an Automatic Gesture Annotation Framework [10.158684480548242]
We propose a framework that can automatically annotate gesture classes and identify their temporal ranges.
Our framework consists of two key components: (1) a novel annotation model that leverages the Connectionist Temporal Classification (CTC) loss, and (2) a semi-supervised learning pipeline.
These high-quality pseudo labels can also be used to enhance the accuracy of other downstream gesture recognition models.
arXiv Detail & Related papers (2024-01-20T07:11:03Z) - LabelFormer: Object Trajectory Refinement for Offboard Perception from
LiDAR Point Clouds [37.87496475959941]
"Auto-labelling" offboard perception models are trained to automatically generate annotations from raw LiDAR point clouds.
We propose LabelFormer, a simple, efficient, and effective trajectory-level refinement approach.
Our approach first encodes each frame's observations separately, then exploits self-attention to reason about the trajectory with full temporal context.
arXiv Detail & Related papers (2023-11-02T17:56:06Z) - Deep Active Learning with Noisy Oracle in Object Detection [5.5165579223151795]
We propose a composite active learning framework including a label review module for deep object detection.
We show that utilizing part of the annotation budget to correct the noisy annotations partially in the active dataset leads to early improvements in model performance.
In our experiments we achieve improvements of up to 4.5 mAP points of object detection performance by incorporating label reviews at equal annotation budget.
arXiv Detail & Related papers (2023-09-30T13:28:35Z) - Helping Hands: An Object-Aware Ego-Centric Video Recognition Model [60.350851196619296]
We introduce an object-aware decoder for improving the performance of ego-centric representations on ego-centric videos.
We show that the model can act as a drop-in replacement for an ego-awareness video model to improve performance through visual-text grounding.
arXiv Detail & Related papers (2023-08-15T17:58:11Z) - Quality and Efficiency of Manual Annotation: Pre-annotation Bias [1.949293198748152]
The aim of the experiment is to judge the final annotation quality when pre-annotation is used.
The experiment confirmed that the pre-annotation is an efficient tool for faster manual syntactic annotation.
arXiv Detail & Related papers (2023-06-15T17:41:14Z) - ComplETR: Reducing the cost of annotations for object detection in dense
scenes with vision transformers [73.29057814695459]
ComplETR is designed to explicitly complete missing annotations in partially annotated dense scene datasets.
This reduces the need to annotate every object instance in the scene thereby reducing annotation cost.
We show performance improvement for several popular detectors such as Faster R-CNN, Cascade R-CNN, CenterNet2, and Deformable DETR.
arXiv Detail & Related papers (2022-09-13T00:11:16Z) - Annotation Error Detection: Analyzing the Past and Present for a More
Coherent Future [63.99570204416711]
We reimplement 18 methods for detecting potential annotation errors and evaluate them on 9 English datasets.
We define a uniform evaluation setup including a new formalization of the annotation error detection task.
We release our datasets and implementations in an easy-to-use and open source software package.
arXiv Detail & Related papers (2022-06-05T22:31:45Z) - FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality
Assessment [93.09267863425492]
We argue that understanding both high-level semantics and internal temporal structures of actions in competitive sports videos is the key to making predictions accurate and interpretable.
We construct a new fine-grained dataset, called FineDiving, developed on diverse diving events with detailed annotations on action procedures.
arXiv Detail & Related papers (2022-04-07T17:59:32Z) - Dynamic Supervisor for Cross-dataset Object Detection [52.95818230087297]
Cross-dataset training in object detection tasks is complicated because the inconsistency in the category range across datasets transforms fully supervised learning into semi-supervised learning.
We propose a dynamic supervisor framework that updates the annotations multiple times through multiple-updated submodels trained using hard and soft labels.
In the final generated annotations, both recall and precision improve significantly through the integration of hard-label training with soft-label training.
arXiv Detail & Related papers (2022-04-01T03:18:46Z) - Weakly Supervised Video Salient Object Detection [79.51227350937721]
We present the first weakly supervised video salient object detection model based on relabeled "fixation guided scribble annotations"
An "Appearance-motion fusion module" and bidirectional ConvLSTM based framework are proposed to achieve effective multi-modal learning and long-term temporal context modeling.
arXiv Detail & Related papers (2021-04-06T09:48:38Z) - Cross-Model Image Annotation Platform with Active Learning [0.0]
This work presents an End-to-End pipeline tool for object annotation and recognition.
We have developed a modular image annotation platform which seamlessly incorporates assisted image annotation, active learning and model training and evaluation.
The highest accuracy achieved is 74%.
arXiv Detail & Related papers (2020-08-06T01:42:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.