Unbalanced Optimal Transport: A Unified Framework for Object Detection
- URL: http://arxiv.org/abs/2307.02402v1
- Date: Wed, 5 Jul 2023 16:21:52 GMT
- Title: Unbalanced Optimal Transport: A Unified Framework for Object Detection
- Authors: Henri De Plaen, Pierre-Fran\c{c}ois De Plaen, Johan A. K. Suykens,
Marc Proesmans, Tinne Tuytelaars and Luc Van Gool
- Abstract summary: We show how Unbalanced Optimal Transport unifies different approaches to object detection.
We show that training an object detection model with Unbalanced Optimal Transport is able to reach the state-of-the-art.
The approach is well suited for GPU implementation, which proves to be an advantage for large-scale models.
- Score: 97.74382560746987
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: During training, supervised object detection tries to correctly match the
predicted bounding boxes and associated classification scores to the ground
truth. This is essential to determine which predictions are to be pushed
towards which solutions, or to be discarded. Popular matching strategies
include matching to the closest ground truth box (mostly used in combination
with anchors), or matching via the Hungarian algorithm (mostly used in
anchor-free methods). Each of these strategies comes with its own properties,
underlying losses, and heuristics. We show how Unbalanced Optimal Transport
unifies these different approaches and opens a whole continuum of methods in
between. This allows for a finer selection of the desired properties.
Experimentally, we show that training an object detection model with Unbalanced
Optimal Transport is able to reach the state-of-the-art both in terms of
Average Precision and Average Recall as well as to provide a faster initial
convergence. The approach is well suited for GPU implementation, which proves
to be an advantage for large-scale models.
Related papers
- A Best-of-Both Approach to Improve Match Predictions and Reciprocal Recommendations for Job Search [15.585641615174623]
This paper introduces and demonstrates a novel and practical solution to improve reciprocal recommendations in production by leveraging pseudo-match scores.
Specifically, our approach generates dense and more directly relevant pseudo-match scores by combining the true match labels with relatively inaccurate but dense match predictions.
Our method can be seen as a best-of-both (BoB) approach, as it combines the high-level ideas of both direct match prediction and the two separate models approach.
arXiv Detail & Related papers (2024-09-17T08:51:02Z) - Model Ensembling for Constrained Optimization [7.4351710906830375]
We consider a setting in which we wish to ensemble models for multidimensional output predictions that are in turn used for downstream optimization.
More precisely, we imagine we are given a number of models mapping a state space to multidimensional real-valued predictions.
These predictions form the coefficients of a linear objective that we would like to optimize under specified constraints.
We apply multicalibration techniques that lead to two provably efficient and convergent algorithms.
arXiv Detail & Related papers (2024-05-27T01:48:07Z) - A Minimaximalist Approach to Reinforcement Learning from Human Feedback [49.45285664482369]
We present Self-Play Preference Optimization (SPO), an algorithm for reinforcement learning from human feedback.
Our approach is minimalist in that it does not require training a reward model nor unstable adversarial training.
We demonstrate that on a suite of continuous control tasks, we are able to learn significantly more efficiently than reward-model based approaches.
arXiv Detail & Related papers (2024-01-08T17:55:02Z) - Scale Optimization Using Evolutionary Reinforcement Learning for Object
Detection on Drone Imagery [17.26524675722299]
This paper proposes an evolutionary reinforcement learning agent, integrated within a coarse-to-fine object detection framework, to optimize the scale for more effective detection of objects in such images.
A set of rewards measuring the localization accuracy, the accuracy of predicted labels, and the scale consistency among nearby patches are designed in the agent to guide the scale optimization.
arXiv Detail & Related papers (2023-12-23T10:49:55Z) - Semi-DETR: Semi-Supervised Object Detection with Detection Transformers [105.45018934087076]
We analyze the DETR-based framework on semi-supervised object detection (SSOD)
We present Semi-DETR, the first transformer-based end-to-end semi-supervised object detector.
Our method outperforms all state-of-the-art methods by clear margins.
arXiv Detail & Related papers (2023-07-16T16:32:14Z) - Towards Feature-Based Performance Regression Using Trajectory Data [0.9281671380673306]
Black-box optimization is a very active area of research, with many new algorithms being developed every year.
The variety of algorithms poses a meta-problem: which algorithm to choose for a given problem at hand?
Past research has shown that per-instance algorithm selection based on exploratory landscape analysis can be an efficient mean to tackle this meta-problem.
arXiv Detail & Related papers (2021-02-10T10:19:13Z) - Provably Efficient Algorithms for Multi-Objective Competitive RL [54.22598924633369]
We study multi-objective reinforcement learning (RL) where an agent's reward is represented as a vector.
In settings where an agent competes against opponents, its performance is measured by the distance of its average return vector to a target set.
We develop statistically and computationally efficient algorithms to approach the associated target set.
arXiv Detail & Related papers (2021-02-05T14:26:00Z) - Deep Shells: Unsupervised Shape Correspondence with Optimal Transport [52.646396621449]
We propose a novel unsupervised learning approach to 3D shape correspondence.
We show that the proposed method significantly improves over the state-of-the-art on multiple datasets.
arXiv Detail & Related papers (2020-10-28T22:24:07Z) - Pairwise Similarity Knowledge Transfer for Weakly Supervised Object
Localization [53.99850033746663]
We study the problem of learning localization model on target classes with weakly supervised image labels.
In this work, we argue that learning only an objectness function is a weak form of knowledge transfer.
Experiments on the COCO and ILSVRC 2013 detection datasets show that the performance of the localization model improves significantly with the inclusion of pairwise similarity function.
arXiv Detail & Related papers (2020-03-18T17:53:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.