What Are Expected Queries in End-to-End Object Detection?
- URL: http://arxiv.org/abs/2206.01232v1
- Date: Thu, 2 Jun 2022 18:15:44 GMT
- Title: What Are Expected Queries in End-to-End Object Detection?
- Authors: Shilong Zhang, Xinjiang Wang, Jiaqi Wang, Jiangmiao Pang and Kai Chen
- Abstract summary: This paper shows that the expected queries should be COCO Distinct Queries (DDQ)
DDQ is stronger, more robust, and converges faster than previous methods.
It obtains 44.5 AP on the MSarity detection dataset with only 12 epochs.
- Score: 28.393693394478724
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: End-to-end object detection is rapidly progressed after the emergence of
DETR. DETRs use a set of sparse queries that replace the dense candidate boxes
in most traditional detectors. In comparison, the sparse queries cannot
guarantee a high recall as dense priors. However, making queries dense is not
trivial in current frameworks. It not only suffers from heavy computational
cost but also difficult optimization. As both sparse and dense queries are
imperfect, then \emph{what are expected queries in end-to-end object
detection}? This paper shows that the expected queries should be Dense Distinct
Queries (DDQ). Concretely, we introduce dense priors back to the framework to
generate dense queries. A duplicate query removal pre-process is applied to
these queries so that they are distinguishable from each other. The dense
distinct queries are then iteratively processed to obtain final sparse outputs.
We show that DDQ is stronger, more robust, and converges faster. It obtains
44.5 AP on the MS COCO detection dataset with only 12 epochs. DDQ is also
robust as it outperforms previous methods on both object detection and instance
segmentation tasks on various datasets. DDQ blends advantages from traditional
dense priors and recent end-to-end detectors. We hope it can serve as a new
baseline and inspires researchers to revisit the complementarity between
traditional methods and end-to-end detectors. The source code is publicly
available at \url{https://github.com/jshilong/DDQ}.
Related papers
- Group Testing for Accurate and Efficient Range-Based Near Neighbor Search for Plagiarism Detection [2.3814052021083354]
This work presents an adaptive group testing framework for the range-based high dimensional near neighbor search problem.
Our method efficiently marks each item in a database as neighbor or non-neighbor of a query point, based on a cosine distance threshold without exhaustive search.
We show that, using softmax-based features, our method achieves a more than ten-fold speed-up over exhaustive search with no loss of accuracy.
arXiv Detail & Related papers (2023-11-05T06:12:03Z) - Allies: Prompting Large Language Model with Beam Search [107.38790111856761]
In this work, we propose a novel method called ALLIES.
Given an input query, ALLIES leverages LLMs to iteratively generate new queries related to the original query.
By iteratively refining and expanding the scope of the original query, ALLIES captures and utilizes hidden knowledge that may not be directly through retrieval.
arXiv Detail & Related papers (2023-05-24T06:16:44Z) - Dense Distinct Query for End-to-End Object Detection [39.32011383066249]
One-to-one assignment in object detection has successfully obviated the need for non-maximum suppression.
This paper shows that the solution should be Dense Distinct Queries (DDQ)
DDQ blends the advantages of traditional and recent end-to-end detectors and significantly improves the performance of various detectors.
arXiv Detail & Related papers (2023-03-22T17:42:22Z) - CQE in OWL 2 QL: A "Longest Honeymoon" Approach (extended version) [13.169982133542266]
We study a dynamic CQE method, namely, we propose to alter the answer to the current query based on the evaluation of previous ones.
We aim at a system that, besides being able to protect confidential data, is maximally cooperative.
arXiv Detail & Related papers (2022-07-22T15:51:15Z) - Progressive End-to-End Object Detection in Crowded Scenes [96.92416613336096]
Previous query-based detectors suffer from two drawbacks: first, multiple predictions will be inferred for a single object, typically in crowded scenes; second, the performance saturates as the depth of the decoding stage increases.
We propose a progressive predicting method to address the above issues. Specifically, we first select accepted queries to generate true positive predictions, then refine the rest noisy queries according to the previously accepted predictions.
Experiments show that our method can significantly boost the performance of query-based detectors in crowded scenes.
arXiv Detail & Related papers (2022-03-15T06:12:00Z) - Anchor DETR: Query Design for Transformer-Based Detector [24.925317590675203]
We propose a novel query design for the transformer-based detectors.
Object queries are based on anchor points, which are widely used in CNN-based detectors.
Our design can predict multiple objects at one position to solve the difficulty: "one region, multiple objects"
arXiv Detail & Related papers (2021-09-15T06:31:55Z) - Open Question Answering over Tables and Text [55.8412170633547]
In open question answering (QA), the answer to a question is produced by retrieving and then analyzing documents that might contain answers to the question.
Most open QA systems have considered only retrieving information from unstructured text.
We present a new large-scale dataset Open Table-and-Text Question Answering (OTT-QA) to evaluate performance on this task.
arXiv Detail & Related papers (2020-10-20T16:48:14Z) - The Sparse Vector Technique, Revisited [67.57692396665915]
We revisit one of the most basic and widely applicable techniques in the literature of differential privacy.
This simple algorithm privately tests whether the value of a given query on a database is close to what we expect it to be.
We suggest an alternative, equally simple, algorithm that can continue testing queries as long as any single individual does not contribute to the answer of too many queries.
arXiv Detail & Related papers (2020-10-02T10:50:52Z) - Progressively Pretrained Dense Corpus Index for Open-Domain Question
Answering [87.32442219333046]
We propose a simple and resource-efficient method to pretrain the paragraph encoder.
Our method outperforms an existing dense retrieval method that uses 7 times more computational resources for pretraining.
arXiv Detail & Related papers (2020-04-30T18:09:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.