Focus on Local Regions for Query-based Object Detection
- URL: http://arxiv.org/abs/2310.06470v3
- Date: Thu, 14 Dec 2023 03:22:48 GMT
- Title: Focus on Local Regions for Query-based Object Detection
- Authors: Hongbin Xu, Yamei Xia, Shuai Zhao, Bo Cheng
- Abstract summary: We propose FoLR, a transformer-like architecture with only decoders.
We improve the self-attention by isolating connections between irrelevant objects.
We also design the adaptive sampling method to extract effective features based on queries' local regions.
- Score: 14.982147587695652
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Query-based methods have garnered significant attention in object detection
since the advent of DETR, the pioneering query-based detector. However, these
methods face challenges like slow convergence and suboptimal performance.
Notably, self-attention in object detection often hampers convergence due to
its global focus. To address these issues, we propose FoLR, a transformer-like
architecture with only decoders. We improve the self-attention by isolating
connections between irrelevant objects that makes it focus on local regions but
not global regions. We also design the adaptive sampling method to extract
effective features based on queries' local regions from feature maps.
Additionally, we employ a look-back strategy for decoders to retain previous
information, followed by the Feature Mixer module to fuse features and queries.
Experimental results demonstrate FoLR's state-of-the-art performance in
query-based detectors, excelling in convergence speed and computational
efficiency.
Index Terms: Local regions, Attention mechanism, Object detection
Related papers
- Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - DQnet: Cross-Model Detail Querying for Camouflaged Object Detection [54.82390534024954]
A convolutional neural network (CNN) for camouflaged object detection tends to activate local discriminative regions while ignoring complete object extent.
In this paper, we argue that partial activation is caused by the intrinsic characteristics of CNN.
In order to obtain feature maps that could activate full object extent, a novel framework termed Cross-Model Detail Querying network (DQnet) is proposed.
arXiv Detail & Related papers (2022-12-16T06:23:58Z) - Bagging Regional Classification Activation Maps for Weakly Supervised
Object Localization [11.25759292976175]
BagCAMs is a plug-and-play mechanism to better project a well-trained classifier for the localization task.
Our BagCAMs adopts a proposed regional localizer generation strategy to define a set of regional localizers.
Experiments indicate that adopting our proposed BagCAMs can improve the performance of baseline WSOL methods.
arXiv Detail & Related papers (2022-07-16T03:03:01Z) - Delving into Sequential Patches for Deepfake Detection [64.19468088546743]
Recent advances in face forgery techniques produce nearly untraceable deepfake videos, which could be leveraged with malicious intentions.
Previous studies has identified the importance of local low-level cues and temporal information in pursuit to generalize well across deepfake methods.
We propose the Local- & Temporal-aware Transformer-based Deepfake Detection framework, which adopts a local-to-global learning protocol.
arXiv Detail & Related papers (2022-07-06T16:46:30Z) - Robust Object Detection via Instance-Level Temporal Cycle Confusion [89.1027433760578]
We study the effectiveness of auxiliary self-supervised tasks to improve the out-of-distribution generalization of object detectors.
Inspired by the principle of maximum entropy, we introduce a novel self-supervised task, instance-level temporal cycle confusion (CycConf)
For each object, the task is to find the most different object proposals in the adjacent frame in a video and then cycle back to itself for self-supervision.
arXiv Detail & Related papers (2021-04-16T21:35:08Z) - Slender Object Detection: Diagnoses and Improvements [74.40792217534]
In this paper, we are concerned with the detection of a particular type of objects with extreme aspect ratios, namely textbfslender objects.
For a classical object detection method, a drastic drop of $18.9%$ mAP on COCO is observed, if solely evaluated on slender objects.
arXiv Detail & Related papers (2020-11-17T09:39:42Z) - Local Context Attention for Salient Object Segmentation [5.542044768017415]
We propose a novel Local Context Attention Network (LCANet) to generate locally reinforcement feature maps in a uniform representational architecture.
The proposed network introduces an Attentional Correlation Filter (ACF) module to generate explicit local attention by calculating the correlation feature map between coarse prediction and global context.
Comprehensive experiments are conducted on several salient object segmentation datasets, demonstrating the superior performance of the proposed LCANet against the state-of-the-art methods.
arXiv Detail & Related papers (2020-09-24T09:20:06Z) - Hierarchical Context Embedding for Region-based Object Detection [40.9463003508027]
Hierarchical Context Embedding (HCE) framework can be applied as a plug-and-play component.
To advance the recognition of context-dependent object categories, we propose an image-level categorical embedding module.
Novel RoI features are generated by exploiting hierarchically embedded context information beneath both whole images and interested regions.
arXiv Detail & Related papers (2020-08-04T05:33:22Z) - Scope Head for Accurate Localization in Object Detection [135.9979405835606]
We propose a novel detector coined as ScopeNet, which models anchors of each location as a mutually dependent relationship.
With our concise and effective design, the proposed ScopeNet achieves state-of-the-art results on COCO.
arXiv Detail & Related papers (2020-05-11T04:00:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.