iffDetector: Inference-aware Feature Filtering for Object Detection
- URL: http://arxiv.org/abs/2006.12708v1
- Date: Tue, 23 Jun 2020 02:57:29 GMT
- Title: iffDetector: Inference-aware Feature Filtering for Object Detection
- Authors: Mingyuan Mao, Yuxin Tian, Baochang Zhang, Qixiang Ye, Wanquan Liu,
Guodong Guo, David Doermann
- Abstract summary: We introduce a generic Inference-aware Feature Filtering (IFF) module that can easily be combined with modern detectors.
IFF performs closed-loop optimization by leveraging high-level semantics to enhance the convolutional features.
IFF can be fused with CNN-based object detectors in a plug-and-play manner with negligible computational cost overhead.
- Score: 70.8678270164057
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Modern CNN-based object detectors focus on feature configuration during
training but often ignore feature optimization during inference. In this paper,
we propose a new feature optimization approach to enhance features and suppress
background noise in both the training and inference stages. We introduce a
generic Inference-aware Feature Filtering (IFF) module that can easily be
combined with modern detectors, resulting in our iffDetector. Unlike
conventional open-loop feature calculation approaches without feedback, the IFF
module performs closed-loop optimization by leveraging high-level semantics to
enhance the convolutional features. By applying Fourier transform analysis, we
demonstrate that the IFF module acts as a negative feedback that theoretically
guarantees the stability of feature learning. IFF can be fused with CNN-based
object detectors in a plug-and-play manner with negligible computational cost
overhead. Experiments on the PASCAL VOC and MS COCO datasets demonstrate that
our iffDetector consistently outperforms state-of-the-art methods by
significant margins\footnote{The test code and model are anonymously available
in https://github.com/anonymous2020new/iffDetector }.
Related papers
- $\textit{X}^2$-DFD: A framework for e${X}$plainable and e${X}$tendable Deepfake Detection [52.14468236527728]
We propose a novel framework called $X2$-DFD, consisting of three core modules.
The first module, Model Feature Assessment (MFA), measures the detection capabilities of forgery features intrinsic to MLLMs, and gives a descending ranking of these features.
The second module, Strong Feature Strengthening (SFS), enhances the detection and explanation capabilities by fine-tuning the MLLM on a dataset constructed based on the top-ranked features.
The third module, Weak Feature Supplementing (WFS), improves the fine-tuned MLLM's capabilities on lower-ranked features by integrating external dedicated
arXiv Detail & Related papers (2024-10-08T15:28:33Z) - Single image super-resolution based on trainable feature matching attention network [0.0]
Convolutional Neural Networks (CNNs) have been widely employed for image Super-Resolution (SR)
We introduce Trainable Feature Matching (TFM) to amalgamate explicit feature learning into CNNs, augmenting their representation capabilities.
We also propose a streamlined variant called Same-size-divided Region-level Non-Local (SRNL) to alleviate the computational demands of non-local operations.
arXiv Detail & Related papers (2024-05-29T08:31:54Z) - Frequency-Aware Deepfake Detection: Improving Generalizability through
Frequency Space Learning [81.98675881423131]
This research addresses the challenge of developing a universal deepfake detector that can effectively identify unseen deepfake images.
Existing frequency-based paradigms have relied on frequency-level artifacts introduced during the up-sampling in GAN pipelines to detect forgeries.
We introduce a novel frequency-aware approach called FreqNet, centered around frequency domain learning, specifically designed to enhance the generalizability of deepfake detectors.
arXiv Detail & Related papers (2024-03-12T01:28:00Z) - Deep Frequency Filtering for Domain Generalization [55.66498461438285]
Deep Neural Networks (DNNs) have preferences for some frequency components in the learning process.
We propose Deep Frequency Filtering (DFF) for learning domain-generalizable features.
We show that applying our proposed DFF on a plain baseline outperforms the state-of-the-art methods on different domain generalization tasks.
arXiv Detail & Related papers (2022-03-23T05:19:06Z) - TRACER: Extreme Attention Guided Salient Object Tracing Network [3.2434811678562676]
We propose TRACER, which detects salient objects with explicit edges by incorporating attention guided tracing modules.
A comparison with 13 existing methods reveals that TRACER achieves state-of-the-art performance on five benchmark datasets.
arXiv Detail & Related papers (2021-12-14T13:20:07Z) - Towards Better Object Detection in Scale Variation with Adaptive Feature
Selection [3.5352273012717044]
We propose a novel adaptive feature selection module (AFSM) to automatically learn the way to fuse multi-level representations in the channel dimension.
It significantly improves the performance of the detectors that have a feature pyramid structure.
A class-aware sampling mechanism (CASM) is proposed to tackle the class imbalance problem.
arXiv Detail & Related papers (2020-12-06T13:41:20Z) - Feature Flow: In-network Feature Flow Estimation for Video Object
Detection [56.80974623192569]
Optical flow is widely used in computer vision tasks to provide pixel-level motion information.
A common approach is to:forward optical flow to a neural network and fine-tune this network on the task dataset.
We propose a novel network (IFF-Net) with an textbfIn-network textbfFeature textbfFlow estimation module for video object detection.
arXiv Detail & Related papers (2020-09-21T07:55:50Z) - Unsupervised Feedforward Feature (UFF) Learning for Point Cloud
Classification and Segmentation [57.62713515497585]
Unsupervised feedforward feature learning is proposed for joint classification and segmentation of 3D point clouds.
The UFF method exploits statistical correlations of points in a point cloud set to learn shape and point features in a one-pass feedforward manner.
It learns global shape features through the encoder and local point features through the encoder-decoder architecture.
arXiv Detail & Related papers (2020-09-02T18:25:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.