Related papers: Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features

Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features

URL: http://arxiv.org/abs/2206.01202v1
Date: Thu, 2 Jun 2022 17:59:57 GMT
Title: Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features
Authors: Chieh Hubert Lin, Hsin-Ying Lee, Hung-Yu Tseng, Maneesh Singh, Ming-Hsuan Yang
Abstract summary: Recent studies show that paddings in convolutional neural networks encode absolute position information. Existing metrics for quantifying the strength of positional information remain unreliable. We propose novel metrics for measuring (and visualizing) the encoded positional information.
Score: 75.62755703738696
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent studies show that paddings in convolutional neural networks encode absolute position information which can negatively affect the model performance for certain tasks. However, existing metrics for quantifying the strength of positional information remain unreliable and frequently lead to erroneous results. To address this issue, we propose novel metrics for measuring (and visualizing) the encoded positional information. We formally define the encoded information as PPP (Position-information Pattern from Padding) and conduct a series of experiments to study its properties as well as its formation. The proposed metrics measure the presence of positional information more reliably than the existing metrics based on PosENet and a test in F-Conv. We also demonstrate that for any extant (and proposed) padding schemes, PPP is primarily a learning artifact and is less dependent on the characteristics of the underlying padding schemes.

Related papers

Multi-Point Positional Insertion Tuning for Small Object Detection [10.852047082856487]
Small object detection aims to localize and classify small objects within images. Finetuning pretrained object detection models is computationally and memory expensive. This paper introduces multi-point positional insertion (MPI) tuning, a parameter-efficient finetuning (PEFT) method for small object detection.
arXiv Detail & Related papers (2024-12-24T02:04:47Z)
SSPNet: Scale and Spatial Priors Guided Generalizable and Interpretable Pedestrian Attribute Recognition [23.55622798950833]
A novel Scale and Spatial Priors Guided Network (SSPNet) is proposed for Pedestrian Attribute Recognition (PAR) models. SSPNet learns to provide reasonable scale prior information for different attribute groups, allowing the model to focus on different levels of feature maps. A novel IoU based attribute localization metric is proposed for Weakly-supervised Pedestrian Attribute localization (WPAL) based on the improved Grad-CAM for attribute response mask.
arXiv Detail & Related papers (2023-12-11T00:41:40Z)
CPPF++: Uncertainty-Aware Sim2Real Object Pose Estimation by Vote Aggregation [67.12857074801731]
We introduce a novel method, CPPF++, designed for sim-to-real pose estimation. To address the challenge posed by vote collision, we propose a novel approach that involves modeling the voting uncertainty. We incorporate several innovative modules, including noisy pair filtering, online alignment optimization, and a feature ensemble.
arXiv Detail & Related papers (2022-11-24T03:27:00Z)
PNI : Industrial Anomaly Detection using Position and Neighborhood Information [6.316693022958221]
We propose a new algorithm, textbfPNI, which estimates the normal distribution using conditional probability given neighborhood features. We conducted experiments on the MVTec AD benchmark dataset and achieved state-of-the-art performance, with textbf99.56% and textbf98.98% AUROC scores in anomaly detection and localization.
arXiv Detail & Related papers (2022-11-22T23:45:27Z)
Toward Reliable Neural Specifications [3.2722498341029653]
Existing specifications for neural networks are in the paradigm of data as specification. We propose a new family of specifications called neural representation as specification. We show that by using NAP, we can verify the prediction of the entire input space, while still recalling 84% of the data.
arXiv Detail & Related papers (2022-10-28T13:21:28Z)
SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object Detection [78.90102636266276]
We propose a novel set abstraction method named Semantics-Augmented Set Abstraction (SASA) Based on the estimated point-wise foreground scores, we then propose a semantics-guided point sampling algorithm to help retain more important foreground points during down-sampling. In practice, SASA shows to be effective in identifying valuable points related to foreground objects and improving feature learning for point-based 3D detection.
arXiv Detail & Related papers (2022-01-06T08:54:47Z)
Transformers Can Do Bayesian Inference [56.99390658880008]
We present Prior-Data Fitted Networks (PFNs) PFNs leverage in-context learning in large-scale machine learning techniques to approximate a large set of posteriors. We demonstrate that PFNs can near-perfectly mimic Gaussian processes and also enable efficient Bayesian inference for intractable problems.
arXiv Detail & Related papers (2021-12-20T13:07:39Z)
CertainNet: Sampling-free Uncertainty Estimation for Object Detection [65.28989536741658]
Estimating the uncertainty of a neural network plays a fundamental role in safety-critical settings. In this work, we propose a novel sampling-free uncertainty estimation method for object detection. We call it CertainNet, and it is the first to provide separate uncertainties for each output signal: objectness, class, location and size.
arXiv Detail & Related papers (2021-10-04T17:59:31Z)
Quantifying point cloud realism through adversarially learned latent representations [0.38233569758620056]
This paper presents a novel approach to quantify the realism of local regions in LiDAR point clouds. The resulting metric can assign a quality score to samples without requiring any task specific annotations. As one important application, we demonstrate how the local realism score can be used for anomaly detection in point clouds.
arXiv Detail & Related papers (2021-09-24T07:17:27Z)
Layer-wise Characterization of Latent Information Leakage in Federated Learning [9.397152006395174]
Training deep neural networks via federated learning allows clients to share, instead of the original data, only the model trained on their data. Prior work has demonstrated that in practice a client's private information, unrelated to the main learning task, can be discovered from the model's gradients. There is still no formal approach for quantifying the leakage of private information via the shared updated model or gradients.
arXiv Detail & Related papers (2020-10-17T10:49:14Z)
Representation Learning for Sequence Data with Deep Autoencoding Predictive Components [96.42805872177067]
We propose a self-supervised representation learning method for sequence data, based on the intuition that useful representations of sequence data should exhibit a simple structure in the latent space. We encourage this latent structure by maximizing an estimate of predictive information of latent feature sequences, which is the mutual information between past and future windows at each time step. We demonstrate that our method recovers the latent space of noisy dynamical systems, extracts predictive features for forecasting tasks, and improves automatic speech recognition when used to pretrain the encoder on large amounts of unlabeled data.
arXiv Detail & Related papers (2020-10-07T03:34:01Z)

This list is automatically generated from the titles and abstracts of the papers in this site.