Low Pass Filter for Anti-aliasing in Temporal Action Localization
- URL: http://arxiv.org/abs/2104.11403v1
- Date: Fri, 23 Apr 2021 03:57:34 GMT
- Title: Low Pass Filter for Anti-aliasing in Temporal Action Localization
- Authors: Cece Jin, Yuanqi Chen, Ge Li, Tao Zhang, Thomas Li
- Abstract summary: This paper aims to verify the existence of aliasing in temporal action localization methods.
It investigates utilizing low pass filters to solve this problem by inhibiting the high-frequency band.
Experiments demonstrate that anti-aliasing with low pass filters in TAL is advantageous and efficient.
- Score: 15.139834271977913
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In temporal action localization methods, temporal downsampling operations are
widely used to extract proposal features, but they often lead to the aliasing
problem, due to lacking consideration of sampling rates. This paper aims to
verify the existence of aliasing in TAL methods and investigate utilizing low
pass filters to solve this problem by inhibiting the high-frequency band.
However, the high-frequency band usually contains large amounts of specific
information, which is important for model inference. Therefore, it is necessary
to make a tradeoff between anti-aliasing and reserving high-frequency
information. To acquire optimal performance, this paper learns different cutoff
frequencies for different instances dynamically. This design can be plugged
into most existing temporal modeling programs requiring only one additional
cutoff frequency parameter. Integrating low pass filters to the downsampling
operations significantly improves the detection performance and achieves
comparable results on THUMOS'14, ActivityNet~1.3, and Charades datasets.
Experiments demonstrate that anti-aliasing with low pass filters in TAL is
advantageous and efficient.
Related papers
- FilterNet: Harnessing Frequency Filters for Time Series Forecasting [34.83702192033196]
FilterNet is built upon our proposed learnable frequency filters to extract key informative temporal patterns by selectively passing or attenuating certain components of time series signals.
equipped with the two filters, FilterNet can approximately surrogate the linear and attention mappings widely adopted in time series literature.
arXiv Detail & Related papers (2024-11-03T16:20:41Z) - Frequency-aware Feature Fusion for Dense Image Prediction [99.85757278772262]
We propose Frequency-Aware Feature Fusion (FreqFusion) for dense image prediction tasks.
FreqFusion integrates an Adaptive Low-Pass Filter (ALPF) generator, an offset generator, and an Adaptive High-Pass Filter (AHPF) generator.
Comprehensive visualization and quantitative analysis demonstrate that FreqFusion effectively improves feature consistency and sharpens object boundaries.
arXiv Detail & Related papers (2024-08-23T07:30:34Z) - Freq-Mip-AA : Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields [3.796287987989994]
Mip-NeRF proposed using frustums to render a pixel and suggested integrated positional encoding (IPE)
While effective, this approach requires long training times due to its reliance on volumetric architecture.
We propose a novel anti-aliasing technique that utilizes grid-based representations, usually showing significantly faster training time.
arXiv Detail & Related papers (2024-06-19T06:33:56Z) - Frequency-Aware Deepfake Detection: Improving Generalizability through
Frequency Space Learning [81.98675881423131]
This research addresses the challenge of developing a universal deepfake detector that can effectively identify unseen deepfake images.
Existing frequency-based paradigms have relied on frequency-level artifacts introduced during the up-sampling in GAN pipelines to detect forgeries.
We introduce a novel frequency-aware approach called FreqNet, centered around frequency domain learning, specifically designed to enhance the generalizability of deepfake detectors.
arXiv Detail & Related papers (2024-03-12T01:28:00Z) - Post-Processing Temporal Action Detection [134.26292288193298]
Temporal Action Detection (TAD) methods typically take a pre-processing step in converting an input varying-length video into a fixed-length snippet representation sequence.
This pre-processing step would temporally downsample the video, reducing the inference resolution and hampering the detection performance in the original temporal resolution.
We introduce a novel model-agnostic post-processing method without model redesign and retraining.
arXiv Detail & Related papers (2022-11-27T19:50:37Z) - Optimally Band-Limited Noise Filtering for Single Qubit Gates [0.0]
We introduce a quantum control protocol that produces smooth, experimentally implementable control sequences optimized to combat temporally correlated noise for single qubit systems.
In particular, we identify regimes of optimal noise suppression and in turn, optimal control bandwidth directly proportional to the size of the frequency bands where the noise power is large.
arXiv Detail & Related papers (2022-06-07T18:00:01Z) - Adaptive Low-Pass Filtering using Sliding Window Gaussian Processes [71.23286211775084]
We propose an adaptive low-pass filter based on Gaussian process regression.
We show that the estimation error of the proposed method is uniformly bounded.
arXiv Detail & Related papers (2021-11-05T17:06:59Z) - Optimal Sequential Detection of Signals with Unknown Appearance and
Disappearance Points in Time [64.26593350748401]
The paper addresses a sequential changepoint detection problem, assuming that the duration of change may be finite and unknown.
We focus on a reliable maximin change detection criterion of maximizing the minimal probability of detection in a given time (or space) window.
The FMA algorithm is applied to detecting faint streaks of satellites in optical images.
arXiv Detail & Related papers (2021-02-02T04:58:57Z) - Delving Deeper into Anti-aliasing in ConvNets [42.82751522973616]
Aliasing refers to the phenomenon that high frequency signals degenerate into completely different ones after sampling.
We propose an adaptive content-aware low-pass filtering layer, which predicts separate filter weights for each spatial location and channel group.
arXiv Detail & Related papers (2020-08-21T17:56:04Z) - Change Point Detection in Time Series Data using Autoencoders with a
Time-Invariant Representation [69.34035527763916]
Change point detection (CPD) aims to locate abrupt property changes in time series data.
Recent CPD methods demonstrated the potential of using deep learning techniques, but often lack the ability to identify more subtle changes in the autocorrelation statistics of the signal.
We employ an autoencoder-based methodology with a novel loss function, through which the used autoencoders learn a partially time-invariant representation that is tailored for CPD.
arXiv Detail & Related papers (2020-08-21T15:03:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.