Sparsity-guided Network Design for Frame Interpolation
- URL: http://arxiv.org/abs/2209.04551v1
- Date: Fri, 9 Sep 2022 23:13:25 GMT
- Title: Sparsity-guided Network Design for Frame Interpolation
- Authors: Tianyu Ding, Luming Liang, Zhihui Zhu, Tianyi Chen, Ilya Zharkov
- Abstract summary: We present a compression-driven network design for frame-based algorithms.
We leverage model pruning through sparsity-inducing optimization to greatly reduce the model size.
We achieve a considerable performance gain with a quarter of the size of the original AdaCoF.
- Score: 39.828644638174225
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: DNN-based frame interpolation, which generates intermediate frames from two
consecutive frames, is often dependent on model architectures with a large
number of features, preventing their deployment on systems with limited
resources, such as mobile devices. We present a compression-driven network
design for frame interpolation that leverages model pruning through
sparsity-inducing optimization to greatly reduce the model size while attaining
higher performance. Concretely, we begin by compressing the recently proposed
AdaCoF model and demonstrating that a 10 times compressed AdaCoF performs
similarly to its original counterpart, where different strategies for using
layerwise sparsity information as a guide are comprehensively investigated
under a variety of hyperparameter settings. We then enhance this compressed
model by introducing a multi-resolution warping module, which improves visual
consistency with multi-level details. As a result, we achieve a considerable
performance gain with a quarter of the size of the original AdaCoF. In
addition, our model performs favorably against other state-of-the-art
approaches on a wide variety of datasets. We note that the suggested
compression-driven framework is generic and can be easily transferred to other
DNN-based frame interpolation algorithms. The source code is available at
https://github.com/tding1/CDFI.
Related papers
- A Lightweight Feature Fusion Architecture For Resource-Constrained Crowd
Counting [3.5066463427087777]
We introduce two lightweight models to enhance the versatility of crowd-counting models.
These models maintain the same downstream architecture while incorporating two distinct backbones: MobileNet and MobileViT.
We leverage Adjacent Feature Fusion to extract diverse scale features from a Pre-Trained Model (PTM) and subsequently combine these features seamlessly.
arXiv Detail & Related papers (2024-01-11T15:13:31Z) - AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation [80.33846577924363]
We present All-Pairs Multi-Field Transforms (AMT), a new network architecture for video framegithub.
It is based on two essential designs. First, we build bidirectional volumes for all pairs of pixels, and use the predicted bilateral flows to retrieve correlations.
Second, we derive multiple groups of fine-grained flow fields from one pair of updated coarse flows for performing backward warping on the input frames separately.
arXiv Detail & Related papers (2023-04-19T16:18:47Z) - Slimmable Domain Adaptation [112.19652651687402]
We introduce a simple framework, Slimmable Domain Adaptation, to improve cross-domain generalization with a weight-sharing model bank.
Our framework surpasses other competing approaches by a very large margin on multiple benchmarks.
arXiv Detail & Related papers (2022-06-14T06:28:04Z) - Triple-level Model Inferred Collaborative Network Architecture for Video
Deraining [43.06607185181434]
We develop a model-guided triple-level optimization framework to deduce network architecture with cooperating optimization and auto-searching mechanism.
Our model shows significant improvements in fidelity and temporal consistency over the state-of-the-art works.
arXiv Detail & Related papers (2021-11-08T13:09:00Z) - Multi-Scale Semantics-Guided Neural Networks for Efficient
Skeleton-Based Human Action Recognition [140.18376685167857]
A simple yet effective multi-scale semantics-guided neural network is proposed for skeleton-based action recognition.
MS-SGN achieves the state-of-the-art performance on the NTU60, NTU120, and SYSU datasets.
arXiv Detail & Related papers (2021-11-07T03:50:50Z) - CDFI: Compression-Driven Network Design for Frame Interpolation [24.205673014182356]
We propose a compression-driven network design for frame size.
We show that a 10X compressed AdaCoF model performs similarly as its original counterpart.
Our model performs favorably against other state-of-the-arts in a broad range of datasets.
arXiv Detail & Related papers (2021-03-18T22:59:42Z) - Collegial Ensembles [11.64359837358763]
We show that collegial ensembles can be efficiently implemented in practical architectures using group convolutions and block diagonal layers.
We also show how our framework can be used to analytically derive optimal group convolution modules without having to train a single model.
arXiv Detail & Related papers (2020-06-13T16:40:26Z) - A Generic Network Compression Framework for Sequential Recommender
Systems [71.81962915192022]
Sequential recommender systems (SRS) have become the key technology in capturing user's dynamic interests and generating high-quality recommendations.
We propose a compressed sequential recommendation framework, termed as CpRec, where two generic model shrinking techniques are employed.
By the extensive ablation studies, we demonstrate that the proposed CpRec can achieve up to 4$sim$8 times compression rates in real-world SRS datasets.
arXiv Detail & Related papers (2020-04-21T08:40:55Z) - Normalizing Flows with Multi-Scale Autoregressive Priors [131.895570212956]
We introduce channel-wise dependencies in their latent space through multi-scale autoregressive priors (mAR)
Our mAR prior for models with split coupling flow layers (mAR-SCF) can better capture dependencies in complex multimodal data.
We show that mAR-SCF allows for improved image generation quality, with gains in FID and Inception scores compared to state-of-the-art flow-based models.
arXiv Detail & Related papers (2020-04-08T09:07:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.