Cinematic-L1 Video Stabilization with a Log-Homography Model
- URL: http://arxiv.org/abs/2011.08144v2
- Date: Fri, 20 Nov 2020 19:14:09 GMT
- Title: Cinematic-L1 Video Stabilization with a Log-Homography Model
- Authors: Arwen Bradley, Jason Klivington, Joseph Triscari, Rudolph van der
Merwe
- Abstract summary: We present a method for stabilizing handheld video that simulates the camera motions cinematographers achieve with equipment like tripods, dollies, and Steadicams.
Our method is computationally efficient, running at 300 fps on an iPhone XS, and yields high-quality results.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a method for stabilizing handheld video that simulates the camera
motions cinematographers achieve with equipment like tripods, dollies, and
Steadicams. We formulate a constrained convex optimization problem minimizing
the $\ell_1$-norm of the first three derivatives of the stabilized motion. Our
approach extends the work of Grundmann et al. [9] by solving with full
homographies (rather than affinities) in order to correct perspective,
preserving linearity by working in log-homography space. We also construct crop
constraints that preserve field-of-view; model the problem as a quadratic
(rather than linear) program to allow for an $\ell_2$ term encouraging fidelity
to the original trajectory; and add constraints and objectives to reduce
distortion. Furthermore, we propose new methods for handling salient objects
via both inclusion constraints and centering objectives. Finally, we describe a
windowing strategy to approximate the solution in linear time and bounded
memory. Our method is computationally efficient, running at 300fps on an iPhone
XS, and yields high-quality results, as we demonstrate with a collection of
stabilized videos, quantitative and qualitative comparisons to [9] and other
methods, and an ablation study.
Related papers
- Trust-Region Sequential Quadratic Programming for Stochastic Optimization with Random Models [57.52124921268249]
We propose a Trust Sequential Quadratic Programming method to find both first and second-order stationary points.
To converge to first-order stationary points, our method computes a gradient step in each iteration defined by minimizing a approximation of the objective subject.
To converge to second-order stationary points, our method additionally computes an eigen step to explore the negative curvature the reduced Hessian matrix.
arXiv Detail & Related papers (2024-09-24T04:39:47Z) - Vanishing Point Estimation in Uncalibrated Images with Prior Gravity
Direction [82.72686460985297]
We tackle the problem of estimating a Manhattan frame.
We derive two new 2-line solvers, one of which does not suffer from singularities affecting existing solvers.
We also design a new non-minimal method, running on an arbitrary number of lines, to boost the performance in local optimization.
arXiv Detail & Related papers (2023-08-21T13:03:25Z) - Fast Full-frame Video Stabilization with Iterative Optimization [21.962533235492625]
We propose an iterative optimization-based learning approach using synthetic datasets for video stabilization.
We develop a two-level (coarse-to-fine) stabilizing algorithm based on the probabilistic flow field.
We take a divide-and-conquer approach and propose a novel multiframe fusion strategy to render full-frame stabilized views.
arXiv Detail & Related papers (2023-07-24T13:24:19Z) - Towards Nonlinear-Motion-Aware and Occlusion-Robust Rolling Shutter
Correction [54.00007868515432]
Existing methods face challenges in estimating the accurate correction field due to the uniform velocity assumption.
We propose a geometry-based Quadratic Rolling Shutter (QRS) motion solver, which precisely estimates the high-order correction field of individual pixels.
Our method surpasses the state-of-the-art by +4.98, +0.77, and +4.33 of PSNR on Carla-RS, Fastec-RS, and BS-RSC datasets, respectively.
arXiv Detail & Related papers (2023-03-31T15:09:18Z) - Robust Camera Pose Refinement for Multi-Resolution Hash Encoding [19.655282240361835]
Multi-resolution hash encoding has been proposed to reduce the computational cost of neural renderings.
We propose a joint optimization algorithm to calibrate the camera pose and learn a geometric representation using efficient multi-resolution hash encoding.
arXiv Detail & Related papers (2023-02-03T06:49:27Z) - Minimum Latency Deep Online Video Stabilization [77.68990069996939]
We present a novel camera path optimization framework for the task of online video stabilization.
In this work, we adopt recent off-the-shelf high-quality deep motion models for motion estimation to recover the camera trajectory.
Our approach significantly outperforms state-of-the-art online methods both qualitatively and quantitatively.
arXiv Detail & Related papers (2022-12-05T07:37:32Z) - Towards Interpretable Video Super-Resolution via Alternating
Optimization [115.85296325037565]
We study a practical space-time video super-resolution (STVSR) problem which aims at generating a high-framerate high-resolution sharp video from a low-framerate blurry video.
We propose an interpretable STVSR framework by leveraging both model-based and learning-based methods.
arXiv Detail & Related papers (2022-07-21T21:34:05Z) - Neural Re-rendering for Full-frame Video Stabilization [144.9918806873405]
We present an algorithm for full-frame video stabilization by first estimating dense warp fields.
Full-frame stabilized frames can then be synthesized by fusing warped contents from neighboring frames.
arXiv Detail & Related papers (2021-02-11T18:59:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.