A survey on Kornia: an Open Source Differentiable Computer Vision
Library for PyTorch
- URL: http://arxiv.org/abs/2009.10521v1
- Date: Mon, 21 Sep 2020 08:48:28 GMT
- Title: A survey on Kornia: an Open Source Differentiable Computer Vision
Library for PyTorch
- Authors: E. Riba, D. Mishkin, J. Shi, D. Ponsa, F. Moreno-Noguer and G. Bradski
- Abstract summary: This work presents Kornia, an open source computer vision library built upon a set of differentiable routines and modules that aims to solve generic computer vision problems.
The package uses PyTorch as its main backend, not only for efficiency but also to take advantage of the reverse auto-differentiation engine to define and compute the gradient of complex functions.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This work presents Kornia, an open source computer vision library built upon
a set of differentiable routines and modules that aims to solve generic
computer vision problems. The package uses PyTorch as its main backend, not
only for efficiency but also to take advantage of the reverse
auto-differentiation engine to define and compute the gradient of complex
functions. Inspired by OpenCV, Kornia is composed of a set of modules
containing operators that can be integrated into neural networks to train
models to perform a wide range of operations including image
transformations,camera calibration, epipolar geometry, and low level image
processing techniques, such as filtering and edge detection that operate
directly on high dimensional tensor representations on graphical processing
units, generating faster systems. Examples of classical vision problems
implemented using our framework are provided including a benchmark comparing to
existing vision libraries.
Related papers
- DiffCSG: Differentiable CSG via Rasterization [42.48127201787373]
Differentiable rendering is a key ingredient for inverse rendering and machine learning.
We present an algorithm, DiffCSG, to render ConstructiveSolidGeometry (CSG) in a differentiable manner.
Our algorithm is simple and fast, can be easily incorporated into modern machine learning setups, and enables a range of applications.
arXiv Detail & Related papers (2024-09-02T18:57:07Z) - Parameter-Inverted Image Pyramid Networks [49.35689698870247]
We propose a novel network architecture known as the Inverted Image Pyramid Networks (PIIP)
Our core idea is to use models with different parameter sizes to process different resolution levels of the image pyramid.
PIIP achieves superior performance in tasks such as object detection, segmentation, and image classification.
arXiv Detail & Related papers (2024-06-06T17:59:10Z) - Vision Transformer with Convolutions Architecture Search [72.70461709267497]
We propose an architecture search method-Vision Transformer with Convolutions Architecture Search (VTCAS)
The high-performance backbone network searched by VTCAS introduces the desirable features of convolutional neural networks into the Transformer architecture.
It enhances the robustness of the neural network for object recognition, especially in the low illumination indoor scene.
arXiv Detail & Related papers (2022-03-20T02:59:51Z) - PnP-DETR: Towards Efficient Visual Analysis with Transformers [146.55679348493587]
Recently, DETR pioneered the solution vision tasks with transformers, it directly translates the image feature map into the object result.
Recent transformer-based image recognition model andTT show consistent efficiency gain.
arXiv Detail & Related papers (2021-09-15T01:10:30Z) - Deep Features for training Support Vector Machine [16.795405355504077]
This paper develops a generic computer vision system based on features extracted from trained CNNs.
Multiple learned features are combined into a single structure to work on different image classification tasks.
arXiv Detail & Related papers (2021-04-08T03:13:09Z) - Deep Learning Framework From Scratch Using Numpy [0.0]
This work is a rigorous development of a complete and general-purpose deep learning framework from the ground up.
The fundamental components of deep learning are developed from elementary calculus and implemented in a sensible object-oriented approach using only Python and the Numpy library.
Demonstrations of solved problems using the framework, named ArrayFlow, include a computer vision classification task, solving for the shape of a catenary, and a 2nd order differential equation.
arXiv Detail & Related papers (2020-11-17T06:28:05Z) - Captum: A unified and generic model interpretability library for PyTorch [49.72749684393332]
We introduce a novel, unified, open-source model interpretability library for PyTorch.
The library contains generic implementations of a number of gradient and perturbation-based attribution algorithms.
It can be used for both classification and non-classification models.
arXiv Detail & Related papers (2020-09-16T18:57:57Z) - A Flexible Framework for Designing Trainable Priors with Adaptive
Smoothing and Game Encoding [57.1077544780653]
We introduce a general framework for designing and training neural network layers whose forward passes can be interpreted as solving non-smooth convex optimization problems.
We focus on convex games, solved by local agents represented by the nodes of a graph and interacting through regularization functions.
This approach is appealing for solving imaging problems, as it allows the use of classical image priors within deep models that are trainable end to end.
arXiv Detail & Related papers (2020-06-26T08:34:54Z) - Image Matching across Wide Baselines: From Paper to Practice [80.9424750998559]
We introduce a comprehensive benchmark for local features and robust estimation algorithms.
Our pipeline's modular structure allows easy integration, configuration, and combination of different methods.
We show that with proper settings, classical solutions may still outperform the perceived state of the art.
arXiv Detail & Related papers (2020-03-03T15:20:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.