DeepLab2: A TensorFlow Library for Deep Labeling
- URL: http://arxiv.org/abs/2106.09748v1
- Date: Thu, 17 Jun 2021 18:04:53 GMT
- Title: DeepLab2: A TensorFlow Library for Deep Labeling
- Authors: Mark Weber, Huiyu Wang, Siyuan Qiao, Jun Xie, Maxwell D. Collins,
Yukun Zhu, Liangzhe Yuan, Dahun Kim, Qihang Yu, Daniel Cremers, Laura
Leal-Taixe, Alan L. Yuille, Florian Schroff, Hartwig Adam, Liang-Chieh Chen
- Abstract summary: DeepLab2 is a library for deep labeling for general dense pixel prediction problems in computer vision.
DeepLab2 includes all our recently developed DeepLab model variants with pretrained checkpoints as well as model training and evaluation code.
To showcase the effectiveness of DeepLab2, our Panoptic-DeepLab employing Axial-SWideRNet as network backbone achieves 68.0% PQ or 83.5% mIoU on Cityscaspes validation set.
- Score: 118.95446843615049
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a
state-of-the-art and easy-to-use TensorFlow codebase for general dense pixel
prediction problems in computer vision. DeepLab2 includes all our recently
developed DeepLab model variants with pretrained checkpoints as well as model
training and evaluation code, allowing the community to reproduce and further
improve upon the state-of-art systems. To showcase the effectiveness of
DeepLab2, our Panoptic-DeepLab employing Axial-SWideRNet as network backbone
achieves 68.0% PQ or 83.5% mIoU on Cityscaspes validation set, with only
single-scale inference and ImageNet-1K pretrained checkpoints. We hope that
publicly sharing our library could facilitate future research on dense pixel
labeling tasks and envision new applications of this technology. Code is made
publicly available at \url{https://github.com/google-research/deeplab2}.
Related papers
- Comgra: A Tool for Analyzing and Debugging Neural Networks [35.89730807984949]
We introduce comgra, an open source python library for use with PyTorch.
Comgra extracts data about the internal activations of a model and organizes it in a GUI.
It can show both summary statistics and individual data points, compare early and late stages of training, focus on individual samples of interest, and visualize the flow of the gradient through the network.
arXiv Detail & Related papers (2024-07-31T14:57:23Z) - Exploring Green AI for Audio Deepfake Detection [21.17957700009653]
State-of-the-art audio deepfake detectors leveraging deep neural networks exhibit impressive recognition performance.
Deep NLP models produce around 626k lbs of COtextsubscript2 which is equivalent to five times of average US car emission at its lifetime.
This study presents a novel framework for audio deepfake detection that can be seamlessly trained using standard CPU resources.
arXiv Detail & Related papers (2024-03-21T10:54:21Z) - NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth
Supervision for Indoor Multi-View 3D Detection [72.0098999512727]
NeRF-Det has achieved impressive performance in indoor multi-view 3D detection by utilizing NeRF to enhance representation learning.
We present three corresponding solutions, including semantic enhancement, perspective-aware sampling, and ordinal depth supervision.
The resulting algorithm, NeRF-Det++, has exhibited appealing performance in the ScanNetV2 and AR KITScenes datasets.
arXiv Detail & Related papers (2024-02-22T11:48:06Z) - BayesDLL: Bayesian Deep Learning Library [29.624531252627484]
We release a new Bayesian neural network library for PyTorch for large-scale deep networks.
Our library implements mainstream inference algorithms: variational inference, MC-dropout, approximate-gradient MCMC, and Laplace approximation.
arXiv Detail & Related papers (2023-09-22T15:27:54Z) - torchgfn: A PyTorch GFlowNet library [56.071033896777784]
torchgfn is a PyTorch library that aims to address this need.
It provides users with a simple API for environments and useful abstractions for samplers and losses.
arXiv Detail & Related papers (2023-05-24T00:20:59Z) - Sparse Auxiliary Networks for Unified Monocular Depth Prediction and
Completion [56.85837052421469]
Estimating scene geometry from data obtained with cost-effective sensors is key for robots and self-driving cars.
In this paper, we study the problem of predicting dense depth from a single RGB image with optional sparse measurements from low-cost active depth sensors.
We introduce Sparse Networks (SANs), a new module enabling monodepth networks to perform both the tasks of depth prediction and completion.
arXiv Detail & Related papers (2021-03-30T21:22:26Z) - TensorX: Extensible API for Neural Network Model Design and Deployment [0.0]
TensorFlowX is a Python library for prototyping, design, and deployment of complex neural network models in computation.
A special emphasis is put on ease of use, performance, and API consistency.
arXiv Detail & Related papers (2020-12-29T00:15:38Z) - Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and
Benchmarking [4.721069729610892]
Deep reinforcement learning has been one of the fastest growing fields of machine learning over the past years and numerous libraries have been open sourced to support research.
This paper introduces Tonic, a Python library allowing researchers to quickly implement new ideas and measure their importance.
arXiv Detail & Related papers (2020-11-15T14:10:37Z) - Semi-supervised deep learning based on label propagation in a 2D
embedded space [117.9296191012968]
Proposed solutions propagate labels from a small set of supervised images to a large set of unsupervised ones to train a deep neural network model.
We present a loop in which a deep neural network (VGG-16) is trained from a set with more correctly labeled samples along iterations.
As the labeled set improves along iterations, it improves the features of the neural network.
arXiv Detail & Related papers (2020-08-02T20:08:54Z) - Accurate RGB-D Salient Object Detection via Collaborative Learning [101.82654054191443]
RGB-D saliency detection shows impressive ability on some challenge scenarios.
We propose a novel collaborative learning framework where edge, depth and saliency are leveraged in a more efficient way.
arXiv Detail & Related papers (2020-07-23T04:33:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.