TransMatting: Enhancing Transparent Objects Matting with Transformers
- URL: http://arxiv.org/abs/2208.03007v1
- Date: Fri, 5 Aug 2022 06:44:14 GMT
- Title: TransMatting: Enhancing Transparent Objects Matting with Transformers
- Authors: Huanqia Cai, Fanglei Xue, Lele Xu, Lili Guo
- Abstract summary: We propose a Transformer-based network, TransMatting, to model transparent objects with a big receptive field.
A small convolutional network is proposed to utilize the global feature and non-background mask to guide the multi-scale feature propagation from encoder to decoder.
We create a high-resolution matting dataset of transparent objects with small known foreground areas.
- Score: 4.012340049240327
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Image matting refers to predicting the alpha values of unknown foreground
areas from natural images. Prior methods have focused on propagating alpha
values from known to unknown regions. However, not all natural images have a
specifically known foreground. Images of transparent objects, like glass,
smoke, web, etc., have less or no known foreground. In this paper, we propose a
Transformer-based network, TransMatting, to model transparent objects with a
big receptive field. Specifically, we redesign the trimap as three learnable
tri-tokens for introducing advanced semantic features into the self-attention
mechanism. A small convolutional network is proposed to utilize the global
feature and non-background mask to guide the multi-scale feature propagation
from encoder to decoder for maintaining the contexture of transparent objects.
In addition, we create a high-resolution matting dataset of transparent objects
with small known foreground areas. Experiments on several matting benchmarks
demonstrate the superiority of our proposed method over the current
state-of-the-art methods.
Related papers
- DiffusionMat: Alpha Matting as Sequential Refinement Learning [87.76572845943929]
DiffusionMat is an image matting framework that employs a diffusion model for the transition from coarse to refined alpha mattes.
A correction module adjusts the output at each denoising step, ensuring that the final result is consistent with the input image's structures.
We evaluate our model across several image matting benchmarks, and the results indicate that DiffusionMat consistently outperforms existing methods.
arXiv Detail & Related papers (2023-11-22T17:16:44Z) - TransMatting: Tri-token Equipped Transformer Model for Image Matting [4.012340049240327]
We propose a Transformer-based network (TransMatting) to model transparent objects with long-range features.
We also redesign the trimap as three learnable tokens, named tri-token.
Our proposed TransMatting outperforms current state-of-the-art methods on several popular matting benchmarks and our newly collected Transparent-460.
arXiv Detail & Related papers (2023-03-11T18:21:25Z) - Location-Free Camouflage Generation Network [82.74353843283407]
Camouflage is a common visual phenomenon, which refers to hiding the foreground objects into the background images, making them briefly invisible to the human eye.
This paper proposes a novel Location-free Camouflage Generation Network (LCG-Net) that fuse high-level features of foreground and background image, and generate result by one inference.
Experiments show that our method has results as satisfactory as state-of-the-art in the single-appearance regions and are less likely to be completely invisible, but far exceed the quality of the state-of-the-art in the multi-appearance regions.
arXiv Detail & Related papers (2022-03-18T10:33:40Z) - TransCG: A Large-Scale Real-World Dataset for Transparent Object Depth
Completion and Grasping [46.6058840385155]
We contribute a large-scale real-world dataset for transparent object depth completion.
Our dataset contains 57,715 RGB-D images from 130 different scenes.
We propose an end-to-end depth completion network, which takes the RGB image and the inaccurate depth map as inputs and outputs a refined depth map.
arXiv Detail & Related papers (2022-02-17T06:50:20Z) - Long-Range Feature Propagating for Natural Image Matting [93.20589403997505]
Natural image matting estimates alpha values of unknown regions in the trimap.
Recently, deep learning based methods propagate the alpha values from the known regions to unknown regions according to the similarity between them.
We propose Long-Range Feature Propagating Network (LFPNet), which learns the long-range context features outside the reception fields for alpha matte estimation.
arXiv Detail & Related papers (2021-09-25T01:17:17Z) - Deep Automatic Natural Image Matting [82.56853587380168]
Automatic image matting (AIM) refers to estimating the soft foreground from an arbitrary natural image without any auxiliary input like trimap.
We propose a novel end-to-end matting network, which can predict a generalized trimap for any image of the above types as a unified semantic representation.
Our network trained on available composite matting datasets outperforms existing methods both objectively and subjectively.
arXiv Detail & Related papers (2021-07-15T10:29:01Z) - Semantic Image Matting [75.21022252141474]
We show how to obtain better alpha mattes by incorporating into our framework semantic classification of matting regions.
Specifically, we consider and learn 20 classes of matting patterns, and propose to extend the conventional trimap to semantic trimap.
Experiments on multiple benchmarks show that our method outperforms other methods and has achieved the most competitive state-of-the-art performance.
arXiv Detail & Related papers (2021-04-16T16:21:02Z) - Human Perception Modeling for Automatic Natural Image Matting [2.179313476241343]
Natural image matting aims to precisely separate foreground objects from background using alpha matte.
We propose an intuitively-designed trimap-free two-stage matting approach without additional annotations.
Our matting algorithm has competitive performance with current state-of-the-art methods in both trimap-free and trimap-needed aspects.
arXiv Detail & Related papers (2021-03-31T12:08:28Z) - Salient Image Matting [0.0]
We propose an image matting framework called Salient Image Matting to estimate the per-pixel opacity value of the most salient foreground in an image.
Our framework simultaneously deals with the challenge of learning a wide range of semantics and salient object types.
Our framework requires only a fraction of expensive matting data as compared to other automatic methods.
arXiv Detail & Related papers (2021-03-23T06:22:33Z) - Through the Looking Glass: Neural 3D Reconstruction of Transparent
Shapes [75.63464905190061]
Complex light paths induced by refraction and reflection have prevented both traditional and deep multiview stereo from solving this problem.
We propose a physically-based network to recover 3D shape of transparent objects using a few images acquired with a mobile phone camera.
Our experiments show successful recovery of high-quality 3D geometry for complex transparent shapes using as few as 5-12 natural images.
arXiv Detail & Related papers (2020-04-22T23:51:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.