Related papers: TransMatting: Enhancing Transparent Objects Matting with Transformers

TransMatting: Enhancing Transparent Objects Matting with Transformers

URL: http://arxiv.org/abs/2208.03007v1
Date: Fri, 5 Aug 2022 06:44:14 GMT
Title: TransMatting: Enhancing Transparent Objects Matting with Transformers
Authors: Huanqia Cai, Fanglei Xue, Lele Xu, Lili Guo
Abstract summary: We propose a Transformer-based network, TransMatting, to model transparent objects with a big receptive field. A small convolutional network is proposed to utilize the global feature and non-background mask to guide the multi-scale feature propagation from encoder to decoder. We create a high-resolution matting dataset of transparent objects with small known foreground areas.
Score: 4.012340049240327
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Image matting refers to predicting the alpha values of unknown foreground areas from natural images. Prior methods have focused on propagating alpha values from known to unknown regions. However, not all natural images have a specifically known foreground. Images of transparent objects, like glass, smoke, web, etc., have less or no known foreground. In this paper, we propose a Transformer-based network, TransMatting, to model transparent objects with a big receptive field. Specifically, we redesign the trimap as three learnable tri-tokens for introducing advanced semantic features into the self-attention mechanism. A small convolutional network is proposed to utilize the global feature and non-background mask to guide the multi-scale feature propagation from encoder to decoder for maintaining the contexture of transparent objects. In addition, we create a high-resolution matting dataset of transparent objects with small known foreground areas. Experiments on several matting benchmarks demonstrate the superiority of our proposed method over the current state-of-the-art methods.

Related papers

ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation [108.69315278353932]
We introduce the Anonymous Region Transformer (ART), which facilitates the direct generation of variable multi-layer transparent images. By enabling precise control and scalable layer generation, ART establishes a new paradigm for interactive content creation.
arXiv Detail & Related papers (2025-02-25T16:57:04Z)
From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$-NeuS [41.38491898098891]
This paper introduces $alpha$-NeuS, a new method for reconstructing thin transparent objects and opaque objects based on neural implicit surfaces (NeuS) Traditional iso-surfacing algorithms such as marching cubes, which rely on fixed iso-values, are ill-suited for this data. We prove that the reconstructed surfaces are unbiased for both transparent and opaque objects.
arXiv Detail & Related papers (2024-11-08T06:36:31Z)
DiffusionMat: Alpha Matting as Sequential Refinement Learning [87.76572845943929]
DiffusionMat is an image matting framework that employs a diffusion model for the transition from coarse to refined alpha mattes. A correction module adjusts the output at each denoising step, ensuring that the final result is consistent with the input image's structures. We evaluate our model across several image matting benchmarks, and the results indicate that DiffusionMat consistently outperforms existing methods.
arXiv Detail & Related papers (2023-11-22T17:16:44Z)
TransMatting: Tri-token Equipped Transformer Model for Image Matting [4.012340049240327]
We propose a Transformer-based network (TransMatting) to model transparent objects with long-range features. We also redesign the trimap as three learnable tokens, named tri-token. Our proposed TransMatting outperforms current state-of-the-art methods on several popular matting benchmarks and our newly collected Transparent-460.
arXiv Detail & Related papers (2023-03-11T18:21:25Z)
Location-Free Camouflage Generation Network [82.74353843283407]
Camouflage is a common visual phenomenon, which refers to hiding the foreground objects into the background images, making them briefly invisible to the human eye. This paper proposes a novel Location-free Camouflage Generation Network (LCG-Net) that fuse high-level features of foreground and background image, and generate result by one inference. Experiments show that our method has results as satisfactory as state-of-the-art in the single-appearance regions and are less likely to be completely invisible, but far exceed the quality of the state-of-the-art in the multi-appearance regions.
arXiv Detail & Related papers (2022-03-18T10:33:40Z)
Long-Range Feature Propagating for Natural Image Matting [93.20589403997505]
Natural image matting estimates alpha values of unknown regions in the trimap. Recently, deep learning based methods propagate the alpha values from the known regions to unknown regions according to the similarity between them. We propose Long-Range Feature Propagating Network (LFPNet), which learns the long-range context features outside the reception fields for alpha matte estimation.
arXiv Detail & Related papers (2021-09-25T01:17:17Z)
Deep Automatic Natural Image Matting [82.56853587380168]
Automatic image matting (AIM) refers to estimating the soft foreground from an arbitrary natural image without any auxiliary input like trimap. We propose a novel end-to-end matting network, which can predict a generalized trimap for any image of the above types as a unified semantic representation. Our network trained on available composite matting datasets outperforms existing methods both objectively and subjectively.
arXiv Detail & Related papers (2021-07-15T10:29:01Z)
Semantic Image Matting [75.21022252141474]
We show how to obtain better alpha mattes by incorporating into our framework semantic classification of matting regions. Specifically, we consider and learn 20 classes of matting patterns, and propose to extend the conventional trimap to semantic trimap. Experiments on multiple benchmarks show that our method outperforms other methods and has achieved the most competitive state-of-the-art performance.
arXiv Detail & Related papers (2021-04-16T16:21:02Z)
Human Perception Modeling for Automatic Natural Image Matting [2.179313476241343]
Natural image matting aims to precisely separate foreground objects from background using alpha matte. We propose an intuitively-designed trimap-free two-stage matting approach without additional annotations. Our matting algorithm has competitive performance with current state-of-the-art methods in both trimap-free and trimap-needed aspects.
arXiv Detail & Related papers (2021-03-31T12:08:28Z)
Salient Image Matting [0.0]
We propose an image matting framework called Salient Image Matting to estimate the per-pixel opacity value of the most salient foreground in an image. Our framework simultaneously deals with the challenge of learning a wide range of semantics and salient object types. Our framework requires only a fraction of expensive matting data as compared to other automatic methods.
arXiv Detail & Related papers (2021-03-23T06:22:33Z)
Through the Looking Glass: Neural 3D Reconstruction of Transparent Shapes [75.63464905190061]
Complex light paths induced by refraction and reflection have prevented both traditional and deep multiview stereo from solving this problem. We propose a physically-based network to recover 3D shape of transparent objects using a few images acquired with a mobile phone camera. Our experiments show successful recovery of high-quality 3D geometry for complex transparent shapes using as few as 5-12 natural images.
arXiv Detail & Related papers (2020-04-22T23:51:30Z)

This list is automatically generated from the titles and abstracts of the papers in this site.