Model-based inexact graph matching on top of CNNs for semantic scene
understanding
- URL: http://arxiv.org/abs/2301.07468v2
- Date: Tue, 1 Aug 2023 07:33:15 GMT
- Title: Model-based inexact graph matching on top of CNNs for semantic scene
understanding
- Authors: J\'er\'emy Chopin and Jean-Baptiste Fasquel and Harold Mouch\`ere and
Rozenn Dahyot and Isabelle Bloch
- Abstract summary: Deep learning pipelines for semantic segmentation often ignore structural information available on annotated images used for training.
We propose a novel post-processing module enforcing structural knowledge about the objects of interest to improve segmentation results.
Our approach is shown to be resilient to small training datasets that often limit the performance of deep learning methods.
- Score: 6.106023882846558
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Deep learning based pipelines for semantic segmentation often ignore
structural information available on annotated images used for training. We
propose a novel post-processing module enforcing structural knowledge about the
objects of interest to improve segmentation results provided by deep learning.
This module corresponds to a "many-to-one-or-none" inexact graph matching
approach, and is formulated as a quadratic assignment problem. Our approach is
compared to a CNN-based segmentation (for various CNN backbones) on two public
datasets, one for face segmentation from 2D RGB images (FASSEG), and the other
for brain segmentation from 3D MRIs (IBSR). Evaluations are performed using two
types of structural information (distances and directional relations, , this
choice being a hyper-parameter of our generic framework). On FASSEG data,
results show that our module improves accuracy of the CNN by about 6.3% (the
Hausdorff distance decreases from 22.11 to 20.71). On IBSR data, the
improvement is of 51% (the Hausdorff distance decreases from 11.01 to 5.4). In
addition, our approach is shown to be resilient to small training datasets that
often limit the performance of deep learning methods: the improvement increases
as the size of the training dataset decreases.
Related papers
- Prompt Tuning for Parameter-efficient Medical Image Segmentation [79.09285179181225]
We propose and investigate several contributions to achieve a parameter-efficient but effective adaptation for semantic segmentation on two medical imaging datasets.
We pre-train this architecture with a dedicated dense self-supervision scheme based on assignments to online generated prototypes.
We demonstrate that the resulting neural network model is able to attenuate the gap between fully fine-tuned and parameter-efficiently adapted models.
arXiv Detail & Related papers (2022-11-16T21:55:05Z) - Large-Margin Representation Learning for Texture Classification [67.94823375350433]
This paper presents a novel approach combining convolutional layers (CLs) and large-margin metric learning for training supervised models on small datasets for texture classification.
The experimental results on texture and histopathologic image datasets have shown that the proposed approach achieves competitive accuracy with lower computational cost and faster convergence when compared to equivalent CNNs.
arXiv Detail & Related papers (2022-06-17T04:07:45Z) - Automated airway segmentation by learning graphical structure [0.76146285961466]
We put forward an advanced method for airway segmentation based on the existent convolutional neural network (CNN) and graph neural network (GNN)
The proposed model shows that compared with the CNN-only method, the combination of CNN and GNN has a better performance in that the bronchi in the chest CT scans can be detected in most cases.
arXiv Detail & Related papers (2021-09-30T01:37:31Z) - Residual Moment Loss for Medical Image Segmentation [56.72261489147506]
Location information is proven to benefit the deep learning models on capturing the manifold structure of target objects.
Most existing methods encode the location information in an implicit way, for the network to learn.
We propose a novel loss function, namely residual moment (RM) loss, to explicitly embed the location information of segmentation targets.
arXiv Detail & Related papers (2021-06-27T09:31:49Z) - Scaling Semantic Segmentation Beyond 1K Classes on a Single GPU [87.48110331544885]
We propose a novel training methodology to train and scale the existing semantic segmentation models.
We demonstrate a clear benefit of our approach on a dataset with 1284 classes, bootstrapped from LVIS and COCO annotations, with three times better mIoU than the DeeplabV3+ model.
arXiv Detail & Related papers (2020-12-14T13:12:38Z) - Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation [49.90178055521207]
This work addresses weakly supervised semantic segmentation (WSSS), with the goal of bridging the gap between image-level annotations and pixel-level segmentation.
We formulate WSSS as a novel group-wise learning task that explicitly models semantic dependencies in a group of images to estimate more reliable pseudo ground-truths.
In particular, we devise a graph neural network (GNN) for group-wise semantic mining, wherein input images are represented as graph nodes.
arXiv Detail & Related papers (2020-12-09T12:40:13Z) - Unsupervised Learning Consensus Model for Dynamic Texture Videos
Segmentation [12.462608802359936]
We present an effective unsupervised learning consensus model for the segmentation of dynamic texture (ULCM)
In the proposed model, the set of values of the requantized local binary patterns (LBP) histogram around the pixel to be classified are used as features.
Experiments conducted on the challenging SynthDB dataset show that ULCM is significantly faster, easier to code, simple and has limited parameters.
arXiv Detail & Related papers (2020-06-29T16:40:59Z) - 3D medical image segmentation with labeled and unlabeled data using
autoencoders at the example of liver segmentation in CT images [58.720142291102135]
This work investigates the potential of autoencoder-extracted features to improve segmentation with a convolutional neural network.
A convolutional autoencoder was used to extract features from unlabeled data and a multi-scale, fully convolutional CNN was used to perform the target task of 3D liver segmentation in CT images.
arXiv Detail & Related papers (2020-03-17T20:20:43Z) - On the Texture Bias for Few-Shot CNN Segmentation [21.349705243254423]
Convolutional Neural Networks (CNNs) are driven by shapes to perform visual recognition tasks.
Recent evidence suggests texture bias in CNNs provides higher performing models when learning on large labeled training datasets.
We propose a novel architecture that integrates a set of Difference of Gaussians (DoG) to attenuate high-frequency local components in the feature space.
arXiv Detail & Related papers (2020-03-09T11:55:47Z) - Convolutional Embedding for Edit Distance [24.65097766064397]
CNN-ED embeds edit distance into Euclidean distance for fast approximate similarity search.
CNN-ED outperforms data-independent CGK embedding and RNN-based GRU embedding in terms of both accuracy and efficiency.
arXiv Detail & Related papers (2020-01-31T07:53:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.