CG-fusion CAM: Online segmentation of laser-induced damage on
large-aperture optics
- URL: http://arxiv.org/abs/2307.09161v1
- Date: Tue, 18 Jul 2023 11:38:20 GMT
- Title: CG-fusion CAM: Online segmentation of laser-induced damage on
large-aperture optics
- Authors: Yueyue Han, Yingyan Huang, Hangcheng Dong, Fengdong Chen, Fa Zeng,
Zhitao Peng, Qihua Zhu, Guodong Liu
- Abstract summary: We propose a weakly supervised semantic segmentation method with Continuous Gradient CAM and its nonlinear multi-scale fusion (CG-fusion CAM)
The proposed method can achieve segmentation performance comparable to that of fully supervised algorithms.
- Score: 1.4658400971135652
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Online segmentation of laser-induced damage on large-aperture optics in
high-power laser facilities is challenged by complicated damage morphology,
uneven illumination and stray light interference. Fully supervised semantic
segmentation algorithms have achieved state-of-the-art performance, but rely on
plenty of pixel-level labels, which are time-consuming and labor-consuming to
produce. LayerCAM, an advanced weakly supervised semantic segmentation
algorithm, can generate pixel-accurate results using only image-level labels,
but its scattered and partially under-activated class activation regions
degrade segmentation performance. In this paper, we propose a weakly supervised
semantic segmentation method with Continuous Gradient CAM and its nonlinear
multi-scale fusion (CG-fusion CAM). The method redesigns the way of
back-propagating gradients and non-linearly activates the multi-scale fused
heatmaps to generate more fine-grained class activation maps with appropriate
activation degree for different sizes of damage sites. Experiments on our
dataset show that the proposed method can achieve segmentation performance
comparable to that of fully supervised algorithms.
Related papers
- Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields [49.66011190843893]
We propose a method that leverages CLIP feature distillation, achieving efficient 3D segmentation through language guidance.
To achieve this, we introduce an adapter module and mitigate the noise issue in the dense CLIP feature distillation process.
Our method surpasses current state-of-the-art technologies in both training speed and performance.
arXiv Detail & Related papers (2025-01-31T12:19:14Z) - Superpixel Boundary Correction for Weakly-Supervised Semantic Segmentation on Histopathology Images [12.002538365135642]
Weakly supervised semantic segmentation (WSSS) reduces the annotation requirement by using image-level labels instead of pixel-level ones.
Class Activation Map (CAM)-based methods still suffer from low spatial resolution and unclear boundaries.
We propose a multi-level superpixel correction algorithm that refines CAM boundaries using superpixel clustering and floodfill.
arXiv Detail & Related papers (2025-01-07T15:54:03Z) - HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization [15.13875300007579]
HisynSeg is a weakly-supervised semantic segmentation framework based on image-mixing synthesis and consistency regularization.
HisynSeg achieves a state-of-the-art performance on three datasets.
arXiv Detail & Related papers (2024-12-30T13:10:48Z) - BreakNet: Discontinuity-Resilient Multi-Scale Transformer Segmentation of Retinal Layers [0.8953337264557399]
BreakNet is a Transformer-based segmentation model designed to address boundary discontinuities caused by shadow artifacts.
Our findings indicate that BreakNet has the potential to significantly improve retinal quantification and analysis.
arXiv Detail & Related papers (2024-08-26T19:59:20Z) - Light-weight Retinal Layer Segmentation with Global Reasoning [14.558920359236572]
We propose LightReSeg for retinal layer segmentation which can be applied to OCT images.
Our approach achieves a better segmentation performance compared to the current state-of-the-art method TransUnet.
arXiv Detail & Related papers (2024-04-25T05:42:41Z) - ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning [54.68180752416519]
Panoptic segmentation is a cutting-edge computer vision task.
We introduce a novel and efficient method for continual panoptic segmentation based on Visual Prompt Tuning, dubbed ECLIPSE.
Our approach involves freezing the base model parameters and fine-tuning only a small set of prompt embeddings, addressing both catastrophic forgetting and plasticity.
arXiv Detail & Related papers (2024-03-29T11:31:12Z) - Multitask AET with Orthogonal Tangent Regularity for Dark Object
Detection [84.52197307286681]
We propose a novel multitask auto encoding transformation (MAET) model to enhance object detection in a dark environment.
In a self-supervision manner, the MAET learns the intrinsic visual structure by encoding and decoding the realistic illumination-degrading transformation.
We have achieved the state-of-the-art performance using synthetic and real-world datasets.
arXiv Detail & Related papers (2022-05-06T16:27:14Z) - Mixed-UNet: Refined Class Activation Mapping for Weakly-Supervised
Semantic Segmentation with Multi-scale Inference [28.409679398886304]
We develop a novel model named Mixed-UNet, which has two parallel branches in the decoding phase.
We evaluate the designed Mixed-UNet against several prevalent deep learning-based segmentation approaches on our dataset collected from the local hospital and public datasets.
arXiv Detail & Related papers (2022-05-06T08:37:02Z) - Weakly-supervised fire segmentation by visualizing intermediate CNN
layers [82.75113406937194]
Fire localization in images and videos is an important step for an autonomous system to combat fire incidents.
We consider weakly supervised segmentation of fire in images, in which only image labels are used to train the network.
We show that in the case of fire segmentation, which is a binary segmentation problem, the mean value of features in a mid-layer of classification CNN can perform better than conventional Class Activation Mapping (CAM) method.
arXiv Detail & Related papers (2021-11-16T11:56:28Z) - Orthogonal Projection Loss [59.61277381836491]
We develop a novel loss function termed Orthogonal Projection Loss' (OPL)
OPL directly enforces inter-class separation alongside intra-class clustering in the feature space.
OPL offers unique advantages as it does not require careful negative mining and is not sensitive to the batch size.
arXiv Detail & Related papers (2021-03-25T17:58:00Z) - Self-supervised Equivariant Attention Mechanism for Weakly Supervised
Semantic Segmentation [93.83369981759996]
We propose a self-supervised equivariant attention mechanism (SEAM) to discover additional supervision and narrow the gap.
Our method is based on the observation that equivariance is an implicit constraint in fully supervised semantic segmentation.
We propose consistency regularization on predicted CAMs from various transformed images to provide self-supervision for network learning.
arXiv Detail & Related papers (2020-04-09T14:57:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.