Related papers: CG-fusion CAM: Online segmentation of laser-induced damage on large-aperture optics

CG-fusion CAM: Online segmentation of laser-induced damage on large-aperture optics

URL: http://arxiv.org/abs/2307.09161v1
Date: Tue, 18 Jul 2023 11:38:20 GMT
Title: CG-fusion CAM: Online segmentation of laser-induced damage on large-aperture optics
Authors: Yueyue Han, Yingyan Huang, Hangcheng Dong, Fengdong Chen, Fa Zeng, Zhitao Peng, Qihua Zhu, Guodong Liu
Abstract summary: We propose a weakly supervised semantic segmentation method with Continuous Gradient CAM and its nonlinear multi-scale fusion (CG-fusion CAM) The proposed method can achieve segmentation performance comparable to that of fully supervised algorithms.
Score: 1.4658400971135652
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Online segmentation of laser-induced damage on large-aperture optics in high-power laser facilities is challenged by complicated damage morphology, uneven illumination and stray light interference. Fully supervised semantic segmentation algorithms have achieved state-of-the-art performance, but rely on plenty of pixel-level labels, which are time-consuming and labor-consuming to produce. LayerCAM, an advanced weakly supervised semantic segmentation algorithm, can generate pixel-accurate results using only image-level labels, but its scattered and partially under-activated class activation regions degrade segmentation performance. In this paper, we propose a weakly supervised semantic segmentation method with Continuous Gradient CAM and its nonlinear multi-scale fusion (CG-fusion CAM). The method redesigns the way of back-propagating gradients and non-linearly activates the multi-scale fused heatmaps to generate more fine-grained class activation maps with appropriate activation degree for different sizes of damage sites. Experiments on our dataset show that the proposed method can achieve segmentation performance comparable to that of fully supervised algorithms.

Related papers

Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion [52.315729095824906]
MLLM Semantic-Corrected Ping-Pong-Ahead Diffusion (PPAD) is a novel framework that introduces a Multimodal Large Language Model (MLLM) as a semantic observer during inference.<n>It performs real-time analysis on intermediate generations, identifies latent semantic inconsistencies, and translates feedback into controllable signals that actively guide the remaining denoising steps.<n>Extensive experiments demonstrate PPAD's significant improvements.
arXiv Detail & Related papers (2025-05-26T14:42:35Z)
Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation [2.034732821736745]
In autonomous driving, thermal image semantic segmentation has emerged as a critical research area.<n>In this paper, we present a comprehensive study on cross-spectral UDA for thermal image semantic segmentation.<n>We introduce a novel self-supervised loss designed to enhance the performance of the thermal segmentation model in nighttime scenarios.
arXiv Detail & Related papers (2025-05-11T11:45:44Z)
Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields [49.66011190843893]
We propose a method that leverages CLIP feature distillation, achieving efficient 3D segmentation through language guidance. To achieve this, we introduce an adapter module and mitigate the noise issue in the dense CLIP feature distillation process. Our method surpasses current state-of-the-art technologies in both training speed and performance.
arXiv Detail & Related papers (2025-01-31T12:19:14Z)
Superpixel Boundary Correction for Weakly-Supervised Semantic Segmentation on Histopathology Images [12.002538365135642]
Weakly supervised semantic segmentation (WSSS) reduces the annotation requirement by using image-level labels instead of pixel-level ones. Class Activation Map (CAM)-based methods still suffer from low spatial resolution and unclear boundaries. We propose a multi-level superpixel correction algorithm that refines CAM boundaries using superpixel clustering and floodfill.
arXiv Detail & Related papers (2025-01-07T15:54:03Z)
HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization [15.13875300007579]
HisynSeg is a weakly-supervised semantic segmentation framework based on image-mixing synthesis and consistency regularization. HisynSeg achieves a state-of-the-art performance on three datasets.
arXiv Detail & Related papers (2024-12-30T13:10:48Z)
BreakNet: Discontinuity-Resilient Multi-Scale Transformer Segmentation of Retinal Layers [0.8953337264557399]
BreakNet is a Transformer-based segmentation model designed to address boundary discontinuities caused by shadow artifacts. Our findings indicate that BreakNet has the potential to significantly improve retinal quantification and analysis.
arXiv Detail & Related papers (2024-08-26T19:59:20Z)
Light-weight Retinal Layer Segmentation with Global Reasoning [14.558920359236572]
We propose LightReSeg for retinal layer segmentation which can be applied to OCT images. Our approach achieves a better segmentation performance compared to the current state-of-the-art method TransUnet.
arXiv Detail & Related papers (2024-04-25T05:42:41Z)
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning [54.68180752416519]
Panoptic segmentation is a cutting-edge computer vision task. We introduce a novel and efficient method for continual panoptic segmentation based on Visual Prompt Tuning, dubbed ECLIPSE. Our approach involves freezing the base model parameters and fine-tuning only a small set of prompt embeddings, addressing both catastrophic forgetting and plasticity.
arXiv Detail & Related papers (2024-03-29T11:31:12Z)
UM-CAM: Uncertainty-weighted Multi-resolution Class Activation Maps for Weakly-supervised Fetal Brain Segmentation [15.333308330432176]
We propose a novel weakly-supervised method with image-level labels based on semantic features and context information exploration. Our proposed method outperforms state-of-the-art weakly-supervised methods with image-level labels.
arXiv Detail & Related papers (2023-06-20T12:21:13Z)
Calibrating Undisciplined Over-Smoothing in Transformer for Weakly Supervised Semantic Segmentation [51.14107156747967]
Weakly supervised semantic segmentation (WSSS) has attracted considerable attention because it requires fewer annotations than fully supervised approaches.<n>We propose an Adaptive Re-Activation Mechanism (AReAM) to control deep-level attention to undisciplined over-smoothing.<n>AReAM substantially improves segmentation performance compared with existing WSSS methods, reducing noise while sharpening focus on relevant semantic regions.
arXiv Detail & Related papers (2023-05-04T19:11:33Z)
GDIP: Gated Differentiable Image Processing for Object-Detection in Adverse Conditions [15.327704761260131]
We present a Gated Differentiable Image Processing (GDIP) block, a domain-agnostic network architecture. Our proposed GDIP block learns to enhance images directly through the downstream object detection loss. We demonstrate significant improvement in detection performance over several state-of-the-art methods.
arXiv Detail & Related papers (2022-09-29T16:43:13Z)
Multitask AET with Orthogonal Tangent Regularity for Dark Object Detection [84.52197307286681]
We propose a novel multitask auto encoding transformation (MAET) model to enhance object detection in a dark environment. In a self-supervision manner, the MAET learns the intrinsic visual structure by encoding and decoding the realistic illumination-degrading transformation. We have achieved the state-of-the-art performance using synthetic and real-world datasets.
arXiv Detail & Related papers (2022-05-06T16:27:14Z)
Mixed-UNet: Refined Class Activation Mapping for Weakly-Supervised Semantic Segmentation with Multi-scale Inference [28.409679398886304]
We develop a novel model named Mixed-UNet, which has two parallel branches in the decoding phase. We evaluate the designed Mixed-UNet against several prevalent deep learning-based segmentation approaches on our dataset collected from the local hospital and public datasets.
arXiv Detail & Related papers (2022-05-06T08:37:02Z)
Multi-Channel Convolutional Analysis Operator Learning for Dual-Energy CT Reconstruction [108.06731611196291]
We develop a multi-channel convolutional analysis operator learning (MCAOL) method to exploit common spatial features within attenuation images at different energies. We propose an optimization method which jointly reconstructs the attenuation images at low and high energies with a mixed norm regularization on the sparse features.
arXiv Detail & Related papers (2022-03-10T14:22:54Z)
Weakly-supervised fire segmentation by visualizing intermediate CNN layers [82.75113406937194]
Fire localization in images and videos is an important step for an autonomous system to combat fire incidents. We consider weakly supervised segmentation of fire in images, in which only image labels are used to train the network. We show that in the case of fire segmentation, which is a binary segmentation problem, the mean value of features in a mid-layer of classification CNN can perform better than conventional Class Activation Mapping (CAM) method.
arXiv Detail & Related papers (2021-11-16T11:56:28Z)
Orthogonal Projection Loss [59.61277381836491]
We develop a novel loss function termed Orthogonal Projection Loss' (OPL) OPL directly enforces inter-class separation alongside intra-class clustering in the feature space. OPL offers unique advantages as it does not require careful negative mining and is not sensitive to the batch size.
arXiv Detail & Related papers (2021-03-25T17:58:00Z)
Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation [93.83369981759996]
We propose a self-supervised equivariant attention mechanism (SEAM) to discover additional supervision and narrow the gap. Our method is based on the observation that equivariance is an implicit constraint in fully supervised semantic segmentation. We propose consistency regularization on predicted CAMs from various transformed images to provide self-supervision for network learning.
arXiv Detail & Related papers (2020-04-09T14:57:57Z)

This list is automatically generated from the titles and abstracts of the papers in this site.