Anomaly-Aware Semantic Segmentation via Style-Aligned OoD Augmentation
- URL: http://arxiv.org/abs/2308.09965v1
- Date: Sat, 19 Aug 2023 09:45:39 GMT
- Title: Anomaly-Aware Semantic Segmentation via Style-Aligned OoD Augmentation
- Authors: Dan Zhang, Kaspar Sakmann, William Beluch, Robin Hutmacher, Yumeng Li
- Abstract summary: We advance the OoD synthesis process by reducing the domain gap between the OoD data and driving scenes.
We propose a simple fine-tuning loss that effectively induces a pre-trained semantic segmentation model to generate a none of the given classes" prediction.
- Score: 4.4886641337581885
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Within the context of autonomous driving, encountering unknown objects
becomes inevitable during deployment in the open world. Therefore, it is
crucial to equip standard semantic segmentation models with anomaly awareness.
Many previous approaches have utilized synthetic out-of-distribution (OoD) data
augmentation to tackle this problem. In this work, we advance the OoD synthesis
process by reducing the domain gap between the OoD data and driving scenes,
effectively mitigating the style difference that might otherwise act as an
obvious shortcut during training. Additionally, we propose a simple fine-tuning
loss that effectively induces a pre-trained semantic segmentation model to
generate a ``none of the given classes" prediction, leveraging per-pixel OoD
scores for anomaly segmentation. With minimal fine-tuning effort, our pipeline
enables the use of pre-trained models for anomaly segmentation while
maintaining the performance on the original task.
Related papers
- Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors [27.848233831749216]
textbfWUDI-Merging (textbfWhoever started the interference shotextbfUld entextbfD textbfIt) is a model merging method that eliminates interference without any additional data or rescaling coefficients.
Comprehensive empirical evaluations across vision and language benchmarks demonstrate our method's superiority.
arXiv Detail & Related papers (2025-03-11T07:01:35Z) - Effort: Efficient Orthogonal Modeling for Generalizable AI-Generated Image Detection [66.16595174895802]
Existing AI-generated image (AIGI) detection methods often suffer from limited generalization performance.
In this paper, we identify a crucial yet previously overlooked asymmetry phenomenon in AIGI detection.
arXiv Detail & Related papers (2024-11-23T19:10:32Z) - Physically Feasible Semantic Segmentation [58.17907376475596]
State-of-the-art semantic segmentation models are typically optimized in a data-driven fashion, minimizing solely per-pixel or per-segment classification objectives on their training data.
This purely data-driven paradigm often leads to absurd segmentations, especially when the domain of input images is shifted from the one encountered during training.
Our method, Physically Feasible Semantic (PhyFea), first extracts explicit constraints that govern spatial class relations from the semantic segmentation training set at hand in an offline data-driven fashion, and then enforces a morphological yet differentiable loss that penalizes violations of these constraints during
arXiv Detail & Related papers (2024-08-26T22:39:08Z) - Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration [64.84134880709625]
We show that it is possible to perform domain adaptation via the noise space using diffusion models.
In particular, by leveraging the unique property of how auxiliary conditional inputs influence the multi-step denoising process, we derive a meaningful diffusion loss.
We present crucial strategies such as channel-shuffling layer and residual-swapping contrastive learning in the diffusion model.
arXiv Detail & Related papers (2024-06-26T17:40:30Z) - Differentially Private Learning with Per-Sample Adaptive Clipping [8.401653565794353]
We propose a Differentially Private Per-Sample Adaptive Clipping (DP-PSAC) algorithm based on a non-monotonic adaptive weight function.
We show that DP-PSAC outperforms or matches the state-of-the-art methods on multiple main-stream vision and language tasks.
arXiv Detail & Related papers (2022-12-01T07:26:49Z) - Anomaly-Aware Semantic Segmentation by Leveraging Synthetic-Unknown Data [19.80173687261055]
Anomaly awareness is essential for safety-critical applications such as autonomous driving.
We propose a novel Synthetic-Unknown Data Generation to tackle the anomaly-aware semantic segmentation task.
We reach the state-of-the-art performance on two anomaly segmentation datasets.
arXiv Detail & Related papers (2021-11-29T06:24:50Z) - Dense Unsupervised Learning for Video Segmentation [49.46930315961636]
We present a novel approach to unsupervised learning for video object segmentation (VOS)
Unlike previous work, our formulation allows to learn dense feature representations directly in a fully convolutional regime.
Our approach exceeds the segmentation accuracy of previous work despite using significantly less training data and compute power.
arXiv Detail & Related papers (2021-11-11T15:15:11Z) - A Simple Baseline for Semi-supervised Semantic Segmentation with Strong
Data Augmentation [74.8791451327354]
We propose a simple yet effective semi-supervised learning framework for semantic segmentation.
A set of simple design and training techniques can collectively improve the performance of semi-supervised semantic segmentation significantly.
Our method achieves state-of-the-art results in the semi-supervised settings on the Cityscapes and Pascal VOC datasets.
arXiv Detail & Related papers (2021-04-15T06:01:39Z) - Unsupervised and self-adaptative techniques for cross-domain person
re-identification [82.54691433502335]
Person Re-Identification (ReID) across non-overlapping cameras is a challenging task.
Unsupervised Domain Adaptation (UDA) is a promising alternative, as it performs feature-learning adaptation from a model trained on a source to a target domain without identity-label annotation.
In this paper, we propose a novel UDA-based ReID method that takes advantage of triplets of samples created by a new offline strategy.
arXiv Detail & Related papers (2021-03-21T23:58:39Z) - Unsupervised Intra-domain Adaptation for Semantic Segmentation through
Self-Supervision [73.76277367528657]
Convolutional neural network-based approaches have achieved remarkable progress in semantic segmentation.
To cope with this limitation, automatically annotated data generated from graphic engines are used to train segmentation models.
We propose a two-step self-supervised domain adaptation approach to minimize the inter-domain and intra-domain gap together.
arXiv Detail & Related papers (2020-04-16T15:24:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.