Related papers: Appearance Shock Grammar for Fast Medial Axis Extraction from Real Images

Appearance Shock Grammar for Fast Medial Axis Extraction from Real Images

URL: http://arxiv.org/abs/2004.02677v1
Date: Mon, 6 Apr 2020 13:57:27 GMT
Title: Appearance Shock Grammar for Fast Medial Axis Extraction from Real Images
Authors: Charles-Olivier Dufresne Camaro, Morteza Rezanejad, Stavros Tsogkas, Kaleem Siddiqi, Sven Dickinson
Abstract summary: We combine ideas from shock graph theory with more recent appearance-based methods for medial axis extraction from complex natural scenes. Our experiments on the BMAX500 and SK-LARGE datasets demonstrate the effectiveness of our approach.
Score: 10.943417197085882
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We combine ideas from shock graph theory with more recent appearance-based methods for medial axis extraction from complex natural scenes, improving upon the present best unsupervised method, in terms of efficiency and performance. We make the following specific contributions: i) we extend the shock graph representation to the domain of real images, by generalizing the shock type definitions using local, appearance-based criteria; ii) we then use the rules of a Shock Grammar to guide our search for medial points, drastically reducing run time when compared to other methods, which exhaustively consider all points in the input image;iii) we remove the need for typical post-processing steps including thinning, non-maximum suppression, and grouping, by adhering to the Shock Grammar rules while deriving the medial axis solution; iv) finally, we raise some fundamental concerns with the evaluation scheme used in previous work and propose a more appropriate alternative for assessing the performance of medial axis extraction from scenes. Our experiments on the BMAX500 and SK-LARGE datasets demonstrate the effectiveness of our approach. We outperform the present state-of-the-art, excelling particularly in the high-precision regime, while running an order of magnitude faster and requiring no post-processing.

Related papers

Navigating with Annealing Guidance Scale in Diffusion Space [50.53780111249146]
The choice of the guidance scale has a critical impact on the convergence toward a visually appealing and prompt-adherent image.<n>In this work, we propose an annealing guidance scheduler which dynamically adjusts the guidance scale over time.<n> Empirical results demonstrate that our guidance scheduler significantly enhances image quality and alignment with the text prompt.
arXiv Detail & Related papers (2025-06-30T17:55:00Z)
Score-Based Turbo Message Passing for Plug-and-Play Compressive Image Recovery [24.60447255507278]
Off-the-shelf image denoisers mostly rely on some generic or hand-crafted priors for denoising. We devise a message passing framework that integrates a score-based minimum mean squared error (MMSE) denoiser for compressive image recovery.
arXiv Detail & Related papers (2025-03-28T04:30:58Z)
Semi-Supervised 360 Layout Estimation with Panoramic Collaborative Perturbations [56.84921040837699]
We propose a novel semi-supervised method named Semi360, which incorporates the priors of the panoramic layout and distortion through collaborative perturbations. Our experimental results on three mainstream benchmarks demonstrate that the proposed method offers significant advantages over existing state-of-the-art (SoTA) solutions.
arXiv Detail & Related papers (2025-03-03T02:49:20Z)
Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation [10.678727237318503]
Real-world visual relationships often exhibit a long-tailed distribution, causing existing methods to produce biased scene graphs. We propose Impar, a novel training framework that leverages loss masking and curriculum learning to mitigate bias generation. Our curriculum-driven mask generation strategy further empowers the model to adaptively adjust its bias mitigation strategy over time, enabling more balanced and robust estimations.
arXiv Detail & Related papers (2024-11-20T06:15:28Z)
UniForensics: Face Forgery Detection via General Facial Representation [60.5421627990707]
High-level semantic features are less susceptible to perturbations and not limited to forgery-specific artifacts, thus having stronger generalization. We introduce UniForensics, a novel deepfake detection framework that leverages a transformer-based video network, with a meta-functional face classification for enriched facial representation.
arXiv Detail & Related papers (2024-07-26T20:51:54Z)
Semi-Supervised Unconstrained Head Pose Estimation in the Wild [60.08319512840091]
We propose the first semi-supervised unconstrained head pose estimation method SemiUHPE. Our method is based on the observation that the aspect-ratio invariant cropping of wild heads is superior to the previous landmark-based affine alignment. Experiments and ablation studies show that SemiUHPE outperforms existing methods greatly on public benchmarks.
arXiv Detail & Related papers (2024-04-03T08:01:00Z)
Improving Adversarial Transferability via Intermediate-level Perturbation Decay [79.07074710460012]
We develop a novel intermediate-level method that crafts adversarial examples within a single stage of optimization. Experimental results show that it outperforms state-of-the-arts by large margins in attacking various victim models.
arXiv Detail & Related papers (2023-04-26T09:49:55Z)
Efficient Human Vision Inspired Action Recognition using Adaptive Spatiotemporal Sampling [13.427887784558168]
We introduce a novel adaptive vision system for efficient action recognition processing. Our system pre-scans the global context sampling scheme at low-resolution and decides to skip or request high-resolution features at salient regions for further processing. We validate the system on EPIC-KENS and UCF-101 datasets for action recognition, and show that our proposed approach can greatly speed up inference with a tolerable loss of accuracy compared with those from state-the-art baselines.
arXiv Detail & Related papers (2022-07-12T01:18:58Z)
Sparse Graph Learning from Spatiotemporal Time Series [16.427698929775023]
We propose a graph learning framework that learns the relational dependencies as distributions over graphs. We show that the proposed solution can be used as a stand-alone graph identification procedure as well as a graph learning component of an end-to-end forecasting architecture.
arXiv Detail & Related papers (2022-05-26T17:02:43Z)
Fast Hybrid Image Retargeting [0.0]
We propose a method that quantifies and limits warping distortions with the use of content-aware cropping. Our method outperforms recent approaches, while running in a fraction of their execution time.
arXiv Detail & Related papers (2022-03-25T11:46:06Z)
Learning Discriminative Shrinkage Deep Networks for Image Deconvolution [122.79108159874426]
We propose an effective non-blind deconvolution approach by learning discriminative shrinkage functions to implicitly model these terms. Experimental results show that the proposed method performs favorably against the state-of-the-art ones in terms of efficiency and accuracy.
arXiv Detail & Related papers (2021-11-27T12:12:57Z)
Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model [168.04947140367258]
We propose a novel framework, i.e., Predict, Prevent, and Evaluate (PPE), for disentangled text-driven image manipulation. Our method approaches the targets by exploiting the power of the large scale pre-trained vision-language model CLIP. Extensive experiments show that the proposed PPE framework achieves much better quantitative and qualitative results than the up-to-date StyleCLIP baseline.
arXiv Detail & Related papers (2021-11-26T06:49:26Z)
Transfer Learning Gaussian Anomaly Detection by Fine-Tuning Representations [3.5031508291335625]
catastrophic forgetting prevents the successful fine-tuning of pre-trained representations on new datasets. We propose a new method to fine-tune learned representations for AD in a transfer learning setting. We additionally propose to use augmentations commonly employed for vicinal risk in a validation scheme to detect onset of catastrophic forgetting.
arXiv Detail & Related papers (2021-08-09T15:29:04Z)
Towards Unsupervised Sketch-based Image Retrieval [126.77787336692802]
We introduce a novel framework that simultaneously performs unsupervised representation learning and sketch-photo domain alignment. Our framework achieves excellent performance in the new unsupervised setting, and performs comparably or better than state-of-the-art in the zero-shot setting.
arXiv Detail & Related papers (2021-05-18T02:38:22Z)
A Flatter Loss for Bias Mitigation in Cross-dataset Facial Age Estimation [37.107335288543624]
We advocate a cross-dataset protocol for age estimation benchmarking. We propose a novel loss function that is more effective for neural network training.
arXiv Detail & Related papers (2020-10-20T15:22:29Z)

This list is automatically generated from the titles and abstracts of the papers in this site.