Appearance Shock Grammar for Fast Medial Axis Extraction from Real
Images
- URL: http://arxiv.org/abs/2004.02677v1
- Date: Mon, 6 Apr 2020 13:57:27 GMT
- Title: Appearance Shock Grammar for Fast Medial Axis Extraction from Real
Images
- Authors: Charles-Olivier Dufresne Camaro, Morteza Rezanejad, Stavros Tsogkas,
Kaleem Siddiqi, Sven Dickinson
- Abstract summary: We combine ideas from shock graph theory with more recent appearance-based methods for medial axis extraction from complex natural scenes.
Our experiments on the BMAX500 and SK-LARGE datasets demonstrate the effectiveness of our approach.
- Score: 10.943417197085882
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We combine ideas from shock graph theory with more recent appearance-based
methods for medial axis extraction from complex natural scenes, improving upon
the present best unsupervised method, in terms of efficiency and performance.
We make the following specific contributions: i) we extend the shock graph
representation to the domain of real images, by generalizing the shock type
definitions using local, appearance-based criteria; ii) we then use the rules
of a Shock Grammar to guide our search for medial points, drastically reducing
run time when compared to other methods, which exhaustively consider all points
in the input image;iii) we remove the need for typical post-processing steps
including thinning, non-maximum suppression, and grouping, by adhering to the
Shock Grammar rules while deriving the medial axis solution; iv) finally, we
raise some fundamental concerns with the evaluation scheme used in previous
work and propose a more appropriate alternative for assessing the performance
of medial axis extraction from scenes. Our experiments on the BMAX500 and
SK-LARGE datasets demonstrate the effectiveness of our approach. We outperform
the present state-of-the-art, excelling particularly in the high-precision
regime, while running an order of magnitude faster and requiring no
post-processing.
Related papers
- Learning to Discover Generalized Facial Expressions [16.44358221618312]
We introduce Facial Expression Category Discovery (FECD)
FECD is a novel task in the domain of open-world facial expression recognition (O-FER)
arXiv Detail & Related papers (2024-09-30T08:50:22Z) - UniForensics: Face Forgery Detection via General Facial Representation [60.5421627990707]
High-level semantic features are less susceptible to perturbations and not limited to forgery-specific artifacts, thus having stronger generalization.
We introduce UniForensics, a novel deepfake detection framework that leverages a transformer-based video network, with a meta-functional face classification for enriched facial representation.
arXiv Detail & Related papers (2024-07-26T20:51:54Z) - Semi-Supervised Unconstrained Head Pose Estimation in the Wild [60.08319512840091]
We propose the first semi-supervised unconstrained head pose estimation method SemiUHPE.
Our method is based on the observation that the aspect-ratio invariant cropping of wild heads is superior to the previous landmark-based affine alignment.
Experiments and ablation studies show that SemiUHPE outperforms existing methods greatly on public benchmarks.
arXiv Detail & Related papers (2024-04-03T08:01:00Z) - Improving Adversarial Transferability via Intermediate-level
Perturbation Decay [79.07074710460012]
We develop a novel intermediate-level method that crafts adversarial examples within a single stage of optimization.
Experimental results show that it outperforms state-of-the-arts by large margins in attacking various victim models.
arXiv Detail & Related papers (2023-04-26T09:49:55Z) - Efficient Human Vision Inspired Action Recognition using Adaptive
Spatiotemporal Sampling [13.427887784558168]
We introduce a novel adaptive vision system for efficient action recognition processing.
Our system pre-scans the global context sampling scheme at low-resolution and decides to skip or request high-resolution features at salient regions for further processing.
We validate the system on EPIC-KENS and UCF-101 datasets for action recognition, and show that our proposed approach can greatly speed up inference with a tolerable loss of accuracy compared with those from state-the-art baselines.
arXiv Detail & Related papers (2022-07-12T01:18:58Z) - Sparse Graph Learning from Spatiotemporal Time Series [16.427698929775023]
We propose a graph learning framework that learns the relational dependencies as distributions over graphs.
We show that the proposed solution can be used as a stand-alone graph identification procedure as well as a graph learning component of an end-to-end forecasting architecture.
arXiv Detail & Related papers (2022-05-26T17:02:43Z) - Learning Discriminative Shrinkage Deep Networks for Image Deconvolution [122.79108159874426]
We propose an effective non-blind deconvolution approach by learning discriminative shrinkage functions to implicitly model these terms.
Experimental results show that the proposed method performs favorably against the state-of-the-art ones in terms of efficiency and accuracy.
arXiv Detail & Related papers (2021-11-27T12:12:57Z) - Predict, Prevent, and Evaluate: Disentangled Text-Driven Image
Manipulation Empowered by Pre-Trained Vision-Language Model [168.04947140367258]
We propose a novel framework, i.e., Predict, Prevent, and Evaluate (PPE), for disentangled text-driven image manipulation.
Our method approaches the targets by exploiting the power of the large scale pre-trained vision-language model CLIP.
Extensive experiments show that the proposed PPE framework achieves much better quantitative and qualitative results than the up-to-date StyleCLIP baseline.
arXiv Detail & Related papers (2021-11-26T06:49:26Z) - Transfer Learning Gaussian Anomaly Detection by Fine-Tuning
Representations [3.5031508291335625]
catastrophic forgetting prevents the successful fine-tuning of pre-trained representations on new datasets.
We propose a new method to fine-tune learned representations for AD in a transfer learning setting.
We additionally propose to use augmentations commonly employed for vicinal risk in a validation scheme to detect onset of catastrophic forgetting.
arXiv Detail & Related papers (2021-08-09T15:29:04Z) - Towards Unsupervised Sketch-based Image Retrieval [126.77787336692802]
We introduce a novel framework that simultaneously performs unsupervised representation learning and sketch-photo domain alignment.
Our framework achieves excellent performance in the new unsupervised setting, and performs comparably or better than state-of-the-art in the zero-shot setting.
arXiv Detail & Related papers (2021-05-18T02:38:22Z) - A Flatter Loss for Bias Mitigation in Cross-dataset Facial Age
Estimation [37.107335288543624]
We advocate a cross-dataset protocol for age estimation benchmarking.
We propose a novel loss function that is more effective for neural network training.
arXiv Detail & Related papers (2020-10-20T15:22:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.