Related papers: Attention Maps in 3D Shape Classification for Dental Stage Estimation with Class Node Graph Attention Networks

Attention Maps in 3D Shape Classification for Dental Stage Estimation with Class Node Graph Attention Networks

URL: http://arxiv.org/abs/2509.07581v1
Date: Tue, 09 Sep 2025 10:44:25 GMT
Title: Attention Maps in 3D Shape Classification for Dental Stage Estimation with Class Node Graph Attention Networks
Authors: Barkin Buyukcakir, Rocharles Cavalcante Fontenele, Reinhilde Jacobs, Jannick De Tobel, Patrick Thevissen, Dirk Vandermeulen, Peter Claes,
Abstract summary: This paper introduces the Class Node Graph Attention Network (CGAT) architecture for 3D shape recognition tasks.<n>CGAT is applied to 3D meshes of third molars derived from CBCT images, for Demirjian stage allocation.<n>The architecture's ability to generate human-understandable attention maps can enhance trust and facilitate expert validation of model decisions.
Score: 4.289327498989562
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Deep learning offers a promising avenue for automating many recognition tasks in fields such as medicine and forensics. However, the black-box nature of these models hinders their adoption in high-stakes applications where trust and accountability are required. For 3D shape recognition tasks in particular, this paper introduces the Class Node Graph Attention Network (CGAT) architecture to address this need. Applied to 3D meshes of third molars derived from CBCT images, for Demirjian stage allocation, CGAT utilizes graph attention convolutions and an inherent attention mechanism, visualized via attention rollout, to explain its decision-making process. We evaluated the local mean curvature and distance to centroid node features, both individually and in combination, as well as model depth, finding that models incorporating directed edges to a global CLS node produced more intuitive attention maps, while also yielding desirable classification performance. We analyzed the attention-based explanations of the models, and their predictive performances to propose optimal settings for the CGAT. The combination of local mean curvature and distance to centroid as node features yielded a slight performance increase with 0.76 weighted F1 score, and more comprehensive attention visualizations. The CGAT architecture's ability to generate human-understandable attention maps can enhance trust and facilitate expert validation of model decisions. While demonstrated on dental data, CGAT is broadly applicable to graph-based classification and regression tasks, promoting wider adoption of transparent and competitive deep learning models in high-stakes environments.

Related papers

Exploiting Inter-Sample Information for Long-tailed Out-of-Distribution Detection [7.0229899259286945]
We show that exploiting inter-sample relationships can significantly improve OOD detection in long-tailed recognition of vision datasets.<n>Our method outperforms the state-of-the-art approaches by a large margin in terms of FPR and tail-class ID classification accuracy.
arXiv Detail & Related papers (2025-11-20T03:31:37Z)
Adaptive graph Kolmogorov-Arnold network for 3D human pose estimation [3.3946853660795884]
Graph convolutional network (GCN)-based methods have shown strong performance in 3D human pose estimation.<n>We introduce PoseKAN, a framework that extends KANs to graph-based learning for 2D-to-3D pose lifting from a single image.<n>Our model employs multi-hop feature aggregation, ensuring the body joints can leverage information from both local and distant neighbors.
arXiv Detail & Related papers (2025-11-11T22:23:24Z)
AI-CNet3D: An Anatomically-Informed Cross-Attention Network with Multi-Task Consistency Fine-tuning for 3D Glaucoma Classification [0.4999814847776097]
Glaucoma is a progressive eye disease that leads to optic nerve damage, causing irreversible vision loss if left untreated.<n>We propose a novel hybrid deep learning model that integrates cross-attention mechanisms into a 3D convolutional neural network.<n>We have named this model AI-CNet3D (AI-See'-Net3D) to reflect its design as an Anatomically-Informed Cross-attention Network operating on 3D data.
arXiv Detail & Related papers (2025-10-01T13:30:55Z)
Improving Open-Set Semantic Segmentation in 3D Point Clouds by Conditional Channel Capacity Maximization: Preliminary Results [1.1328543389752008]
We propose a plug and play framework for Open-Set Semantic (O3S)<n>By modeling the segmentation pipeline as a conditional Markov chain, we derive a novel regularizer term dubbed Conditional Channel Capacity Maximization (3CM)<n>We show that 3CM encourages the encoder to retain richer, label-dependent features, thereby enhancing the network's ability to distinguish and segment previously unseen categories.
arXiv Detail & Related papers (2025-05-09T04:12:26Z)
Wide & Deep Learning for Node Classification [0.7373617024876725]
Graph convolutional networks (GCNs) remain dominant in node classification tasks.<n>We propose a flexible framework GCNIII, which incorporates three techniques: Intersect memory, Initial residual and Identity mapping.<n>We provide empirical evidence showing that GCNIII can more effectively balance the trade-off between over-fitting and over-generalization.
arXiv Detail & Related papers (2025-05-04T07:53:16Z)
FORCE: Feature-Oriented Representation with Clustering and Explanation [0.0]
We propose a SHAP based supervised deep learning framework FORCE.<n>It relies on two-stage usage of SHAP values in the neural network architecture.<n>We show that FORCE led to dramatic improvements in overall performance as compared to networks that did not incorporate the latent feature and attention framework.
arXiv Detail & Related papers (2025-04-07T22:05:50Z)
Deep Contrastive Graph Learning with Clustering-Oriented Guidance [61.103996105756394]
Graph Convolutional Network (GCN) has exhibited remarkable potential in improving graph-based clustering. Models estimate an initial graph beforehand to apply GCN. Deep Contrastive Graph Learning (DCGL) model is proposed for general data clustering.
arXiv Detail & Related papers (2024-02-25T07:03:37Z)
Bayesian Layer Graph Convolutioanl Network for Hyperspetral Image Classification [24.91896527342631]
Graph convolutional network (GCN) based models have shown impressive performance. Deep learning frameworks based on point estimation suffer from low generalization and inability to quantify the classification results uncertainty. In this paper, we propose a Bayesian layer with Bayesian idea as an insertion layer into point estimation based neural networks. A Generative Adversarial Network (GAN) is built to solve the sample imbalance problem of HSI dataset.
arXiv Detail & Related papers (2022-11-14T12:56:56Z)
Guided Point Contrastive Learning for Semi-supervised Point Cloud Semantic Segmentation [90.2445084743881]
We present a method for semi-supervised point cloud semantic segmentation to adopt unlabeled point clouds in training to boost the model performance. Inspired by the recent contrastive loss in self-supervised tasks, we propose the guided point contrastive loss to enhance the feature representation and model generalization ability.
arXiv Detail & Related papers (2021-10-15T16:38:54Z)
Calibrating Class Activation Maps for Long-Tailed Visual Recognition [60.77124328049557]
We present two effective modifications of CNNs to improve network learning from long-tailed distribution. First, we present a Class Activation Map (CAMC) module to improve the learning and prediction of network classifiers. Second, we investigate the use of normalized classifiers for representation learning in long-tailed problems.
arXiv Detail & Related papers (2021-08-29T05:45:03Z)
Adversarial Feature Augmentation and Normalization for Visual Recognition [109.6834687220478]
Recent advances in computer vision take advantage of adversarial data augmentation to ameliorate the generalization ability of classification models. Here, we present an effective and efficient alternative that advocates adversarial augmentation on intermediate feature embeddings. We validate the proposed approach across diverse visual recognition tasks with representative backbone networks.
arXiv Detail & Related papers (2021-03-22T20:36:34Z)
PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object Detection [57.49788100647103]
LiDAR-based 3D object detection is an important task for autonomous driving. Current approaches suffer from sparse and partial point clouds of distant and occluded objects. In this paper, we propose a novel two-stage approach, namely PC-RGNN, dealing with such challenges by two specific solutions.
arXiv Detail & Related papers (2020-12-18T18:06:43Z)
Multi-Lead ECG Classification via an Information-Based Attention Convolutional Neural Network [1.1720399305661802]
One-dimensional convolutional neural networks (CNN) have proven to be effective in pervasive classification tasks. We implement the Residual connection and design a structure which can learn the weights from the information contained in different channels in the input feature map. An indicator named mean square deviation is introduced to monitor the performance of a particular model segment in the classification task.
arXiv Detail & Related papers (2020-03-25T02:28:04Z)
Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification [71.96618723152487]
We introduce Attention Pyramid Convolutional Neural Network (AP-CNN) AP-CNN learns both high-level semantic and low-level detailed feature representation. It can be trained end-to-end, without the need of additional bounding box/part annotations.
arXiv Detail & Related papers (2020-02-09T12:33:23Z)

This list is automatically generated from the titles and abstracts of the papers in this site.