ViTASD: Robust Vision Transformer Baselines for Autism Spectrum Disorder
Facial Diagnosis
- URL: http://arxiv.org/abs/2210.16943v1
- Date: Sun, 30 Oct 2022 20:38:56 GMT
- Title: ViTASD: Robust Vision Transformer Baselines for Autism Spectrum Disorder
Facial Diagnosis
- Authors: Xu Cao, Wenqian Ye, Elena Sizikova, Xue Bai, Megan Coffee, Hongwu
Zeng, Jianguo Cao
- Abstract summary: Autism spectrum disorder (ASD) is a lifelong neurodevelopmental disorder with very high prevalence around the world.
We propose the use of the Vision Transformer (ViT) for the computational analysis of pediatric ASD.
The presented model, known as ViTASD, distills knowledge from large facial expression datasets and offers model structure transferability.
- Score: 6.695640702099725
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Autism spectrum disorder (ASD) is a lifelong neurodevelopmental disorder with
very high prevalence around the world. Research progress in the field of ASD
facial analysis in pediatric patients has been hindered due to a lack of
well-established baselines. In this paper, we propose the use of the Vision
Transformer (ViT) for the computational analysis of pediatric ASD. The
presented model, known as ViTASD, distills knowledge from large facial
expression datasets and offers model structure transferability. Specifically,
ViTASD employs a vanilla ViT to extract features from patients' face images and
adopts a lightweight decoder with a Gaussian Process layer to enhance the
robustness for ASD analysis. Extensive experiments conducted on standard ASD
facial analysis benchmarks show that our method outperforms all of the
representative approaches in ASD facial analysis, while the ViTASD-L achieves a
new state-of-the-art. Our code and pretrained models are available at
https://github.com/IrohXu/ViTASD.
Related papers
- Hugging Rain Man: A Novel Facial Action Units Dataset for Analyzing Atypical Facial Expressions in Children with Autism Spectrum Disorder [2.3001245059699014]
We introduce a novel dataset, Hugging Rain Man, which includes facial action units (AUs) manually annotated by FACS experts for both children with ASD and typical development (TD)
The dataset comprises a rich collection of posed and spontaneous facial expressions, totaling approximately 130,000 frames, along with 22 AUs, 10 Action Descriptors (ADs) and atypicality ratings.
arXiv Detail & Related papers (2024-11-21T02:51:52Z) - Explainable AI for Autism Diagnosis: Identifying Critical Brain Regions Using fMRI Data [0.29687381456163997]
Early diagnosis and intervention for Autism Spectrum Disorder (ASD) has been shown to significantly improve the quality of life of autistic individuals.
There is a need for objective biomarkers of ASD which can help improve diagnostic accuracy.
Deep learning (DL) has achieved outstanding performance in diagnosing diseases and conditions from medical imaging data.
This research aims to improve the accuracy and interpretability of ASD diagnosis by creating a DL model that can not only accurately classify ASD but also provide explainable insights into its working.
arXiv Detail & Related papers (2024-09-19T23:08:09Z) - SkinGEN: an Explainable Dermatology Diagnosis-to-Generation Framework with Interactive Vision-Language Models [52.90397538472582]
SkinGEN is a diagnosis-to-generation framework that generates reference demonstrations from diagnosis results provided by VLM.
We conduct a user study with 32 participants evaluating both the system performance and explainability.
Results demonstrate that SkinGEN significantly improves users' comprehension of VLM predictions and fosters increased trust in the diagnostic process.
arXiv Detail & Related papers (2024-04-23T05:36:33Z) - Early Autism Diagnosis based on Path Signature and Siamese Unsupervised Feature Compressor [15.39635888144281]
We resort to a novel deep learning-based method to extract key features from the inherently scarce, class-imbalanced, and heterogeneous structural MR images for early autism diagnosis.
Specifically, we propose a Siamese verification framework to extend the scarce data, and an unsupervised compressor to alleviate data imbalance.
arXiv Detail & Related papers (2023-07-12T22:08:22Z) - On Sensitivity and Robustness of Normalization Schemes to Input
Distribution Shifts in Automatic MR Image Diagnosis [58.634791552376235]
Deep Learning (DL) models have achieved state-of-the-art performance in diagnosing multiple diseases using reconstructed images as input.
DL models are sensitive to varying artifacts as it leads to changes in the input data distribution between the training and testing phases.
We propose to use other normalization techniques, such as Group Normalization and Layer Normalization, to inject robustness into model performance against varying image artifacts.
arXiv Detail & Related papers (2023-06-23T03:09:03Z) - Orientation-Shared Convolution Representation for CT Metal Artifact
Learning [63.67718355820655]
During X-ray computed tomography (CT) scanning, metallic implants carrying with patients often lead to adverse artifacts.
Existing deep-learning-based methods have gained promising reconstruction performance.
We propose an orientation-shared convolution representation strategy to adapt the physical prior structures of artifacts.
arXiv Detail & Related papers (2022-12-26T13:56:12Z) - Improving Deep Facial Phenotyping for Ultra-rare Disorder Verification
Using Model Ensembles [52.77024349608834]
We analyze the influence of replacing a DCNN with a state-of-the-art face recognition approach, iResNet with ArcFace.
Our proposed ensemble model achieves state-of-the-art performance on both seen and unseen disorders.
arXiv Detail & Related papers (2022-11-12T23:28:54Z) - Proposing a System Level Machine Learning Hybrid Architecture and
Approach for a Comprehensive Autism Spectrum Disorder Diagnosis [1.2691047660244335]
Autism Spectrum Disorder (ASD) is a severe neuropsychiatric disorder that affects intellectual development, social behavior, and facial features.
We propose to develop a hybrid architecture fully utilizing both social behavior and facial feature data to improve the accuracy of diagnosing ASD.
arXiv Detail & Related papers (2021-09-18T04:33:09Z) - The FaceChannel: A Fast & Furious Deep Neural Network for Facial
Expression Recognition [71.24825724518847]
Current state-of-the-art models for automatic Facial Expression Recognition (FER) are based on very deep neural networks that are effective but rather expensive to train.
We formalize the FaceChannel, a light-weight neural network that has much fewer parameters than common deep neural networks.
We demonstrate how our model achieves a comparable, if not better, performance to the current state-of-the-art in FER.
arXiv Detail & Related papers (2020-09-15T09:25:37Z) - A Convolutional Neural Network for gaze preference detection: A
potential tool for diagnostics of autism spectrum disorder in children [0.0]
We propose a convolutional neural network (CNN) algorithm for gaze prediction using images extracted from a one-minute stimulus video.
Our model achieved a high accuracy rate and robustness for prediction of gaze direction with independent persons.
arXiv Detail & Related papers (2020-07-28T18:47:21Z) - Retinopathy of Prematurity Stage Diagnosis Using Object Segmentation and
Convolutional Neural Networks [68.96150598294072]
Retinopathy of Prematurity (ROP) is an eye disorder primarily affecting premature infants with lower weights.
It causes proliferation of vessels in the retina and could result in vision loss and, eventually, retinal detachment, leading to blindness.
In recent years, there has been a significant effort to automate the diagnosis using deep learning.
This paper builds upon the success of previous models and develops a novel architecture, which combines object segmentation and convolutional neural networks (CNN)
Our proposed system first trains an object segmentation model to identify the demarcation line at a pixel level and adds the resulting mask as an additional "color" channel in
arXiv Detail & Related papers (2020-04-03T14:07:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.