Multi-Transfer Learning Techniques for Detecting Auditory Brainstem
Response
- URL: http://arxiv.org/abs/2308.16203v1
- Date: Tue, 29 Aug 2023 10:40:12 GMT
- Title: Multi-Transfer Learning Techniques for Detecting Auditory Brainstem
Response
- Authors: Fatih Ozyurt, Jafar Majidpour, Tarik A. Rashid, Amir Majidpour, Canan
Koc
- Abstract summary: Inaccurate assessment of auditory brainstem response (ABR) tests may lead to incorrect judgments regarding the integrity of the auditory nerve system.
This study proposed deep-learning models using the transfer-learning (TL) approach to extract features from ABR testing and diagnose Hearing Loss using support vector machines (SVM)
It has been decided to use six measures accuracy, precision, recall, geometric mean (GM), standard deviation (SD), and area under the ROC curve to measure the effectiveness of the proposed model.
- Score: 4.023511716339818
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The assessment of the well-being of the peripheral auditory nerve system in
individuals experiencing hearing impairment is conducted through auditory
brainstem response (ABR) testing. Audiologists assess and document the results
of the ABR test. They interpret the findings and assign labels to them using
reference-based markers like peak latency, waveform morphology, amplitude, and
other relevant factors. Inaccurate assessment of ABR tests may lead to
incorrect judgments regarding the integrity of the auditory nerve system;
therefore, proper Hearing Loss (HL) diagnosis and analysis are essential. To
identify and assess ABR automation while decreasing the possibility of human
error, machine learning methods, notably deep learning, may be an appropriate
option. To address these issues, this study proposed deep-learning models using
the transfer-learning (TL) approach to extract features from ABR testing and
diagnose HL using support vector machines (SVM). Pre-trained convolutional
neural network (CNN) architectures like AlexNet, DenseNet, GoogleNet,
InceptionResNetV2, InceptionV3, MobileNetV2, NASNetMobile, ResNet18, ResNet50,
ResNet101, ShuffleNet, and SqueezeNet are used to extract features from the
collected ABR reported images dataset in the proposed model. It has been
decided to use six measures accuracy, precision, recall, geometric mean (GM),
standard deviation (SD), and area under the ROC curve to measure the
effectiveness of the proposed model. According to experimental findings, the
ShuffleNet and ResNet50 models' TL is effective for ABR to diagnose HL using an
SVM classifier, with a high accuracy rate of 95% when using the 5-fold
cross-validation method.
Related papers
- Reproducible Machine Learning-based Voice Pathology Detection: Introducing the Pitch Difference Feature [1.1455937444848385]
We propose a robust set of features derived from a thorough research of contemporary practices in voice pathology detection.
We combine this feature set, containing data from the publicly available Saarbr"ucken Voice Database (SVD), with preprocessing using the K-Means Synthetic Minority Over-Sampling Technique algorithm.
Our approach has achieved the state-of-the-art performance, measured by unweighted average recall in voice pathology detection.
arXiv Detail & Related papers (2024-10-14T14:17:52Z) - Denoising Variational Autoencoder as a Feature Reduction Pipeline for the diagnosis of Autism based on Resting-state fMRI [11.871709357017416]
Autism spectrum disorders (ASDs) are developmental conditions characterized by restricted interests and difficulties in communication.
We propose an ASD feature reduction pipeline using resting-state fMRI (rs-fMRI)
We used Ncuts parcellations and Power atlas to extract functional connectivity data, resulting in over 30 thousand features.
arXiv Detail & Related papers (2024-09-30T09:38:47Z) - Explainable Cost-Sensitive Deep Neural Networks for Brain Tumor
Detection from Brain MRI Images considering Data Imbalance [0.0]
An automated pipeline is proposed, which encompasses five models: CNN, ResNet50, InceptionV3, EfficientNetB0 and NASNetMobile.
The performance of the proposed architecture is evaluated on a balanced dataset and found to yield an accuracy of 99.33% for fine-tuned InceptionV3 model.
To further optimize the training process, a cost-sensitive neural network approach has been proposed in order to work with imbalanced datasets.
arXiv Detail & Related papers (2023-08-01T15:35:06Z) - Brain Imaging-to-Graph Generation using Adversarial Hierarchical Diffusion Models for MCI Causality Analysis [44.45598796591008]
Brain imaging-to-graph generation (BIGG) framework is proposed to map functional magnetic resonance imaging (fMRI) into effective connectivity for mild cognitive impairment analysis.
The hierarchical transformers in the generator are designed to estimate the noise at multiple scales.
Evaluations of the ADNI dataset demonstrate the feasibility and efficacy of the proposed model.
arXiv Detail & Related papers (2023-05-18T06:54:56Z) - Classification and Self-Supervised Regression of Arrhythmic ECG Signals
Using Convolutional Neural Networks [13.025714736073489]
We propose a deep neural network model capable of solving regression and classification tasks.
We tested the model on the MIT-BIH Arrhythmia database.
arXiv Detail & Related papers (2022-10-25T18:11:13Z) - Objective hearing threshold identification from auditory brainstem
response measurements using supervised and self-supervised approaches [1.0627340704073347]
We develop and compare two methods for automated hearing threshold identification from averaged ABR raw data.
We show that both models work well, outperform human threshold detection, and are suitable for fast, reliable, and unbiased hearing threshold detection and quality control.
arXiv Detail & Related papers (2021-12-16T15:24:31Z) - HASA-net: A non-intrusive hearing-aid speech assessment network [52.83357278948373]
We propose a DNN-based hearing aid speech assessment network (HASA-Net) to predict speech quality and intelligibility scores simultaneously.
To the best of our knowledge, HASA-Net is the first work to incorporate quality and intelligibility assessments utilizing a unified DNN-based non-intrusive model for hearing aids.
Experimental results show that the predicted speech quality and intelligibility scores of HASA-Net are highly correlated to two well-known intrusive hearing-aid evaluation metrics.
arXiv Detail & Related papers (2021-11-10T14:10:13Z) - ALT-MAS: A Data-Efficient Framework for Active Testing of Machine
Learning Algorithms [58.684954492439424]
We propose a novel framework to efficiently test a machine learning model using only a small amount of labeled test data.
The idea is to estimate the metrics of interest for a model-under-test using Bayesian neural network (BNN)
arXiv Detail & Related papers (2021-04-11T12:14:04Z) - Multi-Sample Online Learning for Spiking Neural Networks based on
Generalized Expectation Maximization [42.125394498649015]
Spiking Neural Networks (SNNs) capture some of the efficiency of biological brains by processing through binary neural dynamic activations.
This paper proposes to leverage multiple compartments that sample independent spiking signals while sharing synaptic weights.
The key idea is to use these signals to obtain more accurate statistical estimates of the log-likelihood training criterion, as well as of its gradient.
arXiv Detail & Related papers (2021-02-05T16:39:42Z) - Bayesian x-vector: Bayesian Neural Network based x-vector System for
Speaker Verification [71.45033077934723]
We incorporate Bayesian neural networks (BNNs) into the deep neural network (DNN) x-vector speaker verification system.
With the weight uncertainty modeling provided by BNNs, we expect the system could generalize better on the evaluation data.
Results show that the system could benefit from BNNs by a relative EER decrease of 2.66% and 2.32% respectively for short- and long-utterance in-domain evaluations.
arXiv Detail & Related papers (2020-04-08T14:35:12Z) - Pairwise Discriminative Neural PLDA for Speaker Verification [41.76303371621405]
We propose a Pairwise neural discriminative model for the task of speaker verification.
We construct a differentiable cost function which approximates speaker verification loss.
Experiments are performed on the NIST SRE 2018 development and evaluation datasets.
arXiv Detail & Related papers (2020-01-20T09:52:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.