Automated pharyngeal phase detection and bolus localization in
videofluoroscopic swallowing study: Killing two birds with one stone?
- URL: http://arxiv.org/abs/2111.04699v1
- Date: Mon, 8 Nov 2021 18:25:01 GMT
- Title: Automated pharyngeal phase detection and bolus localization in
videofluoroscopic swallowing study: Killing two birds with one stone?
- Authors: Andrea Bandini, Sana Smaoui, Catriona M. Steele
- Abstract summary: The videofluoroscopic swallowing study (VFSS) is a gold-standard imaging technique for assessing swallowing.
Researchers have demonstrated that it is possible to automatically detect the pharyngeal phase of swallowing.
We propose a deep-learning framework that tackles pharyngeal phase detection and bolus localization in a weakly-supervised manner.
- Score: 1.4337588659482519
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The videofluoroscopic swallowing study (VFSS) is a gold-standard imaging
technique for assessing swallowing, but analysis and rating of VFSS recordings
is time consuming and requires specialized training and expertise. Researchers
have demonstrated that it is possible to automatically detect the pharyngeal
phase of swallowing and to localize the bolus in VFSS recordings via computer
vision approaches, fostering the development of novel techniques for automatic
VFSS analysis. However, training of algorithms to perform these tasks requires
large amounts of annotated data that are seldom available. We demonstrate that
the challenges of pharyngeal phase detection and bolus localization can be
solved together using a single approach. We propose a deep-learning framework
that jointly tackles pharyngeal phase detection and bolus localization in a
weakly-supervised manner, requiring only the initial and final frames of the
pharyngeal phase as ground truth annotations for the training. Our approach
stems from the observation that bolus presence in the pharynx is the most
prominent visual feature upon which to infer whether individual VFSS frames
belong to the pharyngeal phase. We conducted extensive experiments with
multiple convolutional neural networks (CNNs) on a dataset of 1245 VFSS clips
from 59 healthy subjects. We demonstrated that the pharyngeal phase can be
detected with an F1-score higher than 0.9. Moreover, by processing the class
activation maps of the CNNs, we were able to localize the bolus with promising
results, obtaining correlations with ground truth trajectories higher than 0.9,
without any manual annotations of bolus location used for training purposes.
Once validated on a larger sample of participants with swallowing disorders,
our framework will pave the way for the development of intelligent tools for
VFSS analysis to support clinicians in swallowing assessment.
Related papers
- Vascular Segmentation of Functional Ultrasound Images using Deep Learning [0.0]
We introduce the first deep learning-based segmentation tool for functional ultrasound (fUS) images.
We achieve competitive segmentation performance, with 90% accuracy, with 71% robustness and an IU of 0.59, using only 100 temporal frames from a fUS stack.
This work offers a non-invasive, cost-effective alternative to localization microscopy, enhancing fUS data interpretation and improving understanding of vessel function.
arXiv Detail & Related papers (2024-10-28T09:00:28Z) - Bifurcation Identification for Ultrasound-driven Robotic Cannulation [11.50984693836901]
In trauma and critical care settings, rapid and precise intravascular access is key to patients' survival.
Vessel bifurcations are anatomical landmarks that can guide the safe placement of catheters or needles during medical procedures.
No existing algorithm can autonomously extract vessel bifurcations using ultrasound images.
We introduce BIFURC, a novel algorithm that identifies vessel bifurcations and provides optimal needle insertion sites for an autonomous robotic cannulation system.
arXiv Detail & Related papers (2024-09-10T18:53:52Z) - Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Grounding [53.629132242389716]
Vision-Language Models (VLM) can support clinicians by analyzing medical images and engaging in natural language interactions.
VLMs often exhibit "hallucinogenic" behavior, generating textual outputs not grounded in contextual multimodal information.
We propose a new alignment algorithm that uses symbolic representations of clinical reasoning to ground VLMs in medical knowledge.
arXiv Detail & Related papers (2024-05-29T23:19:28Z) - CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers [66.15847237150909]
We introduce a self-supervised deep learning architecture to segment catheters in longitudinal ultrasound images.
The network architecture builds upon AiAReSeg, a segmentation transformer built with the Attention in Attention mechanism.
We validated our model on a test dataset, consisting of unseen synthetic data and images collected from silicon aorta phantoms.
arXiv Detail & Related papers (2024-03-21T15:13:36Z) - Stethoscope-guided Supervised Contrastive Learning for Cross-domain
Adaptation on Respiratory Sound Classification [1.690115983364313]
We introduce cross-domain adaptation techniques, which transfer the knowledge from a source domain to a distinct target domain.
In particular, by considering different stethoscope types as individual domains, we propose a novel stethoscope-guided supervised contrastive learning approach.
The experimental results on the ICBHI dataset demonstrate that the proposed methods are effective in reducing the domain dependency and achieving the ICBHI Score of 61.71%, which is a significant improvement of 2.16% over the baseline.
arXiv Detail & Related papers (2023-12-15T08:34:31Z) - Domain Adaptive Synapse Detection with Weak Point Annotations [63.97144211520869]
We present AdaSyn, a framework for domain adaptive synapse detection with weak point annotations.
In the WASPSYN challenge at I SBI 2023, our method ranks the 1st place.
arXiv Detail & Related papers (2023-08-31T05:05:53Z) - Deep denoising autoencoder-based non-invasive blood flow detection for
arteriovenous fistula [10.030431512848239]
We propose an approach based on deep denoising autoencoders (DAEs) that perform dimensionality reduction and reconstruction tasks.
Our results demonstrate that the latent representation generated by the DAE surpasses expectations with an accuracy of 0.93.
The incorporation of noise-mixing and the utilization of a noise-to-clean scheme effectively enhance the discriminative capabilities of the latent representation.
arXiv Detail & Related papers (2023-06-12T04:46:01Z) - Tissue Classification During Needle Insertion Using Self-Supervised
Contrastive Learning and Optical Coherence Tomography [53.38589633687604]
We propose a deep neural network that classifies the tissues from the phase and intensity data of complex OCT signals acquired at the needle tip.
We show that with 10% of the training set, our proposed pretraining strategy helps the model achieve an F1 score of 0.84 whereas the model achieves an F1 score of 0.60 without it.
arXiv Detail & Related papers (2023-04-26T14:11:04Z) - Attend Who is Weak: Pruning-assisted Medical Image Localization under
Sophisticated and Implicit Imbalances [102.68466217374655]
Deep neural networks (DNNs) have rapidly become a textitde facto choice for medical image understanding tasks.
In this paper, we propose to use pruning to automatically and adaptively identify textithard-to-learn (HTL) training samples.
We also present an interesting demographic analysis which illustrates HTLs ability to capture complex demographic imbalances.
arXiv Detail & Related papers (2022-12-06T00:32:03Z) - CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors
and Efficient Neural Networks [51.589769497681175]
The novel coronavirus (SARS-CoV-2) has led to a pandemic.
The current testing regime based on Reverse Transcription-Polymerase Chain Reaction for SARS-CoV-2 has been unable to keep up with testing demands.
We propose a framework called CovidDeep that combines efficient DNNs with commercially available WMSs for pervasive testing of the virus.
arXiv Detail & Related papers (2020-07-20T21:47:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.