Related papers: Ultrasound Video Summarization using Deep Reinforcement Learning

Ultrasound Video Summarization using Deep Reinforcement Learning

URL: http://arxiv.org/abs/2005.09531v1
Date: Tue, 19 May 2020 15:44:18 GMT
Title: Ultrasound Video Summarization using Deep Reinforcement Learning
Authors: Tianrui Liu, Qingjie Meng, Athanasios Vlontzos, Jeremy Tan, Daniel Rueckert and Bernhard Kainz
Abstract summary: We introduce a fully automatic video summarization method tailored to the needs of medical video data. We show that our method is superior to alternative video summarization methods and that it preserves essential information required by clinical diagnostic standards.
Score: 12.320114045092291
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Video is an essential imaging modality for diagnostics, e.g. in ultrasound imaging, for endoscopy, or movement assessment. However, video hasn't received a lot of attention in the medical image analysis community. In the clinical practice, it is challenging to utilise raw diagnostic video data efficiently as video data takes a long time to process, annotate or audit. In this paper we introduce a novel, fully automatic video summarization method that is tailored to the needs of medical video data. Our approach is framed as reinforcement learning problem and produces agents focusing on the preservation of important diagnostic information. We evaluate our method on videos from fetal ultrasound screening, where commonly only a small amount of the recorded data is used diagnostically. We show that our method is superior to alternative video summarization methods and that it preserves essential information required by clinical diagnostic standards.

Related papers

VideoPath-LLaVA: Pathology Diagnostic Reasoning Through Video Instruction Tuning [2.6954348706500766]
We present VideoPath-LLaVA, the first large multimodal model (LMM) in computational pathology.<n>It integrates three distinct image scenarios, single patch images, automatically-extracted clips, and manually segmented video pathology images.<n>By generating detailed histological descriptions and culminating in a definitive sign-out diagnosis, VideoPath-LLaVA bridges visual narratives with diagnostic reasoning.
arXiv Detail & Related papers (2025-05-07T07:41:19Z)
YouTube Video Analytics for Patient Engagement: Evidence from Colonoscopy Preparation Videos [3.7941428390253193]
This study demonstrates a data analysis pipeline that utilizes methods to retrieve medical information from YouTube videos. We first use the YouTube Data API to collect metadata of desired videos on select search keywords. Then we annotate the YouTube video materials on medical information, video understandability and overall recommendation.
arXiv Detail & Related papers (2024-10-01T19:38:46Z)
Breast Ultrasound Report Generation using LangChain [58.07183284468881]
We propose the integration of multiple image analysis tools through a LangChain using Large Language Models (LLM) into the breast reporting process. Our method can accurately extract relevant features from ultrasound images, interpret them in a clinical context, and produce comprehensive and standardized reports.
arXiv Detail & Related papers (2023-12-05T00:28:26Z)
Open video data sharing in developmental and behavioural science [1.9312167442699324]
Video recording is a widely used method for documenting infant and child behaviours. The need of shared large-scaled datasets remains increasing. To share data while abiding by privacy protection rules, a critical question arises whether efforts at data de-identification reduce data utility.
arXiv Detail & Related papers (2022-07-22T11:47:47Z)
Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling [13.161739586288704]
In clinical practice, analysis and diagnosis often rely on US sequences rather than a single image to obtain dynamic anatomical information. This is challenging for novices to learn because practicing with adequate videos from patients is clinically unpractical. We propose a novel framework to synthesize high-fidelity US videos.
arXiv Detail & Related papers (2022-07-01T14:53:22Z)
A New Dataset and A Baseline Model for Breast Lesion Detection in Ultrasound Videos [43.42513012531214]
We first collect and annotate an ultrasound video dataset (188 videos) for breast lesion detection. We propose a clip-level and video-level feature aggregated network (CVA-Net) for addressing breast lesion detection in ultrasound videos.
arXiv Detail & Related papers (2022-07-01T01:37:50Z)
Voice-assisted Image Labelling for Endoscopic Ultrasound Classification using Neural Networks [48.732863591145964]
We propose a multi-modal convolutional neural network architecture that labels endoscopic ultrasound (EUS) images from raw verbal comments provided by a clinician during the procedure. Our results show a prediction accuracy of 76% at image level on a dataset with 5 different labels.
arXiv Detail & Related papers (2021-10-12T21:22:24Z)
Machine Learning Methods for Histopathological Image Analysis: A Review [62.14548392474976]
Histopathological images (HIs) are the gold standard for evaluating some types of tumors for cancer diagnosis. One of the ways of accelerating such an analysis is to use computer-aided diagnosis (CAD) systems.
arXiv Detail & Related papers (2021-02-07T19:12:32Z)
LRTD: Long-Range Temporal Dependency based Active Learning for Surgical Workflow Recognition [67.86810761677403]
We propose a novel active learning method for cost-effective surgical video analysis. Specifically, we propose a non-local recurrent convolutional network (NL-RCNet), which introduces non-local block to capture the long-range temporal dependency. We validate our approach on a large surgical video dataset (Cholec80) by performing surgical workflow recognition task.
arXiv Detail & Related papers (2020-04-21T09:21:22Z)
Confident Coreset for Active Learning in Medical Image Analysis [57.436224561482966]
We propose a novel active learning method, confident coreset, which considers both uncertainty and distribution for effectively selecting informative samples. By comparative experiments on two medical image analysis tasks, we show that our method outperforms other active learning methods.
arXiv Detail & Related papers (2020-04-05T13:46:16Z)
Self-trained Deep Ordinal Regression for End-to-End Video Anomaly Detection [114.9714355807607]
We show that applying self-trained deep ordinal regression to video anomaly detection overcomes two key limitations of existing methods. We devise an end-to-end trainable video anomaly detection approach that enables joint representation learning and anomaly scoring without manually labeled normal/abnormal data.
arXiv Detail & Related papers (2020-03-15T08:44:55Z)
Self-supervised Representation Learning for Ultrasound Video [18.515314344284445]
We propose a self-supervised learning approach to learn meaningful and transferable representations from medical imaging video. We force the model to address anatomy-aware tasks with free supervision from the data itself. Experiments on fetal ultrasound video show that the proposed approach can effectively learn meaningful and strong representations.
arXiv Detail & Related papers (2020-02-28T23:00:26Z)

This list is automatically generated from the titles and abstracts of the papers in this site.