Structure-aware World Model for Probe Guidance via Large-scale Self-supervised Pre-train
- URL: http://arxiv.org/abs/2406.19756v2
- Date: Fri, 19 Jul 2024 07:15:07 GMT
- Title: Structure-aware World Model for Probe Guidance via Large-scale Self-supervised Pre-train
- Authors: Haojun Jiang, Meng Li, Zhenguo Sun, Ning Jia, Yu Sun, Shaqi Luo, Shiji Song, Gao Huang,
- Abstract summary: Successful echocardiography requires a thorough understanding of the structures on the two-dimensional plane and the spatial relationships between planes in three-dimensional space.
We propose a large-scale self-supervised pre-training method to acquire a cardiac structure-aware world model.
- Score: 66.35766658717205
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The complex structure of the heart leads to significant challenges in echocardiography, especially in acquisition cardiac ultrasound images. Successful echocardiography requires a thorough understanding of the structures on the two-dimensional plane and the spatial relationships between planes in three-dimensional space. In this paper, we innovatively propose a large-scale self-supervised pre-training method to acquire a cardiac structure-aware world model. The core innovation lies in constructing a self-supervised task that requires structural inference by predicting masked structures on a 2D plane and imagining another plane based on pose transformation in 3D space. To support large-scale pre-training, we collected over 1.36 million echocardiograms from ten standard views, along with their 3D spatial poses. In the downstream probe guidance task, we demonstrate that our pre-trained model consistently reduces guidance errors across the ten most common standard views on the test set with 0.29 million samples from 74 routine clinical scans, indicating that structure-aware pre-training benefits the scanning.
Related papers
- Sequence-aware Pre-training for Echocardiography Probe Guidance [66.35766658717205]
Cardiac ultrasound faces two major challenges: (1) the inherently complex structure of the heart, and (2) significant individual variations.
Previous works have only learned the population-averaged 2D and 3D structures of the heart rather than personalized cardiac structural features.
We propose a sequence-aware self-supervised pre-training method to learn personalized 2D and 3D cardiac structural features.
arXiv Detail & Related papers (2024-08-27T12:55:54Z) - Cardiac Copilot: Automatic Probe Guidance for Echocardiography with World Model [66.35766658717205]
There is a severe shortage of experienced cardiac sonographers, due to the heart's complex structure and significant operational challenges.
We present a Cardiac Copilot system capable of providing real-time probe movement guidance.
The core innovation lies in proposing a data-driven world model, named Cardiac Dreamer, for representing cardiac spatial structures.
We train our model with real-world ultrasound data and corresponding probe motion from 110 routine clinical scans with 151K sample pairs by three certified sonographers.
arXiv Detail & Related papers (2024-06-19T02:42:29Z) - Multi-class point cloud completion networks for 3D cardiac anatomy
reconstruction from cine magnetic resonance images [4.1448595037512925]
We propose a novel fully automatic surface reconstruction pipeline capable of reconstructing multi-class 3D cardiac anatomy meshes.
Its key component is a multi-class point cloud completion network (PCCN) capable of correcting both the sparsity and misalignment issues of the 3D reconstruction task.
arXiv Detail & Related papers (2023-07-17T14:52:52Z) - Ultrasound Plane Pose Regression: Assessing Generalized Pose Coordinates
in the Fetal Brain [9.465965149145559]
We aim to build a US plane localization system for 3D visualization, training, and guidance without integrating additional sensors.
This work builds on our previous work, which predicts the six-dimensional (6D) pose of arbitrarily oriented US planes slicing the fetal brain.
We investigate the impact of registration quality in the training and testing data and its subsequent effect on trained models.
arXiv Detail & Related papers (2023-01-19T21:16:36Z) - Agent with Tangent-based Formulation and Anatomical Perception for
Standard Plane Localization in 3D Ultrasound [56.7645826576439]
We introduce a novel reinforcement learning framework for automatic SP localization in 3D US.
First, we formulate SP localization in 3D US as a tangent-point-based problem in RL to restructure the action space.
Second, we design an auxiliary task learning strategy to enhance the model's ability to recognize subtle differences crossing Non-SPs and SPs in plane search.
arXiv Detail & Related papers (2022-07-01T14:53:27Z) - DeepRecon: Joint 2D Cardiac Segmentation and 3D Volume Reconstruction
via A Structure-Specific Generative Method [12.26150675728958]
We propose an end-to-end latent-space-based framework, DeepRecon, that generates multiple clinically essential outcomes.
Our method identifies the optimal latent representation of the cine image that contains accurate semantic information for cardiac structures.
In particular, our model jointly generates synthetic images with accurate semantic information and segmentation of the cardiac structures.
arXiv Detail & Related papers (2022-06-14T20:46:11Z) - 3D Reconstruction of Curvilinear Structures with Stereo Matching
DeepConvolutional Neural Networks [52.710012864395246]
We propose a fully automated pipeline for both detection and matching of curvilinear structures in stereo pairs.
We mainly focus on 3D reconstruction of dislocations from stereo pairs of TEM images.
arXiv Detail & Related papers (2021-10-14T23:05:47Z) - Appearance Learning for Image-based Motion Estimation in Tomography [60.980769164955454]
In tomographic imaging, anatomical structures are reconstructed by applying a pseudo-inverse forward model to acquired signals.
Patient motion corrupts the geometry alignment in the reconstruction process resulting in motion artifacts.
We propose an appearance learning approach recognizing the structures of rigid motion independently from the scanned object.
arXiv Detail & Related papers (2020-06-18T09:49:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.