Agent with Warm Start and Active Termination for Plane Localization in
3D Ultrasound
- URL: http://arxiv.org/abs/1910.04331v2
- Date: Sun, 3 Mar 2024 12:01:18 GMT
- Title: Agent with Warm Start and Active Termination for Plane Localization in
3D Ultrasound
- Authors: Haoran Dou, Xin Yang, Jikuan Qian, Wufeng Xue, Hao Qin, Xu Wang,
Lequan Yu, Shujun Wang, Yi Xiong, Pheng-Ann Heng, Dong Ni
- Abstract summary: Standard plane localization is crucial for ultrasound (US) diagnosis.
In prenatal US, dozens of standard planes are manually acquired with a 2D probe.
We propose a novel reinforcement learning framework to automatically localize fetal brain standard planes in 3D US.
- Score: 56.14006424500334
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Standard plane localization is crucial for ultrasound (US) diagnosis. In
prenatal US, dozens of standard planes are manually acquired with a 2D probe.
It is time-consuming and operator-dependent. In comparison, 3D US containing
multiple standard planes in one shot has the inherent advantages of less
user-dependency and more efficiency. However, manual plane localization in US
volume is challenging due to the huge search space and large fetal posture
variation. In this study, we propose a novel reinforcement learning (RL)
framework to automatically localize fetal brain standard planes in 3D US. Our
contribution is two-fold. First, we equip the RL framework with a
landmark-aware alignment module to provide warm start and strong spatial bounds
for the agent actions, thus ensuring its effectiveness. Second, instead of
passively and empirically terminating the agent inference, we propose a
recurrent neural network based strategy for active termination of the agent's
interaction procedure. This improves both the accuracy and efficiency of the
localization system. Extensively validated on our in-house large dataset, our
approach achieves the accuracy of 3.4mm/9.6{\deg} and 2.7mm/9.1{\deg} for the
transcerebellar and transthalamic plane localization, respectively. Ourproposed
RL framework is general and has the potential to improve the efficiency and
standardization of US scanning.
Related papers
- GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane [53.388937705785025]
3D open-vocabulary scene understanding is crucial for advancing augmented reality and robotic applications.
We introduce GOI, a framework that integrates semantic features from 2D vision-language foundation models into 3D Gaussian Splatting (3DGS)
Our method treats the feature selection process as a hyperplane division within the feature space, retaining only features that are highly relevant to the query.
arXiv Detail & Related papers (2024-05-27T18:57:18Z) - Language-Guided 3D Object Detection in Point Cloud for Autonomous
Driving [91.91552963872596]
We propose a new multi-modal visual grounding task, termed LiDAR Grounding.
It jointly learns the LiDAR-based object detector with the language features and predicts the targeted region directly from the detector.
Our work offers a deeper insight into the LiDAR-based grounding task and we expect it presents a promising direction for the autonomous driving community.
arXiv Detail & Related papers (2023-05-25T06:22:10Z) - Ultrasound Plane Pose Regression: Assessing Generalized Pose Coordinates
in the Fetal Brain [9.465965149145559]
We aim to build a US plane localization system for 3D visualization, training, and guidance without integrating additional sensors.
This work builds on our previous work, which predicts the six-dimensional (6D) pose of arbitrarily oriented US planes slicing the fetal brain.
We investigate the impact of registration quality in the training and testing data and its subsequent effect on trained models.
arXiv Detail & Related papers (2023-01-19T21:16:36Z) - Agent with Tangent-based Formulation and Anatomical Perception for
Standard Plane Localization in 3D Ultrasound [56.7645826576439]
We introduce a novel reinforcement learning framework for automatic SP localization in 3D US.
First, we formulate SP localization in 3D US as a tangent-point-based problem in RL to restructure the action space.
Second, we design an auxiliary task learning strategy to enhance the model's ability to recognize subtle differences crossing Non-SPs and SPs in plane search.
arXiv Detail & Related papers (2022-07-01T14:53:27Z) - Progressive Coordinate Transforms for Monocular 3D Object Detection [52.00071336733109]
We propose a novel and lightweight approach, dubbed em Progressive Coordinate Transforms (PCT) to facilitate learning coordinate representations.
In this paper, we propose a novel and lightweight approach, dubbed em Progressive Coordinate Transforms (PCT) to facilitate learning coordinate representations.
arXiv Detail & Related papers (2021-08-12T15:22:33Z) - Searching Collaborative Agents for Multi-plane Localization in 3D
Ultrasound [15.573821037925143]
3D ultrasound can contain multiple standard planes (SPs) in one shot.
Manual SP localization in 3D US is challenging because of the low image quality, huge search space and large anatomical variability.
We propose a novel multi-agent reinforcement learning framework to simultaneously localize multiple SPs in 3D US.
arXiv Detail & Related papers (2021-05-22T02:48:23Z) - Agent with Warm Start and Adaptive Dynamic Termination for Plane
Localization in 3D Ultrasound [14.256624552635786]
This study enhances our previous RL framework with a newly designed adaptive dynamic termination to enable an early stop for the agent searching.
Our approach achieves localization error of 2.52mm/10.26 degrees, 2.48mm/10.39 degrees, 2.02mm/10.48 degrees, 2.00mm/14.57 degrees, 2.61mm/9.71 degrees, 3.09mm/9.58 degrees, 1.49mm/7.54 degrees for the transcerebellar, transventricular, transthalamic planes in fetal brain, abdominal plane in fetal abdomen, and mid-sagittal, transverse and coronal planes in uterus.
arXiv Detail & Related papers (2021-03-26T14:57:26Z) - Searching Collaborative Agents for Multi-plane Localization in 3D
Ultrasound [59.97366727654676]
3D ultrasound (US) is widely used due to its rich diagnostic information, portability and low cost.
Standard plane (SP) localization in US volume not only improves efficiency and reduces user-dependence, but also boosts 3D US interpretation.
We propose a novel Multi-Agent Reinforcement Learning framework to localize multiple uterine SPs in 3D US simultaneously.
arXiv Detail & Related papers (2020-07-30T07:23:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.