Related papers: ConVibNet: Needle Detection during Continuous Insertion via Frequency-Inspired Features

ConVibNet: Needle Detection during Continuous Insertion via Frequency-Inspired Features

URL: http://arxiv.org/abs/2603.01147v1
Date: Sun, 01 Mar 2026 15:16:25 GMT
Title: ConVibNet: Needle Detection during Continuous Insertion via Frequency-Inspired Features
Authors: Jiamei Guo, Zhehao Duan, Maria Neiiendam, Dianye Huang, Nassir Navab, Zhongliang Jiang,
Abstract summary: We present ConVibNet, an extension of VibNet for detecting needles with significantly reduced visibility.<n>We introduce a novel intersection-and-difference loss that explicitly leverages motion correlations across consecutive frames.
Score: 36.97601609064981
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Purpose: Ultrasound-guided needle interventions are widely used in clinical practice, but their success critically depends on accurate needle placement, which is frequently hindered by the poor and intermittent visibility of needles in ultrasound images. Existing approaches remain limited by artifacts, occlusions, and low contrast, and often fail to support real-time continuous insertion. To overcome these challenges, this study introduces a robust real-time framework for continuous needle detection. Methods: We present ConVibNet, an extension of VibNet for detecting needles with significantly reduced visibility, addressing real-time, continuous needle tracking during insertion. ConVibNet leverages temporal dependencies across successive ultrasound frames to enable continuous estimation of both needle tip position and shaft angle in dynamic scenarios. To strengthen temporal awareness of needle-tip motion, we introduce a novel intersection-and-difference loss that explicitly leverages motion correlations across consecutive frames. In addition, we curated a dedicated dataset for model development and evaluation. Results: The performance of the proposed ConVibNet model was evaluated on our dataset, demonstrating superior accuracy compared to the baseline VibNet and UNet-LSTM models. Specifically, ConVibNet achieved a tip error of 2.80+-2.42 mm and an angle error of 1.69+-2.00 deg. These results represent a 0.75 mm improvement in tip localization accuracy over the best-performing baseline, while preserving real-time inference capability. Conclusion: ConVibNet advances real-time needle detection in ultrasound-guided interventions by integrating temporal correlation modeling with a novel intersection-and-difference loss, thereby improving accuracy and robustness and demonstrating high potential for integration into autonomous insertion systems.

Related papers

A Self-Adaptive Frequency Domain Network for Continuous Intraoperative Hypotension Prediction [9.841996321633298]
Intraoperative hypotension (IOH) is strongly associated with postoperative complications, including delirium and increased mortality.<n>Existing methods face limitations in incorporating both time and frequency domain information.<n>We propose a novel Self-Adaptive Frequency Domain Network (SAFDNet)
arXiv Detail & Related papers (2025-09-28T08:02:28Z)
OccluNet: Spatio-Temporal Deep Learning for Occlusion Detection on DSA [1.3635341861371646]
Interpretation of digital subtraction angiography poses challenges due to complexity and anatomical time constraints.<n>This work proposes OccluNet, a-temporal deep learning model that integrates YOLOX, a single-stage object detector.<n> Evaluation on DSA images from the MR CLEAN Registry revealed the model's capability to capture temporally consistent features.
arXiv Detail & Related papers (2025-08-19T21:59:59Z)
Unstable Prompts, Unreliable Segmentations: A Challenge for Longitudinal Lesion Analysis [0.5537760992845262]
This paper investigates the performance of the ULS23 segmentation model in a longitudinal context.<n>We identify two critical, interconnected failure modes: a sharp degradation in segmentation quality in follow-up cases due to inter-scan registration errors, and a subsequent breakdown of the lesion correspondence process.
arXiv Detail & Related papers (2025-07-25T12:55:48Z)
Imputation of Missing Data in Smooth Pursuit Eye Movements Using a Self-Attention-based Deep Learning Approach [0.0]
We propose a novel imputation framework using Self-Attention-based Imputation networks for time series.<n>We refine the imputed data using a custom made autoencoder, tailored to represent smooth pursuit eye movement sequences.<n>Results show a significant improvement in the accuracy of reconstructed eye movement sequences.
arXiv Detail & Related papers (2025-05-31T13:10:30Z)
Rethinking Contrastive Learning in Graph Anomaly Detection: A Clean-View Perspective [54.605073936695575]
Graph anomaly detection aims to identify unusual patterns in graph-based data, with wide applications in fields such as web security and financial fraud detection.<n>Existing methods rely on contrastive learning, assuming that a lower similarity between a node and its local subgraph indicates abnormality.<n>The presence of interfering edges invalidates this assumption, since it introduces disruptive noise that compromises the contrastive learning process.<n>We propose a Clean-View Enhanced Graph Anomaly Detection framework (CVGAD), which includes a multi-scale anomaly awareness module to identify key sources of interference in the contrastive learning process.
arXiv Detail & Related papers (2025-05-23T15:05:56Z)
Real-time guidewire tracking and segmentation in intraoperative x-ray [52.51797358201872]
We propose a two-stage deep learning framework for real-time guidewire segmentation and tracking. In the first stage, a Yolov5 detector is trained, using the original X-ray images as well as synthetic ones, to output the bounding boxes of possible target guidewires. In the second stage, a novel and efficient network is proposed to segment the guidewire in each detected bounding box.
arXiv Detail & Related papers (2024-04-12T20:39:19Z)
VibNet: Vibration-Boosted Needle Detection in Ultrasound Images [40.64433529217187]
VibNet is a learning-based framework designed to enhance the visibility and accuracy of needle detection in US images.<n>VibNet integrates neural Short-Time Fourier Transform Hough Transform modules to achieve successive sub-goals, including motion feature extraction and needle detection.
arXiv Detail & Related papers (2024-03-21T16:23:25Z)
Real-time landmark detection for precise endoscopic submucosal dissection via shape-aware relation network [51.44506007844284]
We propose a shape-aware relation network for accurate and real-time landmark detection in endoscopic submucosal dissection surgery. We first devise an algorithm to automatically generate relation keypoint heatmaps, which intuitively represent the prior knowledge of spatial relations among landmarks. We then develop two complementary regularization schemes to progressively incorporate the prior knowledge into the training process.
arXiv Detail & Related papers (2021-11-08T07:57:30Z)
Markerless Suture Needle 6D Pose Tracking with Robust Uncertainty Estimation for Autonomous Minimally Invasive Robotic Surgery [11.530352384883361]
We present a novel approach for markerless suture needle pose tracking using Bayesian filters. A data-efficient feature point detector is trained to extract the feature points on the needle. A novel observation model measures the overlap between the detections and the expected projection of the needle.
arXiv Detail & Related papers (2021-09-26T23:30:14Z)
Continuity-Discrimination Convolutional Neural Network for Visual Object Tracking [150.51667609413312]
This paper proposes a novel model, named Continuity-Discrimination Convolutional Neural Network (CD-CNN) for visual object tracking. To address this problem, CD-CNN models temporal appearance continuity based on the idea of temporal slowness. In order to alleviate inaccurate target localization and drifting, we propose a novel notion, object-centroid.
arXiv Detail & Related papers (2021-04-18T06:35:03Z)
Towards Streaming Perception [70.68520310095155]
We present an approach that coherently integrates latency and accuracy into a single metric for real-time online perception. The key insight behind this metric is to jointly evaluate the output of the entire perception stack at every time instant. We focus on the illustrative tasks of object detection and instance segmentation in urban video streams, and contribute a novel dataset with high-quality and temporally-dense annotations.
arXiv Detail & Related papers (2020-05-21T01:51:35Z)

This list is automatically generated from the titles and abstracts of the papers in this site.