Patient-Specific Domain Adaptation for Fast Optical Flow Based on
Teacher-Student Knowledge Transfer
- URL: http://arxiv.org/abs/2007.04928v1
- Date: Thu, 9 Jul 2020 17:01:08 GMT
- Title: Patient-Specific Domain Adaptation for Fast Optical Flow Based on
Teacher-Student Knowledge Transfer
- Authors: Sontje Ihler and Max-Heinrich Laves and Tobias Ortmaier
- Abstract summary: Fast motion feedback is crucial in computer-aided surgery (CAS) on moving tissue.
Current deep learning OF models show the common speed vs. accuracy trade-off.
We propose patient-specific fine-tuning of a fast model to achieve high accuracy at high processing rates.
- Score: 2.0303656145222857
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Fast motion feedback is crucial in computer-aided surgery (CAS) on moving
tissue. Image-assistance in safety-critical vision applications requires a
dense tracking of tissue motion. This can be done using optical flow (OF).
Accurate motion predictions at high processing rates lead to higher patient
safety. Current deep learning OF models show the common speed vs. accuracy
trade-off. To achieve high accuracy at high processing rates, we propose
patient-specific fine-tuning of a fast model. This minimizes the domain gap
between training and application data, while reducing the target domain to the
capability of the lower complex, fast model. We propose to obtain training
sequences pre-operatively in the operation room. We handle missing ground
truth, by employing teacher-student learning. Using flow estimations from
teacher model FlowNet2 we specialize a fast student model FlowNet2S on the
patient-specific domain. Evaluation is performed on sequences from the Hamlyn
dataset. Our student model shows very good performance after fine-tuning.
Tracking accuracy is comparable to the teacher model at a speed up of factor
six. Fine-tuning can be performed within minutes, making it feasible for the
operation room. Our method allows to use a real-time capable model that was
previously not suited for this task. This method is laying the path for
improved patient-specific motion estimation in CAS.
Related papers
- FastSTI: A Fast Conditional Pseudo Numerical Diffusion Model for Spatio-temporal Traffic Data Imputation [4.932317347331121]
High-temporal traffic data is crucial for intelligent transportation systems (ITS) and their data-driven applications.
Recent studies of diffusion probability models have demonstrated the superiority of deep generative models in imputation.
Fast on two types of real-world traffic datasets proves its ability to impute higher-quality samples in only six steps.
arXiv Detail & Related papers (2024-10-20T01:45:51Z) - A-SDM: Accelerating Stable Diffusion through Redundancy Removal and
Performance Optimization [54.113083217869516]
In this work, we first explore the computational redundancy part of the network.
We then prune the redundancy blocks of the model and maintain the network performance.
Thirdly, we propose a global-regional interactive (GRI) attention to speed up the computationally intensive attention part.
arXiv Detail & Related papers (2023-12-24T15:37:47Z) - Efficient Deformable Tissue Reconstruction via Orthogonal Neural Plane [58.871015937204255]
We introduce Fast Orthogonal Plane (plane) for the reconstruction of deformable tissues.
We conceptualize surgical procedures as 4D volumes, and break them down into static and dynamic fields comprised of neural planes.
This factorization iscretizes four-dimensional space, leading to a decreased memory usage and faster optimization.
arXiv Detail & Related papers (2023-12-23T13:27:50Z) - StreamYOLO: Real-time Object Detection for Streaming Perception [84.2559631820007]
We endow the models with the capacity of predicting the future, significantly improving the results for streaming perception.
We consider multiple velocities driving scene and propose Velocity-awared streaming AP (VsAP) to jointly evaluate the accuracy.
Our simple method achieves the state-of-the-art performance on Argoverse-HD dataset and improves the sAP and VsAP by 4.7% and 8.2% respectively.
arXiv Detail & Related papers (2022-07-21T12:03:02Z) - Embedding Recycling for Language Models [38.11465250435789]
We study how to decrease computational cost in such settings through embedding recycling (ER)
We propose caching an intermediate layer's output from a pretrained model and finetuning the remaining layers for new tasks.
We show that our method provides a 100% speedup during training and a 55-86% speedup for inference, and has negligible impacts on accuracy for text classification and entity recognition tasks in the scientific domain.
arXiv Detail & Related papers (2022-07-11T16:36:14Z) - EM-driven unsupervised learning for efficient motion segmentation [3.5232234532568376]
This paper presents a CNN-based fully unsupervised method for motion segmentation from optical flow.
We use the Expectation-Maximization (EM) framework to leverage the loss function and the training procedure of our motion segmentation neural network.
Our method outperforms comparable unsupervised methods and is very efficient.
arXiv Detail & Related papers (2022-01-06T14:35:45Z) - Training Aware Sigmoidal Optimizer [2.99368851209995]
Training Aware Sigmoidal functions present landscapes with much more saddle loss than local minima.
We proposed the Training Aware Sigmoidal functions (TASO), which consists of a two-phases automated learning rate schedule.
We compared the proposed approach with commonly used adaptive learning rate schedules such as Adam, RMS, and Adagrad.
arXiv Detail & Related papers (2021-02-17T12:00:46Z) - FastIF: Scalable Influence Functions for Efficient Model Interpretation
and Debugging [112.19994766375231]
Influence functions approximate the 'influences' of training data-points for test predictions.
We present FastIF, a set of simple modifications to influence functions that significantly improves their run-time.
Our experiments demonstrate the potential of influence functions in model interpretation and correcting model errors.
arXiv Detail & Related papers (2020-12-31T18:02:34Z) - Motion Pyramid Networks for Accurate and Efficient Cardiac Motion
Estimation [51.72616167073565]
We propose Motion Pyramid Networks, a novel deep learning-based approach for accurate and efficient cardiac motion estimation.
We predict and fuse a pyramid of motion fields from multiple scales of feature representations to generate a more refined motion field.
We then use a novel cyclic teacher-student training strategy to make the inference end-to-end and further improve the tracking performance.
arXiv Detail & Related papers (2020-06-28T21:03:19Z) - Fast Modeling and Understanding Fluid Dynamics Systems with
Encoder-Decoder Networks [0.0]
We show that an accurate deep-learning-based proxy model can be taught efficiently by a finite-volume-based simulator.
Compared to traditional simulation, the proposed deep learning approach enables much faster forward computation.
We quantify the sensitivity of the deep learning model to key physical parameters and hence demonstrate that the inversion problems can be solved with great acceleration.
arXiv Detail & Related papers (2020-06-09T17:14:08Z) - Parameter-Efficient Transfer from Sequential Behaviors for User Modeling
and Recommendation [111.44445634272235]
In this paper, we develop a parameter efficient transfer learning architecture, termed as PeterRec.
PeterRec allows the pre-trained parameters to remain unaltered during fine-tuning by injecting a series of re-learned neural networks.
We perform extensive experimental ablation to show the effectiveness of the learned user representation in five downstream tasks.
arXiv Detail & Related papers (2020-01-13T14:09:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.