AutoML Segmentation for 3D Medical Image Data: Contribution to the MSD
Challenge 2018
- URL: http://arxiv.org/abs/2005.09978v1
- Date: Wed, 20 May 2020 11:47:02 GMT
- Title: AutoML Segmentation for 3D Medical Image Data: Contribution to the MSD
Challenge 2018
- Authors: Oliver Rippel, Leon Weninger, Dorit Merhof
- Abstract summary: A 3D convolutional neural network with encoder-decoder architecture was developed and is presented in this paper.
It works on anisotropic voxel-geometries and has anisotropic depth, i.e., the number of down steps is a task-specific parameter.
- Score: 2.9864637081333085
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Fueled by recent advances in machine learning, there has been tremendous
progress in the field of semantic segmentation for the medical image computing
community. However, developed algorithms are often optimized and validated by
hand based on one task only. In combination with small datasets, interpreting
the generalizability of the results is often difficult. The Medical
Segmentation Decathlon challenge addresses this problem, and aims to facilitate
development of generalizable 3D semantic segmentation algorithms that require
no manual parametrization. Such an algorithm was developed and is presented in
this paper. It consists of a 3D convolutional neural network with
encoder-decoder architecture employing residual-connections, skip-connections
and multi-level generation of predictions. It works on anisotropic
voxel-geometries and has anisotropic depth, i.e., the number of downsampling
steps is a task-specific parameter. These depths are automatically inferred for
each task prior to training. By combining this flexible architecture with
on-the-fly data augmentation and little-to-no pre-- or postprocessing,
promising results could be achieved. The code developed for this challenge will
be available online after the final deadline at:
https://github.com/ORippler/MSD_2018
Related papers
- MeshConv3D: Efficient convolution and pooling operators for triangular 3D meshes [0.0]
MeshConv3D is a 3D mesh-dedicated methodology integrating specialized convolution and face collapse-based pooling operators.
The experimental results obtained on three distinct benchmark datasets show that the proposed approach makes it possible to achieve equivalent or superior classification results.
arXiv Detail & Related papers (2025-01-07T14:41:26Z) - Bayesian Self-Training for Semi-Supervised 3D Segmentation [59.544558398992386]
3D segmentation is a core problem in computer vision.
densely labeling 3D point clouds to employ fully-supervised training remains too labor intensive and expensive.
Semi-supervised training provides a more practical alternative, where only a small set of labeled data is given, accompanied by a larger unlabeled set.
arXiv Detail & Related papers (2024-09-12T14:54:31Z) - E2ENet: Dynamic Sparse Feature Fusion for Accurate and Efficient 3D Medical Image Segmentation [34.865695471451886]
We propose a 3D medical image segmentation model, named Efficient to Efficient Network (E2ENet)
It incorporates two parametrically and computationally efficient designs.
It consistently achieves a superior trade-off between accuracy and efficiency across various resource constraints.
arXiv Detail & Related papers (2023-12-07T22:13:37Z) - Automatic Aorta Segmentation with Heavily Augmented, High-Resolution 3-D
ResUNet: Contribution to the SEG.A Challenge [0.1633301148398433]
This work presents a contribution by the MedGIFT team to the SEG.A challenge organized during the MICCAI 2023 conference.
We propose a fully automated algorithm based on deep encoder-decoder architecture.
We freely release the source code, pretrained model, and provide access to the algorithm on the Grand-Challenge platform.
arXiv Detail & Related papers (2023-10-24T13:28:46Z) - EVC-Net: Multi-scale V-Net with Conditional Random Fields for Brain
Extraction [3.4376560669160394]
EVC-Net adds lower scale inputs on each encoder block.
Conditional Random Fields are re-introduced here as an additional step for refining the network's output.
Results show that even with limited training resources, EVC-Net achieves higher Dice Coefficient and Jaccard Index.
arXiv Detail & Related papers (2022-06-06T18:21:21Z) - MISSU: 3D Medical Image Segmentation via Self-distilling TransUNet [55.16833099336073]
We propose to self-distill a Transformer-based UNet for medical image segmentation.
It simultaneously learns global semantic information and local spatial-detailed features.
Our MISSU achieves the best performance over previous state-of-the-art methods.
arXiv Detail & Related papers (2022-06-02T07:38:53Z) - Dynamic Linear Transformer for 3D Biomedical Image Segmentation [2.440109381823186]
Transformer-based neural networks have surpassed promising performance on many biomedical image segmentation tasks.
Main challenge for 3D transformer-based segmentation methods is the quadratic complexity introduced by the self-attention mechanism.
We propose a novel transformer architecture for 3D medical image segmentation using an encoder-decoder style architecture with linear complexity.
arXiv Detail & Related papers (2022-06-01T21:15:01Z) - Recovering 3D Human Mesh from Monocular Images: A Survey [49.00136388529404]
Estimating human pose and shape from monocular images is a long-standing problem in computer vision.
This survey focuses on the task of monocular 3D human mesh recovery.
arXiv Detail & Related papers (2022-03-03T18:56:08Z) - CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image
Segmentation [95.51455777713092]
Convolutional neural networks (CNNs) have been the de facto standard for nowadays 3D medical image segmentation.
We propose a novel framework that efficiently bridges a bf Convolutional neural network and a bf Transformer bf (CoTr) for accurate 3D medical image segmentation.
arXiv Detail & Related papers (2021-03-04T13:34:22Z) - TSGCNet: Discriminative Geometric Feature Learning with Two-Stream
GraphConvolutional Network for 3D Dental Model Segmentation [141.2690520327948]
We propose a two-stream graph convolutional network (TSGCNet) to learn multi-view information from different geometric attributes.
We evaluate our proposed TSGCNet on a real-patient dataset of dental models acquired by 3D intraoral scanners.
arXiv Detail & Related papers (2020-12-26T08:02:56Z) - 2nd Place Scheme on Action Recognition Track of ECCV 2020 VIPriors
Challenges: An Efficient Optical Flow Stream Guided Framework [57.847010327319964]
We propose a data-efficient framework that can train the model from scratch on small datasets.
Specifically, by introducing a 3D central difference convolution operation, we proposed a novel C3D neural network-based two-stream framework.
It is proved that our method can achieve a promising result even without a pre-trained model on large scale datasets.
arXiv Detail & Related papers (2020-08-10T09:50:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.