Weighted Monte Carlo augmented spherical Fourier-Bessel convolutional
layers for 3D abdominal organ segmentation
- URL: http://arxiv.org/abs/2402.16825v3
- Date: Sat, 9 Mar 2024 22:06:18 GMT
- Title: Weighted Monte Carlo augmented spherical Fourier-Bessel convolutional
layers for 3D abdominal organ segmentation
- Authors: Wenzhao Zhao, Steffen Albert, Barbara D. Wichtmann, Angelika Maurer,
Ulrike Attenberger, Frank G. Z\"ollner, and J\"urgen Hesser
- Abstract summary: Filter-decomposition-based 3D group equivariant neural networks show promising stability and data efficiency for 3D image feature extraction.
This paper describes a non- parameter-sharing affine group equivariant neural network for 3D medical image segmentation.
The efficiency and flexibility of the adopted non- parameter-sharing strategy enable for the first time an efficient implementation of 3D affine group equivariant convolutional neural networks for volumetric data.
- Score: 0.31410859223862103
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Filter-decomposition-based group equivariant convolutional neural networks
show promising stability and data efficiency for 3D image feature extraction.
However, the existing filter-decomposition-based 3D group equivariant neural
networks rely on parameter-sharing designs and are mostly limited to rotation
transformation groups, where the chosen spherical harmonic filter bases
consider only angular orthogonality. These limitations hamper its application
to deep neural network architectures for medical image segmentation. To address
these issues, this paper describes a non-parameter-sharing affine group
equivariant neural network for 3D medical image segmentation based on an
adaptive aggregation of Monte Carlo augmented spherical Fourier Bessel filter
bases. The efficiency and flexibility of the adopted non-parameter-sharing
strategy enable for the first time an efficient implementation of 3D affine
group equivariant convolutional neural networks for volumetric data. The
introduced spherical Bessel Fourier filter basis combines both angular and
radial orthogonality for better feature extraction. The 3D image segmentation
experiments on two abdominal medical image sets, BTCV and the NIH Pancreas
datasets, show that the proposed methods excel the state-of-the-art 3D neural
networks with high training stability and data efficiency. The code will be
available at https://github.com/ZhaoWenzhao/WMCSFB.
Related papers
- On the Fourier analysis in the SO(3) space : EquiLoPO Network [2.7624021966289605]
Existing deep-learning approaches utilize either group convolutional networks limited to discrete rotations or steerable convolutional networks with constrained filter structures.
This work proposes a novel equivariant neural network architecture that achieves analytical Equivariance to Local Pattern Orientation on the continuous SO(3) group.
By integrating these operations into a ResNet-style architecture, we propose a model that overcomes the limitations of prior methods.
arXiv Detail & Related papers (2024-04-24T16:54:39Z) - NeuRBF: A Neural Fields Representation with Adaptive Radial Basis
Functions [93.02515761070201]
We present a novel type of neural fields that uses general radial bases for signal representation.
Our method builds upon general radial bases with flexible kernel position and shape, which have higher spatial adaptivity and can more closely fit target signals.
When applied to neural radiance field reconstruction, our method achieves state-of-the-art rendering quality, with small model size and comparable training speed.
arXiv Detail & Related papers (2023-09-27T06:32:05Z) - NeRF-GAN Distillation for Efficient 3D-Aware Generation with
Convolutions [97.27105725738016]
integration of Neural Radiance Fields (NeRFs) and generative models, such as Generative Adversarial Networks (GANs) has transformed 3D-aware generation from single-view images.
We propose a simple and effective method, based on re-using the well-disentangled latent space of a pre-trained NeRF-GAN in a pose-conditioned convolutional network to directly generate 3D-consistent images corresponding to the underlying 3D representations.
arXiv Detail & Related papers (2023-03-22T18:59:48Z) - Leveraging SO(3)-steerable convolutions for pose-robust semantic segmentation in 3D medical data [2.207533492015563]
We present a new family of segmentation networks that use equivariant voxel convolutions based on spherical harmonics.
These networks are robust to data poses not seen during training, and do not require rotation-based data augmentation during training.
We demonstrate improved segmentation performance in MRI brain tumor and healthy brain structure segmentation tasks.
arXiv Detail & Related papers (2023-03-01T09:27:08Z) - Moving Frame Net: SE(3)-Equivariant Network for Volumes [0.0]
A rotation and translation equivariant neural network for image data was proposed based on the moving frames approach.
We significantly improve that approach by reducing the computation of moving frames to only one, at the input stage.
Our trained model overperforms the benchmarks in the medical volume classification of most of the tested datasets from MedMNIST3D.
arXiv Detail & Related papers (2022-11-07T10:25:38Z) - Two-Stream Graph Convolutional Network for Intra-oral Scanner Image
Segmentation [133.02190910009384]
We propose a two-stream graph convolutional network (i.e., TSGCN) to handle inter-view confusion between different raw attributes.
Our TSGCN significantly outperforms state-of-the-art methods in 3D tooth (surface) segmentation.
arXiv Detail & Related papers (2022-04-19T10:41:09Z) - CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image
Segmentation [95.51455777713092]
Convolutional neural networks (CNNs) have been the de facto standard for nowadays 3D medical image segmentation.
We propose a novel framework that efficiently bridges a bf Convolutional neural network and a bf Transformer bf (CoTr) for accurate 3D medical image segmentation.
arXiv Detail & Related papers (2021-03-04T13:34:22Z) - TSGCNet: Discriminative Geometric Feature Learning with Two-Stream
GraphConvolutional Network for 3D Dental Model Segmentation [141.2690520327948]
We propose a two-stream graph convolutional network (TSGCNet) to learn multi-view information from different geometric attributes.
We evaluate our proposed TSGCNet on a real-patient dataset of dental models acquired by 3D intraoral scanners.
arXiv Detail & Related papers (2020-12-26T08:02:56Z) - 3D Solid Spherical Bispectrum CNNs for Biomedical Texture Analysis [3.579867431007686]
Locally Rotation Invariant (LRI) operators have shown great potential in biomedical texture analysis.
We investigate the benefits of using the bispectrum over the spectrum in the design of a LRI layer embedded in a shallow Convolutional Neural Network (CNN) for 3D image analysis.
arXiv Detail & Related papers (2020-04-28T09:01:13Z) - Learning Local Neighboring Structure for Robust 3D Shape Representation [143.15904669246697]
Representation learning for 3D meshes is important in many computer vision and graphics applications.
We propose a local structure-aware anisotropic convolutional operation (LSA-Conv)
Our model produces significant improvement in 3D shape reconstruction compared to state-of-the-art methods.
arXiv Detail & Related papers (2020-04-21T13:40:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.