A Hybrid Convolutional Neural Network with Meta Feature Learning for
Abnormality Detection in Wireless Capsule Endoscopy Images
- URL: http://arxiv.org/abs/2207.09769v1
- Date: Wed, 20 Jul 2022 09:25:57 GMT
- Title: A Hybrid Convolutional Neural Network with Meta Feature Learning for
Abnormality Detection in Wireless Capsule Endoscopy Images
- Authors: Samir Jain, Ayan Seal, Aparajita Ojha
- Abstract summary: A hybrid convolutional neural network is proposed for abnormality detection in wireless capsule endoscopy images.
It consists of three parallel convolutional neural networks, each with a distinctive feature learning capability.
The network trio effectively handles intra-class variance and efficiently detects gastrointestinal abnormalities.
- Score: 8.744537620217674
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Wireless Capsule Endoscopy is one of the most advanced non-invasive methods
for the examination of gastrointestinal tracts. An intelligent computer-aided
diagnostic system for detecting gastrointestinal abnormalities like polyp,
bleeding, inflammation, etc. is highly exigent in wireless capsule endoscopy
image analysis. Abnormalities greatly differ in their shape, size, color, and
texture, and some appear to be visually similar to normal regions. This poses a
challenge in designing a binary classifier due to intra-class variations. In
this study, a hybrid convolutional neural network is proposed for abnormality
detection that extracts a rich pool of meaningful features from wireless
capsule endoscopy images using a variety of convolution operations. It consists
of three parallel convolutional neural networks, each with a distinctive
feature learning capability. The first network utilizes depthwise separable
convolution, while the second employs cosine normalized convolution operation.
A novel meta-feature extraction mechanism is introduced in the third network,
to extract patterns from the statistical information drawn over the features
generated from the first and second networks and its own previous layer. The
network trio effectively handles intra-class variance and efficiently detects
gastrointestinal abnormalities. The proposed hybrid convolutional neural
network model is trained and tested on two widely used publicly available
datasets. The test results demonstrate that the proposed model outperforms six
state-of-the-art methods with 97\% and 98\% classification accuracy on KID and
Kvasir-Capsule datasets respectively. Cross dataset evaluation results also
demonstrate the generalization performance of the proposed model.
Related papers
- Brain Tumor Classification on MRI in Light of Molecular Markers [61.77272414423481]
Co-deletion of the 1p/19q gene is associated with clinical outcomes in low-grade gliomas.
This study aims to utilize a specially MRI-based convolutional neural network for brain cancer detection.
arXiv Detail & Related papers (2024-09-29T07:04:26Z) - Single-Shared Network with Prior-Inspired Loss for Parameter-Efficient Multi-Modal Imaging Skin Lesion Classification [6.195015783344803]
We introduce a multi-modal approach that efficiently integrates multi-scale clinical and dermoscopy features within a single network.
Our method exhibits superiority in both accuracy and model parameters compared to currently advanced methods.
arXiv Detail & Related papers (2024-03-28T08:00:14Z) - Deception Detection from Linguistic and Physiological Data Streams Using Bimodal Convolutional Neural Networks [19.639533220155965]
This paper explores the application of convolutional neural networks for the purpose of multimodal deception detection.
We use a dataset built by interviewing 104 subjects about two topics, with one truthful and one falsified response from each subject about each topic.
arXiv Detail & Related papers (2023-11-18T02:44:33Z) - Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection [76.11864242047074]
We propose a novel Affine-Consistent Transformer (AC-Former), which directly yields a sequence of nucleus positions.
We introduce an Adaptive Affine Transformer (AAT) module, which can automatically learn the key spatial transformations to warp original images for local network training.
Experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art algorithms on various benchmarks.
arXiv Detail & Related papers (2023-10-22T02:27:02Z) - A Prototype-Based Neural Network for Image Anomaly Detection and Localization [10.830337829732915]
This paper proposes ProtoAD, a prototype-based neural network for image anomaly detection and localization.
First, the patch features of normal images are extracted by a deep network pre-trained on nature images.
ProtoAD achieves competitive performance compared to the state-of-the-art methods with a higher inference speed.
arXiv Detail & Related papers (2023-10-04T04:27:16Z) - Multilayer Multiset Neuronal Networks -- MMNNs [55.2480439325792]
The present work describes multilayer multiset neuronal networks incorporating two or more layers of coincidence similarity neurons.
The work also explores the utilization of counter-prototype points, which are assigned to the image regions to be avoided.
arXiv Detail & Related papers (2023-08-28T12:55:13Z) - Two-Stream Graph Convolutional Network for Intra-oral Scanner Image
Segmentation [133.02190910009384]
We propose a two-stream graph convolutional network (i.e., TSGCN) to handle inter-view confusion between different raw attributes.
Our TSGCN significantly outperforms state-of-the-art methods in 3D tooth (surface) segmentation.
arXiv Detail & Related papers (2022-04-19T10:41:09Z) - Anomaly Detection using Capsule Networks for High-dimensional Datasets [0.0]
This study uses a capsule network for the anomaly detection task.
To the best of our knowledge, this is the first instance where a capsule network is analyzed for the anomaly detection task in a high-dimensional complex data setting.
arXiv Detail & Related papers (2021-12-27T05:07:02Z) - TSGCNet: Discriminative Geometric Feature Learning with Two-Stream
GraphConvolutional Network for 3D Dental Model Segmentation [141.2690520327948]
We propose a two-stream graph convolutional network (TSGCNet) to learn multi-view information from different geometric attributes.
We evaluate our proposed TSGCNet on a real-patient dataset of dental models acquired by 3D intraoral scanners.
arXiv Detail & Related papers (2020-12-26T08:02:56Z) - Comparisons among different stochastic selection of activation layers
for convolutional neural networks for healthcare [77.99636165307996]
We classify biomedical images using ensembles of neural networks.
We select our activations among the following ones: ReLU, leaky ReLU, Parametric ReLU, ELU, Adaptive Piecewice Linear Unit, S-Shaped ReLU, Swish, Mish, Mexican Linear Unit, Parametric Deformable Linear Unit, Soft Root Sign.
arXiv Detail & Related papers (2020-11-24T01:53:39Z) - A Deep Convolutional Neural Network for the Detection of Polyps in
Colonoscopy Images [12.618653234201089]
We propose a deep convolutional neural network based model for the computerized detection of polyps within colonoscopy images.
Data augmentation techniques such as photometric and geometric distortions are adapted to overcome the obstacles faced in polyp detection.
arXiv Detail & Related papers (2020-08-15T13:55:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.