Applying a random projection algorithm to optimize machine learning
model for breast lesion classification
- URL: http://arxiv.org/abs/2009.09937v1
- Date: Wed, 9 Sep 2020 21:27:27 GMT
- Title: Applying a random projection algorithm to optimize machine learning
model for breast lesion classification
- Authors: Morteza Heidari (1), Sivaramakrishnan Lakshmivarahan (2),
Seyedehnafiseh Mirniaharikandehei (1), Gopichandh Danala (1), Sai Kiran R.
Maryada (2), Hong Liu (1), Bin Zheng (1), ((1) School of Electrical and
Computer Engineering, University of Oklahoma, Norman, OK, USA, (2) School of
Computer Sciences, University of Oklahoma, Norman, OK, USA)
- Abstract summary: We build a retrospective dataset involving 1,487 cases of mammograms in which 644 cases have confirmed malignant mass lesions and 843 have benign lesions.
Support vector machine (SVM) models embedded with several feature dimensionality reduction methods are built to predict likelihood of lesions being malignant.
SVM generates a likelihood score of each segmented mass region depicting on one-view mammogram.
By fusion of two scores of the same mass depicting on two-view mammogram, a case-based likelihood score is also evaluated.
- Score: 0.2970239953900422
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Machine learning is widely used in developing computer-aided diagnosis (CAD)
schemes of medical images. However, CAD usually computes large number of image
features from the targeted regions, which creates a challenge of how to
identify a small and optimal feature vector to build robust machine learning
models. In this study, we investigate feasibility of applying a random
projection algorithm to build an optimal feature vector from the initially
CAD-generated large feature pool and improve performance of machine learning
model. We assemble a retrospective dataset involving 1,487 cases of mammograms
in which 644 cases have confirmed malignant mass lesions and 843 have benign
lesions. A CAD scheme is first applied to segment mass regions and initially
compute 181 features. Then, support vector machine (SVM) models embedded with
several feature dimensionality reduction methods are built to predict
likelihood of lesions being malignant. All SVM models are trained and tested
using a leave-one-case-out cross-validation method. SVM generates a likelihood
score of each segmented mass region depicting on one-view mammogram. By fusion
of two scores of the same mass depicting on two-view mammograms, a case-based
likelihood score is also evaluated. Comparing with the principle component
analyses, nonnegative matrix factorization, and Chi-squared methods, SVM
embedded with the random projection algorithm yielded a significantly higher
case-based lesion classification performance with the area under ROC curve of
0.84+0.01 (p<0.02). The study demonstrates that the random project algorithm is
a promising method to generate optimal feature vectors to help improve
performance of machine learning models of medical images.
Related papers
- Dimensionality Reduction and Nearest Neighbors for Improving Out-of-Distribution Detection in Medical Image Segmentation [1.2873975765521795]
This work applied the Mahalanobis distance (MD) post hoc to the bottleneck features of four Swin UNETR and nnU-net models that segmented the liver.
Images the models failed on were detected with high performance and minimal computational load.
arXiv Detail & Related papers (2024-08-05T18:24:48Z) - Histopathological Image Classification with Cell Morphology Aware Deep Neural Networks [11.749248917866915]
We propose a novel DeepCMorph model pre-trained to learn cell morphology and identify a large number of different cancer types.
We pretrained this module on the Pan-Cancer TCGA dataset consisting of over 270K tissue patches extracted from 8736 diagnostic slides from 7175 patients.
The proposed solution achieved a new state-of-the-art performance on the dataset under consideration, detecting 32 cancer types with over 82% accuracy and outperforming all previously proposed solutions by more than 4%.
arXiv Detail & Related papers (2024-07-11T16:03:59Z) - Deep Fusion Model for Brain Tumor Classification Using Fine-Grained Gradient Preservation [0.0807707808613597]
The research presents a novel architecture for precise brain tumor classification fusing pretrained ResNet152V2 and modified VGG16 models.
The proposed solution incorporates various image processing techniques to improve image quality and achieves an astounding accuracy of 98.36% and 98.04% in Figshare and Kaggle datasets respectively.
arXiv Detail & Related papers (2024-06-28T07:06:02Z) - EMT-NET: Efficient multitask network for computer-aided diagnosis of
breast cancer [58.720142291102135]
We propose an efficient and light-weighted learning architecture to classify and segment breast tumors simultaneously.
We incorporate a segmentation task into a tumor classification network, which makes the backbone network learn representations focused on tumor regions.
The accuracy, sensitivity, and specificity of tumor classification is 88.6%, 94.1%, and 85.3%, respectively.
arXiv Detail & Related papers (2022-01-13T05:24:40Z) - Lung Cancer Lesion Detection in Histopathology Images Using Graph-Based
Sparse PCA Network [93.22587316229954]
We propose a graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E)
We evaluate the performance of the proposed algorithm on H&E slides obtained from an SVM K-rasG12D lung cancer mouse model using precision/recall rates, F-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC)
arXiv Detail & Related papers (2021-10-27T19:28:36Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Leveraging SLIC Superpixel Segmentation and Cascaded Ensemble SVM for
Fully Automated Mass Detection In Mammograms [1.7205106391379026]
This paper proposes a rigorous segmentation method, supported by morphological enhancement using grayscale linear filters.
A novel cascaded ensemble of support vector machines (SVM) is used to effectively tackle the class imbalance and provide significant predictions.
arXiv Detail & Related papers (2020-10-20T15:02:25Z) - Applying a random projection algorithm to optimize machine learning
model for predicting peritoneal metastasis in gastric cancer patients using
CT images [0.3120960917423201]
Non-invasively predicting the risk of cancer metastasis before surgery plays an essential role in determining optimal treatment methods.
In this study, we explore a new approach to build an optimal machine learning model using small and imbalanced image datasets.
arXiv Detail & Related papers (2020-09-01T19:53:09Z) - Improved Slice-wise Tumour Detection in Brain MRIs by Computing
Dissimilarities between Latent Representations [68.8204255655161]
Anomaly detection for Magnetic Resonance Images (MRIs) can be solved with unsupervised methods.
We have proposed a slice-wise semi-supervised method for tumour detection based on the computation of a dissimilarity function in the latent space of a Variational AutoEncoder.
We show that by training the models on higher resolution images and by improving the quality of the reconstructions, we obtain results which are comparable with different baselines.
arXiv Detail & Related papers (2020-07-24T14:02:09Z) - Modelling the Distribution of 3D Brain MRI using a 2D Slice VAE [66.63629641650572]
We propose a method to model 3D MR brain volumes distribution by combining a 2D slice VAE with a Gaussian model that captures the relationships between slices.
We also introduce a novel evaluation method for generated volumes that quantifies how well their segmentations match those of true brain anatomy.
arXiv Detail & Related papers (2020-07-09T13:23:15Z) - Y-Net for Chest X-Ray Preprocessing: Simultaneous Classification of
Geometry and Segmentation of Annotations [70.0118756144807]
This work introduces a general pre-processing step for chest x-ray input into machine learning algorithms.
A modified Y-Net architecture based on the VGG11 encoder is used to simultaneously learn geometric orientation and segmentation of radiographs.
Results were evaluated by expert clinicians, with acceptable geometry in 95.8% and annotation mask in 96.2%, compared to 27.0% and 34.9% respectively in control images.
arXiv Detail & Related papers (2020-05-08T02:16:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.