Virus-MNIST: Machine Learning Baseline Calculations for Image
Classification
- URL: http://arxiv.org/abs/2111.02375v1
- Date: Wed, 3 Nov 2021 17:44:23 GMT
- Title: Virus-MNIST: Machine Learning Baseline Calculations for Image
Classification
- Authors: Erik Larsen, Korey MacVittie, and John Lilly
- Abstract summary: The Virus-MNIST data set is a collection of thumbnail images that is similar in style to the ubiquitous MNIST hand-written digits.
It is poised to take on a role in benchmarking progress of virus model training.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The Virus-MNIST data set is a collection of thumbnail images that is similar
in style to the ubiquitous MNIST hand-written digits. These, however, are cast
by reshaping possible malware code into an image array. Naturally, it is poised
to take on a role in benchmarking progress of virus classifier model training.
Ten types are present: nine classified as malware and one benign. Cursory
examination reveals unequal class populations and other key aspects that must
be considered when selecting classification and pre-processing methods.
Exploratory analyses show possible identifiable characteristics from aggregate
metrics (e.g., the pixel median values), and ways to reduce the number of
features by identifying strong correlations. A model comparison shows that
Light Gradient Boosting Machine, Gradient Boosting Classifier, and Random
Forest algorithms produced the highest accuracy scores, thus showing promise
for deeper scrutiny.
Related papers
- Knee or ROC [0.0]
Self-attention transformers have demonstrated accuracy for image classification with smaller data sets.
We consider calculating accuracy using the knee method to determine threshold values on an ad-hoc basis.
arXiv Detail & Related papers (2024-01-14T23:25:44Z) - High-resolution Image-based Malware Classification using Multiple
Instance Learning [0.0]
This paper proposes a novel method of classifying malware into families using high-resolution greyscale images and multiple instance learning.
The implementation is evaluated on the Microsoft Malware Classification dataset and achieves accuracies of up to $96.6%$ on adversarially enlarged samples.
arXiv Detail & Related papers (2023-11-21T18:11:26Z) - Optimizations of Autoencoders for Analysis and Classification of
Microscopic In Situ Hybridization Images [68.8204255655161]
We propose a deep-learning framework to detect and classify areas of microscopic images with similar levels of gene expression.
The data we analyze requires an unsupervised learning model for which we employ a type of Artificial Neural Network - Deep Learning Autoencoders.
arXiv Detail & Related papers (2023-04-19T13:45:28Z) - Vision-Language Modelling For Radiological Imaging and Reports In The
Low Data Regime [70.04389979779195]
This paper explores training medical vision-language models (VLMs) where the visual and language inputs are embedded into a common space.
We explore several candidate methods to improve low-data performance, including adapting generic pre-trained models to novel image and text domains.
Using text-to-image retrieval as a benchmark, we evaluate the performance of these methods with variable sized training datasets of paired chest X-rays and radiological reports.
arXiv Detail & Related papers (2023-03-30T18:20:00Z) - Zero-shot Model Diagnosis [80.36063332820568]
A common approach to evaluate deep learning models is to build a labeled test set with attributes of interest and assess how well it performs.
This paper argues the case that Zero-shot Model Diagnosis (ZOOM) is possible without the need for a test set nor labeling.
arXiv Detail & Related papers (2023-03-27T17:59:33Z) - Traditional Classification Neural Networks are Good Generators: They are
Competitive with DDPMs and GANs [104.72108627191041]
We show that conventional neural network classifiers can generate high-quality images comparable to state-of-the-art generative models.
We propose a mask-based reconstruction module to make semantic gradients-aware to synthesize plausible images.
We show that our method is also applicable to text-to-image generation by regarding image-text foundation models.
arXiv Detail & Related papers (2022-11-27T11:25:35Z) - Intrusion Detection: Machine Learning Baseline Calculations for Image
Classification [0.0]
Cyber security can be enhanced through application of machine learning.
Most promising candidates for consideration are Light Machine, Random Forest Boost, and Extra Trees.
arXiv Detail & Related papers (2021-11-03T17:49:38Z) - Classification of Viral Pneumonia X-ray Images with the Aucmedi
Framework [0.0]
We use the AUCMEDI-Framework to train a deep neural network to classify chest X-ray images as either normal or viral pneumonia.
Grad-CAM and LIME explainable artificial intelligence (XAI) algorithms are applied to visualize the image features that are most important for the prediction.
arXiv Detail & Related papers (2021-10-03T14:57:43Z) - Ensembling with Deep Generative Views [72.70801582346344]
generative models can synthesize "views" of artificial images that mimic real-world variations, such as changes in color or pose.
Here, we investigate whether such views can be applied to real images to benefit downstream analysis tasks such as image classification.
We use StyleGAN2 as the source of generative augmentations and investigate this setup on classification tasks involving facial attributes, cat faces, and cars.
arXiv Detail & Related papers (2021-04-29T17:58:35Z) - Malware Detection Using Frequency Domain-Based Image Visualization and
Deep Learning [16.224649756613655]
We propose a novel method to detect and visualize malware through image classification.
The executable binaries are represented as grayscale images obtained from the count of N-grams (N=2) of bytes in the Discrete Cosine Transform domain.
A shallow neural network is trained for classification, and its accuracy is compared with deep-network architectures such as ResNet that are trained using transfer learning.
arXiv Detail & Related papers (2021-01-26T06:07:46Z) - Improved Slice-wise Tumour Detection in Brain MRIs by Computing
Dissimilarities between Latent Representations [68.8204255655161]
Anomaly detection for Magnetic Resonance Images (MRIs) can be solved with unsupervised methods.
We have proposed a slice-wise semi-supervised method for tumour detection based on the computation of a dissimilarity function in the latent space of a Variational AutoEncoder.
We show that by training the models on higher resolution images and by improving the quality of the reconstructions, we obtain results which are comparable with different baselines.
arXiv Detail & Related papers (2020-07-24T14:02:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.