Malware Classification using a Hybrid Hidden Markov Model-Convolutional Neural Network
- URL: http://arxiv.org/abs/2412.18932v1
- Date: Wed, 25 Dec 2024 15:34:57 GMT
- Title: Malware Classification using a Hybrid Hidden Markov Model-Convolutional Neural Network
- Authors: Ritik Mehta, Olha Jureckova, Mark Stamp,
- Abstract summary: We present a novel approach based on a hybrid architecture combining features extracted using a Hidden Markov Model (HMM) with a Convolutional Neural Network (CNN)
We demonstrate the effectiveness of our approach on the popular Malicia dataset, and we obtain superior performance, as compared to other machine learning methods.
- Score: 1.2289361708127875
- License:
- Abstract: The proliferation of malware variants poses a significant challenges to traditional malware detection approaches, such as signature-based methods, necessitating the development of advanced machine learning techniques. In this research, we present a novel approach based on a hybrid architecture combining features extracted using a Hidden Markov Model (HMM), with a Convolutional Neural Network (CNN) then used for malware classification. Inspired by the strong results in previous work using an HMM-Random Forest model, we propose integrating HMMs, which serve to capture sequential patterns in opcode sequences, with CNNs, which are adept at extracting hierarchical features. We demonstrate the effectiveness of our approach on the popular Malicia dataset, and we obtain superior performance, as compared to other machine learning methods -- our results surpass the aforementioned HMM-Random Forest model. Our findings underscore the potential of hybrid HMM-CNN architectures in bolstering malware classification capabilities, offering several promising avenues for further research in the field of cybersecurity.
Related papers
- AI-Aided Kalman Filters [65.35350122917914]
The Kalman filter (KF) and its variants are among the most celebrated algorithms in signal processing.
Recent developments illustrate the possibility of fusing deep neural networks (DNNs) with classic Kalman-type filtering.
This article provides a tutorial-style overview of design approaches for incorporating AI in aiding KF-type algorithms.
arXiv Detail & Related papers (2024-10-16T06:47:53Z) - Heterogenous Memory Augmented Neural Networks [84.29338268789684]
We introduce a novel heterogeneous memory augmentation approach for neural networks.
By introducing learnable memory tokens with attention mechanism, we can effectively boost performance without huge computational overhead.
We show our approach on various image and graph-based tasks under both in-distribution (ID) and out-of-distribution (OOD) conditions.
arXiv Detail & Related papers (2023-10-17T01:05:28Z) - Toward Transparent Sequence Models with Model-Based Tree Markov Model [0.46873264197900916]
We introduce the Model-Based tree Hidden Semi-Markov Model (MOB-HSMM), an inherently interpretable model aimed at detecting high mortality risk events.
This model leverages knowledge distilled from Deep Neural Networks (DNN) to enhance predictive performance while offering clear explanations.
arXiv Detail & Related papers (2023-07-28T07:34:44Z) - A Natural Language Processing Approach to Malware Classification [2.707154152696381]
In this research, we consider a hybrid architecture, where Hidden Markov Models (HMM) are trained on opcode sequences.
extracting the HMM hidden state sequences can be viewed as a form of feature engineering.
We find that this NLP-based approach outperforms other popular techniques on a challenging malware dataset.
arXiv Detail & Related papers (2023-07-07T23:16:23Z) - Batch-Ensemble Stochastic Neural Networks for Out-of-Distribution
Detection [55.028065567756066]
Out-of-distribution (OOD) detection has recently received much attention from the machine learning community due to its importance in deploying machine learning models in real-world applications.
In this paper we propose an uncertainty quantification approach by modelling the distribution of features.
We incorporate an efficient ensemble mechanism, namely batch-ensemble, to construct the batch-ensemble neural networks (BE-SNNs) and overcome the feature collapse problem.
We show that BE-SNNs yield superior performance on several OOD benchmarks, such as the Two-Moons dataset, the FashionMNIST vs MNIST dataset, FashionM
arXiv Detail & Related papers (2022-06-26T16:00:22Z) - Sparse Flows: Pruning Continuous-depth Models [107.98191032466544]
We show that pruning improves generalization for neural ODEs in generative modeling.
We also show that pruning finds minimal and efficient neural ODE representations with up to 98% less parameters compared to the original network, without loss of accuracy.
arXiv Detail & Related papers (2021-06-24T01:40:17Z) - Gone Fishing: Neural Active Learning with Fisher Embeddings [55.08537975896764]
There is an increasing need for active learning algorithms that are compatible with deep neural networks.
This article introduces BAIT, a practical representation of tractable, and high-performing active learning algorithm for neural networks.
arXiv Detail & Related papers (2021-06-17T17:26:31Z) - Robust Classification using Hidden Markov Models and Mixtures of
Normalizing Flows [25.543231171094384]
We use a generative model that combines the state transitions of a hidden Markov model (HMM) and the neural network based probability distributions for the hidden states of the HMM.
We verify the improved robustness of NMM-HMM classifiers in an application to speech recognition.
arXiv Detail & Related papers (2021-02-15T00:40:30Z) - A novel DL approach to PE malware detection: exploring Glove
vectorization, MCC_RCNN and feature fusion [0.0]
We propose the DL-based approaches for detection and use static-based features fed up into models.
We implement a neural network model called MCC_RCNN, comprising of the combination with CNN and RNN.
Our proposed classification methods can obtain a higher prediction accuracy than the other baseline methods.
arXiv Detail & Related papers (2021-01-22T07:08:10Z) - Scaling Hidden Markov Language Models [118.55908381553056]
This work revisits the challenge of scaling HMMs to language modeling datasets.
We propose methods for scaling HMMs to massive state spaces while maintaining efficient exact inference, a compact parameterization, and effective regularization.
arXiv Detail & Related papers (2020-11-09T18:51:55Z) - Rare-Event Simulation for Neural Network and Random Forest Predictors [16.701364984106092]
We study rare-event simulation for a class of problems where the target hitting sets of interest are defined via modern machine learning tools.
This problem is motivated from fast emerging studies on the safety evaluation of intelligent systems.
arXiv Detail & Related papers (2020-10-10T03:27:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.