Log-based Anomaly Detection Without Log Parsing
- URL: http://arxiv.org/abs/2108.01955v1
- Date: Wed, 4 Aug 2021 10:42:13 GMT
- Title: Log-based Anomaly Detection Without Log Parsing
- Authors: Van-Hoang Le and Hongyu Zhang
- Abstract summary: We propose NeuralLog, a novel log-based anomaly detection approach that does not require log parsing.
Our experimental results show that the proposed approach can effectively understand the semantic meaning of log messages.
Overall, NeuralLog achieves F1-scores greater than 0.95 on four public datasets, outperforming the existing approaches.
- Score: 7.66638994053231
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Software systems often record important runtime information in system logs
for troubleshooting purposes. There have been many studies that use log data to
construct machine learning models for detecting system anomalies. Through our
empirical study, we find that existing log-based anomaly detection approaches
are significantly affected by log parsing errors that are introduced by 1) OOV
(out-of-vocabulary) words, and 2) semantic misunderstandings. The log parsing
errors could cause the loss of important information for anomaly detection. To
address the limitations of existing methods, we propose NeuralLog, a novel
log-based anomaly detection approach that does not require log parsing.
NeuralLog extracts the semantic meaning of raw log messages and represents them
as semantic vectors. These representation vectors are then used to detect
anomalies through a Transformer-based classification model, which can capture
the contextual information from log sequences. Our experimental results show
that the proposed approach can effectively understand the semantic meaning of
log messages and achieve accurate anomaly detection results. Overall, NeuralLog
achieves F1-scores greater than 0.95 on four public datasets, outperforming the
existing approaches.
Related papers
- Log2graphs: An Unsupervised Framework for Log Anomaly Detection with Efficient Feature Extraction [1.474723404975345]
High cost of manual annotation and dynamic nature of usage scenarios present major challenges to effective log analysis.
This study proposes a novel log feature extraction model called DualGCN-LogAE, designed to adapt to various scenarios.
We also introduce Log2graphs, an unsupervised log anomaly detection method based on the feature extractor.
arXiv Detail & Related papers (2024-09-18T11:35:58Z) - LogELECTRA: Self-supervised Anomaly Detection for Unstructured Logs [0.0]
The goal of log-based anomaly detection is to automatically detect system anomalies by analyzing the large number of logs generated in a short period of time.
Previous studies have used a log to extract templates from unstructured log data and detect anomalies on the basis of patterns of the template occurrences.
We propose LogELECTRA, a new log anomaly detection model that analyzes a single line of log messages more deeply on the basis of self-supervised anomaly detection.
arXiv Detail & Related papers (2024-02-16T01:47:02Z) - LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection [73.69399219776315]
We propose a unified Transformer-based framework for Log anomaly detection (LogFormer) to improve the generalization ability across different domains.
Specifically, our model is first pre-trained on the source domain to obtain shared semantic knowledge of log data.
Then, we transfer such knowledge to the target domain via shared parameters.
arXiv Detail & Related papers (2024-01-09T12:55:21Z) - GLAD: Content-aware Dynamic Graphs For Log Anomaly Detection [49.9884374409624]
GLAD is a Graph-based Log Anomaly Detection framework designed to detect anomalies in system logs.
We introduce GLAD, a Graph-based Log Anomaly Detection framework designed to detect anomalies in system logs.
arXiv Detail & Related papers (2023-09-12T04:21:30Z) - Impact of Log Parsing on Deep Learning-Based Anomaly Detection [4.0719622481627376]
We show that there is no strong correlation between log parsing accuracy and anomaly detection accuracy.
We experimentally confirm existing theoretical results showing that it is a property that we refer to as distinguishability in log parsing results.
arXiv Detail & Related papers (2023-05-25T09:53:02Z) - PULL: Reactive Log Anomaly Detection Based On Iterative PU Learning [58.85063149619348]
We propose PULL, an iterative log analysis method for reactive anomaly detection based on estimated failure time windows.
Our evaluation shows that PULL consistently outperforms ten benchmark baselines across three different datasets.
arXiv Detail & Related papers (2023-01-25T16:34:43Z) - LogGD:Detecting Anomalies from System Logs by Graph Neural Networks [14.813971618949068]
We propose a novel graph-based log anomaly detection method, LogGD, to effectively address the issue.
We exploit the powerful capability of Graph Transformer Neural Network, which combines graph structure and node semantics for log-based anomaly detection.
arXiv Detail & Related papers (2022-09-16T11:51:58Z) - Robust and Transferable Anomaly Detection in Log Data using Pre-Trained
Language Models [59.04636530383049]
Anomalies or failures in large computer systems, such as the cloud, have an impact on a large number of users.
We propose a framework for anomaly detection in log data, as a major troubleshooting source of system information.
arXiv Detail & Related papers (2021-02-23T09:17:05Z) - Self-Attentive Classification-Based Anomaly Detection in Unstructured
Logs [59.04636530383049]
We propose Logsy, a classification-based method to learn log representations.
We show an average improvement of 0.25 in the F1 score, compared to the previous methods.
arXiv Detail & Related papers (2020-08-21T07:26:55Z) - Self-Supervised Log Parsing [59.04636530383049]
Large-scale software systems generate massive volumes of semi-structured log records.
Existing approaches rely on log-specifics or manual rule extraction.
We propose NuLog that utilizes a self-supervised learning model and formulates the parsing task as masked language modeling.
arXiv Detail & Related papers (2020-03-17T19:25:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.