Detecting Anomaly In Big Data System Logs Using Convolutional Neural Network
Keywords
Anomaly detection; Big Data; CNN; Log analysis
Abstract
Nowadays, big data systems are being widely adopted by many domains for offering effective data solutions, such as manufacturing, healthcare, education, and media. Big data systems produce tons of unstructured logs that contain buried valuable information. However, it is a daunting task to manually unearth the information and detect system anomalies. A few automatic methods have been developed, where the cutting-edge machine learning technique is one of the most promising ways. In this paper, we propose a novel approach for anomaly detection from big data system logs by leveraging Convolutional Neural Networks (CNN). Different from other existing statistical methods or traditional rule-based machine learning approaches, our CNN-based model can automatically learn event relationships in system logs and detect anomaly with high accuracy. Our deep neural network consists of logkey2vec embeddings, three 1D convolutional layers, dropout layer, and max-pooling. According to our experiment, our CNN-based approach has better accuracy(reaches to 99%) compared to other approaches using Long Short term memory (LSTM) and Multilayer Perceptron (MLP) on detecting anomaly in Hadoop Distributed File System (HDFS) logs.
Publication Date
10-26-2018
Publication Title
Proceedings - IEEE 16th International Conference on Dependable, Autonomic and Secure Computing, IEEE 16th International Conference on Pervasive Intelligence and Computing, IEEE 4th International Conference on Big Data Intelligence and Computing and IEEE 3rd Cyber Science and Technology Congress, DASC-PICom-DataCom-CyberSciTec 2018
Number of Pages
159-165
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
DOI Link
https://doi.org/10.1109/DASC/PiCom/DataCom/CyberSciTec.2018.00037
Copyright Status
Unknown
Socpus ID
85056867030 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/85056867030
STARS Citation
Lu, Siyang; Wei, Xiang; Li, Yandong; and Wang, Liqiang, "Detecting Anomaly In Big Data System Logs Using Convolutional Neural Network" (2018). Scopus Export 2015-2019. 8953.
https://stars.library.ucf.edu/scopus2015/8953