Investigation of Weaknesses in Typically Anomaly Detection Methods for Software Development
Abstract
Software systems are rapidly increasing and diversifying due to technological innovations such as IoT, artificial intelligence, and blockchain. Accordingly, automatic analysis of software logs has recently attracted particular attention as a research area to ensure system reliability. Currently, in the research domain, anomaly detection in text logs using CNN, LSTM, and Transformer-based DNN models has shown high accuracy of over 90%. However, contrary to these excellent results, there are reports that it has not been used in the field of the software development field. We predict that the reason for this lies in the way the models are evaluated and, in the datasets, so we investigate using a representative anomaly detection model and the common dataset BGL. First, we investigate the effect of the splitting ratio of the dataset. As a result, we confirm that the accuracy decreases as the number of unknown anomaly logs increases. As a result, we identify features that are over-learned in all supervised learning models. In addition, we validate the generality of the model with the validation datasets and learning curves. The results show signs of overfitting in both supervised and unsupervised learning models. These results suggest that the composition of the dataset used affects the accuracy of the log-text anomaly detection model. Therefore, we plan to create a dataset with multiple anomaly patterns based on the logs used in the software development domain and create a model that can detect anomalies with the created dataset.
Keywords: Anomaly Detection, Software Log, Log Analysis, Deep Learning
DOI: 10.54941/ahfe1004034
Cite this paper
More from this volume
- Human Risk-Informed Design Framework (HURID) for integrating human factors in the design of systems and operations
- On the development of an ergonomic approach for the design of an industrial robotic coworker
- AI Technology, Holocaust Survivors, and Human Interactions at Holocaust Museums
- Software Usability for Different Age Groups
- Post-Pandemic Impact Analysis for airport processes from security to boarding – How to respond to the next pandemic
- Human-centric decision for the Integrated Planning of Smart Port Systems
- Design and Evaluation of A Wearable Adaptable Setup System for Occupational Exoskeletons
- User Centered Design of a Digital Platform for Therapeutic Education and Respiratory Rehabilitation in Patients with Post-COVID-19
- Towards Smart Building: Visualization of Indoor CO2 Concentration. Adapting Modern Computational Tools for Informing Design Building Decisions
- The impact of automation frameworks on today's data science competencies
- Conceptual modeling for Human Systems Integration in Manned-Unmanned Teaming
- Sensing Intra-clothing Climate to Increase Comfort According to time, place, and occasion


AHFE Open Access