Sinusoidal time-based features and human error metrics: Advancing software defect prediction in safety-critical systems
Abstract
Defect detection in safety-critical software remains difficult despite advanced tools and mature quality assurance, largely due to the human origins of many errors. Building on prior work introducing human error–driven metrics that outperform traditional code measures, this study enhances predictive accuracy by prioritizing higher recall to strengthen defect triage in environments where missing defects carries severe risk and moderate false positives are acceptable. We integrate temporal cyclicality into defect prediction by transforming code commit timestamps into sine features via a parameterized sinusoidal model, optimized with a genetic algorithm to capture daily and periodic developer activity patterns. These features preserve non-linear, cyclical relationships linked to defect introduction, allowing machine learning models to exploit latent human-behavioural signals. Evaluation across three open-source safety-critical systems shows average recall gains of 48.68% over code metrics baselines and 9.27% over previously defined human error metrics. Embedding periodic human activity patterns alongside human-error features significantly improves defect prediction. The approach is interpretable, and generalizable, offering a pathway for broader application and future integration with adaptive, human-centric software quality models.
Keywords: Computer bugs, Human factors, Software defect prediction, Software development, Software quality, Software testing
DOI: 10.54941/ahfe1007069
Cite this paper
More from this volume
- Artificial Intelligence Maturity Model (AIMM)
- An Experimental Study on Consensus Building with an AI Chatbot Across Two Topics
- An Agent-Based Simulation Framework for ADHD: Modeling Attention Regulation and Adaptive Therapeutic Interventions
- CRMSON: Co-Designing Adaptive and Ethical AI Systems to Address Mental Health Barriers in Aviation
- Usability Evaluation of FAIR Data Planning in the Data Stewardship Wizard
- Seeing the Invisible Load: XR + Multimodal Sensing for Cognitive Ergonomics in Industrial Training
- Conceptual Framework for Designing Domain-Specific LLM-Based Information Systems
- Shaping Conversations: Custom GPTs to Spark Reflection in Design
- Privacy at the Core: Toward Automated Detection of Privacy-Sensitive Content in an LLM-Based Care Documentation Support System
- Dynamic Difficulty Adjustment via Dynamic Scripting: An Empirical Study of Player Flow in a Brawler Game
- Designing an Experimental Method for Evaluating Divergent Thinking with a Color Queue under Time Constraints
- Designing Experiments to Explore Optimal Timing for Refreshing Breaks During Cognitive Tasks Using Time-Series Changes


AHFE Open Access