Interactive Driving Turing Test with Think-Aloud Protocol: How Realistic Are Behavioural Driver Models?
Abstract
Driver models are essential for virtual safety assessments of automated vehicles. This study evaluates the realism of the i4Driving model, a behavioural driver model designed to emulate human-like driving, using an interactive driving Turing test combined with a think-aloud protocol in connected driving simulators. Thirty participants interacted with either a human-controlled vehicle or the i4Driving model across three motorway scenarios and rated realism, predictability, safety, and aggressiveness. Results showed that the i4Driving model was perceived as less realistic and predictable than human drivers (p < 0.001), yet participants could not reliably distinguish between the two (classification accuracy = 0.673). Think-aloud analysis revealed specific shortcomings, such as unrealistic merging and speed adaptation, alongside instances of naturalistic behaviour. These findings highlight the need for improvements in tactical decision-making and demonstrate the value of combining subjective ratings with qualitative insights for refining driver models.
Keywords: driving Turing test, driving behaviour, human-likeness, subjective rating, connected driving simulators, driver model
DOI: 10.54941/ahfe1007130
Cite this paper
More from this volume
- Artificial Intelligence Maturity Model (AIMM)
- An Experimental Study on Consensus Building with an AI Chatbot Across Two Topics
- An Agent-Based Simulation Framework for ADHD: Modeling Attention Regulation and Adaptive Therapeutic Interventions
- CRMSON: Co-Designing Adaptive and Ethical AI Systems to Address Mental Health Barriers in Aviation
- Usability Evaluation of FAIR Data Planning in the Data Stewardship Wizard
- Seeing the Invisible Load: XR + Multimodal Sensing for Cognitive Ergonomics in Industrial Training
- Conceptual Framework for Designing Domain-Specific LLM-Based Information Systems
- Shaping Conversations: Custom GPTs to Spark Reflection in Design
- Privacy at the Core: Toward Automated Detection of Privacy-Sensitive Content in an LLM-Based Care Documentation Support System
- Dynamic Difficulty Adjustment via Dynamic Scripting: An Empirical Study of Player Flow in a Brawler Game
- Sinusoidal time-based features and human error metrics: Advancing software defect prediction in safety-critical systems
- Designing an Experimental Method for Evaluating Divergent Thinking with a Color Queue under Time Constraints


AHFE Open Access