On Immersivity of Transmitted Spatial Sounds for Human-Machine Interaction
Abstract
Spatial sound perception is a natural way of perceiving sound (not only) by humans and serves for spatial orientation, escape from imminent danger, or shifting attention toward an object of interest. In the modern world, spatial sound, transmitted e.g. via telecommunications networks, is increasingly used to convey an additional dimension of information to the recipient (e.g. an air traffic controller can virtually hear a pilot from the direction where his aircraft is located, etc.). Given the expanding field of AI for generating spatial information, it is necessary to investigate the influence of basic technical parameters on the subjective experience of the recipient, represented, for example, by the immersiveness of communication, the intelligibility of transmitted speech spatial mix, or the spatial resolution of multiple sound stimuli. When using AI to generate spatial sound, the influence of (computational) delay on the quality of the subjective experience is crucial (the recipient's head movement is tracked using head-tracker embedded in the headphones, and the audio channels information is rendered in real time to compensate for any head movements of the subject, so that the sound appears to come from a fixed location regardless of the head's orientation). Any delay in this computational loop can compromise the subjective perception or quality of communication.The paper will present the results of extensive subjective tests that clarify the relationship between (computational) delay and the quality of subjective perception in various tested situations. The results can be directly used to specify the minimum technical requirements for design of AI-based audio-immersive communication and control systems.
Keywords: Audio man-machine communication, Subjective testing, Head-tracking, Immersiveness, Quality of experience
DOI: 10.54941/ahfe1007156
Cite this paper
More from this volume
- Teaching Multimodal Interaction in Cars to First-time Users
- Human-Centered optimization through Digital Twins, and Motion Capture Technologies of a manual activity in the logistics sector
- Exploring Empathy for Emotion-Aware Vehicles: How Should a Car Respond?
- Enhancing Usability in Crisis Management Training: Evaluation of the Virtual Reality-Based Situational Awareness Table
- Formal Verification for Human-Centred Trust in AI: A Critical Examination of Current Paradigms
- Designing Inclusive Mobile Government Services in the Middle East: A User Experience–Centered Framework
- Capturing Food Culture for Adaptive AI: Generative Insights from a Multimodal Profiling Study
- A methodical approach to AI-supported human learning in complex task environments
- Glossary as a Compass: Domain Knowledge Artifacts in Human-Centered AI Development
- Fiscolab: Co-Creation, Artificial Intelligence, and User-Centered Design in the Development of Educational Fiscal Solutions
- Thinking With AI: Human–AI Interaction and Critical Thinking in Scenario-Based Learning
- Traffic Evacuation Simulation and Development of Contingency Plan for a Flood-Prone Community in Puerto Rico


AHFE Open Access