You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Multimodal research and applications are becoming more commonplace as Virtual Reality (VR) technology integrates different sensory feedback, enabling the recreation of real spaces in an audio-visual context. Within VR experiences, numerous applications rely on the users voice as a key element of interaction, including music performances and public speaking applications. Self-perception of our voice plays a crucial role in vocal production. When singing or speaking, our voice interacts with the acoustic properties of the environment, shaping the adjustment of vocal parameters in response to the perceived characteristics of the space.;;This technical report presents a real-time auralization pipeline that leverages three-dimensional Spatial Impulse Responses (SIRs) for multimodal research applications in VR requiring first-person vocal interaction. It describes the impulse response creation and rendering workflow, the audio-visual integration, and addresses latency and computational considerations. The system enables users to explore acoustic spaces from various positions and orientations within a predefined area, supporting three and five Degrees of Freedom (3Dof and 5DoF) in audio-visual multimodal perception for both research and creative applications in VR.;;The design of this pipeline arises from the limitations of existing audio tools and spatializers, particularly regarding signal latency, and the lack of SIRs captured from a first-person perspective and in multiple adjacent distributions to enable translational rendering. By addressing these gaps, the system enables real-time auralization of self-generated vocal feedback.
Author (s): Vargas, Mauricio Flores; Bates, Vargas Enda; McDonnell, Rachel
Affiliation:
(See document for exact affiliation information.)
AES Convention: 158
Paper Number:323
Publication Date:
2025-05-12
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=22874
(1066KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Vargas, Mauricio Flores; Bates, Vargas Enda; McDonnell, Rachel; 2025; Real-Time Auralization Pipeline for First-Person Vocal Interaction in Audio-Visual Virtual Environments [PDF]; ; Paper 323; Available from: https://aes2.org/publications/elibrary-page/?id=22874
Vargas, Mauricio Flores; Bates, Vargas Enda; McDonnell, Rachel; Real-Time Auralization Pipeline for First-Person Vocal Interaction in Audio-Visual Virtual Environments [PDF]; ; Paper 323; 2025 Available: https://aes2.org/publications/elibrary-page/?id=22874
@article{vargas2025real-time,
author={vargas mauricio flores and bates vargas enda and mcdonnell rachel},
journal={journal of the audio engineering society},
title={real-time auralization pipeline for first-person vocal interaction in audio-visual virtual environments},
year={2025},
number={323},
month={may},}