Audiovisual Database with 360 Video and Higher-Order Ambisonics Audio for Perception, Cognition, Behavior, and QoE Evaluation Research
Thomas RobothamAshutosh SinglaOlli S. RummukainenAlexander RaakeEmanu\"el A. P. Habets
Thomas RobothamAshutosh SinglaOlli S. RummukainenAlexander RaakeEmanu\"el A. P. Habets
Dec 2022
0被引用
1笔记
读论文,拿好礼活动火爆进行中,iPad、蓝牙耳机、拍立得、键盘鼠标套装等你来拿!
摘要原文
Research into multi-modal perception, human cognition, behavior, andattention can benefit from high-fidelity content that may recreatereal-life-like scenes when rendered on head-mounted displays. Moreover, aspectsof audiovisual perception, cognitive processes, and behavior may complementquestionnaire-based Quality of Experience (QoE) evaluation of interactivevirtual environments. Currently, there is a lack of high-quality open-sourceaudiovisual databases that can be used to evaluate such aspects or systemscapable of reproducing high-quality content. With this paper, we provide apublicly available audiovisual database consisting of twelve scenes capturingreal-life nature and urban environments with a video resolution of 7680x3840 at60 frames-per-second and with 4th-order Ambisonics audio. These 360 videosequences, with an average duration of 60 seconds, represent real-life settingsfor systematically evaluating various dimensions of uni-/multi-modalperception, cognition, behavior, and QoE. The paper provides details of thescene requirements, recording approach, and scene descriptions. The databaseprovides high-quality reference material with a balanced focus on auditory andvisual sensory information. The database will be continuously updated withadditional scenes and further metadata such as human ratings and saliencyinformation.