This website requires JavaScript.
DOI: 10.1109/QoMEX55416.2022.9900893

Audiovisual Database with 360 Video and Higher-Order Ambisonics Audio for Perception, Cognition, Behavior, and QoE Evaluation Research

Thomas RobothamAshutosh SinglaOlli S. RummukainenAlexander RaakeEmanu\"el A. P. Habets
Dec 2022
摘要
Research into multi-modal perception, human cognition, behavior, andattention can benefit from high-fidelity content that may recreatereal-life-like scenes when rendered on head-mounted displays. Moreover, aspectsof audiovisual perception, cognitive processes, and behavior may complementquestionnaire-based Quality of Experience (QoE) evaluation of interactivevirtual environments. Currently, there is a lack of high-quality open-sourceaudiovisual databases that can be used to evaluate such aspects or systemscapable of reproducing high-quality content. With this paper, we provide apublicly available audiovisual database consisting of twelve scenes capturingreal-life nature and urban environments with a video resolution of 7680x3840 at60 frames-per-second and with 4th-order Ambisonics audio. These 360 videosequences, with an average duration of 60 seconds, represent real-life settingsfor systematically evaluating various dimensions of uni-/multi-modalperception, cognition, behavior, and QoE. The paper provides details of thescene requirements, recording approach, and scene descriptions. The databaseprovides high-quality reference material with a balanced focus on auditory andvisual sensory information. The database will be continuously updated withadditional scenes and further metadata such as human ratings and saliencyinformation.
展开全部
图表提取

暂无人提供速读十问回答

论文十问由沈向洋博士提出,鼓励大家带着这十个问题去阅读论文,用有用的信息构建认知模型。写出自己的十问回答,还有机会在当前页面展示哦。

Q1论文试图解决什么问题?
Q2这是否是一个新的问题?
Q3这篇文章要验证一个什么科学假设?
0
被引用
笔记
问答