polimi-ispl / homula-rir

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

HOMULA-RIR: A Room Impulse Response Dataset for Teleconferencing and Spatial Audio Applications Acquired Through Higher-Order Microphones and Uniform Linear Microphone Arrays

HOMULA-RIR is a dataset of room impulse responses (RIRs) acquired using both higher-order microphones (HOMs) and a uniform linear array (ULA), in order to model a remote attendance teleconferencing scenario. Specifically, measurements were performed in a seminar room, where a 64-microphone ULA (Eventide-Polimi eStick) was used as a multichannel audio acquisition system in the proximity of 2 speakers (Genelec 8020), while HOMs (Voyage Audio Spatial Mic) were used to model 25 attendees actually present in the seminar room.


plan room


RIRs have been recorded at a sample rate of fs = 48 kHz and truncated to a duration of 1 s. They are provided as multichannel wav files, saved at 32 bit per sample. Together with RIRs we also release acoustically calibrated positions of each microphone capsule.

Sample code

The repository contains two sample files: homula-rir.py and homula-rir.m, which consist of scripts (in Python and MATLAB respectively) for reading and visualizing both RIRs and microphone or sources positions.

Naming convention

RIRs of individual arrays are saved as separate files, following the naming convention: rir-source-array.wav. Here, source can be either S1 or S2, depending on the considered source, and array is an acronym representing a specific microphone array, as depicted in the previous picture. The term array can take on either ULA for the eSticks measures, or a pair row-HOM for the Spatial Mics measures. Specifically, row = {R1, R2, R3, R4, R5} designates the row where a particular Spatial Mic is positioned, and HOM = {HOM1, HOM2, HOM3, HOM4, HOM5} denotes a specific array within each row. The positions of each capsule in every array are released as csv files, adopting the naming convention pos-array.csv, where array is the same acronym denoting a specific microphone array. Additionally, the positions of the two sources are reported in the file pos-sources.csv.

Authors

Federico Miotello
Paolo Ostan
Mirco Pezzoli
Luca Comanducci
Alberto Bernardini
Fabio Antonacci
Augusto Sarti

About


Languages

Language:Python 56.1%Language:MATLAB 43.9%