This is the accompanying repository for the article Tweaking autoregressive methods for inpainting of gaps in audio signals authored by Ondřej Mokrý and Pavel Rajmic, presented at EUSIPCO 2025.
A novel variant of the Janssen method for audio inpainting is presented and compared to other popular audio inpainting methods based on autoregressive (AR) modeling. Both conceptual differences and practical implications are discussed. The experiments demonstrate the importance of the choice of the AR model estimator, window/context length, and model order. The results show the superiority of the proposed gap-wise Janssen approach using objective metrics, which is confirmed by a listening test.
The preprint is available at arXiv, the official version is published at IEEE Xplore.
The repository includes the MATLAB source codes needed to reproduce the research:
- plotting – Matlab scripts that load the results and replicate the figures used in the paper + some more.
- references – Implementation of reference methods, namely SPAIN and SPAIN-MOD, with the help of InpaintingRevisited and Dictionary learning for sparse audio inpainting.
- results –
.matfiles with all the SDR and ODG values from testing the methods. Recovered signals are not included due to file size limits. However, these can be reproduced using themaintest.mandmaintest_spain.mscripts. - utils – All the functions needed to run the main files, except for the codes for the Psychoacoustically motivated evaluation.
gaps_table.mat– Source signals and masks, taken from TestSignals repository.maintest.m– Main code running the test of the AR-based methods.maintest_spain.m– Main code running the test of the SPAIN variants.
Regarding the mid-scale experiment using the IRMAS dataset, the folder irmas includes a list of the files used in our experiment and a Matlab script which crops the files to a length of 7 seconds. The original files can be downloaded here.
For supplementary material (graphs, audio), see the accompanying website.
Note that the codes to compute the psychoacoustically motivated metrics (PEMO-Q, PEAQ) are not provided.
The PEAQ package can be acquired from TSP Lab of McGill University, the PEMO-Q software is available through University of Oldenburg. Because the processing is very time- and space-demanding, the provided .mat files with the results include all the data precomputed.
The experiments were run in Matlab R2023a using Signal Processing Toolbox (Version 9.2), Parallel Computing Toolbox (Version 7.8) and Large Time-Frequency Analysis Toolbox (Version 2.4.0).
For running SPAIN with dictionary learning as a reference method, the CVX toolbox is necessary.