Joint dereverberation and beamforming with blind estimation of the shape parameter of the desired source prior

Show simple item record

dc.contributor.author Yadav, Shekhar Kumar
dc.contributor.author George, Nithin V.
dc.coverage.spatial United States of America
dc.date.accessioned 2023-12-01T15:31:21Z
dc.date.available 2023-12-01T15:31:21Z
dc.date.issued 2024
dc.identifier.citation Yadav, Shekhar Kumar and George, Nithin V., “Joint dereverberation and beamforming with blind estimation of the shape parameter of the desired source prior”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, DOI: 10.1109/TASLP.2023.3335000, vol. 32, pp. 779-793, 2024.
dc.identifier.issn 2329-9290
dc.identifier.issn 2329-9304
dc.identifier.uri https://doi.org/10.1109/TASLP.2023.3335000
dc.identifier.uri https://repository.iitgn.ac.in/handle/123456789/9510
dc.description.abstract Dereverberation and acoustic beamforming is used to capture the speech of a desired speaker in the presence of interfering speakers in a reverberant room using an array of microphones. Traditionally, to perform these two tasks, the desired speech is modelled in the time-frequency domain using a complex Gaussian (CG) prior with time-varying variances. The shape parameter of the prior distribution is fixed at the same value for all time-frequency bins. In this work, we propose to model the inverse of the variance (i.e. the precision parameter) of the CG prior distribution which controls the shape of the distribution as a Gamma distributed random variable. The hyperparameters of the Gamma distribution are then estimated based on the data captured by the microphones. This data-dependent blind estimation of the shape of the prior distribution helps the proposed algorithm to accurately model the desired speech and adapt to different speakers and acoustic scenarios better than algorithms with a fixed shape parameter. We use maximum likelihood techniques to estimate the multi-channel linear prediction (MCLP) dereverberation coefficients and the beamforming weights using the proposed signal model. The stochastically latent precision parameters are obtained by estimating the hyperparameters using the expectation maximization (EM) method. For the online version of the algorithm, a recursive EM method is also proposed for real-time processing. Extensive simulation results show improved dereverberation and interference cancellation performance of the proposed method highlighting the importance of not choosing the shape parameter of the prior distribution manually.
dc.description.statementofresponsibility by Shekhar Kumar Yadav and Nithin V. George
dc.format.extent vol. 32, pp. 779-793
dc.language.iso en_US
dc.publisher Institute of Electrical and Electronics Engineers
dc.subject Microphone Array
dc.subject Dereverberation
dc.subject Acoustic Beamforming
dc.subject Student's t-distribution
dc.title Joint dereverberation and beamforming with blind estimation of the shape parameter of the desired source prior
dc.type Article
dc.relation.journal IEEE/ACM Transactions on Audio, Speech, and Language Processing


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account