dc.contributor.author |
Yadav, Shekhar Kumar |
|
dc.contributor.author |
George, Nithin V. |
|
dc.coverage.spatial |
United States of America |
|
dc.date.accessioned |
2023-12-01T15:31:21Z |
|
dc.date.available |
2023-12-01T15:31:21Z |
|
dc.date.issued |
2024 |
|
dc.identifier.citation |
Yadav, Shekhar Kumar and George, Nithin V., “Joint dereverberation and beamforming with blind estimation of the shape parameter of the desired source prior”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, DOI: 10.1109/TASLP.2023.3335000, vol. 32, pp. 779-793, 2024. |
|
dc.identifier.issn |
2329-9290 |
|
dc.identifier.issn |
2329-9304 |
|
dc.identifier.uri |
https://doi.org/10.1109/TASLP.2023.3335000 |
|
dc.identifier.uri |
https://repository.iitgn.ac.in/handle/123456789/9510 |
|
dc.description.abstract |
Dereverberation and acoustic beamforming is used to capture the speech of a desired speaker in the presence of interfering speakers in a reverberant room using an array of microphones. Traditionally, to perform these two tasks, the desired speech is modelled in the time-frequency domain using a complex Gaussian (CG) prior with time-varying variances. The shape parameter of the prior distribution is fixed at the same value for all time-frequency bins. In this work, we propose to model the inverse of the variance (i.e. the precision parameter) of the CG prior distribution which controls the shape of the distribution as a Gamma distributed random variable. The hyperparameters of the Gamma distribution are then estimated based on the data captured by the microphones. This data-dependent blind estimation of the shape of the prior distribution helps the proposed algorithm to accurately model the desired speech and adapt to different speakers and acoustic scenarios better than algorithms with a fixed shape parameter. We use maximum likelihood techniques to estimate the multi-channel linear prediction (MCLP) dereverberation coefficients and the beamforming weights using the proposed signal model. The stochastically latent precision parameters are obtained by estimating the hyperparameters using the expectation maximization (EM) method. For the online version of the algorithm, a recursive EM method is also proposed for real-time processing. Extensive simulation results show improved dereverberation and interference cancellation performance of the proposed method highlighting the importance of not choosing the shape parameter of the prior distribution manually. |
|
dc.description.statementofresponsibility |
by Shekhar Kumar Yadav and Nithin V. George |
|
dc.format.extent |
vol. 32, pp. 779-793 |
|
dc.language.iso |
en_US |
|
dc.publisher |
Institute of Electrical and Electronics Engineers |
|
dc.subject |
Microphone Array |
|
dc.subject |
Dereverberation |
|
dc.subject |
Acoustic Beamforming |
|
dc.subject |
Student's t-distribution |
|
dc.title |
Joint dereverberation and beamforming with blind estimation of the shape parameter of the desired source prior |
|
dc.type |
Article |
|
dc.relation.journal |
IEEE/ACM Transactions on Audio, Speech, and Language Processing |
|