Joint dereverberation and beamforming with blind estimation of the shape parameter of the desired source prior

DR Home
→
Electrical Engineering
→
Journal Articles
→
View Item

dc.contributor.author	Yadav, Shekhar Kumar
dc.contributor.author	George, Nithin V.
dc.coverage.spatial	United States of America
dc.date.accessioned	2023-12-01T15:31:21Z
dc.date.available	2023-12-01T15:31:21Z
dc.date.issued	2024
dc.identifier.citation	Yadav, Shekhar Kumar and George, Nithin V., “Joint dereverberation and beamforming with blind estimation of the shape parameter of the desired source prior”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, DOI: 10.1109/TASLP.2023.3335000, vol. 32, pp. 779-793, 2024.
dc.identifier.issn	2329-9290
dc.identifier.issn	2329-9304
dc.identifier.uri	https://doi.org/10.1109/TASLP.2023.3335000
dc.identifier.uri	https://repository.iitgn.ac.in/handle/123456789/9510
dc.description.abstract	Dereverberation and acoustic beamforming is used to capture the speech of a desired speaker in the presence of interfering speakers in a reverberant room using an array of microphones. Traditionally, to perform these two tasks, the desired speech is modelled in the time-frequency domain using a complex Gaussian (CG) prior with time-varying variances. The shape parameter of the prior distribution is fixed at the same value for all time-frequency bins. In this work, we propose to model the inverse of the variance (i.e. the precision parameter) of the CG prior distribution which controls the shape of the distribution as a Gamma distributed random variable. The hyperparameters of the Gamma distribution are then estimated based on the data captured by the microphones. This data-dependent blind estimation of the shape of the prior distribution helps the proposed algorithm to accurately model the desired speech and adapt to different speakers and acoustic scenarios better than algorithms with a fixed shape parameter. We use maximum likelihood techniques to estimate the multi-channel linear prediction (MCLP) dereverberation coefficients and the beamforming weights using the proposed signal model. The stochastically latent precision parameters are obtained by estimating the hyperparameters using the expectation maximization (EM) method. For the online version of the algorithm, a recursive EM method is also proposed for real-time processing. Extensive simulation results show improved dereverberation and interference cancellation performance of the proposed method highlighting the importance of not choosing the shape parameter of the prior distribution manually.
dc.description.statementofresponsibility	by Shekhar Kumar Yadav and Nithin V. George
dc.format.extent	vol. 32, pp. 779-793
dc.language.iso	en_US
dc.publisher	Institute of Electrical and Electronics Engineers
dc.subject	Microphone Array
dc.subject	Dereverberation
dc.subject	Acoustic Beamforming
dc.subject	Student's t-distribution
dc.title	Joint dereverberation and beamforming with blind estimation of the shape parameter of the desired source prior
dc.type	Article
dc.relation.journal	IEEE/ACM Transactions on Audio, Speech, and Language Processing

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Journal Articles [411]

Show simple item record

Search Digital Repository

Browse

All of DSpace
This Collection
- Titles
- Authors
- By Advisor
- By Issue Date
- Subjects
- By Type
- By Degree
- By Department

Joint dereverberation and beamforming with blind estimation of the shape parameter of the desired source prior

Files in this item

This item appears in the following Collection(s)

Search Digital Repository

Browse

All of DSpace

This Collection

My Account