dc.contributor.author |
Kumari, Seema |
|
dc.contributor.author |
Mishra, Utkarsh |
|
dc.contributor.author |
Mandal, Srimanta |
|
dc.contributor.author |
Raman, Shanmuganathan |
|
dc.coverage.spatial |
United States of America |
|
dc.date.accessioned |
2025-08-29T13:22:37Z |
|
dc.date.available |
2025-08-29T13:22:37Z |
|
dc.date.issued |
2025-09-14 |
|
dc.identifier.citation |
Kumari, Seema; Mishra, Utkarsh; Mandal, Srimanta and Raman, Shanmuganathan, "Transformer augmented multi-resolution hash encoding in diffusion model for 3D point cloud denoising", in the IEEE International Conference on Image Processing (ICIP 2025), Anchorage, US, Sep. 14-17, 2025. |
|
dc.identifier.uri |
https://doi.org/10.1109/ICIP55913.2025.11084368 |
|
dc.identifier.uri |
https://repository.iitgn.ac.in/handle/123456789/11824 |
|
dc.description.abstract |
Denoising 3D point cloud strives to remove noise from noisy data. Existing methods address the problem by estimating point-wise displacement from the point feature or by learning the distribution of noise. In this paper, we propose to embed the point cloud through a novel multi-resolution hash encoding, and utilize the embedding to learn an optimum transport plan between noisy and corresponding clean point cloud via transformer encoder and a shared-MLP based decoder. The multi-resolution hash encoding uses hierarchical hash-based representations to efficiently capture geometric details at multiple resolutions. Hence, it enables removal of noise while encoding global-to-local structural details. The transformer encoder further improves the model’s ability to learn long-range dependencies and contextual relationships, facilitating improved denoising performance. The optimum transport plan is devised by simulating a denoising diffusion probabilistic model through Schrödinger bridge problem. The proposed method advances state-of-the-art methods through extensive experiments and offers new insights into the synergy between hash encoding, and transformer architectures in the diffusion framework. |
|
dc.description.statementofresponsibility |
by Seema Kumari, Utkarsh Mishra, Srimanta Mandal and Shanmuganathan Raman |
|
dc.language.iso |
en_US |
|
dc.publisher |
Institute of Electrical and Electronics Engineers |
|
dc.subject |
Denoising |
|
dc.subject |
Point cloud |
|
dc.subject |
Multi-resolution |
|
dc.subject |
Hash encoding |
|
dc.subject |
Diffusion model |
|
dc.subject |
Ttransformer |
|
dc.title |
Transformer augmented multi-resolution hash encoding in diffusion model for 3D point cloud denoising |
|
dc.type |
Conference Paper |
|
dc.relation.journal |
IEEE International Conference on Image Processing (ICIP 2025) |
|