dc.contributor.author |
Verma, Shashikant |
|
dc.contributor.author |
Sebe, Nicu |
|
dc.contributor.author |
Raman, Shanmuganathan |
|
dc.coverage.spatial |
United States of America |
|
dc.date.accessioned |
2025-08-29T13:22:37Z |
|
dc.date.available |
2025-08-29T13:22:37Z |
|
dc.date.issued |
2025-09-14 |
|
dc.identifier.citation |
Verma, Shashikant; Sebe, Nicu and Raman, Shanmuganathan, "GMOT-Mamba: Mamba-based model prediction for generic multiple object tracking", in the IEEE International Conference on Image Processing (ICIP 2025), Anchorage, US, Sep. 14-17, 2025. |
|
dc.identifier.uri |
https://doi.org/10.1109/ICIP55913.2025.11084714 |
|
dc.identifier.uri |
https://repository.iitgn.ac.in/handle/123456789/11825 |
|
dc.description.abstract |
We introduce GMOT-Mamba, a novel Mamba-based model prediction framework for Generic Multiple Object Tracking (GMOT) in video sequences. Our approach features a Weighted Feature Pooling (WFP) layer, which processes encoded target states, and an innovative encoder-decoder architecture that leverages Vision-Mamba (ViM) to predict filter weights. We train our model on combinations of large-scale datasets to capture strong priors and discriminative features necessary for generic object tracking. Through extensive experiments and ablation studies, we demonstrate the effectiveness of our approach, showcasing its competitive performance against state-of-the-art GMOT methods while outperforming SOT methods in both accuracy and inference speed. Our findings underscore the potential of Mamba for enhancing model prediction in visual tracking applications. |
|
dc.description.statementofresponsibility |
by Shashikant Verma, Nicu Sebe and Shanmuganathan Raman |
|
dc.language.iso |
en_US |
|
dc.publisher |
Institute of Electrical and Electronics Engineers |
|
dc.subject |
Generic object tracking |
|
dc.subject |
Vision Mamba |
|
dc.subject |
State space models |
|
dc.subject |
Multiple object tracking |
|
dc.title |
GMOT-Mamba: Mamba-based model prediction for generic multiple object tracking |
|
dc.type |
Conference Paper |
|
dc.relation.journal |
IEEE International Conference on Image Processing (ICIP 2025) |
|