VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models

Authors: Haidong Xu, Guangwei Xu, Zhedong Zheng, Xiatian Zhu, wei-jiWei Ji, Xiangtai Li, Ruijie Guo, Meishan Zhang, Min Zhang, hao-feiHao Fei

Published in NeurIPS, 2025

Recommended citation: Haidong Xu, Guangwei Xu, Zhedong Zheng, Xiatian Zhu, Wei Ji, Xiangtai Li, Ruijie Guo, Meishan Zhang, Min Zhang, Hao Fei, "VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models." NeurIPS, 2025.
Download PDF: https://zdzheng.xyz/files/Haidong_VimoRAG.pdf

Code is available at: https://walkermitty.github.io/VimoRAG/

@inproceedings{xu2025VimoRAG,
author = "Xu, Haidong and Xu, Guangwei and Zheng, Zhedong and Zhu, Xiatian and Ji, Wei and Li, Xiangtai and Guo, Ruijie and Zhang, Meishan and Zhang, Min and Fei, Hao",
title = "VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models",
booktitle = "NeurIPS",
url = "https://zdzheng.xyz/files/Haidong\_VimoRAG.pdf",
code = "https://walkermitty.github.io/VimoRAG/",
year = "2025" }