Align and tell: Boosting text-video retrieval with local alignment and fine-grained supervision

Authors: xiaohan-wangXiaohan Wang, linchao-zhuLinchao Zhu, Zhedong Zheng, Mingliang Xu, yi-yangYi Yang

Published in IEEE Transactions on Multimedia, 2022

Recommended citation: Xiaohan Wang, Linchao Zhu, Zhedong Zheng, Mingliang Xu, Yi Yang, "Align and tell: Boosting text-video retrieval with local alignment and fine-grained supervision." IEEE Transactions on Multimedia, 2022. DOI: 10.1109/TMM.2022.3204444
Download PDF: https://zdzheng.xyz/files/TMM22-Xiaohan.pdf

@article{wang2022align,
author = "Wang, Xiaohan and Zhu, Linchao and Zheng, Zhedong and Xu, Mingliang and Yang, Yi",
doi = "10.1109/TMM.2022.3204444",
title = "Align and tell: Boosting text-video retrieval with local alignment and fine-grained supervision",
journal = "IEEE Transactions on Multimedia",
url = "https://zdzheng.xyz/files/TMM22-Xiaohan.pdf",
year = "2022",
publisher = "IEEE" }