Align and tell: Boosting text-video retrieval with local alignment and fine-grained supervision Xiaohan Wang, Linchao Zhu, Zhedong Zheng, Mingliang Xu, Yi Yang IEEE Transactions on Multimedia (TMM), 2022 PDF