
v1v2 (latest)
Align before Fuse: Vision and Language Representation Learning with Momentum Distillation
Papers citing "Align before Fuse: Vision and Language Representation Learning with Momentum Distillation"
50 / 1,231 papers shown
Title |
---|
![]() Dataset Growth Ziheng Qin Zhaopan Xu Yukun Zhou Zangwei Zheng Zebang Cheng ...Xiaojiang Peng Radu Timofte Hongxun Yao Kai Wang Yang You |