Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.07075
Cited By
Cross-Modal Similarity-Based Curriculum Learning for Image Captioning
14 December 2022
Hongkuan Zhang
Saku Sugawara
Akiko Aizawa
Lei Zhou
Ryohei Sasano
Koichi Takeda
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Cross-Modal Similarity-Based Curriculum Learning for Image Captioning"
3 / 3 papers shown
Title
Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning
Xinzhi Dong
Chengjiang Long
Wenju Xu
Chunxia Xiao
ViT
73
66
0
05 Aug 2021
Unifying Vision-and-Language Tasks via Text Generation
Jaemin Cho
Jie Lei
Hao Tan
Mohit Bansal
MLLM
256
525
0
04 Feb 2021
Unified Vision-Language Pre-Training for Image Captioning and VQA
Luowei Zhou
Hamid Palangi
Lei Zhang
Houdong Hu
Jason J. Corso
Jianfeng Gao
MLLM
VLM
252
927
0
24 Sep 2019
1