Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.14525
Cited By
Curriculum Learning for Data-Efficient Vision-Language Alignment
29 July 2022
Tejas Srinivasan
Xiang Ren
Jesse Thomason
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Curriculum Learning for Data-Efficient Vision-Language Alignment"
7 / 7 papers shown
Title
Exploring Curriculum Learning for Vision-Language Tasks: A Study on Small-Scale Multimodal Training
Rohan Saha
Abrar Fahim
Alona Fyshe
Alex Murphy
26
0
0
20 Oct 2024
Can Vision Language Models Learn from Visual Demonstrations of Ambiguous Spatial Reasoning?
Bowen Zhao
Leo Parker Dirac
Paulina Varshavskaya
VLM
LRM
26
0
0
25 Sep 2024
Robust Domain Misinformation Detection via Multi-modal Feature Alignment
Hui Liu
Wenya Wang
Hao Sun
Anderson de Rezende Rocha
Haoliang Li
43
11
0
24 Nov 2023
UnIVAL: Unified Model for Image, Video, Audio and Language Tasks
Mustafa Shukor
Corentin Dancette
Alexandre Ramé
Matthieu Cord
MoMe
MLLM
61
42
0
30 Jul 2023
Learning from Children: Improving Image-Caption Pretraining via Curriculum
Hammad A. Ayyubi
R. Lokesh
Alireza Zareian
Bohong Wu
Shih-Fu Chang
VLM
CLIP
22
1
0
27 May 2023
Does Vision-and-Language Pretraining Improve Lexical Grounding?
Tian Yun
Chen Sun
Ellie Pavlick
VLM
CoGe
40
30
0
21 Sep 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
313
3,708
0
11 Feb 2021
1