ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.14525
  4. Cited By
Curriculum Learning for Data-Efficient Vision-Language Alignment

Curriculum Learning for Data-Efficient Vision-Language Alignment

29 July 2022
Tejas Srinivasan
Xiang Ren
Jesse Thomason
    VLM
ArXivPDFHTML

Papers citing "Curriculum Learning for Data-Efficient Vision-Language Alignment"

7 / 7 papers shown
Title
Exploring Curriculum Learning for Vision-Language Tasks: A Study on
  Small-Scale Multimodal Training
Exploring Curriculum Learning for Vision-Language Tasks: A Study on Small-Scale Multimodal Training
Rohan Saha
Abrar Fahim
Alona Fyshe
Alex Murphy
26
0
0
20 Oct 2024
Can Vision Language Models Learn from Visual Demonstrations of Ambiguous
  Spatial Reasoning?
Can Vision Language Models Learn from Visual Demonstrations of Ambiguous Spatial Reasoning?
Bowen Zhao
Leo Parker Dirac
Paulina Varshavskaya
VLM
LRM
26
0
0
25 Sep 2024
Robust Domain Misinformation Detection via Multi-modal Feature Alignment
Robust Domain Misinformation Detection via Multi-modal Feature Alignment
Hui Liu
Wenya Wang
Hao Sun
Anderson de Rezende Rocha
Haoliang Li
43
11
0
24 Nov 2023
UnIVAL: Unified Model for Image, Video, Audio and Language Tasks
UnIVAL: Unified Model for Image, Video, Audio and Language Tasks
Mustafa Shukor
Corentin Dancette
Alexandre Ramé
Matthieu Cord
MoMe
MLLM
61
42
0
30 Jul 2023
Learning from Children: Improving Image-Caption Pretraining via
  Curriculum
Learning from Children: Improving Image-Caption Pretraining via Curriculum
Hammad A. Ayyubi
R. Lokesh
Alireza Zareian
Bohong Wu
Shih-Fu Chang
VLM
CLIP
22
1
0
27 May 2023
Does Vision-and-Language Pretraining Improve Lexical Grounding?
Does Vision-and-Language Pretraining Improve Lexical Grounding?
Tian Yun
Chen Sun
Ellie Pavlick
VLM
CoGe
40
32
0
21 Sep 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
310
3,708
0
11 Feb 2021
1