ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.05333
  4. Cited By
IDEA: Increasing Text Diversity via Online Multi-Label Recognition for
  Vision-Language Pre-training
v1v2 (latest)

IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-training

12 July 2022
Xinyu Huang
Youcai Zhang
Ying Cheng
Weiwei Tian
Ruiwei Zhao
Rui Feng
Yuejie Zhang
Yaqian Li
Yandong Guo
Xiao-Yong Zhang
    VLM
ArXiv (abs)PDFHTML

Papers citing "IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-training"

10 / 10 papers shown
Title
Reminding Multimodal Large Language Models of Object-aware Knowledge
  with Retrieved Tags
Reminding Multimodal Large Language Models of Object-aware Knowledge with Retrieved Tags
Daiqing Qi
Handong Zhao
Zijun Wei
Sheng Li
83
2
0
16 Jun 2024
A Survey on Incomplete Multi-label Learning: Recent Advances and Future
  Trends
A Survey on Incomplete Multi-label Learning: Recent Advances and Future Trends
Xiang Li
Jiexi Liu
Xinrui Wang
Songcan Chen
AI4TS
98
0
0
10 Jun 2024
Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation
Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation
Xue-mei Hu
Ce Zhang
Yi Zhang
Bowen Hai
Ke Yu
Zhihai He
MDEVLM
100
18
0
02 Nov 2023
Open-Set Image Tagging with Multi-Grained Text Supervision
Open-Set Image Tagging with Multi-Grained Text Supervision
Xinyu Huang
Yi-Jie Huang
Youcai Zhang
Weiwei Tian
Rui Feng
Yuejie Zhang
Yanchun Xie
Yaqian Li
Lei Zhang
VLM
87
35
0
23 Oct 2023
Unsupervised Prototype Adapter for Vision-Language Models
Unsupervised Prototype Adapter for Vision-Language Models
Yi Zhang
Ce Zhang
Xue-mei Hu
Z. He
VLM
79
4
0
22 Aug 2023
Recognize Anything: A Strong Image Tagging Model
Recognize Anything: A Strong Image Tagging Model
Youcai Zhang
Xinyu Huang
Jinyu Ma
Zhaoyang Li
Zhaochuan Luo
...
Tong Luo
Yaqian Li
Siyi Liu
Yandong Guo
Lei Zhang
VLM
144
242
0
06 Jun 2023
Bi-VLGM : Bi-Level Class-Severity-Aware Vision-Language Graph Matching
  for Text Guided Medical Image Segmentation
Bi-VLGM : Bi-Level Class-Severity-Aware Vision-Language Graph Matching for Text Guided Medical Image Segmentation
Wenting Chen
Jie Liu
Yixuan Yuan
VLM
81
3
0
20 May 2023
LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed
  Multi-Label Visual Recognition
LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition
Peng Xia
Di Xu
Ming Hu
Lie Ju
Zongyuan Ge
VLM
94
11
0
08 May 2023
Tag2Text: Guiding Vision-Language Model via Image Tagging
Tag2Text: Guiding Vision-Language Model via Image Tagging
Xinyu Huang
Youcai Zhang
Jinyu Ma
Weiwei Tian
Rui Feng
Yuejie Zhang
Yaqian Li
Yandong Guo
Lei Zhang
CLIPMLLMVLM3DV
152
77
0
10 Mar 2023
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image
  Pretraining
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining
Xiaoyi Dong
Jianmin Bao
Yinglin Zheng
Ting Zhang
Dongdong Chen
...
Weiming Zhang
Lu Yuan
Dong Chen
Fang Wen
Nenghai Yu
CLIPVLM
115
167
0
25 Aug 2022
1