Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.12311
Cited By
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data
21 May 2023
Ziyi Yang
Mahmoud Khademi
Yichong Xu
Reid Pryzant
Yuwei Fang
Chenguang Zhu
Dongdong Chen
Yao Qian
Mei Gao
Yi-Ling Chen
R. Gmyr
Naoyuki Kanda
Noel Codella
Bin Xiao
Yu Shi
Lu Yuan
Takuya Yoshioka
Michael Zeng
Xuedong Huang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data"
4 / 54 papers shown
Title
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
Justin Johnson
A. Karpathy
Li Fei-Fei
VLM
131
1,170
0
24 Nov 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
233
5,509
0
03 May 2015
Microsoft COCO Captions: Data Collection and Evaluation Server
Xinlei Chen
Hao Fang
Nayeon Lee
Ramakrishna Vedantam
Saurabh Gupta
Piotr Dollar
C. L. Zitnick
224
2,496
0
01 Apr 2015
Deep Visual-Semantic Alignments for Generating Image Descriptions
A. Karpathy
Li Fei-Fei
152
5,591
0
07 Dec 2014
Previous
1
2