Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.06558
Cited By
MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning
13 December 2021
Wenqiao Zhang
Haochen Shi
Jiannan Guo
Shengyu Zhang
Qingpeng Cai
Juncheng Li
Sihui Luo
Yueting Zhuang
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning"
30 / 30 papers shown
Title
End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding
Meng Li
Tianbao Wang
Haoyu Zhang
Shengyu Zhang
Zhou Zhao
...
Wenming Tan
Jin Wang
Peng Wang
Shi Pu
Leilei Gan
64
45
0
15 Mar 2022
Consensus Graph Representation Learning for Better Grounded Image Captioning
Wenqiao Zhang
Haochen Shi
Siliang Tang
Jun Xiao
Qiang Yu
Yueting Zhuang
34
56
0
02 Dec 2021
Relational Graph Learning for Grounded Video Description Generation
Wenqiao Zhang
Xinze Wang
Siliang Tang
Haizhou Shi
Haochen Shi
Jun Xiao
Yueting Zhuang
Wenjie Wang
42
33
0
02 Dec 2021
Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language Inference
Juncheng Li
Siliang Tang
Linchao Zhu
Haochen Shi
Xuanwen Huang
Leilei Gan
Yi Yang
Yueting Zhuang
70
28
0
26 Jul 2021
Exploring Semantic Relationships for Unpaired Image Captioning
Fenglin Liu
Meng Gao
Tianhao Zhang
Yuexian Zou
106
7
0
20 Jun 2021
Towards Accurate Text-based Image Captioning with Content Diversity Exploration
Guanghui Xu
Shuaicheng Niu
Mingkui Tan
Yucheng Luo
Qing Du
Qi Wu
DiffM
67
57
0
23 Apr 2021
Cross-Domain Facial Expression Recognition: A Unified Evaluation Benchmark and Adversarial Graph Learning
Tianshui Chen
Tao Pu
Hefeng Wu
Yuan Xie
Lingbo Liu
Liang Lin
61
99
0
03 Aug 2020
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments
Mathilde Caron
Ishan Misra
Julien Mairal
Priya Goyal
Piotr Bojanowski
Armand Joulin
OCL
SSL
215
4,070
0
17 Jun 2020
TextCaps: a Dataset for Image Captioning with Reading Comprehension
Oleksii Sidorov
Ronghang Hu
Marcus Rohrbach
Amanpreet Singh
58
411
0
24 Mar 2020
Deconfounded Image Captioning: A Causal Retrospect
Xu Yang
Hanwang Zhang
Jianfei Cai
CML
42
124
0
09 Mar 2020
Meshed-Memory Transformer for Image Captioning
Marcella Cornia
Matteo Stefanini
Lorenzo Baraldi
Rita Cucchiara
59
874
0
17 Dec 2019
Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation
Juncheng Li
Xinze Wang
Siliang Tang
Haizhou Shi
Leilei Gan
Yueting Zhuang
William Yang Wang
SSL
73
69
0
18 Nov 2019
Rosetta: Large scale system for text detection and recognition in images
Fedor Borisyuk
Albert Gordo
V. Sivakumar
56
298
0
11 Oct 2019
Walking with MIND: Mental Imagery eNhanceD Embodied QA
Juncheng Li
Siliang Tang
Leilei Gan
Yueting Zhuang
34
23
0
05 Aug 2019
Relational Reasoning using Prior Knowledge for Visual Captioning
Jingyi Hou
Xinxiao Wu
Yayun Qi
Wentian Zhao
Jiebo Luo
Yunde Jia
60
14
0
04 Jun 2019
Multimodal Transformer with Multi-View Visual Representation for Image Captioning
Jun-chen Yu
Jing Li
Zhou Yu
Qingming Huang
ViT
61
383
0
20 May 2019
Towards VQA Models That Can Read
Amanpreet Singh
Vivek Natarajan
Meet Shah
Yu Jiang
Xinlei Chen
Dhruv Batra
Devi Parikh
Marcus Rohrbach
EgoV
69
1,210
0
18 Apr 2019
Describing like humans: on diversity in image captioning
Qingzhong Wang
Antoni B. Chan
60
99
0
28 Mar 2019
Unpaired Image Captioning via Scene Graph Alignments
Jiuxiang Gu
Shafiq Joty
Jianfei Cai
Handong Zhao
Xu Yang
G. Wang
GNN
51
174
0
26 Mar 2019
Exploring Visual Relationship for Image Captioning
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
74
831
0
19 Sep 2018
Unpaired Image Captioning by Language Pivoting
Jiuxiang Gu
Shafiq Joty
Jianfei Cai
G. Wang
59
83
0
14 Mar 2018
Unsupervised Machine Translation Using Monolingual Corpora Only
Guillaume Lample
Alexis Conneau
Ludovic Denoyer
MarcÁurelio Ranzato
SSL
110
1,094
0
31 Oct 2017
Graph Convolutional Matrix Completion
Rianne van den Berg
Thomas Kipf
Max Welling
GNN
110
1,256
0
07 Jun 2017
Encoding Sentences with Graph Convolutional Networks for Semantic Role Labeling
Diego Marcheggiani
Ivan Titov
GNN
NAI
64
832
0
14 Mar 2017
Self-critical Sequence Training for Image Captioning
Steven J. Rennie
E. Marcheret
Youssef Mroueh
Jerret Ross
Vaibhava Goel
105
1,886
0
02 Dec 2016
SPICE: Semantic Propositional Image Caption Evaluation
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
EGVM
90
1,909
0
29 Jul 2016
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
Justin Johnson
A. Karpathy
Li Fei-Fei
VLM
124
1,167
0
24 Nov 2015
A Diversity-Promoting Objective Function for Neural Conversation Models
Jiwei Li
Michel Galley
Chris Brockett
Jianfeng Gao
W. Dolan
129
2,387
0
11 Oct 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
471
62,122
0
04 Jun 2015
CIDEr: Consensus-based Image Description Evaluation
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
260
4,471
0
20 Nov 2014
1