Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.09522
Cited By
v1
v2 (latest)
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends
19 October 2020
Shagun Uppal
Sarthak Bhagat
Devamanyu Hazarika
Navonil Majumdar
Soujanya Poria
Roger Zimmermann
Amir Zadeh
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multimodal Research in Vision and Language: A Review of Current and Emerging Trends"
30 / 180 papers shown
Title
Dynamic Coattention Networks For Question Answering
Caiming Xiong
Victor Zhong
R. Socher
AIMat
84
684
0
05 Nov 2016
Solving Visual Madlibs with Multiple Cues
Tatiana Tommasi
Arun Mallya
Bryan A. Plummer
Svetlana Lazebnik
Alexander C. Berg
Tamara L. Berg
62
18
0
11 Aug 2016
SPICE: Semantic Propositional Image Caption Evaluation
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
EGVM
102
1,914
0
29 Jul 2016
Hierarchical Attention Network for Action Recognition in Videos
Yilin Wang
Suhang Wang
Jiliang Tang
Neil O'Hare
Yi-Ju Chang
Baoxin Li
BDL
56
82
0
21 Jul 2016
Improved Techniques for Training GANs
Tim Salimans
Ian Goodfellow
Wojciech Zaremba
Vicki Cheung
Alec Radford
Xi Chen
GAN
483
9,052
0
10 Jun 2016
Adversarial Feature Learning
Jiasen Lu
Philipp Krahenbuhl
Trevor Darrell
GAN
111
1,610
0
31 May 2016
Does Multimodality Help Human and Machine for Translation and Image Captioning?
Ozan Caglayan
Walid Aransa
Yaxing Wang
Marc Masana
Mercedes García-Martínez
Fethi Bougares
Loïc Barrault
Joost van de Weijer
60
86
0
30 May 2016
Review Networks for Caption Generation
Zhilin Yang
Ye Yuan
Yuexin Wu
Ruslan Salakhutdinov
William W. Cohen
3DV
49
85
0
25 May 2016
Generative Adversarial Text to Image Synthesis
Scott E. Reed
Zeynep Akata
Xinchen Yan
Lajanugen Logeswaran
Bernt Schiele
Honglak Lee
GAN
203
3,146
0
17 May 2016
MovieQA: Understanding Stories in Movies through Question-Answering
Makarand Tapaswi
Yukun Zhu
Rainer Stiefelhagen
Antonio Torralba
R. Urtasun
Sanja Fidler
115
749
0
09 Dec 2015
Uncovering Temporal Context for Video Question and Answering
Linchao Zhu
Zhongwen Xu
Yi Yang
Alexander G. Hauptmann
BDL
68
45
0
15 Nov 2015
Stacked Attention Networks for Image Question Answering
Zichao Yang
Xiaodong He
Jianfeng Gao
Li Deng
Alex Smola
BDL
109
1,882
0
07 Nov 2015
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
Haonan Yu
Jiang Wang
Zhiheng Huang
Yi Yang
Wenyuan Xu
90
560
0
26 Oct 2015
SentiCap: Generating Image Descriptions with Sentiments
A. Mathews
Lexing Xie
Xuming He
82
221
0
06 Oct 2015
Automatic Concept Discovery from Parallel Text and Visual Corpora
Chen Sun
Chuang Gan
Ram Nevatia
CoGe
42
107
0
24 Sep 2015
Alignment-based compositional semantics for instruction following
Jacob Andreas
Dan Klein
68
102
0
26 Aug 2015
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
385
7,964
0
17 Aug 2015
A Survey of Current Datasets for Vision and Language Research
Francis Ferraro
N. Mostafazadeh
Ting-Hao 'Kenneth' Huang
Huang
Lucy Vanderwende
Jacob Devlin
Michel Galley
Margaret Mitchell
VLM
49
75
0
23 Jun 2015
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
Bryan A. Plummer
Liwei Wang
Christopher M. Cervantes
Juan C. Caicedo
Julia Hockenmaier
Svetlana Lazebnik
199
2,060
0
19 May 2015
Sequence to Sequence -- Video to Text
Subhashini Venugopalan
Marcus Rohrbach
Jeff Donahue
Raymond J. Mooney
Trevor Darrell
Kate Saenko
142
1,418
0
03 May 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
208
5,478
0
03 May 2015
DRAW: A Recurrent Neural Network For Image Generation
Karol Gregor
Ivo Danihelka
Alex Graves
Danilo Jimenez Rezende
Daan Wierstra
GAN
DRL
168
1,961
0
16 Feb 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
346
10,070
0
10 Feb 2015
Deep Visual-Semantic Alignments for Generating Image Descriptions
A. Karpathy
Li Fei-Fei
127
5,585
0
07 Dec 2014
CIDEr: Consensus-based Image Description Evaluation
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
292
4,488
0
20 Nov 2014
Show and Tell: A Neural Image Caption Generator
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
246
6,029
0
17 Nov 2014
Neural Turing Machines
Alex Graves
Greg Wayne
Ivo Danihelka
97
2,328
0
20 Oct 2014
Unsupervised Domain Adaptation by Backpropagation
Yaroslav Ganin
Victor Lempitsky
OOD
233
6,030
0
26 Sep 2014
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
437
20,568
0
10 Sep 2014
Generating Sequences With Recurrent Neural Networks
Alex Graves
GAN
155
4,034
0
04 Aug 2013
Previous
1
2
3
4