v1v2 (latest)

Multimodal Research in Vision and Language: A Review of Current and Emerging Trends

19 October 2020

Roger Zimmermann

Papers citing "Multimodal Research in Vision and Language: A Review of Current and Emerging Trends"

30 / 180 papers shown

Title
Dynamic Coattention Networks For Question Answering Caiming Xiong Victor Zhong R. Socher AIMat 84 684 0 05 Nov 2016
Solving Visual Madlibs with Multiple Cues Tatiana Tommasi Arun Mallya Bryan A. Plummer Svetlana Lazebnik Alexander C. Berg Tamara L. Berg 62 18 0 11 Aug 2016
SPICE: Semantic Propositional Image Caption Evaluation Peter Anderson Basura Fernando Mark Johnson Stephen Gould EGVM 102 1,914 0 29 Jul 2016
Hierarchical Attention Network for Action Recognition in Videos Yilin Wang Suhang Wang Jiliang Tang Neil O'Hare Yi-Ju Chang Baoxin Li BDL 56 82 0 21 Jul 2016
Improved Techniques for Training GANs Tim Salimans Ian Goodfellow Wojciech Zaremba Vicki Cheung Alec Radford Xi Chen GAN 483 9,052 0 10 Jun 2016
Adversarial Feature Learning Jiasen Lu Philipp Krahenbuhl Trevor Darrell GAN 111 1,610 0 31 May 2016
Does Multimodality Help Human and Machine for Translation and Image Captioning? Ozan Caglayan Walid Aransa Yaxing Wang Marc Masana Mercedes García-Martínez Fethi Bougares Loïc Barrault Joost van de Weijer 60 86 0 30 May 2016
Review Networks for Caption Generation Zhilin Yang Ye Yuan Yuexin Wu Ruslan Salakhutdinov William W. Cohen 3DV 49 85 0 25 May 2016
Generative Adversarial Text to Image Synthesis Scott E. Reed Zeynep Akata Xinchen Yan Lajanugen Logeswaran Bernt Schiele Honglak Lee GAN 203 3,146 0 17 May 2016
MovieQA: Understanding Stories in Movies through Question-Answering Makarand Tapaswi Yukun Zhu Rainer Stiefelhagen Antonio Torralba R. Urtasun Sanja Fidler 115 749 0 09 Dec 2015
Uncovering Temporal Context for Video Question and Answering Linchao Zhu Zhongwen Xu Yi Yang Alexander G. Hauptmann BDL 68 45 0 15 Nov 2015
Stacked Attention Networks for Image Question Answering Zichao Yang Xiaodong He Jianfeng Gao Li Deng Alex Smola BDL 109 1,882 0 07 Nov 2015
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks Haonan Yu Jiang Wang Zhiheng Huang Yi Yang Wenyuan Xu 90 560 0 26 Oct 2015
SentiCap: Generating Image Descriptions with Sentiments A. Mathews Lexing Xie Xuming He 82 221 0 06 Oct 2015
Automatic Concept Discovery from Parallel Text and Visual Corpora Chen Sun Chuang Gan Ram Nevatia CoGe 42 107 0 24 Sep 2015
Alignment-based compositional semantics for instruction following Jacob Andreas Dan Klein 68 102 0 26 Aug 2015
Effective Approaches to Attention-based Neural Machine Translation Thang Luong Hieu H. Pham Christopher D. Manning 385 7,964 0 17 Aug 2015
A Survey of Current Datasets for Vision and Language Research Francis Ferraro N. Mostafazadeh Ting-Hao 'Kenneth' Huang Huang Lucy Vanderwende Jacob Devlin Michel Galley Margaret Mitchell VLM 49 75 0 23 Jun 2015
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models Bryan A. Plummer Liwei Wang Christopher M. Cervantes Juan C. Caicedo Julia Hockenmaier Svetlana Lazebnik 199 2,060 0 19 May 2015
Sequence to Sequence -- Video to Text Subhashini Venugopalan Marcus Rohrbach Jeff Donahue Raymond J. Mooney Trevor Darrell Kate Saenko 142 1,418 0 03 May 2015
VQA: Visual Question Answering Aishwarya Agrawal Jiasen Lu Stanislaw Antol Margaret Mitchell C. L. Zitnick Dhruv Batra Devi Parikh CoGe 208 5,478 0 03 May 2015
DRAW: A Recurrent Neural Network For Image Generation Karol Gregor Ivo Danihelka Alex Graves Danilo Jimenez Rezende Daan Wierstra GAN DRL 168 1,961 0 16 Feb 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention Ke Xu Jimmy Ba Ryan Kiros Kyunghyun Cho Aaron Courville Ruslan Salakhutdinov R. Zemel Yoshua Bengio DiffM 346 10,070 0 10 Feb 2015
Deep Visual-Semantic Alignments for Generating Image Descriptions A. Karpathy Li Fei-Fei 127 5,585 0 07 Dec 2014
CIDEr: Consensus-based Image Description Evaluation Ramakrishna Vedantam C. L. Zitnick Devi Parikh 292 4,488 0 20 Nov 2014
Show and Tell: A Neural Image Caption Generator Oriol Vinyals Alexander Toshev Samy Bengio D. Erhan 3DV 246 6,029 0 17 Nov 2014
Neural Turing Machines Alex Graves Greg Wayne Ivo Danihelka 97 2,328 0 20 Oct 2014
Unsupervised Domain Adaptation by Backpropagation Yaroslav Ganin Victor Lempitsky OOD 233 6,030 0 26 Sep 2014
Sequence to Sequence Learning with Neural Networks Ilya Sutskever Oriol Vinyals Quoc V. Le AIMat 437 20,568 0 10 Sep 2014
Generating Sequences With Recurrent Neural Networks Alex Graves GAN 155 4,034 0 04 Aug 2013