Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.05710
Cited By
Look Before you Speak: Visually Contextualized Utterances
10 December 2020
Paul Hongsuck Seo
Arsha Nagrani
Cordelia Schmid
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Look Before you Speak: Visually Contextualized Utterances"
16 / 66 papers shown
Title
Frames: A Corpus for Adding Memory to Goal-Oriented Dialogue Systems
Layla El Asri
Hannes Schulz
Shikhar Sharma
Jeremie Zumer
Justin Harris
Emery Fine
Rahul Mehrotra
Kaheer Suleman
115
270
0
31 Mar 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
Abhishek Das
Satwik Kottur
J. M. F. Moura
Stefan Lee
Dhruv Batra
OffRL
110
425
0
20 Mar 2017
Generating High-Quality and Informative Conversation Responses with Sequence-to-Sequence Models
Yuanlong Shao
Stephan Gouws
D. Britz
Anna Goldie
B. Strope
R. Kurzweil
130
212
0
11 Jan 2017
Text-guided Attention Model for Image Captioning
Jonghwan Mun
Minsu Cho
Bohyung Han
VLM
39
93
0
12 Dec 2016
MarioQA: Answering Questions by Watching Gameplay Videos
Jonghwan Mun
Paul Hongsuck Seo
Ilchae Jung
Bohyung Han
83
109
0
06 Dec 2016
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
336
3,238
0
02 Dec 2016
Visual Dialog
Abhishek Das
Satwik Kottur
Khushi Gupta
Avi Singh
Deshraj Yadav
José M. F. Moura
Devi Parikh
Dhruv Batra
142
997
0
26 Nov 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
296
1,465
0
06 Jun 2016
A Persona-Based Neural Conversation Model
Jiwei Li
Michel Galley
Chris Brockett
Georgios P. Spithourakis
Jianfeng Gao
W. Dolan
113
1,036
0
19 Mar 2016
Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction
Hyeonwoo Noh
Paul Hongsuck Seo
Bohyung Han
OOD
72
327
0
18 Nov 2015
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models
Iulian Serban
Alessandro Sordoni
Yoshua Bengio
Aaron Courville
Joelle Pineau
AILaw
166
1,754
0
17 Jul 2015
Multi-domain Dialog State Tracking using Recurrent Neural Networks
N. Mrksic
Diarmuid Ó Séaghdha
Blaise Thomson
Milica Gasic
Pei-hao Su
David Vandyke
Tsung-Hsien Wen
S. Young
65
183
0
23 Jun 2015
You Only Look Once: Unified, Real-Time Object Detection
Joseph Redmon
S. Divvala
Ross B. Girshick
Ali Farhadi
ObjD
697
36,935
0
08 Jun 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
502
62,270
0
04 Jun 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
199
5,470
0
03 May 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
340
10,069
0
10 Feb 2015
Previous
1
2