Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1701.08251
Cited By
v1
v2 (latest)
Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation
28 January 2017
N. Mostafazadeh
Chris Brockett
W. Dolan
Michel Galley
Jianfeng Gao
Georgios P. Spithourakis
Lucy Vanderwende
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation"
50 / 64 papers shown
Title
On the Effectiveness of Integration Methods for Multimodal Dialogue Response Retrieval
Seongbo Jang
Seonghyeon Lee
Dongha Lee
Hwanjo Yu
19
0
0
13 Jun 2025
KwaiChat: A Large-Scale Video-Driven Multilingual Mixed-Type Dialogue Corpus
Xiaoming Shi
Zeming Liu
Chenkai Zhang
Yiming Lei
Haitao Leng
...
Qingjie Liu
Wanxiang Che
Shaoguo Liu
Size Li
Yanjie Wang
161
1
0
10 Mar 2025
An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation
Peiming Guo
Sinuo Liu
Yanzhao Zhang
Dingkun Long
Pengjun Xie
Meishan Zhang
Hao Fei
DiffM
151
1
0
16 Aug 2024
A Survey of Personality, Persona, and Profile in Conversational Agents and Chatbots
Richard Sutcliffe
133
4
0
31 Dec 2023
See, Say, and Segment: Teaching LMMs to Overcome False Premises
Tsung-Han Wu
Giscard Biamby
David M. Chan
Lisa Dunlap
Ritwik Gupta
Xudong Wang
Joseph E. Gonzalez
Trevor Darrell
VLM
MLLM
113
21
0
13 Dec 2023
Large Language Models can Share Images, Too!
Young-Jun Lee
Dokyong Lee
Joo Won Sung
Jonghwan Hyeon
Ho-Jin Choi
MLLM
79
2
0
23 Oct 2023
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts
Yunshui Li
Binyuan Hui
Zhichao Yin
Min Yang
Fei Huang
Yongbin Li
MoE
87
21
0
24 May 2023
Building Multimodal AI Chatbots
Mingyu Lee
59
3
0
21 Apr 2023
Models of symbol emergence in communication: a conceptual review and a guide for avoiding local minima
Julian Zubek
Tomasz Korbak
J. Rączaszek-Leonardi
60
3
0
08 Mar 2023
TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real World
Hongpeng Lin
Ludan Ruan
Wenke Xia
Peiyu Liu
Jing Wen
...
Di Hu
Ruihua Song
Wayne Xin Zhao
Qin Jin
Zhiwu Lu
VGen
85
11
0
14 Jan 2023
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue Dataset
Young-Jun Lee
ByungSoo Ko
Han-Gyu Kim
Jonghwan Hyeon
Ho-Jin Choi
89
8
0
08 Dec 2022
Persona-Based Conversational AI: State of the Art and Challenges
Junfeng Liu
Christopher T. Symons
R. Vatsavai
63
12
0
04 Dec 2022
MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation
Jiazhan Feng
Qingfeng Sun
Can Xu
Pu Zhao
Yaming Yang
Chongyang Tao
Dongyan Zhao
Qingwei Lin
99
59
0
10 Nov 2022
Lexical Knowledge Internalization for Neural Dialog Generation
Zhiyong Wu
Wei Bi
Xiang Li
Lingpeng Kong
B. Kao
65
2
0
04 May 2022
Learning to Express in Knowledge-Grounded Conversation
Xueliang Zhao
Tingchen Fu
Chongyang Tao
Wei Wu
Dongyan Zhao
Rui Yan
58
6
0
12 Apr 2022
Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge
Yoonna Jang
J. Lim
Yuna Hur
Dongsuk Oh
Suhyune Son
Yeonsoo Lee
Donghoon Shin
Seungryong Kim
Heuiseok Lim
93
42
0
16 Dec 2021
The JDDC 2.0 Corpus: A Large-Scale Multimodal Multi-Turn Chinese Dialogue Dataset for E-commerce Customer Service
Nan Zhao
Haoran Li
Youzheng Wu
Xiaodong He
Bowen Zhou
46
9
0
27 Sep 2021
Constructing Multi-Modal Dialogue Dataset by Replacing Text with Semantically Relevant Images
Nyoungwoo Lee
Suwon Shin
Jaegul Choo
Ho‐Jin Choi
S. Myaeng
60
27
0
19 Jul 2021
PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Xiaoxue Zang
Lijuan Liu
Maria Wang
Yang Song
Hao Zhang
Jindong Chen
VLM
99
60
0
06 Jul 2021
Modeling Text-visual Mutual Dependency for Multi-modal Dialog Generation
Shuhe Wang
Yuxian Meng
Xiaofei Sun
Leilei Gan
Rongbin Ouyang
Rui Yan
Tianwei Zhang
Jiwei Li
66
15
0
30 May 2021
Maria: A Visual Experience Powered Conversational Agent
Zujie Liang
Huang Hu
Can Xu
Chongyang Tao
Xiubo Geng
Yining Chen
Fan Liang
Daxin Jiang
91
32
0
27 May 2021
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey
Jinjie Ni
Tom Young
Vlad Pandelea
Fuzhao Xue
Min Zhang
225
280
0
10 May 2021
Focused Attention Improves Document-Grounded Generation
Shrimai Prabhumoye
Kazuma Hashimoto
Yingbo Zhou
A. Black
Ruslan Salakhutdinov
225
41
0
26 Apr 2021
Knowledge-Grounded Dialogue Generation with Pre-trained Language Models
Xueliang Zhao
Wei Wu
Can Xu
Chongyang Tao
Dongyan Zhao
Rui Yan
260
193
0
17 Oct 2020
Multi-Modal Open-Domain Dialogue
Kurt Shuster
Eric Michael Smith
Da Ju
Jason Weston
AI4CE
137
44
0
02 Oct 2020
The Adapter-Bot: All-In-One Controllable Conversational Model
Andrea Madotto
Zhaojiang Lin
Yejin Bang
Pascale Fung
98
63
0
28 Aug 2020
Open-Domain Conversational Agents: Current Progress, Open Problems, and Future Directions
Stephen Roller
Y-Lan Boureau
Jason Weston
Antoine Bordes
Emily Dinan
...
Kurt Shuster
Eric Michael Smith
Arthur Szlam
Jack Urbanek
Mary Williamson
LLMAG
AI4CE
132
52
0
22 Jun 2020
Open Domain Dialogue Generation with Latent Images
Ze Yang
Wei Wu
Huang Hu
Can Xu
Wei Wang
Zhoujun Li
76
30
0
04 Apr 2020
Low-Resource Knowledge-Grounded Dialogue Generation
Xueliang Zhao
Wei Wu
Chongyang Tao
Can Xu
Dongyan Zhao
Rui Yan
117
110
0
24 Feb 2020
Teaching Machines to Converse
Jiwei Li
87
4
0
31 Jan 2020
The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational Agents
Kurt Shuster
Da Ju
Stephen Roller
Emily Dinan
Y-Lan Boureau
Jason Weston
109
82
0
09 Nov 2019
Contrastive Multi-document Question Generation
W. Cho
Yizhe Zhang
Sudha Rao
Asli Celikyilmaz
Chenyan Xiong
Jianfeng Gao
Mengdi Wang
Bill Dolan
SyDa
121
28
0
08 Nov 2019
Generating a Common Question from Multiple Documents using Multi-source Encoder-Decoder Models
W. Cho
Yizhe Zhang
Sudha Rao
Chris Brockett
Sungjin Lee
79
7
0
25 Oct 2019
Inverse Visual Question Answering with Multi-Level Attentions
Yaser Alwatter
Yuhong Guo
BDL
35
1
0
17 Sep 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
141
136
0
22 Jul 2019
Hindi Visual Genome: A Dataset for Multimodal English-to-Hindi Machine Translation
Shantipriya Parida
Ondrej Bojar
S. Dash
77
63
0
21 Jul 2019
"My Way of Telling a Story": Persona based Grounded Story Generation
Shrimai Prabhumoye
Khyathi Chandu
Ruslan Salakhutdinov
A. Black
76
35
0
14 Jun 2019
Conversing by Reading: Contentful Neural Conversation with On-demand Machine Reading
Lianhui Qin
Michel Galley
Chris Brockett
Xiaodong Liu
Xiang Gao
W. Dolan
Yejin Choi
Jianfeng Gao
91
110
0
06 Jun 2019
Challenges in Building Intelligent Open-domain Dialog Systems
Minlie Huang
Xiaoyan Zhu
Jianfeng Gao
VLM
150
316
0
13 May 2019
Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts
Julia Kruk
Jonah Lubin
Karan Sikka
Xiaoyu Lin
Dan Jurafsky
Ajay Divakaran
145
96
0
19 Apr 2019
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
Alex Schwing
130
110
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
Alex Schwing
Tamir Hazan
87
71
0
11 Apr 2019
An End-to-End Conversational Style Matching Agent
Rens Hoegen
Deepali Aneja
Daniel J. McDuff
Mary Czerwinski
81
57
0
04 Apr 2019
Answer-based Adversarial Training for Generating Clarification Questions
Sudha Rao
Hal Daumé
GAN
71
112
0
04 Apr 2019
Learning to Speak and Act in a Fantasy Text Adventure Game
Jack Urbanek
Angela Fan
Siddharth Karamcheti
Saachi Jain
Samuel Humeau
Emily Dinan
Tim Rocktaschel
Douwe Kiela
Arthur Szlam
Jason Weston
LLMAG
93
207
0
07 Mar 2019
Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog
Zhe Gan
Yu Cheng
Ahmed El Kholy
Linjie Li
Jingjing Liu
Jianfeng Gao
106
105
0
01 Feb 2019
Multi-modal dialog for browsing large visual catalogs using exploration-exploitation paradigm in a joint embedding space
Indrani Bhattacharya
Arkabandhu Chowdhury
V. Raykar
37
5
0
28 Jan 2019
The Design and Implementation of XiaoIce, an Empathetic Social Chatbot
Li Zhou
Jianfeng Gao
Di Li
Harry Shum
82
608
0
21 Dec 2018
Sequential Attention GAN for Interactive Image Editing
Yu Cheng
Zhe Gan
Yitong Li
Jingjing Liu
Jianfeng Gao
77
98
0
20 Dec 2018
From FiLM to Video: Multi-turn Question Answering with Multi-modal Context
T. Nguyen
Shikhar Sharma
Hannes Schulz
Layla El Asri
69
33
0
17 Dec 2018
1
2
Next