Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering

29 March 2018

Papers citing "Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering"

23 / 23 papers shown

Title
GenQ: Automated Question Generation to Support Caregivers While Reading Stories with Children Arun Balajiee Lekshmi Narayanan Ligia E. Gómez Martha Michelle Soto Fernandez Tri Minh Nguyen Chris Blais M. Restrepo A. Glenberg AI4Ed 21 1 0 26 May 2023
DiffCap: Exploring Continuous Diffusion on Image Captioning Yufeng He Zefan Cai Xu Gan Baobao Chang DiffM 31 5 0 20 May 2023
Affordances from Human Videos as a Versatile Representation for Robotics Shikhar Bahl Russell Mendonca Lili Chen Unnat Jain Deepak Pathak 44 164 0 17 Apr 2023
Progressive Tree-Structured Prototype Network for End-to-End Image Captioning Pengpeng Zeng Jinkuan Zhu Jingkuan Song Lianli Gao VLM 24 27 0 17 Nov 2022
Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review Hao Wang Bin Guo Y. Zeng Yasan Ding Chen Qiu Ying Zhang Li Yao Zhiwen Yu 32 2 0 02 Jul 2022
Asking for Knowledge: Training RL Agents to Query External Knowledge Using Language Iou-Jen Liu Xingdi Yuan Marc-Alexandre Côté Pierre-Yves Oudeyer A. Schwing RALM 19 12 0 12 May 2022
Learning to Respond with Your Favorite Stickers: A Framework of Unifying Multi-Modality and User Preference in Multi-Turn Dialog Shen Gao Xiuying Chen Li Liu Dongyan Zhao Rui Yan 24 14 0 05 Nov 2020
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies Itai Gat Idan Schwartz A. Schwing Tamir Hazan 55 90 0 21 Oct 2020
DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue X. Jiang Jiahao Yu Zengchang Qin Yingying Zhuang Xingxing Zhang Yue Hu Qi Wu 23 70 0 17 Nov 2019
TAB-VCR: Tags and Attributes based Visual Commonsense Reasoning Baselines Jingxiang Lin Unnat Jain A. Schwing LRM ReLM 34 9 0 31 Oct 2019
Probabilistic framework for solving Visual Dialog Badri N. Patro Anupriy Vinay P. Namboodiri BDL 30 13 0 11 Sep 2019
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning J. Aneja Harsh Agrawal Dhruv Batra A. Schwing BDL VLM 23 66 0 22 Aug 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods Aditya Mogadala M. Kalimuthu Dietrich Klakow VLM 20 132 0 22 Jul 2019
Factor Graph Attention Idan Schwartz Seunghak Yu Tamir Hazan A. Schwing 24 110 0 11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog Idan Schwartz A. Schwing Tamir Hazan 24 69 0 11 Apr 2019
Reasoning Visual Dialogs with Structural and Partial Observations Zilong Zheng Wenguan Wang Siyuan Qi Song-Chun Zhu 39 117 0 11 Apr 2019
Image-Question-Answer Synergistic Network for Visual Dialog Dalu Guo Chang Xu Dacheng Tao 19 74 0 26 Feb 2019
Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog Zhe Gan Yu Cheng Ahmed El Kholy Linjie Li Jingjing Liu Jianfeng Gao 11 104 0 01 Feb 2019
Audio-Visual Scene-Aware Dialog Huda AlAmri Vincent Cartillier Abhishek Das Jue Wang A. Cherian ... Tim K. Marks Chiori Hori Peter Anderson Stefan Lee Devi Parikh VGen 25 189 0 25 Jan 2019
Holistic Multi-modal Memory Network for Movie Question Answering Anran Wang Anh Tuan Luu Chuan-Sheng Foo Erik Cambria Yi Tay V. Chandrasekhar 30 20 0 12 Nov 2018
Diverse and Coherent Paragraph Generation from Images Moitreya Chatterjee A. Schwing 19 66 0 03 Sep 2018
Convolutional Image Captioning J. Aneja Aditya Deshpande A. Schwing VLM 37 359 0 24 Nov 2017
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding Akira Fukui Dong Huk Park Daylen Yang Anna Rohrbach Trevor Darrell Marcus Rohrbach 167 1,464 0 06 Jun 2016