Multimodal Conversational AI: A Survey of Datasets and Approaches

13 May 2022

Papers citing "Multimodal Conversational AI: A Survey of Datasets and Approaches"

14 / 14 papers shown

Title
Cross-Format Retrieval-Augmented Generation in XR with LLMs for Context-Aware Maintenance Assistance Á. Nagy Yannis Spyridis Vasileios Argyriou RALM 48 0 0 24 Feb 2025
iTBLS: A Dataset of Interactive Conversations Over Tabular Information Anirudh S. Sundar Christopher Richardson William Gay Larry Heck LMTD 47 1 0 19 Apr 2024
Dialogue Games for Benchmarking Language Understanding: Motivation, Taxonomy, Strategy David Schlangen ELM 24 13 0 14 Apr 2023
Bringing the State-of-the-Art to Customers: A Neural Agent Assistant Framework for Customer Service Support Stephen Obadinma Faiza Khan Khattak Shi Wang Tania Sidhom Elaine Lau ... Jaswinder Narain D. Pandya Xiao-Dan Zhu Frank Rudzicz Elham Dolatabadi AI4TS 24 3 0 07 Feb 2023
Interactive Question Answering Systems: Literature Review Giovanni Maria Biancofiore Yashar Deldjoo T. D. Noia E. Sciascio F. Narducci 34 13 0 04 Sep 2022
"Do you follow me?": A Survey of Recent Approaches in Dialogue State Tracking Léo Jacqmin L. Rojas-Barahona Benoit Favre 34 27 0 29 Jul 2022
Ego4D: Around the World in 3,000 Hours of Egocentric Video Kristen Grauman Andrew Westbury Eugene Byrne Zachary Chavis Antonino Furnari ... Mike Zheng Shou Antonio Torralba Lorenzo Torresani Mingfei Yan Jitendra Malik EgoV 229 1,019 0 13 Oct 2021
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding Hu Xu Gargi Ghosh Po-Yao (Bernie) Huang Dmytro Okhonko Armen Aghajanyan Florian Metze Luke Zettlemoyer Florian Metze Luke Zettlemoyer Christoph Feichtenhofer CLIP VLM 259 558 0 28 Sep 2021
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text Hassan Akbari Liangzhe Yuan Rui Qian Wei-Hong Chuang Shih-Fu Chang Yin Cui Boqing Gong ViT 248 577 0 22 Apr 2021
mForms : Multimodal Form-Filling with Question Answering Larry Heck S. Heck Anirudh S. Sundar 51 6 0 24 Nov 2020
Big Bird: Transformers for Longer Sequences Manzil Zaheer Guru Guruganesh Kumar Avinava Dubey Joshua Ainslie Chris Alberti ... Philip Pham Anirudh Ravula Qifan Wang Li Yang Amr Ahmed VLM 274 2,015 0 28 Jul 2020
Multi-modal Transformer for Video Retrieval Valentin Gabeur Chen Sun Alahari Karteek Cordelia Schmid ViT 424 596 0 21 Jul 2020
Aggregated Residual Transformations for Deep Neural Networks Saining Xie Ross B. Girshick Piotr Dollár Z. Tu Kaiming He 297 10,220 0 16 Nov 2016
Efficient Estimation of Word Representations in Vector Space Tomáš Mikolov Kai Chen G. Corrado J. Dean 3DV 239 31,257 0 16 Jan 2013