How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics

24 August 2020

Papers citing "How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics"

20 / 20 papers shown

Title
Can Visual Dialogue Models Do Scorekeeping? Exploring How Dialogue Representations Incrementally Encode Shared Knowledge Brielen Madureira David Schlangen 78 4 0 14 Apr 2022
Amnesic Probing: Behavioral Explanation with Amnesic Counterfactuals Yanai Elazar Shauli Ravfogel Alon Jacovi Yoav Goldberg 62 25 0 01 Jun 2020
Unsupervised State Representation Learning in Atari Ankesh Anand Evan Racah Sherjil Ozair Yoshua Bengio Marc-Alexandre Côté R. Devon Hjelm SSL 64 255 0 19 Jun 2019
Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study Chinnadhurai Sankar Sandeep Subramanian C. Pal A. Chandar Yoshua Bengio 40 121 0 04 Jun 2019
Analysis Methods in Neural Language Processing: A Survey Yonatan Belinkov James R. Glass 84 558 0 21 Dec 2018
MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling Paweł Budzianowski Tsung-Hsien Wen Bo-Hsiang Tseng I. Casanueva Stefan Ultes Osman Ramadan Milica Gasic 184 1,323 0 29 Sep 2018
CoQA: A Conversational Question Answering Challenge Siva Reddy Danqi Chen Christopher D. Manning RALM HAI 114 1,209 0 21 Aug 2018
What you can cram into a single vector: Probing sentence embeddings for linguistic properties Alexis Conneau Germán Kruszewski Guillaume Lample Loïc Barrault Marco Baroni 349 895 0 03 May 2018
Personalizing Dialogue Agents: I have a dog, do you have pets too? Saizheng Zhang Emily Dinan Jack Urbanek Arthur Szlam Douwe Kiela Jason Weston 118 1,464 0 22 Jan 2018
Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses Ryan J. Lowe Michael Noseworthy Iulian Serban Nicolas Angelard-Gontier Yoshua Bengio Joelle Pineau 57 372 0 23 Aug 2017
ParlAI: A Dialog Research Software Platform Alexander H. Miller Will Feng Adam Fisch Jiasen Lu Dhruv Batra Antoine Bordes Devi Parikh Jason Weston 86 376 0 18 May 2017
Adversarial Learning for Neural Dialogue Generation Jiwei Li Will Monroe Tianlin Shi Sébastien Jean Alan Ritter Dan Jurafsky 63 899 0 23 Jan 2017
SQuAD: 100,000+ Questions for Machine Comprehension of Text Pranav Rajpurkar Jian Zhang Konstantin Lopyrev Percy Liang RALM 303 8,160 0 16 Jun 2016
Deep Reinforcement Learning for Dialogue Generation Jiwei Li Will Monroe Alan Ritter Michel Galley Jianfeng Gao Dan Jurafsky 285 1,338 0 05 Jun 2016
Learning End-to-End Goal-Oriented Dialog Antoine Bordes Y-Lan Boureau Jason Weston 82 782 0 24 May 2016
The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems Ryan J. Lowe Nissan Pow Iulian Serban Joelle Pineau 80 950 0 30 Jun 2015
A Neural Conversational Model Oriol Vinyals Quoc V. Le BDL 139 1,768 0 19 Jun 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention Ke Xu Jimmy Ba Ryan Kiros Kyunghyun Cho Aaron Courville Ruslan Salakhutdinov R. Zemel Yoshua Bengio DiffM 348 10,079 0 10 Feb 2015
Show and Tell: A Neural Image Caption Generator Oriol Vinyals Alexander Toshev Samy Bengio D. Erhan 3DV 249 6,035 0 17 Nov 2014
Long-term Recurrent Convolutional Networks for Visual Recognition and Description Jeff Donahue Lisa Anne Hendricks Marcus Rohrbach Subhashini Venugopalan S. Guadarrama Kate Saenko Trevor Darrell VLM 165 6,056 0 17 Nov 2014