ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.10427
  4. Cited By
How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for
  Token-level Evaluation Metrics

How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics

24 August 2020
Prasanna Parthasarathi
Joelle Pineau
Sarath Chandar
ArXiv (abs)PDFHTML

Papers citing "How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics"

20 / 20 papers shown
Title
Can Visual Dialogue Models Do Scorekeeping? Exploring How Dialogue Representations Incrementally Encode Shared Knowledge
Can Visual Dialogue Models Do Scorekeeping? Exploring How Dialogue Representations Incrementally Encode Shared Knowledge
Brielen Madureira
David Schlangen
78
4
0
14 Apr 2022
Amnesic Probing: Behavioral Explanation with Amnesic Counterfactuals
Amnesic Probing: Behavioral Explanation with Amnesic Counterfactuals
Yanai Elazar
Shauli Ravfogel
Alon Jacovi
Yoav Goldberg
62
25
0
01 Jun 2020
Unsupervised State Representation Learning in Atari
Unsupervised State Representation Learning in Atari
Ankesh Anand
Evan Racah
Sherjil Ozair
Yoshua Bengio
Marc-Alexandre Côté
R. Devon Hjelm
SSL
64
255
0
19 Jun 2019
Do Neural Dialog Systems Use the Conversation History Effectively? An
  Empirical Study
Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study
Chinnadhurai Sankar
Sandeep Subramanian
C. Pal
A. Chandar
Yoshua Bengio
40
121
0
04 Jun 2019
Analysis Methods in Neural Language Processing: A Survey
Analysis Methods in Neural Language Processing: A Survey
Yonatan Belinkov
James R. Glass
84
558
0
21 Dec 2018
MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for
  Task-Oriented Dialogue Modelling
MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling
Paweł Budzianowski
Tsung-Hsien Wen
Bo-Hsiang Tseng
I. Casanueva
Stefan Ultes
Osman Ramadan
Milica Gasic
184
1,323
0
29 Sep 2018
CoQA: A Conversational Question Answering Challenge
CoQA: A Conversational Question Answering Challenge
Siva Reddy
Danqi Chen
Christopher D. Manning
RALMHAI
114
1,209
0
21 Aug 2018
What you can cram into a single vector: Probing sentence embeddings for
  linguistic properties
What you can cram into a single vector: Probing sentence embeddings for linguistic properties
Alexis Conneau
Germán Kruszewski
Guillaume Lample
Loïc Barrault
Marco Baroni
349
895
0
03 May 2018
Personalizing Dialogue Agents: I have a dog, do you have pets too?
Personalizing Dialogue Agents: I have a dog, do you have pets too?
Saizheng Zhang
Emily Dinan
Jack Urbanek
Arthur Szlam
Douwe Kiela
Jason Weston
118
1,464
0
22 Jan 2018
Towards an Automatic Turing Test: Learning to Evaluate Dialogue
  Responses
Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses
Ryan J. Lowe
Michael Noseworthy
Iulian Serban
Nicolas Angelard-Gontier
Yoshua Bengio
Joelle Pineau
57
372
0
23 Aug 2017
ParlAI: A Dialog Research Software Platform
ParlAI: A Dialog Research Software Platform
Alexander H. Miller
Will Feng
Adam Fisch
Jiasen Lu
Dhruv Batra
Antoine Bordes
Devi Parikh
Jason Weston
86
376
0
18 May 2017
Adversarial Learning for Neural Dialogue Generation
Adversarial Learning for Neural Dialogue Generation
Jiwei Li
Will Monroe
Tianlin Shi
Sébastien Jean
Alan Ritter
Dan Jurafsky
63
899
0
23 Jan 2017
SQuAD: 100,000+ Questions for Machine Comprehension of Text
SQuAD: 100,000+ Questions for Machine Comprehension of Text
Pranav Rajpurkar
Jian Zhang
Konstantin Lopyrev
Percy Liang
RALM
303
8,160
0
16 Jun 2016
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
285
1,338
0
05 Jun 2016
Learning End-to-End Goal-Oriented Dialog
Learning End-to-End Goal-Oriented Dialog
Antoine Bordes
Y-Lan Boureau
Jason Weston
82
782
0
24 May 2016
The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured
  Multi-Turn Dialogue Systems
The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems
Ryan J. Lowe
Nissan Pow
Iulian Serban
Joelle Pineau
80
950
0
30 Jun 2015
A Neural Conversational Model
A Neural Conversational Model
Oriol Vinyals
Quoc V. Le
BDL
139
1,768
0
19 Jun 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
348
10,079
0
10 Feb 2015
Show and Tell: A Neural Image Caption Generator
Show and Tell: A Neural Image Caption Generator
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
249
6,035
0
17 Nov 2014
Long-term Recurrent Convolutional Networks for Visual Recognition and
  Description
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
VLM
165
6,056
0
17 Nov 2014
1