ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.11646
  4. Cited By
Of Human Criteria and Automatic Metrics: A Benchmark of the Evaluation
  of Story Generation
v1v2v3v4 (latest)

Of Human Criteria and Automatic Metrics: A Benchmark of the Evaluation of Story Generation

24 August 2022
Cyril Chhun
Pierre Colombo
Chloé Clavel
Fabian M. Suchanek
ArXiv (abs)PDFHTML

Papers citing "Of Human Criteria and Automatic Metrics: A Benchmark of the Evaluation of Story Generation"

50 / 67 papers shown
Title
SCORE: Story Coherence and Retrieval Enhancement for AI Narratives
SCORE: Story Coherence and Retrieval Enhancement for AI Narratives
Qiang Yi
Yangfan He
Jing Wang
Xinyuan Song
Shiyao Qian
...
Kuan Lu
Menghao Huo
Jiaqi Chen
Tianyu Shi
Tianyu Shi
RALM
108
16
0
30 Mar 2025
Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation
Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation
Mingqi Gao
Xinyu Hu
Li Lin
Xiaojun Wan
71
2
0
28 Jan 2025
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators
Yinhong Liu
Han Zhou
Zhijiang Guo
Ehsan Shareghi
Ivan Vulić
Anna Korhonen
Nigel Collier
ALM
180
83
0
20 Jan 2025
Evaluating Creative Short Story Generation in Humans and Large Language Models
Evaluating Creative Short Story Generation in Humans and Large Language Models
Mete Ismayilzada
Claire Stevenson
Lonneke van der Plas
LM&MALRM
107
5
0
04 Nov 2024
Agents' Room: Narrative Generation through Multi-step Collaboration
Agents' Room: Narrative Generation through Multi-step Collaboration
Fantine Huot
Reinald Kim Amplayo
Jennimaria Palomaki
Alice Shoshana Jakobovits
Elizabeth Clark
Mirella Lapata
92
15
0
03 Oct 2024
Navigating the Path of Writing: Outline-guided Text Generation with Large Language Models
Navigating the Path of Writing: Outline-guided Text Generation with Large Language Models
Yukyung Lee
Soonwon Ka
Bokyung Son
Pilsung Kang
Jaewook Kang
LLMAG
134
6
0
22 Apr 2024
LongStory: Coherent, Complete and Length Controlled Long story Generation
LongStory: Coherent, Complete and Length Controlled Long story Generation
Kyeongman Park
Nakyeong Yang
Kyomin Jung
102
5
0
26 Nov 2023
Learning Disentangled Textual Representations via Statistical Measures
  of Similarity
Learning Disentangled Textual Representations via Statistical Measures of Similarity
Pierre Colombo
Guillaume Staerman
Nathan Noiry
Pablo Piantanida
FaMLDRL
179
22
0
07 May 2022
A Differential Entropy Estimator for Training Neural Networks
A Differential Entropy Estimator for Training Neural Networks
Georg Pichler
Pierre Colombo
Malik Boudiaf
Günther Koliander
Pablo Piantanida
149
23
0
14 Feb 2022
What are the best systems? New perspectives on NLP Benchmarking
What are the best systems? New perspectives on NLP Benchmarking
Pierre Colombo
Nathan Noiry
Ekhine Irurozki
Stephan Clémençon
163
42
0
08 Feb 2022
InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation
InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation
Pierre Colombo
Chloe Clave
Pablo Piantanida
90
44
0
02 Dec 2021
Beam Search with Bidirectional Strategies for Neural Response Generation
Beam Search with Bidirectional Strategies for Neural Response Generation
Pierre Colombo
Chouchang Yang
Giovanna Varni
Chloé Clavel
182
13
0
07 Oct 2021
A Plug-and-Play Method for Controlled Text Generation
A Plug-and-Play Method for Controlled Text Generation
Damian Pascual
Béni Egressy
Clara Meister
Ryan Cotterell
Roger Wattenhofer
122
93
0
20 Sep 2021
The Perils of Using Mechanical Turk to Evaluate Open-Ended Text
  Generation
The Perils of Using Mechanical Turk to Evaluate Open-Ended Text Generation
Marzena Karpinska
Nader Akoury
Mohit Iyyer
273
108
0
14 Sep 2021
A Temporal Variational Model for Story Generation
A Temporal Variational Model for Story Generation
David Wilmot
Frank Keller
DRL
86
9
0
14 Sep 2021
Code-switched inspired losses for generic spoken dialog representations
Code-switched inspired losses for generic spoken dialog representations
E. Chapuis
Pierre Colombo
Matthieu Labeau
Chloe Clave
153
12
0
27 Aug 2021
Automatic Text Evaluation through the Lens of Wasserstein Barycenters
Automatic Text Evaluation through the Lens of Wasserstein Barycenters
Pierre Colombo
Guillaume Staerman
Chloé Clavel
Pablo Piantanida
173
41
0
27 Aug 2021
BARTScore: Evaluating Generated Text as Text Generation
BARTScore: Evaluating Generated Text as Text Generation
Weizhe Yuan
Graham Neubig
Pengfei Liu
123
849
0
22 Jun 2021
Long Text Generation by Modeling Sentence-Level and Discourse-Level
  Coherence
Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence
Jian Guan
Xiaoxi Mao
Changjie Fan
Zitao Liu
Wenbiao Ding
Minlie Huang
AuLLM
95
81
0
19 May 2021
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
Jian Guan
Zhexin Zhang
Zhuoer Feng
Zitao Liu
Wenbiao Ding
Xiaoxi Mao
Changjie Fan
Minlie Huang
72
61
0
19 May 2021
A Novel Estimator of Mutual Information for Learning to Disentangle
  Textual Representations
A Novel Estimator of Mutual Information for Learning to Disentangle Textual Representations
Pierre Colombo
Chloé Clavel
Pablo Piantanida
AAMLDRL
155
51
0
06 May 2021
A Pseudo-Metric between Probability Distributions based on Depth-Trimmed
  Regions
A Pseudo-Metric between Probability Distributions based on Depth-Trimmed Regions
Guillaume Staerman
Pavlo Mozharovskyi
Pierre Colombo
Stéphan Clémenccon
Florence dÁlché-Buc
OOD
539
17
0
23 Mar 2021
Automatic Story Generation: Challenges and Attempts
Automatic Story Generation: Challenges and Attempts
Amal Alabdulkarim
Siyan Li
Xiangyu Peng
68
51
0
25 Feb 2021
Transformer-based Conditional Variational Autoencoder for Controllable
  Story Generation
Transformer-based Conditional Variational Autoencoder for Controllable Story Generation
Le Fang
Tao Zeng
Chao-Ning Liu
Liefeng Bo
Wen Dong
Changyou Chen
DRL
90
61
0
04 Jan 2021
Re-evaluating Evaluation in Text Summarization
Re-evaluating Evaluation in Text Summarization
Manik Bhandari
Pranav Narayan Gour
A. Ashfaq
Pengfei Liu
Graham Neubig
150
178
0
14 Oct 2020
Modeling Protagonist Emotions for Emotion-Aware Storytelling
Modeling Protagonist Emotions for Emotion-Aware Storytelling
Faeze Brahman
Snigdha Chaturvedi
64
50
0
14 Oct 2020
Narrative Text Generation with a Latent Discrete Plan
Narrative Text Generation with a Latent Discrete Plan
Harsh Jhamtani
Taylor Berg-Kirkpatrick
35
17
0
07 Oct 2020
Hierarchical Pre-training for Sequence Labelling in Spoken Dialog
Hierarchical Pre-training for Sequence Labelling in Spoken Dialog
E. Chapuis
Pierre Colombo
Matteo Manica
Matthieu Labeau
Chloé Clavel
147
59
0
23 Sep 2020
Narrative Interpolation for Generating and Understanding Stories
Narrative Interpolation for Generating and Understanding Stories
Su Wang
Greg Durrett
K. Erk
156
34
0
17 Aug 2020
SummEval: Re-evaluating Summarization Evaluation
SummEval: Re-evaluating Summarization Evaluation
Alexander R. Fabbri
Wojciech Kry'sciñski
Bryan McCann
Caiming Xiong
R. Socher
Dragomir R. Radev
HILM
100
720
0
24 Jul 2020
Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine
  Translation Evaluation Metrics
Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evaluation Metrics
Nitika Mathur
Tim Baldwin
Trevor Cohn
56
247
0
11 Jun 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
880
42,379
0
28 May 2020
SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for
  Multi-Document Summarization
SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization
Yang Gao
Wei Zhao
Steffen Eger
ELM
92
126
0
07 May 2020
PlotMachines: Outline-Conditioned Generation with Dynamic Plot State
  Tracking
PlotMachines: Outline-Conditioned Generation with Dynamic Plot State Tracking
Hannah Rashkin
Asli Celikyilmaz
Yejin Choi
Jianfeng Gao
54
154
0
30 Apr 2020
Semantics of the Unwritten: The Effect of End of Paragraph and Sequence
  Tokens on Text Generation with GPT2
Semantics of the Unwritten: The Effect of End of Paragraph and Sequence Tokens on Text Generation with GPT2
Richard He Bai
Peng Shi
Jimmy J. Lin
Luchen Tan
Kun Xiong
Wen Gao
Jie Liu
Ming Li
45
5
0
05 Apr 2020
Heavy-tailed Representations, Text Polarity Classification & Data
  Augmentation
Heavy-tailed Representations, Text Polarity Classification & Data Augmentation
Hamid Jalalzai
Pierre Colombo
Chloé Clavel
Éric Gaussier
Giovanna Varni
Emmanuel Vignon
Anne Sabourin
47
28
0
25 Mar 2020
Fill in the BLANC: Human-free quality estimation of document summaries
Fill in the BLANC: Human-free quality estimation of document summaries
Oleg V. Vasilyev
Vedant Dharnidharka
John Bohannon
3DH
83
119
0
23 Feb 2020
Guiding attention in Sequence-to-sequence models for Dialogue Act
  prediction
Guiding attention in Sequence-to-sequence models for Dialogue Act prediction
Pierre Colombo
E. Chapuis
Matteo Manica
Emmanuel Vignon
Giovanna Varni
Chloé Clavel
3DV
131
63
0
20 Feb 2020
Bringing Stories Alive: Generating Interactive Fiction Worlds
Bringing Stories Alive: Generating Interactive Fiction Worlds
Prithviraj Ammanabrolu
W. Cheung
Dan Tu
William Broniec
Mark O. Riedl
83
51
0
28 Jan 2020
A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation
A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation
Jian Guan
Fei Huang
Zhihao Zhao
Xiaoyan Zhu
Minlie Huang
LRMSyDa
70
247
0
15 Jan 2020
CTRL: A Conditional Transformer Language Model for Controllable
  Generation
CTRL: A Conditional Transformer Language Model for Controllable Generation
N. Keskar
Bryan McCann
Lav Varshney
Caiming Xiong
R. Socher
AI4CE
130
1,254
0
11 Sep 2019
MoverScore: Text Generation Evaluating with Contextualized Embeddings
  and Earth Mover Distance
MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance
Wei Zhao
Maxime Peyrard
Fei Liu
Yang Gao
Christian M. Meyer
Steffen Eger
187
602
0
05 Sep 2019
Answers Unite! Unsupervised Metrics for Reinforced Summarization Models
Answers Unite! Unsupervised Metrics for Reinforced Summarization Models
Thomas Scialom
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
74
150
0
04 Sep 2019
From the Token to the Review: A Hierarchical Multimodal approach to
  Opinion Mining
From the Token to the Review: A Hierarchical Multimodal approach to Opinion Mining
Alexandre Garcia
Pierre Colombo
S. Essid
Florence dÁlché-Buc
Chloé Clavel
125
21
0
29 Aug 2019
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
S. Rothe
Shashi Narayan
Aliaksei Severyn
SILM
127
436
0
29 Jul 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
680
24,541
0
26 Jul 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
236
8,447
0
19 Jun 2019
Handling Divergent Reference Texts when Evaluating Table-to-Text
  Generation
Handling Divergent Reference Texts when Evaluating Table-to-Text Generation
Bhuwan Dhingra
Manaal Faruqui
Ankur P. Parikh
Ming-Wei Chang
Dipanjan Das
William W. Cohen
96
196
0
03 Jun 2019
BERTScore: Evaluating Text Generation with BERT
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
352
5,868
0
21 Apr 2019
Crowdsourcing Lightweight Pyramids for Manual Summary Evaluation
Crowdsourcing Lightweight Pyramids for Manual Summary Evaluation
Ori Shapira
David Gabay
Yang Gao
H. Ronen
Ramakanth Pasunuru
Joey Tianyi Zhou
Yael Amsterdamer
Ido Dagan
66
60
0
11 Apr 2019
12
Next