v1v2v3v4 (latest)

Of Human Criteria and Automatic Metrics: A Benchmark of the Evaluation of Story Generation

24 August 2022

Papers citing "Of Human Criteria and Automatic Metrics: A Benchmark of the Evaluation of Story Generation"

50 / 67 papers shown

Title
SCORE: Story Coherence and Retrieval Enhancement for AI Narratives Qiang Yi Yangfan He Jing Wang Xinyuan Song Shiyao Qian ... Kuan Lu Menghao Huo Jiaqi Chen Tianyu Shi Tianyu Shi RALM 108 16 0 30 Mar 2025
Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation Mingqi Gao Xinyu Hu Li Lin Xiaojun Wan 71 2 0 28 Jan 2025
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators Yinhong Liu Han Zhou Zhijiang Guo Ehsan Shareghi Ivan Vulić Anna Korhonen Nigel Collier ALM 180 83 0 20 Jan 2025
Evaluating Creative Short Story Generation in Humans and Large Language Models Mete Ismayilzada Claire Stevenson Lonneke van der Plas LM&MA LRM 107 5 0 04 Nov 2024
Agents' Room: Narrative Generation through Multi-step Collaboration Fantine Huot Reinald Kim Amplayo Jennimaria Palomaki Alice Shoshana Jakobovits Elizabeth Clark Mirella Lapata 92 15 0 03 Oct 2024
Navigating the Path of Writing: Outline-guided Text Generation with Large Language Models Yukyung Lee Soonwon Ka Bokyung Son Pilsung Kang Jaewook Kang LLMAG 134 6 0 22 Apr 2024
LongStory: Coherent, Complete and Length Controlled Long story Generation Kyeongman Park Nakyeong Yang Kyomin Jung 102 5 0 26 Nov 2023
Learning Disentangled Textual Representations via Statistical Measures of Similarity Pierre Colombo Guillaume Staerman Nathan Noiry Pablo Piantanida FaML DRL 179 22 0 07 May 2022
A Differential Entropy Estimator for Training Neural Networks Georg Pichler Pierre Colombo Malik Boudiaf Günther Koliander Pablo Piantanida 149 23 0 14 Feb 2022
What are the best systems? New perspectives on NLP Benchmarking Pierre Colombo Nathan Noiry Ekhine Irurozki Stephan Clémençon 163 42 0 08 Feb 2022
InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation Pierre Colombo Chloe Clave Pablo Piantanida 90 44 0 02 Dec 2021
Beam Search with Bidirectional Strategies for Neural Response Generation Pierre Colombo Chouchang Yang Giovanna Varni Chloé Clavel 182 13 0 07 Oct 2021
A Plug-and-Play Method for Controlled Text Generation Damian Pascual Béni Egressy Clara Meister Ryan Cotterell Roger Wattenhofer 122 93 0 20 Sep 2021
The Perils of Using Mechanical Turk to Evaluate Open-Ended Text Generation Marzena Karpinska Nader Akoury Mohit Iyyer 273 108 0 14 Sep 2021
A Temporal Variational Model for Story Generation David Wilmot Frank Keller DRL 86 9 0 14 Sep 2021
Code-switched inspired losses for generic spoken dialog representations E. Chapuis Pierre Colombo Matthieu Labeau Chloe Clave 153 12 0 27 Aug 2021
Automatic Text Evaluation through the Lens of Wasserstein Barycenters Pierre Colombo Guillaume Staerman Chloé Clavel Pablo Piantanida 173 41 0 27 Aug 2021
BARTScore: Evaluating Generated Text as Text Generation Weizhe Yuan Graham Neubig Pengfei Liu 123 849 0 22 Jun 2021
Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence Jian Guan Xiaoxi Mao Changjie Fan Zitao Liu Wenbiao Ding Minlie Huang AuLLM 95 81 0 19 May 2021
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics Jian Guan Zhexin Zhang Zhuoer Feng Zitao Liu Wenbiao Ding Xiaoxi Mao Changjie Fan Minlie Huang 72 61 0 19 May 2021
A Novel Estimator of Mutual Information for Learning to Disentangle Textual Representations Pierre Colombo Chloé Clavel Pablo Piantanida AAML DRL 155 51 0 06 May 2021
A Pseudo-Metric between Probability Distributions based on Depth-Trimmed Regions Guillaume Staerman Pavlo Mozharovskyi Pierre Colombo Stéphan Clémenccon Florence dÁlché-Buc OOD 539 17 0 23 Mar 2021
Automatic Story Generation: Challenges and Attempts Amal Alabdulkarim Siyan Li Xiangyu Peng 68 51 0 25 Feb 2021
Transformer-based Conditional Variational Autoencoder for Controllable Story Generation Le Fang Tao Zeng Chao-Ning Liu Liefeng Bo Wen Dong Changyou Chen DRL 90 61 0 04 Jan 2021
Re-evaluating Evaluation in Text Summarization Manik Bhandari Pranav Narayan Gour A. Ashfaq Pengfei Liu Graham Neubig 150 178 0 14 Oct 2020
Modeling Protagonist Emotions for Emotion-Aware Storytelling Faeze Brahman Snigdha Chaturvedi 64 50 0 14 Oct 2020
Narrative Text Generation with a Latent Discrete Plan Harsh Jhamtani Taylor Berg-Kirkpatrick 35 17 0 07 Oct 2020
Hierarchical Pre-training for Sequence Labelling in Spoken Dialog E. Chapuis Pierre Colombo Matteo Manica Matthieu Labeau Chloé Clavel 147 59 0 23 Sep 2020
Narrative Interpolation for Generating and Understanding Stories Su Wang Greg Durrett K. Erk 156 34 0 17 Aug 2020
SummEval: Re-evaluating Summarization Evaluation Alexander R. Fabbri Wojciech Kry'sciñski Bryan McCann Caiming Xiong R. Socher Dragomir R. Radev HILM 100 720 0 24 Jul 2020
Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evaluation Metrics Nitika Mathur Tim Baldwin Trevor Cohn 56 247 0 11 Jun 2020
Language Models are Few-Shot Learners Tom B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan ... Christopher Berner Sam McCandlish Alec Radford Ilya Sutskever Dario Amodei BDL 880 42,379 0 28 May 2020
SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization Yang Gao Wei Zhao Steffen Eger ELM 92 126 0 07 May 2020
PlotMachines: Outline-Conditioned Generation with Dynamic Plot State Tracking Hannah Rashkin Asli Celikyilmaz Yejin Choi Jianfeng Gao 54 154 0 30 Apr 2020
Semantics of the Unwritten: The Effect of End of Paragraph and Sequence Tokens on Text Generation with GPT2 Richard He Bai Peng Shi Jimmy J. Lin Luchen Tan Kun Xiong Wen Gao Jie Liu Ming Li 45 5 0 05 Apr 2020
Heavy-tailed Representations, Text Polarity Classification & Data Augmentation Hamid Jalalzai Pierre Colombo Chloé Clavel Éric Gaussier Giovanna Varni Emmanuel Vignon Anne Sabourin 47 28 0 25 Mar 2020
Fill in the BLANC: Human-free quality estimation of document summaries Oleg V. Vasilyev Vedant Dharnidharka John Bohannon 3DH 83 119 0 23 Feb 2020
Guiding attention in Sequence-to-sequence models for Dialogue Act prediction Pierre Colombo E. Chapuis Matteo Manica Emmanuel Vignon Giovanna Varni Chloé Clavel 3DV 131 63 0 20 Feb 2020
Bringing Stories Alive: Generating Interactive Fiction Worlds Prithviraj Ammanabrolu W. Cheung Dan Tu William Broniec Mark O. Riedl 83 51 0 28 Jan 2020
A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation Jian Guan Fei Huang Zhihao Zhao Xiaoyan Zhu Minlie Huang LRM SyDa 70 247 0 15 Jan 2020
CTRL: A Conditional Transformer Language Model for Controllable Generation N. Keskar Bryan McCann Lav Varshney Caiming Xiong R. Socher AI4CE 130 1,254 0 11 Sep 2019
MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance Wei Zhao Maxime Peyrard Fei Liu Yang Gao Christian M. Meyer Steffen Eger 187 602 0 05 Sep 2019
Answers Unite! Unsupervised Metrics for Reinforced Summarization Models Thomas Scialom Sylvain Lamprier Benjamin Piwowarski Jacopo Staiano 74 150 0 04 Sep 2019
From the Token to the Review: A Hierarchical Multimodal approach to Opinion Mining Alexandre Garcia Pierre Colombo S. Essid Florence dÁlché-Buc Chloé Clavel 125 21 0 29 Aug 2019
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks S. Rothe Shashi Narayan Aliaksei Severyn SILM 127 436 0 29 Jul 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach Yinhan Liu Myle Ott Naman Goyal Jingfei Du Mandar Joshi Danqi Chen Omer Levy M. Lewis Luke Zettlemoyer Veselin Stoyanov AIMat 680 24,541 0 26 Jul 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding Zhilin Yang Zihang Dai Yiming Yang J. Carbonell Ruslan Salakhutdinov Quoc V. Le AI4CE 236 8,447 0 19 Jun 2019
Handling Divergent Reference Texts when Evaluating Table-to-Text Generation Bhuwan Dhingra Manaal Faruqui Ankur P. Parikh Ming-Wei Chang Dipanjan Das William W. Cohen 96 196 0 03 Jun 2019
BERTScore: Evaluating Text Generation with BERT Tianyi Zhang Varsha Kishore Felix Wu Kilian Q. Weinberger Yoav Artzi 352 5,868 0 21 Apr 2019
Crowdsourcing Lightweight Pyramids for Manual Summary Evaluation Ori Shapira David Gabay Yang Gao H. Ronen Ramakanth Pasunuru Joey Tianyi Zhou Yael Amsterdamer Ido Dagan 66 60 0 11 Apr 2019