ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.00398
  4. Cited By
Judge the Judges: A Large-Scale Evaluation Study of Neural Language
  Models for Online Review Generation

Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation

2 January 2019
Cristina Garbacea
Samuel Carton
Shiyan Yan
Qiaozhu Mei
    ELM
ArXivPDFHTML

Papers citing "Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation"

18 / 18 papers shown
Title
Language Models Hallucinate, but May Excel at Fact Verification
Language Models Hallucinate, but May Excel at Fact Verification
Jian Guan
Jesse Dodge
David Wadden
Minlie Huang
Hao Peng
LRM
HILM
45
29
0
23 Oct 2023
Diffusing Gaussian Mixtures for Generating Categorical Data
Diffusing Gaussian Mixtures for Generating Categorical Data
Florence Regol
Mark Coates
DiffM
44
5
0
08 Mar 2023
Evaluation of Categorical Generative Models -- Bridging the Gap Between
  Real and Synthetic Data
Evaluation of Categorical Generative Models -- Bridging the Gap Between Real and Synthetic Data
Florence Regol
Anja Kroon
Mark Coates
ELM
EGVM
43
1
0
28 Oct 2022
A Comprehensive Survey of Natural Language Generation Advances from the
  Perspective of Digital Deception
A Comprehensive Survey of Natural Language Generation Advances from the Perspective of Digital Deception
Keenan I. Jones
Enes ALTUNCU
V. N. Franqueira
Yi-Chia Wang
Shujun Li
DeLMO
58
3
0
11 Aug 2022
The Authenticity Gap in Human Evaluation
The Authenticity Gap in Human Evaluation
Kawin Ethayarajh
Dan Jurafsky
87
24
0
24 May 2022
CTRLEval: An Unsupervised Reference-Free Metric for Evaluating
  Controlled Text Generation
CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation
Pei Ke
Hao Zhou
Yankai Lin
Peng Li
Jie Zhou
Xiaoyan Zhu
Minlie Huang
34
40
0
02 Apr 2022
Attacking Open-domain Question Answering by Injecting Misinformation
Attacking Open-domain Question Answering by Injecting Misinformation
Liangming Pan
Wenhu Chen
Min-Yen Kan
Wenjie Wang
HILM
AAML
217
23
0
15 Oct 2021
The Perils of Using Mechanical Turk to Evaluate Open-Ended Text
  Generation
The Perils of Using Mechanical Turk to Evaluate Open-Ended Text Generation
Marzena Karpinska
Nader Akoury
Mohit Iyyer
236
107
0
14 Sep 2021
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated
  Text
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text
Elizabeth Clark
Tal August
Sofia Serrano
Nikita Haduong
Suchin Gururangan
Noah A. Smith
DeLMO
59
401
0
30 Jun 2021
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
Jian Guan
Zhexin Zhang
Zhuoer Feng
Zitao Liu
Wenbiao Ding
Xiaoxi Mao
Changjie Fan
Minlie Huang
25
61
0
19 May 2021
Adversarial Machine Learning in Text Analysis and Generation
Adversarial Machine Learning in Text Analysis and Generation
I. Alsmadi
AAML
47
5
0
14 Jan 2021
UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation
UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation
Jian Guan
Minlie Huang
34
70
0
16 Sep 2020
Evaluation of Text Generation: A Survey
Evaluation of Text Generation: A Survey
Asli Celikyilmaz
Elizabeth Clark
Jianfeng Gao
ELM
LM&MA
46
378
0
26 Jun 2020
A Survey on Generative Adversarial Networks: Variants, Applications, and
  Training
A Survey on Generative Adversarial Networks: Variants, Applications, and Training
Abdul Jabbar
Xi Li
Bourahla Omar
38
266
0
09 Jun 2020
A Review on Generative Adversarial Networks: Algorithms, Theory, and
  Applications
A Review on Generative Adversarial Networks: Algorithms, Theory, and Applications
Jie Gui
Zhenan Sun
Yonggang Wen
Dacheng Tao
Jieping Ye
EGVM
38
827
0
20 Jan 2020
Language GANs Falling Short
Language GANs Falling Short
Massimo Caccia
Lucas Caccia
W. Fedus
Hugo Larochelle
Joelle Pineau
Laurent Charlin
136
216
0
06 Nov 2018
Adversarial Evaluation of Dialogue Models
Adversarial Evaluation of Dialogue Models
Anjuli Kannan
Oriol Vinyals
AAML
ALM
146
76
0
27 Jan 2017
Convolutional Neural Networks for Sentence Classification
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
319
13,383
0
25 Aug 2014
1