Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.13648
Cited By
ChatGPT or Grammarly? Evaluating ChatGPT on Grammatical Error Correction Benchmark
15 March 2023
Hao Wu
Wenxuan Wang
Yuxuan Wan
Wenxiang Jiao
Michael Lyu
ELM
AI4MH
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ChatGPT or Grammarly? Evaluating ChatGPT on Grammatical Error Correction Benchmark"
25 / 25 papers shown
Title
Whitened CLIP as a Likelihood Surrogate of Images and Captions
Roy Betser
Meir Yossef Levi
Guy Gilboa
31
0
0
11 May 2025
Integration of LLM Quality Assurance into an NLG System
Ching-Yi Chen
Johanna Heininger
Adela Schneider
Christian Eckard
Andreas Madsack
Robert Weißgraeber
39
0
0
28 Jan 2025
ASR Error Correction using Large Language Models
Rao Ma
Mengjie Qian
Mark J. F. Gales
Kate Knill
KELM
46
1
0
14 Sep 2024
ChatLang-8: An LLM-Based Synthetic Data Generation Framework for Grammatical Error Correction
Jeiyoon Park
Chanjun Park
Heuiseok Lim
38
2
0
05 Jun 2024
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
Jen-tse Huang
E. Li
Man Ho Lam
Tian Liang
Wenxuan Wang
Youliang Yuan
Wenxiang Jiao
Xing Wang
Zhaopeng Tu
Michael R. Lyu
ELM
LLMAG
88
33
0
18 Mar 2024
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs
Simone Balloccu
Patrícia Schmidtová
Mateusz Lango
Ondrej Dusek
SILM
ELM
PILM
21
156
0
06 Feb 2024
Inconsistent dialogue responses and how to recover from them
Mian Zhang
Lifeng Jin
Linfeng Song
Haitao Mi
Dong Yu
37
1
0
18 Jan 2024
Stability Analysis of ChatGPT-based Sentiment Analysis in AI Quality Assurance
Tinghui Ouyang
AprilPyone Maungmaung
Koichi Konishi
Yoshiki Seo
Isao Echizen
AI4MH
23
5
0
15 Jan 2024
Prompting open-source and commercial language models for grammatical error correction of English learner text
Christopher Davis
Andrew Caines
Oistein Andersen
Shiva Taslimipoor
H. Yannakoudakis
Zheng Yuan
Christopher Bryant
Marek Rei
P. Buttery
35
13
0
15 Jan 2024
The Earth is Flat? Unveiling Factual Errors in Large Language Models
Wenxuan Wang
Juluan Shi
Zhaopeng Tu
Youliang Yuan
Jen-tse Huang
Wenxiang Jiao
Michael R. Lyu
KELM
HILM
SyDa
47
1
0
01 Jan 2024
Evaluation Metrics in the Era of GPT-4: Reliably Evaluating Large Language Models on Sequence to Sequence Tasks
Andrea Sottana
Bin Liang
Kai Zou
Zheng Yuan
ALM
ELM
LM&MA
38
54
0
20 Oct 2023
Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models
Wenxuan Wang
Wenxiang Jiao
Jingyuan Huang
Ruyi Dai
Jen-tse Huang
Zhaopeng Tu
Michael R. Lyu
54
27
0
19 Oct 2023
RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation
Yue Zhang
Leyang Cui
Enbo Zhao
Wei Bi
Shuming Shi
38
6
0
11 Oct 2023
Evaluation of ChatGPT Feedback on ELL Writers' Coherence and Cohesion
Su-Youn Yoon
Eva Miszoglad
Lisa R. Pierce
16
11
0
10 Oct 2023
Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models
Qingyue Wang
Y. Fu
Yanan Cao
Zhiliang Tian
Shi Wang
Dacheng Tao
LLMAG
KELM
RALM
64
24
0
29 Aug 2023
On the (In)Effectiveness of Large Language Models for Chinese Text Correction
Hai-Tao Zheng
Haojing Huang
Shirong Ma
Yong-jia Jiang
Yongqian Li
F. Zhou
Haitao Zheng
Qingyu Zhou
34
43
0
18 Jul 2023
Evaluating the Capability of Large-scale Language Models on Chinese Grammatical Error Correction Task
Fanyi Qu
Yunfang Wu
Yunfang Wu
ELM
LRM
22
6
0
08 Jul 2023
Quilt-1M: One Million Image-Text Pairs for Histopathology
Wisdom O. Ikezogwo
M. S. Seyfioglu
Fatemeh Ghezloo
Dylan Stefan Chan Geva
Fatwir Sheikh Mohammed
Pavan Kumar Anand
Ranjay Krishna
Linda G. Shapiro
CLIP
VLM
139
114
0
20 Jun 2023
A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation
Xiaowei Huang
Wenjie Ruan
Wei Huang
Gao Jin
Yizhen Dong
...
Sihao Wu
Peipei Xu
Dengyu Wu
André Freitas
Mustafa A. Mustafa
ALM
42
82
0
19 May 2023
CoEdIT: Text Editing by Task-Specific Instruction Tuning
Vipul Raheja
Dhruv Kumar
Ryan Koo
Dongyeop Kang
ALM
23
56
0
17 May 2023
Error Analysis Prompting Enables Human-Like Translation Evaluation in Large Language Models
Qingyu Lu
Baopu Qiu
Liang Ding
Liping Xie
Tom Kocmi
Dacheng Tao
LRM
ALM
ELM
26
107
0
24 Mar 2023
Toward Human-Like Evaluation for Natural Language Generation with Error Analysis
Qingyu Lu
Liang Ding
Liping Xie
Kanjian Zhang
Derek F. Wong
Dacheng Tao
ELM
ALM
34
14
0
20 Dec 2022
Grammatical Error Correction: A Survey of the State of the Art
Christopher Bryant
Zheng Yuan
Muhammad Reza Qorib
Hannan Cao
Hwee Tou Ng
Ted Briscoe
3DV
26
79
0
09 Nov 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
319
11,953
0
04 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
389
8,495
0
28 Jan 2022
1