Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.02397
Cited By
Learning to Refine with Fine-Grained Natural Language Feedback
2 July 2024
Manya Wadhwa
Xinyu Zhao
Junyi Jessy Li
Greg Durrett
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning to Refine with Fine-Grained Natural Language Feedback"
18 / 18 papers shown
Title
Critique-Guided Distillation: Improving Supervised Fine-tuning via Better Distillation
Berkcan Kapusuzoglu
Supriyo Chakraborty
Chia-Hsuan Lee
Sambit Sahu
103
0
0
16 May 2025
Can AI writing be salvaged? Mitigating Idiosyncrasies and Improving Human-AI Alignment in the Writing Process through Edits
Tuhin Chakrabarty
Philippe Laban
Chien-Sheng Wu
75
13
0
22 Sep 2024
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models
Seungone Kim
Juyoung Suk
Shayne Longpre
Bill Yuchen Lin
Jamin Shin
Sean Welleck
Graham Neubig
Moontae Lee
Kyungjae Lee
Minjoon Seo
MoMe
ALM
ELM
101
198
0
02 May 2024
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Wei-Lin Chiang
Lianmin Zheng
Ying Sheng
Anastasios Nikolas Angelopoulos
Tianle Li
...
Hao Zhang
Banghua Zhu
Michael I. Jordan
Joseph E. Gonzalez
Ion Stoica
OSLM
150
574
0
07 Mar 2024
InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification
Jan Trienes
Sebastian Antony Joseph
Jorg Schlotterer
Christin Seifert
Kyle Lo
Wei Xu
Byron C. Wallace
Junyi Jessy Li
101
7
0
29 Jan 2024
AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Yann Dubois
Xuechen Li
Rohan Taori
Tianyi Zhang
Ishaan Gulrajani
Jimmy Ba
Carlos Guestrin
Percy Liang
Tatsunori B. Hashimoto
ALM
130
600
0
22 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELM
LRM
68
386
0
19 May 2023
Learning Performance-Improving Code Edits
Alex Shypula
Aman Madaan
Yiming Yang
Uri Alon
Jacob R. Gardner
Milad Hashemi
Graham Neubig
Parthasarathy Ranganathan
Osbert Bastani
Amir Yazdanbakhsh
SyDa
74
87
0
15 Feb 2023
On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning
Omar Shaikh
Hongxin Zhang
William B. Held
Michael S. Bernstein
Diyi Yang
ReLM
LRM
118
199
0
15 Dec 2022
Improving Factual Consistency in Summarization with Compression-Based Post-Editing
Alexander R. Fabbri
Prafulla Kumar Choubey
Jesse Vig
Chien-Sheng Wu
Caiming Xiong
HILM
KELM
111
17
0
11 Nov 2022
CodeT: Code Generation with Generated Tests
Bei Chen
Fengji Zhang
A. Nguyen
Daoguang Zan
Zeqi Lin
Jian-Guang Lou
Weizhu Chen
91
339
0
21 Jul 2022
Self-critiquing models for assisting human evaluators
William Saunders
Catherine Yeh
Jeff Wu
Steven Bills
Ouyang Long
Jonathan Ward
Jan Leike
ALM
ELM
103
302
0
12 Jun 2022
Training Verifiers to Solve Math Word Problems
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
...
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
ReLM
OffRL
LRM
285
4,408
0
27 Oct 2021
MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization
Chenguang Zhu
Yang Liu
Jie Mei
Michael Zeng
59
137
0
11 Mar 2021
Evidence-based Factual Error Correction
James Thorne
Andreas Vlachos
KELM
OffRL
83
58
0
31 Dec 2020
What Can We Learn from Collective Human Opinions on Natural Language Inference Data?
Yixin Nie
Xiang Zhou
Joey Tianyi Zhou
81
138
0
07 Oct 2020
Copy that! Editing Sequences by Copying Spans
Sheena Panthaplackel
Miltiadis Allamanis
Marc Brockschmidt
BDL
50
28
0
08 Jun 2020
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
659
24,464
0
26 Jul 2019
1