Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.18679
Cited By
v1
v2 (latest)
N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics
28 October 2023
Sajad Mousavi
Ricardo Luna Gutierrez
Desik Rengarajan
Vineet Gundecha
Ashwin Ramesh Babu
Avisek Naug
Antonio Guillen-Perez
Soumyendu Sarkar
LRM
HILM
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics"
14 / 14 papers shown
Title
Function Approximation for Reinforcement Learning Controller for Energy from Spread Waves
Soumyendu Sarkar
Vineet Gundecha
Sahand Ghorbanpour
Alexander Shmakov
Ashwin Ramesh Babu
Avisek Naug
Alexandre Frederic Julien Pichard
Mathieu Cocho
64
8
0
17 Apr 2024
RTDK-BO: High Dimensional Bayesian Optimization with Reinforced Transformer Deep kernels
Alexander Shmakov
Avisek Naug
Vineet Gundecha
Sahand Ghorbanpour
Ricardo Luna Gutierrez
Ashwin Ramesh Babu
Antonio Guillen-Perez
Soumyendu Sarkar
69
11
0
05 Oct 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELM
LRM
68
386
0
19 May 2023
Toxicity in ChatGPT: Analyzing Persona-assigned Language Models
Ameet Deshpande
Vishvak Murahari
Tanmay Rajpurohit
Ashwin Kalyan
Karthik Narasimhan
LM&MA
LLMAG
67
365
0
11 Apr 2023
SESCORE2: Learning Text Generation Evaluation via Synthesizing Realistic Mistakes
Wenda Xu
Xian Qian
Mingxuan Wang
Lei Li
William Yang Wang
35
10
0
19 Dec 2022
Skip Training for Multi-Agent Reinforcement Learning Controller for Industrial Wave Energy Converters
Soumyendu Sarkar
Vineet Gundecha
Sahand Ghorbanpour
Alexander Shmakov
Ashwin Ramesh Babu
Alexandre Frederic Julien Pichard
Mathieu Cocho
33
16
0
13 Sep 2022
Quark: Controllable Text Generation with Reinforced Unlearning
Ximing Lu
Sean Welleck
Jack Hessel
Liwei Jiang
Lianhui Qin
Peter West
Prithviraj Ammanabrolu
Yejin Choi
MU
144
216
0
26 May 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
820
9,576
0
28 Jan 2022
BERTTune: Fine-Tuning Neural Machine Translation with BERTScore
Inigo Jauregi Unanue
Jacob Parnell
Massimo Piccardi
56
33
0
04 Jun 2021
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models
Samuel Gehman
Suchin Gururangan
Maarten Sap
Yejin Choi
Noah A. Smith
158
1,209
0
24 Sep 2020
Don't Stop Pretraining: Adapt Language Models to Domains and Tasks
Suchin Gururangan
Ana Marasović
Swabha Swayamdipta
Kyle Lo
Iz Beltagy
Doug Downey
Noah A. Smith
VLM
AI4CE
CLL
155
2,428
0
23 Apr 2020
Plug and Play Language Models: A Simple Approach to Controlled Text Generation
Sumanth Dathathri
Andrea Madotto
Janice Lan
Jane Hung
Eric Frank
Piero Molino
J. Yosinski
Rosanne Liu
KELM
147
976
0
04 Dec 2019
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering
Zhilin Yang
Peng Qi
Saizheng Zhang
Yoshua Bengio
William W. Cohen
Ruslan Salakhutdinov
Christopher D. Manning
RALM
182
2,689
0
25 Sep 2018
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
210
2,676
0
09 May 2017
1