Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.09332
Cited By
WebGPT: Browser-assisted question-answering with human feedback
17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
ALM
RALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"WebGPT: Browser-assisted question-answering with human feedback"
50 / 914 papers shown
Title
Natural Language Commanding via Program Synthesis
Apurva Gandhi
Thong Q. Nguyen
Huitian Jiao
R. Steen
Ameya Bhatawdekar
31
7
0
06 Jun 2023
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
Kenneth Li
Oam Patel
Fernanda Viégas
Hanspeter Pfister
Martin Wattenberg
KELM
HILM
58
495
0
06 Jun 2023
Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot Performance For Scoring and Providing Actionable Insights on Classroom Instruction
Rose E. Wang
Dorottya Demszky
32
59
0
05 Jun 2023
Orca: Progressive Learning from Complex Explanation Traces of GPT-4
Subhabrata Mukherjee
Arindam Mitra
Ganesh Jawahar
Sahaj Agarwal
Hamid Palangi
Ahmed Hassan Awadallah
ELM
ALM
LRM
51
264
0
05 Jun 2023
Fine-Tuning Language Models with Advantage-Induced Policy Alignment
Banghua Zhu
Hiteshi Sharma
Felipe Vieira Frujeri
Shi Dong
Chenguang Zhu
Michael I. Jordan
Jiantao Jiao
OSLM
36
39
0
04 Jun 2023
Question-Context Alignment and Answer-Context Dependencies for Effective Answer Sentence Selection
Minh Le Nguyen
K. Kishan
Toan Q. Nguyen
Thien Huu Nguyen
Ankit Chadha
Thuy Vu
26
0
0
03 Jun 2023
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Zeqiu Wu
Yushi Hu
Weijia Shi
Nouha Dziri
Alane Suhr
Prithviraj Ammanabrolu
Noah A. Smith
Mari Ostendorf
Hannaneh Hajishirzi
ALM
53
305
0
02 Jun 2023
TorchRL: A data-driven decision-making library for PyTorch
Albert Bou
Matteo Bettini
Sebastian Dittert
Vikash Kumar
Shagun Sodhani
Xiaomeng Yang
Gianni De Fabritiis
Vincent Moens
OffRL
AI4CE
37
37
0
01 Jun 2023
Challenges and Remedies to Privacy and Security in AIGC: Exploring the Potential of Privacy Computing, Blockchain, and Beyond
Chuan Chen
Zhenpeng Wu
Yan-Hao Lai
Wen-chao Ou
Tianchi Liao
Zibin Zheng
38
33
0
01 Jun 2023
From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Peter Shaw
Mandar Joshi
James Cohan
Jonathan Berant
Panupong Pasupat
Hexiang Hu
Urvashi Khandelwal
Kenton Lee
Kristina Toutanova
LLMAG
LM&Ro
23
54
0
31 May 2023
Let's Verify Step by Step
Hunter Lightman
V. Kosaraju
Yura Burda
Harrison Edwards
Bowen Baker
Teddy Lee
Jan Leike
John Schulman
Ilya Sutskever
K. Cobbe
ALM
OffRL
LRM
50
945
0
31 May 2023
Concise Answers to Complex Questions: Summarization of Long-form Answers
Abhilash Potluri
Fangyuan Xu
Eunsol Choi
ELM
39
11
0
30 May 2023
SheetCopilot: Bringing Software Productivity to the Next Level through Large Language Models
Hongxin Li
Jingran Su
Yuntao Chen
Qing Li
Zhaoxiang Zhang
LMTD
49
32
0
30 May 2023
Strategic Reasoning with Language Models
Kanishk Gandhi
Dorsa Sadigh
Noah D. Goodman
LM&Ro
LRM
42
37
0
30 May 2023
Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey
Chen Ling
Xujiang Zhao
Jiaying Lu
Chengyuan Deng
Can Zheng
...
Chris White
Quanquan Gu
Jian Pei
Carl Yang
Liang Zhao
ALM
33
126
0
30 May 2023
KEYword based Sampling (KEYS) for Large Language Models
V. JyothirS
Zuhaib Akhtar
12
1
0
30 May 2023
A Critical Evaluation of Evaluations for Long-form Question Answering
Fangyuan Xu
Yixiao Song
Mohit Iyyer
Eunsol Choi
ELM
42
97
0
29 May 2023
Provable Reward-Agnostic Preference-Based Reinforcement Learning
Wenhao Zhan
Masatoshi Uehara
Wen Sun
Jason D. Lee
27
8
0
29 May 2023
Taming AI Bots: Controllability of Neural States in Large Language Models
Stefano Soatto
Paulo Tabuada
Pratik Chaudhari
Tianwei Liu
LLMAG
LM&Ro
18
13
0
29 May 2023
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism
Zihao Li
Zhuoran Yang
Mengdi Wang
OffRL
39
55
0
29 May 2023
Reward Collapse in Aligning Large Language Models
Ziang Song
Tianle Cai
Jason D. Lee
Weijie J. Su
ALM
33
22
0
28 May 2023
MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought Prompting
Tatsuro Inaba
Hirokazu Kiyomaru
Fei Cheng
Sadao Kurohashi
KELM
LRM
32
23
0
26 May 2023
On the Tool Manipulation Capability of Open-source Large Language Models
Qiantong Xu
Fenglu Hong
Yangqiu Song
Changran Hu
Zheng Chen
Jian Zhang
LLMAG
35
69
0
25 May 2023
Inverse Preference Learning: Preference-based RL without a Reward Function
Joey Hejna
Dorsa Sadigh
OffRL
32
48
0
24 May 2023
Gorilla: Large Language Model Connected with Massive APIs
Shishir G. Patil
Tianjun Zhang
Xin Wang
Joseph E. Gonzalez
ELM
CLL
ALM
SyDa
42
521
0
24 May 2023
Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
Zhihong Shao
Yeyun Gong
Yelong Shen
Minlie Huang
Nan Duan
Weizhu Chen
RALM
LRM
KELM
36
220
0
24 May 2023
Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems
Marek Kadlcík
Michal Štefánik
Ondřej Sotolář
Vlastimil Martinek
LRM
30
13
0
24 May 2023
Provable Offline Preference-Based Reinforcement Learning
Wenhao Zhan
Masatoshi Uehara
Nathan Kallus
Jason D. Lee
Wen Sun
OffRL
43
12
0
24 May 2023
Allies: Prompting Large Language Model with Beam Search
Hao Sun
Xiao Liu
Yeyun Gong
Yan Zhang
Daxin Jiang
Linjun Yang
Nan Duan
RALM
33
5
0
24 May 2023
Mastering the ABCDs of Complex Questions: Answer-Based Claim Decomposition for Fine-grained Self-Evaluation
Nishant Balepur
Jie Huang
Samraj Moorjani
Hari Sundaram
Kevin Chen-Chuan Chang
ReLM
35
0
0
24 May 2023
Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language Models
Sheng Shen
Le Hou
Yan-Quan Zhou
Nan Du
Shayne Longpre
...
Vincent Zhao
Hongkun Yu
Kurt Keutzer
Trevor Darrell
Denny Zhou
ALM
MoE
43
54
0
24 May 2023
DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4
Ye Hu
Kaiqiang Song
Sangwoo Cho
Xiaoyang Wang
H. Foroosh
Fei Liu
31
12
0
24 May 2023
Enabling Large Language Models to Generate Text with Citations
Tianyu Gao
Howard Yen
Jiatong Yu
Danqi Chen
LM&MA
HILM
40
319
0
24 May 2023
CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models
Cheng Qian
Chi Han
Yi R. Fung
Yujia Qin
Zhiyuan Liu
Heng Ji
LRM
20
30
0
23 May 2023
Skill-Based Few-Shot Selection for In-Context Learning
Shengnan An
Bo Zhou
Zeqi Lin
Qiang Fu
B. Chen
Nanning Zheng
Weizhu Chen
Jian-Guang Lou
36
31
0
23 May 2023
Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization
Jeonghoon Kim
J. H. Lee
Sungdong Kim
Joonsuk Park
Kang Min Yoo
S. Kwon
Dongsoo Lee
MQ
44
100
0
23 May 2023
Improving Language Models via Plug-and-Play Retrieval Feedback
Wenhao Yu
Zhihan Zhang
Zhenwen Liang
Meng Jiang
Ashish Sabharwal
KELM
LRM
44
71
0
23 May 2023
Aligning Large Language Models through Synthetic Feedback
Sungdong Kim
Sanghwan Bae
Jamin Shin
Soyoung Kang
Donghyun Kwak
Kang Min Yoo
Minjoon Seo
ALM
SyDa
81
67
0
23 May 2023
ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models
Binfeng Xu
Zhiyuan Peng
Bowen Lei
Subhabrata Mukherjee
Yuchen Liu
Dongkuan Xu
KELM
LLMAG
LRM
32
88
0
23 May 2023
Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding
Mutian He
Philip N. Garner
ELM
AI4MH
LRM
48
22
0
22 May 2023
Training Diffusion Models with Reinforcement Learning
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
EGVM
44
320
0
22 May 2023
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts
Jian Xie
Kai Zhang
Jiangjie Chen
Renze Lou
Yu-Chuan Su
RALM
227
159
0
22 May 2023
Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Munan Ning
Yujia Xie
Dongdong Chen
Zeyin Song
Lu Yuan
Yonghong Tian
QiXiang Ye
Liuliang Yuan
33
8
0
22 May 2023
Can We Edit Factual Knowledge by In-Context Learning?
Ce Zheng
Lei Li
Qingxiu Dong
Yuxuan Fan
Zhiyong Wu
Jingjing Xu
Baobao Chang
KELM
39
187
0
22 May 2023
Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models
Oana Ignat
Zhijing Jin
Artem Abzaliev
Laura Biester
Santiago Castro
...
Verónica Pérez-Rosas
Siqi Shen
Zekun Wang
Winston Wu
Rada Mihalcea
LRM
41
6
0
21 May 2023
Continually Improving Extractive QA via Human Feedback
Ge Gao
Hung-Ting Chen
Yoav Artzi
Eunsol Choi
31
12
0
21 May 2023
Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning
Liangming Pan
Alon Albalak
Xinyi Wang
William Yang Wang
ReLM
LRM
AI4CE
54
236
0
20 May 2023
Complex Claim Verification with Evidence Retrieved in the Wild
Jifan Chen
Grace Kim
Aniruddh Sriram
Greg Durrett
Eunsol Choi
HILM
41
72
0
19 May 2023
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
Hiroki Furuta
Kuang-Huei Lee
Ofir Nachum
Yutaka Matsuo
Aleksandra Faust
S. Gu
Izzeddin Gur
LM&Ro
38
93
0
19 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELM
LRM
36
360
0
19 May 2023
Previous
1
2
3
...
14
15
16
17
18
19
Next