Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.09332
Cited By
WebGPT: Browser-assisted question-answering with human feedback
17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
ALM
RALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"WebGPT: Browser-assisted question-answering with human feedback"
50 / 914 papers shown
Title
Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading
Howard Chen
Ramakanth Pasunuru
Jason Weston
Asli Celikyilmaz
RALM
68
73
0
08 Oct 2023
Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages
Shih-Cheng Huang
Pin-Zu Li
Yu-Chi Hsu
Kuang-Ming Chen
Yu Tung Lin
Shih-Kai Hsiao
Richard Tzong-Han Tsai
Hung-yi Lee
MoMe
34
14
0
07 Oct 2023
Reverse Chain: A Generic-Rule for LLMs to Master Multi-API Planning
Yinger Zhang
Hui Cai
Xeirui Song
Yicheng Chen
Rui Sun
Jing Zheng
LRM
21
7
0
06 Oct 2023
A Long Way to Go: Investigating Length Correlations in RLHF
Prasann Singhal
Tanya Goyal
Jiacheng Xu
Greg Durrett
44
143
0
05 Oct 2023
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
Omar Khattab
Arnav Singhvi
Paridhi Maheshwari
Zhiyuan Zhang
Keshav Santhanam
...
Thomas T. Joshi
Hanna Moazam
Heather Miller
Matei A. Zaharia
Christopher Potts
RALM
38
236
0
05 Oct 2023
Redefining Digital Health Interfaces with Large Language Models
F. Imrie
Paulius Rauba
M. Schaar
AI4MH
LM&MA
27
3
0
05 Oct 2023
MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation
Qian Huang
Jian Vora
Percy Liang
J. Leskovec
ELM
LLMAG
35
72
0
05 Oct 2023
FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation
Tu Vu
Mohit Iyyer
Xuezhi Wang
Noah Constant
Jerry W. Wei
...
Chris Tar
Yun-hsuan Sung
Denny Zhou
Quoc Le
Thang Luong
KELM
HILM
LRM
36
189
0
05 Oct 2023
Retrieval meets Long Context Large Language Models
Peng Xu
Ming-Yu Liu
Xianchao Wu
Lawrence C. McAfee
Chen Zhu
Zihan Liu
Sandeep Subramanian
Evelina Bakhturina
M. Shoeybi
Bryan Catanzaro
RALM
LRM
14
82
0
04 Oct 2023
Reward Model Ensembles Help Mitigate Overoptimization
Thomas Coste
Usman Anwar
Robert Kirk
David M. Krueger
NoLa
ALM
28
122
0
04 Oct 2023
CITING: Large Language Models Create Curriculum for Instruction Tuning
Tao Feng
Zifeng Wang
Jimeng Sun
ALM
33
14
0
04 Oct 2023
EcoAssistant: Using LLM Assistant More Affordably and Accurately
Jieyu Zhang
Ranjay Krishna
Ahmed Hassan Awadallah
Chi Wang
38
35
0
03 Oct 2023
The Empty Signifier Problem: Towards Clearer Paradigms for Operationalising "Alignment" in Large Language Models
Hannah Rose Kirk
Bertie Vidgen
Paul Röttger
Scott A. Hale
50
2
0
03 Oct 2023
Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond
Liang Chen
Yichi Zhang
Shuhuai Ren
Haozhe Zhao
Zefan Cai
Yuchi Wang
Peiyi Wang
Tianyu Liu
Baobao Chang
LM&Ro
LLMAG
33
41
0
03 Oct 2023
Tool-Augmented Reward Modeling
Lei Li
Yekun Chai
Shuohuan Wang
Yu Sun
Hao Tian
Ningyu Zhang
Hua Wu
OffRL
46
13
0
02 Oct 2023
Resolving Knowledge Conflicts in Large Language Models
Yike Wang
Shangbin Feng
Heng Wang
Weijia Shi
Vidhisha Balachandran
Tianxing He
Yulia Tsvetkov
61
12
0
02 Oct 2023
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration
Qiushi Sun
Zhangyue Yin
Xiang Li
Zhiyong Wu
Xipeng Qiu
Lingpeng Kong
LRM
LLMAG
28
44
0
30 Sep 2023
Voice2Action: Language Models as Agent for Efficient Real-Time Interaction in Virtual Reality
Yang Su
LLMAG
13
2
0
29 Sep 2023
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Minlie Huang
Nan Duan
Weizhu Chen
LRM
AI4CE
LLMAG
61
145
0
29 Sep 2023
CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets
Lifan Yuan
Yangyi Chen
Xingyao Wang
Yi R. Fung
Hao Peng
Heng Ji
LLMAG
KELM
38
58
0
29 Sep 2023
Intuitive or Dependent? Investigating LLMs' Behavior Style to Conflicting Prompts
Jiahao Ying
Yixin Cao
Kai Xiong
Yidong He
Long Cui
Yongbin Liu
33
7
0
29 Sep 2023
Qwen Technical Report
Jinze Bai
Shuai Bai
Yunfei Chu
Zeyu Cui
Kai Dang
...
Zhenru Zhang
Chang Zhou
Jingren Zhou
Xiaohuan Zhou
Tianhang Zhu
OSLM
73
1,622
0
28 Sep 2023
The Trickle-down Impact of Reward (In-)consistency on RLHF
Lingfeng Shen
Sihao Chen
Linfeng Song
Lifeng Jin
Baolin Peng
Haitao Mi
Daniel Khashabi
Dong Yu
42
21
0
28 Sep 2023
TPE: Towards Better Compositional Reasoning over Conceptual Tools with Multi-persona Collaboration
Hongru Wang
Huimin Wang
Lingzhi Wang
Minda Hu
Rui Wang
Boyang Xue
Hongyuan Lu
Fei Mi
Kam-Fai Wong
LRM
KELM
LLMAG
38
12
0
28 Sep 2023
How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
Lorenzo Pacchiardi
A. J. Chan
Sören Mindermann
Ilan Moscovitz
Alexa Y. Pan
Y. Gal
Owain Evans
J. Brauner
LLMAG
HILM
22
49
0
26 Sep 2023
Teach AI How to Code: Using Large Language Models as Teachable Agents for Programming Education
Hyoungwook Jin
Seonghee Lee
Hyun Joon Shin
Juho Kim
LLMAG
26
52
0
25 Sep 2023
Can LLM-Generated Misinformation Be Detected?
Canyu Chen
Kai Shu
DeLMO
41
158
0
25 Sep 2023
An In-depth Survey of Large Language Model-based Artificial Intelligence Agents
Pengyu Zhao
Zijian Jin
Ning Cheng
LLMAG
41
20
0
23 Sep 2023
MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback
Xingyao Wang
Zihan Wang
Jiateng Liu
Yangyi Chen
Lifan Yuan
Hao Peng
Heng Ji
LRM
133
142
0
19 Sep 2023
Data Distribution Bottlenecks in Grounding Language Models to Knowledge Bases
Yiheng Shu
Zhiwei Yu
27
3
0
15 Sep 2023
Agents: An Open-source Framework for Autonomous Language Agents
Wangchunshu Zhou
Yuchen Eleanor Jiang
Long Li
Jialong Wu
Tiannan Wang
...
Xiangru Tang
Ningyu Zhang
Huajun Chen
Peng Cui
Mrinmaya Sachan
LLMAG
LM&Ro
AI4CE
39
90
0
14 Sep 2023
ExpertQA: Expert-Curated Questions and Attributed Answers
Chaitanya Malaviya
Subin Lee
Sihao Chen
Elizabeth Sieber
Mark Yatskar
Dan Roth
ELM
HILM
36
52
0
14 Sep 2023
RAIN: Your Language Models Can Align Themselves without Finetuning
Yuhui Li
Fangyun Wei
Jinjing Zhao
Chao Zhang
Hongyang R. Zhang
SILM
44
108
0
13 Sep 2023
Mitigating the Alignment Tax of RLHF
Yong Lin
Hangyu Lin
Wei Xiong
Shizhe Diao
Zeming Zheng
...
Han Zhao
Nan Jiang
Heng Ji
Yuan Yao
Tong Zhang
MoMe
CLL
29
69
0
12 Sep 2023
Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in Chinese
Hao Wang
Sendong Zhao
Zewen Qiang
Zijian Li
Nuwa Xi
...
Haoqiang Guo
Yuhan Chen
Haoming Xu
Bing Qin
Ting Liu
LM&MA
AI4MH
34
17
0
08 Sep 2023
Everyone Deserves A Reward: Learning Customized Human Preferences
Pengyu Cheng
Jiawen Xie
Ke Bai
Yong Dai
Nan Du
19
30
0
06 Sep 2023
Cognitive Architectures for Language Agents
T. Sumers
Shunyu Yao
Karthik Narasimhan
Thomas Griffiths
LLMAG
LM&Ro
56
154
0
05 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
A. Luu
Wei Bi
Freda Shi
Shuming Shi
RALM
LRM
HILM
48
524
0
03 Sep 2023
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model
Fengxiang Bie
Yibo Yang
Zhongzhu Zhou
Adam Ghanem
Minjia Zhang
...
Pareesa Ameneh Golnari
David A. Clifton
Yuxiong He
Dacheng Tao
Shuaiwen Leon Song
EGVM
36
20
0
02 Sep 2023
Efficient RLHF: Reducing the Memory Usage of PPO
Michael Santacroce
Yadong Lu
Han Yu
Yuan-Fang Li
Yelong Shen
35
27
0
01 Sep 2023
Ladder-of-Thought: Using Knowledge as Steps to Elevate Stance Detection
Kairui Hu
Ming Yan
Qiufeng Wang
Ivor W. Tsang
Wen-Haw Chong
Yong Keong Yap
LRM
25
3
0
31 Aug 2023
Recommender AI Agent: Integrating Large Language Models for Interactive Recommendations
Xu Huang
Jianxun Lian
Yuxuan Lei
Jing Yao
Defu Lian
Xing Xie
LLMAG
26
88
0
31 Aug 2023
Peering Through Preferences: Unraveling Feedback Acquisition for Aligning Large Language Models
Hritik Bansal
John Dang
Aditya Grover
ALM
35
20
0
30 Aug 2023
Optimizing Factual Accuracy in Text Generation through Dynamic Knowledge Selection
Hongjin Qian
Zhicheng Dou
Jiejun Tan
Haonan Chen
Haoqi Gu
Ruofei Lai
Xinyu Zhang
Bo Zhao
Ji-Rong Wen
29
2
0
30 Aug 2023
RecMind: Large Language Model Powered Agent For Recommendation
Yancheng Wang
Ziyan Jiang
Zheng Chen
Fan Yang
Yingxue Zhou
Eunah Cho
Xing Fan
Xiaojiang Huang
Yanbin Lu
Yingzhen Yang
LLMAG
LM&Ro
LRM
39
86
0
28 Aug 2023
Spoken Language Intelligence of Large Language Models for Language Learning
Linkai Peng
Baorian Nuchged
Yingming Gao
ELM
64
4
0
28 Aug 2023
Generations of Knowledge Graphs: The Crazy Ideas and the Business Impact
Xin Luna Dong
32
23
0
27 Aug 2023
Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on Language, Multimodal, and Scientific GPT Models
Kaiyuan Gao
Su He
Zhenyu He
Jiacheng Lin
Qizhi Pei
Jie Shao
Wei Zhang
LM&MA
SyDa
38
4
0
27 Aug 2023
Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum
Shen Gao
Zhengliang Shi
Minghang Zhu
Bowen Fang
Xin Xin
Pengjie Ren
Zhumin Chen
Jun Ma
Zhaochun Ren
LLMAG
CLL
40
37
0
27 Aug 2023
Rational Decision-Making Agent with Internalized Utility Judgment
Yining Ye
Xin Cong
Shizuo Tian
Yujia Qin
Chong Liu
Yankai Lin
Zhiyuan Liu
Maosong Sun
LLMAG
32
9
0
24 Aug 2023
Previous
1
2
3
...
12
13
14
...
17
18
19
Next