ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.09332
  4. Cited By
WebGPT: Browser-assisted question-answering with human feedback

WebGPT: Browser-assisted question-answering with human feedback

17 December 2021
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
Christina Kim
Christopher Hesse
Shantanu Jain
V. Kosaraju
William Saunders
Xu Jiang
K. Cobbe
Tyna Eloundou
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
    ALM
    RALM
ArXivPDFHTML

Papers citing "WebGPT: Browser-assisted question-answering with human feedback"

50 / 907 papers shown
Title
MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA
MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA
Lang Yu
Qin Chen
Jie Zhou
Liang He
KELM
17
47
0
19 Dec 2023
Agent-based Learning of Materials Datasets from Scientific Literature
Agent-based Learning of Materials Datasets from Scientific Literature
Mehrad Ansari
S. M. Moosavi
AI4CE
25
14
0
18 Dec 2023
Iterative Preference Learning from Human Feedback: Bridging Theory and
  Practice for RLHF under KL-Constraint
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint
Wei Xiong
Hanze Dong
Chen Ye
Ziqi Wang
Han Zhong
Heng Ji
Nan Jiang
Tong Zhang
OffRL
38
163
0
18 Dec 2023
Explore 3D Dance Generation via Reward Model from Automatically-Ranked
  Demonstrations
Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations
Zilin Wang
Hao-Wen Zhuang
Lu Li
Yinmin Zhang
Junjie Zhong
Jun Chen
Yu Yang
Boshi Tang
Zhiyong Wu
53
3
0
18 Dec 2023
Retrieval-Augmented Generation for Large Language Models: A Survey
Retrieval-Augmented Generation for Large Language Models: A Survey
Yunfan Gao
Yun Xiong
Xinyu Gao
Kangxiang Jia
Jinliu Pan
Yuxi Bi
Yi Dai
Jiawei Sun
Meng Wang
Haofen Wang
3DV
RALM
61
1,530
1
18 Dec 2023
Let AI Entertain You: Increasing User Engagement with Generative AI and
  Rejection Sampling
Let AI Entertain You: Increasing User Engagement with Generative AI and Rejection Sampling
Jingying Zeng
Jaewon Yang
Waleed Malik
Xiao Yan
Richard Huang
Qi He
30
1
0
16 Dec 2023
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Renat Aksitov
Sobhan Miryoosefi
Zong-xiao Li
Daliang Li
Sheila Babayan
...
Sushant Prakash
Pranesh Srinivasan
Manzil Zaheer
Felix X. Yu
Sanjiv Kumar
LRM
ReLM
LLMAG
KELM
23
45
0
15 Dec 2023
Towards Verifiable Text Generation with Evolving Memory and
  Self-Reflection
Towards Verifiable Text Generation with Evolving Memory and Self-Reflection
Hao Sun
Hengyi Cai
Bo Wang
Yingyan Hou
Xiaochi Wei
Shuaiqiang Wang
Yan Zhang
Dawei Yin
47
9
0
14 Dec 2023
TAP4LLM: Table Provider on Sampling, Augmenting, and Packing
  Semi-structured Data for Large Language Model Reasoning
TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning
Yuan Sui
Jiaru Zou
Mengyu Zhou
Xinyi He
Lun Du
Shi Han
Dongmei Zhang
LRM
LMTD
24
23
0
14 Dec 2023
LDM$^2$: A Large Decision Model Imitating Human Cognition with Dynamic
  Memory Enhancement
LDM2^22: A Large Decision Model Imitating Human Cognition with Dynamic Memory Enhancement
Xingjin Wang
Linjing Li
D. Zeng
38
0
0
13 Dec 2023
AI capabilities can be significantly improved without expensive
  retraining
AI capabilities can be significantly improved without expensive retraining
Tom Davidson
Jean-Stanislas Denain
Pablo Villalobos
Guillem Bas
OffRL
VLM
26
26
0
12 Dec 2023
On Diversified Preferences of Large Language Model Alignment
On Diversified Preferences of Large Language Model Alignment
Dun Zeng
Yong Dai
Pengyu Cheng
Longyue Wang
Tianhao Hu
Wanshun Chen
Nan Du
Zenglin Xu
ALM
38
16
0
12 Dec 2023
Exploring Large Language Models to Facilitate Variable Autonomy for
  Human-Robot Teaming
Exploring Large Language Models to Facilitate Variable Autonomy for Human-Robot Teaming
Younes Lakhnati
Max Pascher
Jens Gerken
LLMAG
LM&Ro
40
3
0
12 Dec 2023
Building Open-Ended Embodied Agent via Language-Policy Bidirectional
  Adaptation
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Fuxian Huang
Qi Zhang
Ming Zhou
Jing Hou
Yu Qiao
Yu Liu
LLMAG
LM&Ro
37
1
0
12 Dec 2023
Alignment for Honesty
Alignment for Honesty
Yuqing Yang
Ethan Chern
Xipeng Qiu
Graham Neubig
Pengfei Liu
44
30
0
12 Dec 2023
"I Want It That Way": Enabling Interactive Decision Support Using Large
  Language Models and Constraint Programming
"I Want It That Way": Enabling Interactive Decision Support Using Large Language Models and Constraint Programming
Connor Lawless
Jakob Schoeffer
Lindy Le
Kael Rowan
Shilad Sen
Cristina St. Hill
Jina Suh
Bahar Sarrafzadeh
41
8
0
12 Dec 2023
"What's important here?": Opportunities and Challenges of Using LLMs in
  Retrieving Information from Web Interfaces
"What's important here?": Opportunities and Challenges of Using LLMs in Retrieving Information from Web Interfaces
Faria Huq
Jeffrey P. Bigham
Nikolas Martelaro
25
7
0
11 Dec 2023
KwaiAgents: Generalized Information-seeking Agent System with Large
  Language Models
KwaiAgents: Generalized Information-seeking Agent System with Large Language Models
Haojie Pan
Zepeng Zhai
Hao Yuan
Yaojia Lv
Ruiji Fu
Ming Liu
Zhongyuan Wang
Bing Qin
LLMAG
RALM
26
10
0
08 Dec 2023
Learning to Break: Knowledge-Enhanced Reasoning in Multi-Agent Debate
  System
Learning to Break: Knowledge-Enhanced Reasoning in Multi-Agent Debate System
Haotian Wang
Xiyuan Du
Weijiang Yu
Qianglong Chen
Kun Zhu
Zheng Chu
Lian Yan
Yi Guan
32
10
0
08 Dec 2023
LLM as OS, Agents as Apps: Envisioning AIOS, Agents and the AIOS-Agent
  Ecosystem
LLM as OS, Agents as Apps: Envisioning AIOS, Agents and the AIOS-Agent Ecosystem
Yingqiang Ge
Yujie Ren
Wenyue Hua
Shuyuan Xu
Juntao Tan
Yongfeng Zhang
LLMAG
23
28
0
06 Dec 2023
Speculative Exploration on the Concept of Artificial Agents Conducting
  Autonomous Research
Speculative Exploration on the Concept of Artificial Agents Conducting Autonomous Research
Shiro Takagi
45
0
0
06 Dec 2023
Rethinking E-Commerce Search
Rethinking E-Commerce Search
Haixun Wang
Taesik Na
45
6
0
06 Dec 2023
ULMA: Unified Language Model Alignment with Human Demonstration and
  Point-wise Preference
ULMA: Unified Language Model Alignment with Human Demonstration and Point-wise Preference
Tianchi Cai
Xierui Song
Jiyan Jiang
Fei Teng
Jinjie Gu
Guannan Zhang
ALM
21
4
0
05 Dec 2023
Explore, Select, Derive, and Recall: Augmenting LLM with Human-like
  Memory for Mobile Task Automation
Explore, Select, Derive, and Recall: Augmenting LLM with Human-like Memory for Mobile Task Automation
Sunjae Lee
Junyoung Choi
Jungjae Lee
Munim Hasan Wasi
Hojun Choi
Steven Y. Ko
Sangeun Oh
Insik Shin
RALM
42
6
0
04 Dec 2023
D-Bot: Database Diagnosis System using Large Language Models
D-Bot: Database Diagnosis System using Large Language Models
Xuanhe Zhou
Guoliang Li
Zhaoyan Sun
Zhiyuan Liu
Weize Chen
Jianming Wu
Jiesi Liu
Ruohang Feng
Guoyang Zeng
LLMAG
65
14
0
03 Dec 2023
Nash Learning from Human Feedback
Nash Learning from Human Feedback
Rémi Munos
Michal Valko
Daniele Calandriello
M. G. Azar
Mark Rowland
...
Nikola Momchev
Olivier Bachem
D. Mankowitz
Doina Precup
Bilal Piot
42
126
0
01 Dec 2023
Griffon: Spelling out All Object Locations at Any Granularity with Large
  Language Models
Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models
Yufei Zhan
Yousong Zhu
Zhiyang Chen
Fan Yang
E. Goles
Jinqiao Wang
ObjD
52
14
0
24 Nov 2023
PrivateLoRA For Efficient Privacy Preserving LLM
PrivateLoRA For Efficient Privacy Preserving LLM
Yiming Wang
Yu Lin
Xiaodong Zeng
Guannan Zhang
66
11
0
23 Nov 2023
DaG LLM ver 1.0: Pioneering Instruction-Tuned Language Modeling for
  Korean NLP
DaG LLM ver 1.0: Pioneering Instruction-Tuned Language Modeling for Korean NLP
Dongjun Jang
Sangah Lee
Sungjoo Byun
Jinwoong Kim
Jean Seo
...
Soyeon Kim
Chaeyoung Oh
Jaeyoon Kim
Hyemi Jo
Hyopil Shin
ALM
24
0
0
23 Nov 2023
GAIA: a benchmark for General AI Assistants
GAIA: a benchmark for General AI Assistants
Grégoire Mialon
Clémentine Fourrier
Craig Swift
Thomas Wolf
Yann LeCun
Thomas Scialom
AI4MH
ALM
ELM
RALM
17
141
0
21 Nov 2023
Unifying Corroborative and Contributive Attributions in Large Language
  Models
Unifying Corroborative and Contributive Attributions in Large Language Models
Theodora Worledge
Judy Hanwen Shen
Nicole Meister
Caleb Winston
Carlos Guestrin
TDI
32
10
0
20 Nov 2023
Igniting Language Intelligence: The Hitchhiker's Guide From
  Chain-of-Thought Reasoning to Language Agents
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Zhuosheng Zhang
Yao Yao
Aston Zhang
Xiangru Tang
Xinbei Ma
...
Yiming Wang
Mark B. Gerstein
Rui Wang
Gongshen Liu
Hai Zhao
LLMAG
LM&Ro
LRM
42
53
0
20 Nov 2023
Towards Robust Text Retrieval with Progressive Learning
Towards Robust Text Retrieval with Progressive Learning
Tong Wu
Yulei Qin
Enwei Zhang
Zihan Xu
Yuting Gao
Ke Li
Xing Sun
RALM
VLM
55
1
0
20 Nov 2023
Behavior Optimized Image Generation
Behavior Optimized Image Generation
Varun Khurana
Yaman Kumar Singla
J. Subramanian
R. Shah
Changyou Chen
Zhiqiang Xu
Balaji Krishnamurthy
EGVM
10
4
0
18 Nov 2023
GEO: Generative Engine Optimization
GEO: Generative Engine Optimization
Pranjal Aggarwal
Vishvak Murahari
Tanmay Rajpurohit
Ashwin Kalyan
Karthik R. Narasimhan
Ameet Deshpande
43
2
0
16 Nov 2023
On Evaluating the Integration of Reasoning and Action in LLM Agents with
  Database Question Answering
On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering
Linyong Nan
Ellen Zhang
Weijin Zou
Yilun Zhao
Wenfei Zhou
Arman Cohan
LLMAG
46
13
0
16 Nov 2023
RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with
  Human Feedback in Large Language Models
RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with Human Feedback in Large Language Models
Jiong Wang
Junlin Wu
Muhao Chen
Yevgeniy Vorobeychik
Chaowei Xiao
AAML
29
13
0
16 Nov 2023
HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Zhilin Wang
Yi Dong
Jiaqi Zeng
Virginia Adams
Makesh Narsimhan Sreedhar
...
Olivier Delalleau
Jane Polak Scowcroft
Neel Kant
Aidan Swope
Oleksii Kuchaiev
3DV
14
66
0
16 Nov 2023
Rescue: Ranking LLM Responses with Partial Ordering to Improve Response
  Generation
Rescue: Ranking LLM Responses with Partial Ordering to Improve Response Generation
Yikun Wang
Rui Zheng
Haoming Li
Qi Zhang
Tao Gui
Fei Liu
OffRL
25
3
0
15 Nov 2023
Value FULCRA: Mapping Large Language Models to the Multidimensional
  Spectrum of Basic Human Values
Value FULCRA: Mapping Large Language Models to the Multidimensional Spectrum of Basic Human Values
Jing Yao
Xiaoyuan Yi
Xiting Wang
Yifan Gong
Xing Xie
41
22
0
15 Nov 2023
Towards Evaluating AI Systems for Moral Status Using Self-Reports
Towards Evaluating AI Systems for Moral Status Using Self-Reports
Ethan Perez
Robert Long
ELM
38
8
0
14 Nov 2023
Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM
  Game
Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM Game
Pengyu Cheng
Yifan Yang
Jian Li
Yong Dai
Tianhao Hu
Peixin Cao
Nan Du
Xiaolong Li
28
28
0
14 Nov 2023
LLatrieval: LLM-Verified Retrieval for Verifiable Generation
LLatrieval: LLM-Verified Retrieval for Verifiable Generation
Xiaonan Li
Changtai Zhu
Linyang Li
Zhangyue Yin
Tianxiang Sun
Xipeng Qiu
RALM
40
25
0
14 Nov 2023
Large Language Models are Zero Shot Hypothesis Proposers
Large Language Models are Zero Shot Hypothesis Proposers
Biqing Qi
Kaiyan Zhang
Haoxiang Li
Kai Tian
Sihang Zeng
Zhang-Ren Chen
Bowen Zhou
32
27
0
10 Nov 2023
A Survey of Large Language Models in Medicine: Progress, Application,
  and Challenge
A Survey of Large Language Models in Medicine: Progress, Application, and Challenge
Hongjian Zhou
Fenglin Liu
Boyang Gu
Xinyu Zou
Jinfa Huang
...
Yefeng Zheng
Lei A. Clifton
Zheng Li
Fenglin Liu
David A. Clifton
LM&MA
41
108
0
09 Nov 2023
A Survey of Large Language Models Attribution
A Survey of Large Language Models Attribution
Dongfang Li
Zetian Sun
Xinshuo Hu
Zhenyu Liu
Ziyang Chen
Baotian Hu
Aiguo Wu
Min Zhang
HILM
21
49
0
07 Nov 2023
Successor Features for Efficient Multisubject Controlled Text Generation
Successor Features for Efficient Multisubject Controlled Text Generation
Mengyao Cao
Mehdi Fatemi
Jackie Chi Kit Cheung
Samira Shabanian
BDL
37
0
0
03 Nov 2023
ProAgent: From Robotic Process Automation to Agentic Process Automation
ProAgent: From Robotic Process Automation to Agentic Process Automation
Yining Ye
Xin Cong
Shizuo Tian
Jian Cao
Hao Wang
...
Heyang Yu
Huadong Wang
Yankai Lin
Zhiyuan Liu
Maosong Sun
AI4CE
26
19
0
02 Nov 2023
The Alignment Ceiling: Objective Mismatch in Reinforcement Learning from
  Human Feedback
The Alignment Ceiling: Objective Mismatch in Reinforcement Learning from Human Feedback
Nathan Lambert
Roberto Calandra
ALM
29
32
0
31 Oct 2023
Language Agents with Reinforcement Learning for Strategic Play in the
  Werewolf Game
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
Zelai Xu
Chao Yu
Fei Fang
Yu Wang
Yi Wu
LLMAG
31
79
0
29 Oct 2023
Previous
123...101112...171819
Next