ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLM
    ALM
ArXivPDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 7,311 papers shown
Title
TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned
  Decision
TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
Ruiwen Zhou
Yingxuan Yang
Kangrui Chen
Ying Wen
Wenhao Wang
Chunling Xi
Guoqiang Xu
Jiliang Tang
Lingjuan Lyu
LLMAG
32
8
0
10 Mar 2024
Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small
  Language Models
Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models
Minjie Zhu
Yichen Zhu
Xin Liu
Ning Liu
Zhiyuan Xu
Yaxin Peng
Chaomin Shen
Zhicai Ou
Feifei Feng
Jian Tang
VLM
57
20
0
10 Mar 2024
Detectors for Safe and Reliable LLMs: Implementations, Uses, and
  Limitations
Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Swapnaja Achintalwar
Adriana Alvarado Garcia
Ateret Anaby-Tavor
Ioana Baldini
Sara E. Berger
...
Aashka Trivedi
Kush R. Varshney
Dennis L. Wei
Shalisha Witherspooon
Marcel Zalmanovici
43
10
0
09 Mar 2024
A Generalized Acquisition Function for Preference-based Reward Learning
A Generalized Acquisition Function for Preference-based Reward Learning
Evan Ellis
Gaurav R. Ghosal
Stuart J. Russell
Anca Dragan
Erdem Biyik
42
2
0
09 Mar 2024
Reverse That Number! Decoding Order Matters in Arithmetic Learning
Reverse That Number! Decoding Order Matters in Arithmetic Learning
Daniel Zhang-Li
Nianyi Lin
Jifan Yu
Zheyuan Zhang
Zijun Yao
Xiaokang Zhang
Lei Hou
Jing Zhang
Juanzi Li
37
3
0
09 Mar 2024
$\textbf{S}^2$IP-LLM: Semantic Space Informed Prompt Learning with LLM
  for Time Series Forecasting
S2\textbf{S}^2S2IP-LLM: Semantic Space Informed Prompt Learning with LLM for Time Series Forecasting
Zijie Pan
Yushan Jiang
Sahil Garg
Anderson Schneider
Yuriy Nevmyvaka
Dongjin Song
AI4TS
55
7
0
09 Mar 2024
Are Large Language Models Aligned with People's Social Intuitions for
  Human-Robot Interactions?
Are Large Language Models Aligned with People's Social Intuitions for Human-Robot Interactions?
Lennart Wachowiak
Andrew Coles
Oya Celiktutan
Gerard Canal
43
0
0
08 Mar 2024
Concept-aware Data Construction Improves In-context Learning of Language
  Models
Concept-aware Data Construction Improves In-context Learning of Language Models
Michal Štefánik
Marek Kadlcík
Petr Sojka
54
0
0
08 Mar 2024
Bayesian Preference Elicitation with Language Models
Bayesian Preference Elicitation with Language Models
Kunal Handa
Yarin Gal
Ellie Pavlick
Noah D. Goodman
Jacob Andreas
Alex Tamkin
Belinda Z. Li
42
12
0
08 Mar 2024
Beyond Finite Data: Towards Data-free Out-of-distribution Generalization
  via Extrapolation
Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapolation
Yijiang Li
Sucheng Ren
Weipeng Deng
Yuzhi Xu
Ying Gao
Edith C.H. Ngai
Haohan Wang
OOD
51
1
0
08 Mar 2024
Bias-Augmented Consistency Training Reduces Biased Reasoning in
  Chain-of-Thought
Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought
James Chua
Edward Rees
Hunar Batra
Samuel R. Bowman
Julian Michael
Ethan Perez
Miles Turpin
LRM
50
13
0
08 Mar 2024
Unfamiliar Finetuning Examples Control How Language Models Hallucinate
Unfamiliar Finetuning Examples Control How Language Models Hallucinate
Katie Kang
Eric Wallace
Claire Tomlin
Aviral Kumar
Sergey Levine
HILM
LRM
49
49
0
08 Mar 2024
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in
  Long-Horizon Generation
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation
Zihao Wang
Hoang Trung-Dung
Haowei Lin
Jiaqi Li
Xiaojian Ma
Yitao Liang
ReLM
RALM
LRM
102
48
0
08 Mar 2024
Harnessing Multi-Role Capabilities of Large Language Models for
  Open-Domain Question Answering
Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering
Hongda Sun
Yuxuan Liu
Chengwei Wu
Haiyu Yan
Cheng Tai
Xin Gao
Shuo Shang
Rui Yan
36
7
0
08 Mar 2024
Overcoming Reward Overoptimization via Adversarial Policy Optimization
  with Lightweight Uncertainty Estimation
Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Xiaoying Zhang
Jean-François Ton
Wei Shen
Hongning Wang
Yang Liu
39
14
0
08 Mar 2024
On Protecting the Data Privacy of Large Language Models (LLMs): A Survey
On Protecting the Data Privacy of Large Language Models (LLMs): A Survey
Biwei Yan
Kun Li
Minghui Xu
Yueyan Dong
Yue Zhang
Zhaochun Ren
Xiuzhen Cheng
AILaw
PILM
80
78
0
08 Mar 2024
Evaluating Text-to-Image Generative Models: An Empirical Study on Human
  Image Synthesis
Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis
Mu-Hwa Chen
Yi Liu
Jian Yi
Changran Xu
Qiuxia Lai
Hongliang Wang
Tsung-Yi Ho
Qiang Xu
EGVM
40
7
0
08 Mar 2024
Benchmarking Large Language Models for Molecule Prediction Tasks
Benchmarking Large Language Models for Molecule Prediction Tasks
Zhiqiang Zhong
Kuangyu Zhou
Davide Mottin
40
8
0
08 Mar 2024
Aligning Large Language Models for Controllable Recommendations
Aligning Large Language Models for Controllable Recommendations
Wensheng Lu
Jianxun Lian
Wei Zhang
Guanghua Li
Mingyang Zhou
Hao Liao
Xing Xie
ALM
49
15
0
08 Mar 2024
Is this the real life? Is this just fantasy? The Misleading Success of
  Simulating Social Interactions With LLMs
Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs
Xuhui Zhou
Zhe Su
Tiwalayo Eisape
Hyunwoo J. Kim
Maarten Sap
39
38
0
08 Mar 2024
Provable Multi-Party Reinforcement Learning with Diverse Human Feedback
Provable Multi-Party Reinforcement Learning with Diverse Human Feedback
Huiying Zhong
Zhun Deng
Weijie J. Su
Zhiwei Steven Wu
Linjun Zhang
52
15
0
08 Mar 2024
Automatic and Universal Prompt Injection Attacks against Large Language
  Models
Automatic and Universal Prompt Injection Attacks against Large Language Models
Xiaogeng Liu
Zhiyuan Yu
Yizhe Zhang
Ning Zhang
Chaowei Xiao
SILM
AAML
51
35
0
07 Mar 2024
MEIT: Multi-Modal Electrocardiogram Instruction Tuning on Large Language
  Models for Report Generation
MEIT: Multi-Modal Electrocardiogram Instruction Tuning on Large Language Models for Report Generation
Zhongwei Wan
Che Liu
Xin Wang
Chaofan Tao
Hui Shen
Zhenwu Peng
Jie Fu
Rossella Arcucci
Huaxiu Yao
Mi Zhang
57
7
0
07 Mar 2024
A Survey on Human-AI Teaming with Large Pre-Trained Models
A Survey on Human-AI Teaming with Large Pre-Trained Models
Vanshika Vats
Marzia Binta Nizam
Minghao Liu
Ziyuan Wang
Richard Ho
...
Celeste Shen
Rachel Shen
Nafisa Hussain
Kesav Ravichandran
James Davis
LM&MA
62
8
0
07 Mar 2024
Fact-Checking the Output of Large Language Models via Token-Level
  Uncertainty Quantification
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
Ekaterina Fadeeva
Aleksandr Rubashevskii
Artem Shelmanov
Sergey Petrakov
Haonan Li
...
Gleb Kuzmin
Alexander Panchenko
Timothy Baldwin
Preslav Nakov
Maxim Panov
HILM
45
42
0
07 Mar 2024
Yi: Open Foundation Models by 01.AI
Yi: Open Foundation Models by 01.AI
01. AI
Alex Young
01.AI Alex Young
Bei Chen
Chao Li
...
Yue Wang
Yuxuan Cai
Zhenyu Gu
Zhiyuan Liu
Zonghong Dai
OSLM
LRM
150
512
0
07 Mar 2024
Teaching Large Language Models to Reason with Reinforcement Learning
Teaching Large Language Models to Reason with Reinforcement Learning
Alex Havrilla
Yuqing Du
Sharath Chandra Raparthy
Christoforos Nalmpantis
Jane Dwivedi-Yu
Maksym Zhuravinskyi
Eric Hambro
Sainbayar Sukhbaatar
Roberta Raileanu
ReLM
LRM
39
71
0
07 Mar 2024
CAT: Enhancing Multimodal Large Language Model to Answer Questions in
  Dynamic Audio-Visual Scenarios
CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
Qilang Ye
Zitong Yu
Rui Shao
Xinyu Xie
Philip Torr
Xiaochun Cao
MLLM
56
24
0
07 Mar 2024
NLPre: a revised approach towards language-centric benchmarking of
  Natural Language Preprocessing systems
NLPre: a revised approach towards language-centric benchmarking of Natural Language Preprocessing systems
Martyna Wia̧cek
Piotr Rybak
Lukasz Pszenny
Alina Wróblewska
31
1
0
07 Mar 2024
GraphInstruct: Empowering Large Language Models with Graph Understanding
  and Reasoning Capability
GraphInstruct: Empowering Large Language Models with Graph Understanding and Reasoning Capability
Zihan Luo
Xiran Song
Hong Huang
Jianxun Lian
Chenhao Zhang
Jinqi Jiang
Xing Xie
LRM
31
32
0
07 Mar 2024
Pearl: A Review-driven Persona-Knowledge Grounded Conversational
  Recommendation Dataset
Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation Dataset
Minjin Kim
Minju Kim
Hana Kim
Beong-woo Kwak
Soyeon Chun
Hyunseo Kim
SeongKu Kang
Youngjae Yu
Jinyoung Yeo
Dongha Lee
RALM
31
10
0
07 Mar 2024
Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model
  with Proxy
Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy
Yu Zhu
Chuxiong Sun
Wenfei Yang
Wenqiang Wei
Simin Niu
...
Zhiyu Li
Shifeng Zhang
Zhiyu Li
Jie Hu
Mingchuan Yang
42
3
0
07 Mar 2024
Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks
Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks
Linyuan Gong
Sida Wang
Mostafa Elhoushi
Alvin Cheung
32
15
0
07 Mar 2024
Aligners: Decoupling LLMs and Alignment
Aligners: Decoupling LLMs and Alignment
Lilian Ngweta
Mayank Agarwal
Subha Maity
Alex Gittens
Yuekai Sun
Mikhail Yurochkin
36
1
0
07 Mar 2024
On the Essence and Prospect: An Investigation of Alignment Approaches
  for Big Models
On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models
Xinpeng Wang
Shitong Duan
Xiaoyuan Yi
Jing Yao
Shanlin Zhou
Zhihua Wei
Peng Zhang
Dongkuan Xu
Maosong Sun
Xing Xie
OffRL
50
16
0
07 Mar 2024
Preference optimization of protein language models as a multi-objective
  binder design paradigm
Preference optimization of protein language models as a multi-objective binder design paradigm
Pouria A. Mistani
Venkatesh Mysore
45
6
0
07 Mar 2024
Bridging Text and Molecule: A Survey on Multimodal Frameworks for
  Molecule
Bridging Text and Molecule: A Survey on Multimodal Frameworks for Molecule
Yi Xiao
Xiangxin Zhou
Qiang Liu
Liang Wang
AI4CE
37
3
0
07 Mar 2024
CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning
CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning
Yanqi Dai
Dong Jing
Nanyi Fei
Zhiwu Lu
Nanyi Fei
Guoxing Yang
Zhiwu Lu
61
3
0
07 Mar 2024
Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach
  for Robust Manipulation
Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach for Robust Manipulation
M. Torné
Anthony Simeonov
Zechu Li
April Chan
Tao Chen
Abhishek Gupta
Pulkit Agrawal
50
58
0
06 Mar 2024
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt
  Injection Attacks
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection Attacks
Dario Pasquini
Martin Strohmeier
Carmela Troncoso
AAML
48
22
0
06 Mar 2024
Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection
  from Remote Sensing Imagery
Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery
Wei Zhang
Miaoxin Cai
Tong Zhang
Guoqiang Lei
Zhuang Yin
Xuerui Mao
35
7
0
06 Mar 2024
MedSafetyBench: Evaluating and Improving the Medical Safety of Large
  Language Models
MedSafetyBench: Evaluating and Improving the Medical Safety of Large Language Models
Tessa Han
Aounon Kumar
Chirag Agarwal
Himabindu Lakkaraju
ELM
LM&MA
AI4MH
39
5
0
06 Mar 2024
Benchmarking Hallucination in Large Language Models based on
  Unanswerable Math Word Problem
Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem
Yuhong Sun
Zhangyue Yin
Qipeng Guo
Jiawen Wu
Xipeng Qiu
Hui Zhao
41
14
0
06 Mar 2024
Towards Efficient and Effective Unlearning of Large Language Models for
  Recommendation
Towards Efficient and Effective Unlearning of Large Language Models for Recommendation
Hangyu Wang
Jianghao Lin
Bo Chen
Yang Yang
Ruiming Tang
Weinan Zhang
Yong Yu
MU
39
10
0
06 Mar 2024
Human vs. Machine: Behavioral Differences Between Expert Humans and
  Language Models in Wargame Simulations
Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame Simulations
Max Lamparth
Anthony Corso
Jacob Ganz
O. Mastro
Jacquelyn G. Schneider
Harold Trinkunas
54
7
0
06 Mar 2024
"It's the only thing I can trust": Envisioning Large Language Model Use
  by Autistic Workers for Communication Assistance
"It's the only thing I can trust": Envisioning Large Language Model Use by Autistic Workers for Communication Assistance
JiWoong Jang
Sanika Moharana
Patrick Carrington
Andrew Begel
22
26
0
05 Mar 2024
AI Insights: A Case Study on Utilizing ChatGPT Intelligence for Research
  Paper Analysis
AI Insights: A Case Study on Utilizing ChatGPT Intelligence for Research Paper Analysis
Anjalee de Silva
Janaka Wijekoon
Rashini K. Liyanarachchi
Rrubaa Panchendrarajan
Weranga Rajapaksha
LM&MA
17
3
0
05 Mar 2024
Should We Fear Large Language Models? A Structural Analysis of the Human
  Reasoning System for Elucidating LLM Capabilities and Risks Through the Lens
  of Heidegger's Philosophy
Should We Fear Large Language Models? A Structural Analysis of the Human Reasoning System for Elucidating LLM Capabilities and Risks Through the Lens of Heidegger's Philosophy
Jianqiiu Zhang
ELM
40
1
0
05 Mar 2024
The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Nathaniel Li
Alexander Pan
Anjali Gopal
Summer Yue
Daniel Berrios
...
Yan Shoshitaishvili
Jimmy Ba
K. Esvelt
Alexandr Wang
Dan Hendrycks
ELM
59
147
0
05 Mar 2024
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal
  Datasets
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Hossein Aboutalebi
Hwanjun Song
Yusheng Xie
Arshit Gupta
Justin Sun
Hang Su
Igor Shalyminov
Nikolaos Pappas
Siffi Singh
Saab Mansour
DiffM
EGVM
48
4
0
05 Mar 2024
Previous
123...808182...145146147
Next