ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,384 papers shown
Title
CodeChameleon: Personalized Encryption Framework for Jailbreaking Large
  Language Models
CodeChameleon: Personalized Encryption Framework for Jailbreaking Large Language Models
Huijie Lv
Xiao Wang
Yuan Zhang
Caishuang Huang
Shihan Dou
Junjie Ye
Tao Gui
Qi Zhang
Xuanjing Huang
AAML
88
36
0
26 Feb 2024
Navigating Complexity: Orchestrated Problem Solving with Multi-Agent
  LLMs
Navigating Complexity: Orchestrated Problem Solving with Multi-Agent LLMs
Sumedh Rasal
E. Hauer
69
0
0
26 Feb 2024
Look Before You Leap: Towards Decision-Aware and Generalizable
  Tool-Usage for Large Language Models
Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models
Anchun Gui
Jian Li
Yong Dai
Nan Du
Han Xiao
45
1
0
26 Feb 2024
LangGPT: Rethinking Structured Reusable Prompt Design Framework for LLMs
  from the Programming Language
LangGPT: Rethinking Structured Reusable Prompt Design Framework for LLMs from the Programming Language
Ming Wang
Yuanzhong Liu
Xiaoyu Liang
Songlian Li
Yijie Huang
...
Shi Feng
Chi Zhang
Yifei Zhang
Minghui Zheng
Jigang Li
133
15
0
26 Feb 2024
Long-Context Language Modeling with Parallel Context Encoding
Long-Context Language Modeling with Parallel Context Encoding
Howard Yen
Tianyu Gao
Danqi Chen
95
50
0
26 Feb 2024
PAQA: Toward ProActive Open-Retrieval Question Answering
PAQA: Toward ProActive Open-Retrieval Question Answering
Pierre Erbacher
Jian-Yun Nie
P. Preux
Laure Soulier
RALM
39
2
0
26 Feb 2024
Rethinking Negative Instances for Generative Named Entity Recognition
Rethinking Negative Instances for Generative Named Entity Recognition
Yuyang Ding
Juntao Li
Pinzheng Wang
Zecheng Tang
Bowen Yan
Min Zhang
80
13
0
26 Feb 2024
mEdIT: Multilingual Text Editing via Instruction Tuning
mEdIT: Multilingual Text Editing via Instruction Tuning
Vipul Raheja
Dimitris Alikaniotis
Vivek Kulkarni
Bashar Alhafni
Dhruv Kumar
VLM
102
8
0
26 Feb 2024
RoCoIns: Enhancing Robustness of Large Language Models through
  Code-Style Instructions
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions
Yuan Zhang
Xiao Wang
Zhiheng Xi
Han Xia
Tao Gui
Qi Zhang
Xuanjing Huang
93
4
0
26 Feb 2024
LLM Inference Unveiled: Survey and Roofline Model Insights
LLM Inference Unveiled: Survey and Roofline Model Insights
Zhihang Yuan
Yuzhang Shang
Yang Zhou
Zhen Dong
Zhe Zhou
...
Yong Jae Lee
Yan Yan
Beidi Chen
Guangyu Sun
Kurt Keutzer
240
91
0
26 Feb 2024
Feedback Efficient Online Fine-Tuning of Diffusion Models
Feedback Efficient Online Fine-Tuning of Diffusion Models
Masatoshi Uehara
Yulai Zhao
Kevin Black
Ehsan Hajiramezanali
Gabriele Scalia
N. Diamant
Alex Tseng
Sergey Levine
Tommaso Biancalani
121
28
0
26 Feb 2024
CodeS: Towards Building Open-source Language Models for Text-to-SQL
CodeS: Towards Building Open-source Language Models for Text-to-SQL
Haoyang Li
Jing Zhang
Hanbing Liu
Ju Fan
Yanling Wang
Jun Zhu
Renjie Wei
Hongyan Pan
Cuiping Li
Hong Chen
ELMAI4TS
114
119
0
26 Feb 2024
Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based
  Question Answering
Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering
Mingxu Tao
Dongyan Zhao
Yansong Feng
LLMAG
61
3
0
26 Feb 2024
From Large Language Models and Optimization to Decision Optimization
  CoPilot: A Research Manifesto
From Large Language Models and Optimization to Decision Optimization CoPilot: A Research Manifesto
Segev Wasserkrug
Léonard Boussioux
D. Hertog
F. Mirzazadeh
Ilker Birbil
Jannis Kurtz
Donato Maragno
LLMAG
100
3
0
26 Feb 2024
GenAINet: Enabling Wireless Collective Intelligence via Knowledge Transfer and Reasoning
GenAINet: Enabling Wireless Collective Intelligence via Knowledge Transfer and Reasoning
Han Zou
Qiyang Zhao
Lina Bariah
Yu Tian
M. Bennis
S. Lasaulce
158
14
0
26 Feb 2024
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination
  Tendency of LLMs
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs
Cem Uluoglakci
T. Taşkaya-Temizel
HILM
69
3
0
25 Feb 2024
Defending Large Language Models against Jailbreak Attacks via Semantic
  Smoothing
Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing
Jiabao Ji
Bairu Hou
Alexander Robey
George J. Pappas
Hamed Hassani
Yang Zhang
Eric Wong
Shiyu Chang
AAML
107
51
0
25 Feb 2024
No Free Lunch in LLM Watermarking: Trade-offs in Watermarking Design
  Choices
No Free Lunch in LLM Watermarking: Trade-offs in Watermarking Design Choices
Qi Pang
Shengyuan Hu
Wenting Zheng
Virginia Smith
WaLM
136
15
0
25 Feb 2024
DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM
  Jailbreakers
DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers
Xirui Li
Ruochen Wang
Minhao Cheng
Tianyi Zhou
Cho-Jui Hsieh
AAML
92
50
0
25 Feb 2024
PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
Xiangdi Meng
Damai Dai
Weiyao Luo
Zhe Yang
Shaoxiang Wu
Xiaochen Wang
Peiyi Wang
Qingxiu Dong
Liang Chen
Zhifang Sui
170
13
0
25 Feb 2024
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D
  Talking Face Generation
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation
Yasheng Sun
Wenqing Chu
Hang Zhou
Kaisiyuan Wang
Hideki Koike
79
4
0
25 Feb 2024
InstructEdit: Instruction-based Knowledge Editing for Large Language
  Models
InstructEdit: Instruction-based Knowledge Editing for Large Language Models
Ningyu Zhang
Bo Tian
Siyuan Cheng
Xiaozhuan Liang
Yi Hu
Kouying Xue
Yanjie Gou
Xi Chen
Huajun Chen
KELM
97
5
0
25 Feb 2024
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Yao Mu
Junting Chen
Qinglong Zhang
Shoufa Chen
Qiaojun Yu
...
Wenhai Wang
Jifeng Dai
Yu Qiao
Mingyu Ding
Ping Luo
104
24
0
25 Feb 2024
Evaluating Robustness of Generative Search Engine on Adversarial Factual
  Questions
Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions
Xuming Hu
Xiaochuan Li
Junzhe Chen
Hai-Tao Zheng
Yangning Li
...
Yasheng Wang
Qun Liu
Lijie Wen
Philip S. Yu
Zhijiang Guo
AAMLELM
81
4
0
25 Feb 2024
Say More with Less: Understanding Prompt Learning Behaviors through Gist
  Compression
Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression
Xinze Li
Zhenghao Liu
Chenyan Xiong
Shi Yu
Yukun Yan
Shuo Wang
Ge Yu
VLM
74
4
0
25 Feb 2024
Detecting Machine-Generated Texts by Multi-Population Aware Optimization
  for Maximum Mean Discrepancy
Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy
Shuhai Zhang
Yiliao Song
Jiahao Yang
Yuanqing Li
Bo Han
Mingkui Tan
DeLMO
113
8
0
25 Feb 2024
Don't Forget Your Reward Values: Language Model Alignment via
  Value-based Calibration
Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration
Xin Mao
Fengming Li
Huimin Xu
Wei Zhang
Anh Tuan Luu
ALM
80
7
0
25 Feb 2024
GraphWiz: An Instruction-Following Language Model for Graph Problems
GraphWiz: An Instruction-Following Language Model for Graph Problems
Nuo Chen
Yuhan Li
Jianheng Tang
Jia Li
143
29
0
25 Feb 2024
ASETF: A Novel Method for Jailbreak Attack on LLMs through Translate
  Suffix Embeddings
ASETF: A Novel Method for Jailbreak Attack on LLMs through Translate Suffix Embeddings
Hao Wang
Hao Li
Minlie Huang
Lei Sha
AAML
111
14
0
25 Feb 2024
Rethinking Software Engineering in the Foundation Model Era: A Curated
  Catalogue of Challenges in the Development of Trustworthy FMware
Rethinking Software Engineering in the Foundation Model Era: A Curated Catalogue of Challenges in the Development of Trustworthy FMware
Ahmed E. Hassan
Dayi Lin
Gopi Krishnan Rajbahadur
Keheliya Gallaba
F. Côgo
...
Kishanthan Thangarajah
G. Oliva
Jiahuei Lin
Wali Mohammad Abdullah
Zhen Ming Jiang
66
7
0
25 Feb 2024
Citation-Enhanced Generation for LLM-based Chatbots
Citation-Enhanced Generation for LLM-based Chatbots
Weitao Li
Junkai Li
Weizhi Ma
Yang Liu
143
21
0
25 Feb 2024
PRP: Propagating Universal Perturbations to Attack Large Language Model
  Guard-Rails
PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails
Neal Mangaokar
Ashish Hooda
Jihye Choi
Shreyas Chandrashekaran
Kassem Fawaz
Somesh Jha
Atul Prakash
AAML
94
37
0
24 Feb 2024
Multimodal Instruction Tuning with Conditional Mixture of LoRA
Multimodal Instruction Tuning with Conditional Mixture of LoRA
Ying Shen
Zhiyang Xu
Qifan Wang
Yu Cheng
Wenpeng Yin
Lifu Huang
82
20
0
24 Feb 2024
Measuring Bargaining Abilities of LLMs: A Benchmark and A
  Buyer-Enhancement Method
Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method
Tian Xia
Zhiwei He
Tong Ren
Yibo Miao
Zhuosheng Zhang
Yang Yang
Rui Wang
92
18
0
24 Feb 2024
Look Before You Leap: Problem Elaboration Prompting Improves
  Mathematical Reasoning in Large Language Models
Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning in Large Language Models
Haoran Liao
Jidong Tian
Shaohua Hu
Hao He
Yaohui Jin
ReLMLRM
86
0
0
24 Feb 2024
Intelligent Director: An Automatic Framework for Dynamic Visual
  Composition using ChatGPT
Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT
Sixiao Zheng
Jingyang Huo
Yu Wang
Yanwei Fu
VGenDiffM
69
1
0
24 Feb 2024
MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation
MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation
Han Wang
Roy Ka-wei Lee
69
8
0
24 Feb 2024
Social Convos: Capturing Agendas and Emotions on Social Media
Social Convos: Capturing Agendas and Emotions on Social Media
Ankita Bhaumik
Ning Sa
Gregorios A. Katsios
T. Strzalkowski
69
1
0
23 Feb 2024
Fast Adversarial Attacks on Language Models In One GPU Minute
Fast Adversarial Attacks on Language Models In One GPU Minute
Vinu Sankar Sadasivan
Shoumik Saha
Gaurang Sriramanan
Priyatham Kattakinda
Atoosa Malemir Chegini
Soheil Feizi
MIALM
106
42
0
23 Feb 2024
Foundation Policies with Hilbert Representations
Foundation Policies with Hilbert Representations
Seohong Park
Tobias Kreiman
Sergey Levine
SSLOffRL
112
30
0
23 Feb 2024
Co-Supervised Learning: Improving Weak-to-Strong Generalization with
  Hierarchical Mixture of Experts
Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts
Yuejiang Liu
Alexandre Alahi
96
25
0
23 Feb 2024
Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A
  Case-Study in E-Commerce Opinion Summarization
Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization
Swaroop Nath
Tejpalsingh Siledar
Sankara Sri Raghava Ravindra Muddu
Rupasai Rangaraju
H. Khadilkar
...
Suman Banerjee
Amey Patil
Sudhanshu Singh
M. Chelliah
Nikesh Garera
88
0
0
23 Feb 2024
Faithful Temporal Question Answering over Heterogeneous Sources
Faithful Temporal Question Answering over Heterogeneous Sources
Zhen Jia
Philipp Christmann
Gerhard Weikum
74
10
0
23 Feb 2024
DEEM: Dynamic Experienced Expert Modeling for Stance Detection
DEEM: Dynamic Experienced Expert Modeling for Stance Detection
Xiaolong Wang
Yile Wang
Sijie Cheng
Peng Li
Yang Liu
58
9
0
23 Feb 2024
GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech
  Detection?
GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection?
Yiping Jin
Leo Wanner
A. Shvets
56
2
0
23 Feb 2024
Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized
  Control
Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
Masatoshi Uehara
Yulai Zhao
Kevin Black
Ehsan Hajiramezanali
Gabriele Scalia
N. Diamant
Alex Tseng
Tommaso Biancalani
Sergey Levine
94
52
0
23 Feb 2024
Break the Breakout: Reinventing LM Defense Against Jailbreak Attacks
  with Self-Refinement
Break the Breakout: Reinventing LM Defense Against Jailbreak Attacks with Self-Refinement
Heegyu Kim
Sehyun Yuk
Hyunsouk Cho
AAML
65
21
0
23 Feb 2024
Advancing Parameter Efficiency in Fine-tuning via Representation Editing
Advancing Parameter Efficiency in Fine-tuning via Representation Editing
Muling Wu
Tianlong Li
Xiaohua Wang
Changze Lv
Changze Lv
Zixuan Ling
Jianhao Zhu
Cenyuan Zhang
Xiaoqing Zheng
Xuanjing Huang
79
25
0
23 Feb 2024
Entity-level Factual Adaptiveness of Fine-tuning based Abstractive
  Summarization Models
Entity-level Factual Adaptiveness of Fine-tuning based Abstractive Summarization Models
Jongyoon Song
Nohil Park
Bongkyu Hwang
Jaewoong Yun
Seongho Joe
Youngjune Gwon
Sungroh Yoon
KELMHILM
68
1
0
23 Feb 2024
Machine Unlearning of Pre-trained Large Language Models
Machine Unlearning of Pre-trained Large Language Models
Jin Yao
Eli Chien
Minxin Du
Xinyao Niu
Tianhao Wang
Zezhou Cheng
Xiang Yue
MU
154
51
0
23 Feb 2024
Previous
123...939495...126127128
Next