ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,381 papers shown
Title
AI Alignment through Reinforcement Learning from Human Feedback?
  Contradictions and Limitations
AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations
Adam Dahlgren Lindstrom
Leila Methnani
Lea Krause
Petter Ericson
Ínigo Martínez de Rituerto de Troya
Dimitri Coelho Mollo
Roel Dobbe
ALM
88
2
0
26 Jun 2024
AI-native Memory: A Pathway from LLMs Towards AGI
AI-native Memory: A Pathway from LLMs Towards AGI
Jingbo Shang
Zai Zheng
Jiale Wei
Xiang Ying
Felix Tao
Mindverse Team
LLMAG
115
8
0
26 Jun 2024
Hierarchical Context Pruning: Optimizing Real-World Code Completion with
  Repository-Level Pretrained Code LLMs
Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs
Lei Zhang
Yunshui Li
Jiaming Li
Xiaobo Xia
Jiaxi Yang
Run Luo
Minzheng Wang
Longze Chen
Junhao Liu
Min Yang
91
5
0
26 Jun 2024
Unveiling and Controlling Anomalous Attention Distribution in
  Transformers
Unveiling and Controlling Anomalous Attention Distribution in Transformers
Ruiqing Yan
Xingbo Du
Haoyu Deng
Linghan Zheng
Qiuzhuang Sun
Jifang Hu
Yuhang Shao
Penghao Jiang
Jinrong Jiang
Lian Zhao
69
1
0
26 Jun 2024
Weak Reward Model Transforms Generative Models into Robust Causal Event
  Extraction Systems
Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems
Italo Luis da Silva
Hanqi Yan
Lin Gui
Yulan He
CML
109
0
0
26 Jun 2024
Zero-shot prompt-based classification: topic labeling in times of
  foundation models in German Tweets
Zero-shot prompt-based classification: topic labeling in times of foundation models in German Tweets
Simon Münker
Kai Kugler
Achim Rettinger
VLM
76
1
0
26 Jun 2024
SEED: Accelerating Reasoning Tree Construction via Scheduled Speculative
  Decoding
SEED: Accelerating Reasoning Tree Construction via Scheduled Speculative Decoding
Zhenglin Wang
Jialong Wu
Yilong Lai
Congzhi Zhang
Deyu Zhou
LRMReLM
78
6
0
26 Jun 2024
Selective Prompting Tuning for Personalized Conversations with LLMs
Selective Prompting Tuning for Personalized Conversations with LLMs
Qiushi Huang
Xubo Liu
Tom Ko
Bo Wu
Wenwu Wang
Yu Zhang
Lilian H. Y. Tang
87
8
0
26 Jun 2024
ViPro: Enabling and Controlling Video Prediction for Complex Dynamical
  Scenarios using Procedural Knowledge
ViPro: Enabling and Controlling Video Prediction for Complex Dynamical Scenarios using Procedural Knowledge
Patrick Takenaka
Johannes Maucher
Marco F. Huber
VGen
81
0
0
26 Jun 2024
Shimo Lab at "Discharge Me!": Discharge Summarization by Prompt-Driven
  Concatenation of Electronic Health Record Sections
Shimo Lab at "Discharge Me!": Discharge Summarization by Prompt-Driven Concatenation of Electronic Health Record Sections
Yunzhen He
Hiroaki Yamagiwa
Hidetoshi Shimodaira
96
3
0
26 Jun 2024
PharmaGPT: Domain-Specific Large Language Models for Bio-Pharmaceutical
  and Chemistry
PharmaGPT: Domain-Specific Large Language Models for Bio-Pharmaceutical and Chemistry
Linqing Chen
Weilei Wang
Zilong Bai
Peng Xu
Yan Fang
...
Lisha Zhang
Fu Bian
Zhongkai Ye
Lidong Pei
Changyang Tu
AI4MHLM&MA
107
3
0
26 Jun 2024
Preference Elicitation for Offline Reinforcement Learning
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace
Bernhard Schölkopf
Gunnar Rätsch
Giorgia Ramponi
OffRL
140
1
0
26 Jun 2024
RouteLLM: Learning to Route LLMs with Preference Data
RouteLLM: Learning to Route LLMs with Preference Data
Isaac Ong
Amjad Almahairi
Vincent Wu
Wei-Lin Chiang
Tianhao Wu
Joseph E. Gonzalez
M. W. Kadous
Ion Stoica
174
106
0
26 Jun 2024
Panacea: A foundation model for clinical trial search, summarization,
  design, and recruitment
Panacea: A foundation model for clinical trial search, summarization, design, and recruitment
J. Lin
H. Xu
Zifeng Wang
Sheng Wang
Jimeng Sun
ELMLM&MA
106
10
0
25 Jun 2024
Domain Adaptation of Echocardiography Segmentation Via Reinforcement
  Learning
Domain Adaptation of Echocardiography Segmentation Via Reinforcement Learning
Arnaud Judge
Thierry Judge
Nicolas Duchateau
Roman A. Sandler
Joseph Z. Sokol
Olivier Bernard
Pierre-Marc Jodoin
OOD
59
0
0
25 Jun 2024
Following Length Constraints in Instructions
Following Length Constraints in Instructions
Weizhe Yuan
Ilia Kulikov
Ping Yu
Kyunghyun Cho
Sainbayar Sukhbaatar
Jason Weston
Jing Xu
FaMLALM
102
26
0
25 Jun 2024
LLM Targeted Underperformance Disproportionately Impacts Vulnerable
  Users
LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users
Elinor Poole-Dayan
Deb Roy
Jad Kabbara
66
5
0
25 Jun 2024
InFiConD: Interactive No-code Fine-tuning with Concept-based Knowledge
  Distillation
InFiConD: Interactive No-code Fine-tuning with Concept-based Knowledge Distillation
Jinbin Huang
Wenbin He
Liang Gou
Liu Ren
Chris Bryan
115
0
0
25 Jun 2024
FedBiOT: LLM Local Fine-tuning in Federated Learning without Full Model
FedBiOT: LLM Local Fine-tuning in Federated Learning without Full Model
Feijie Wu
Zitao Li
Yaliang Li
Bolin Ding
Jing Gao
117
55
0
25 Jun 2024
VarBench: Robust Language Model Benchmarking Through Dynamic Variable
  Perturbation
VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation
Kun Qian
Shunji Wan
Claudia Tang
Youzhi Wang
Xuanming Zhang
Maximillian Chen
Zhou Yu
AAML
93
12
0
25 Jun 2024
Self-assessment, Exhibition, and Recognition: a Review of Personality in
  Large Language Models
Self-assessment, Exhibition, and Recognition: a Review of Personality in Large Language Models
Zhiyuan Wen
Yu Yang
Jiannong Cao
Haoming Sun
Ruosong Yang
Shuaiqi Liu
106
5
0
25 Jun 2024
FrenchToxicityPrompts: a Large Benchmark for Evaluating and Mitigating
  Toxicity in French Texts
FrenchToxicityPrompts: a Large Benchmark for Evaluating and Mitigating Toxicity in French Texts
Caroline Brun
Vassilina Nikoulina
76
1
0
25 Jun 2024
Multi-property Steering of Large Language Models with Dynamic Activation
  Composition
Multi-property Steering of Large Language Models with Dynamic Activation Composition
Daniel Scalena
Gabriele Sarti
Malvina Nissim
KELMLLMSVAI4CE
89
15
0
25 Jun 2024
MoE-CT: A Novel Approach For Large Language Models Training With
  Resistance To Catastrophic Forgetting
MoE-CT: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting
Tianhao Li
Shangjie Li
Binbin Xie
Deyi Xiong
Baosong Yang
CLL
122
4
0
25 Jun 2024
TALEC: Teach Your LLM to Evaluate in Specific Domain with In-house
  Criteria by Criteria Division and Zero-shot Plus Few-shot
TALEC: Teach Your LLM to Evaluate in Specific Domain with In-house Criteria by Criteria Division and Zero-shot Plus Few-shot
Kaiqi Zhang
Shuai Yuan
Honghan Zhao
ALMELM
71
2
0
25 Jun 2024
A Three-Pronged Approach to Cross-Lingual Adaptation with Multilingual
  LLMs
A Three-Pronged Approach to Cross-Lingual Adaptation with Multilingual LLMs
Vaibhav Singh
Amrith Krishna
Karthika NJ
Ganesh Ramakrishnan
123
4
0
25 Jun 2024
ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for
  Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback
ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback
Ju-Seung Byun
Jiyun Chun
Jihyung Kil
Andrew Perrault
ReLMLRM
134
3
0
25 Jun 2024
Leveraging LLMs for Dialogue Quality Measurement
Leveraging LLMs for Dialogue Quality Measurement
Jinghan Jia
A. Komma
Timothy Leffel
Xujun Peng
Ajay Nagesh
Tamer Soliman
Aram Galstyan
Anoop Kumar
74
5
0
25 Jun 2024
Predicting the Big Five Personality Traits in Chinese Counselling
  Dialogues Using Large Language Models
Predicting the Big Five Personality Traits in Chinese Counselling Dialogues Using Large Language Models
Yang Yan
Lizhi Ma
Anqi Li
Jingsong Ma
Zhenzhong Lan
55
3
0
25 Jun 2024
Large Language Models are Interpretable Learners
Large Language Models are Interpretable Learners
Ruochen Wang
Si Si
Felix X. Yu
Dorothea Wiesmann
Cho-Jui Hsieh
Inderjit Dhillon
100
3
0
25 Jun 2024
From Distributional to Overton Pluralism: Investigating Large Language Model Alignment
From Distributional to Overton Pluralism: Investigating Large Language Model Alignment
Thom Lake
Eunsol Choi
Greg Durrett
117
14
0
25 Jun 2024
Entropy-Based Decoding for Retrieval-Augmented Large Language Models
Entropy-Based Decoding for Retrieval-Augmented Large Language Models
Zexuan Qiu
Zijing Ou
Bin Wu
Jingjing Li
Aiwei Liu
Irwin King
KELMRALM
147
7
0
25 Jun 2024
Learning on Transformers is Provable Low-Rank and Sparse: A One-layer
  Analysis
Learning on Transformers is Provable Low-Rank and Sparse: A One-layer Analysis
Hongkang Li
Meng Wang
Shuai Zhang
Sijia Liu
Pin-Yu Chen
119
7
0
24 Jun 2024
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
Shengbang Tong
Ellis L Brown
Penghao Wu
Sanghyun Woo
Manoj Middepogu
...
Xichen Pan
Austin Wang
Rob Fergus
Yann LeCun
Saining Xie
3DVMLLM
166
377
0
24 Jun 2024
From Decoding to Meta-Generation: Inference-time Algorithms for Large
  Language Models
From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models
Sean Welleck
Amanda Bertsch
Matthew Finlayson
Hailey Schoelkopf
Alex Xie
Graham Neubig
Ilia Kulikov
Zaid Harchaoui
161
77
0
24 Jun 2024
ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians
ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians
Yufei Liu
Junshu Tang
Chu Zheng
Shijie Zhang
Jinkun Hao
Junwei Zhu
Dongjin Huang
112
5
0
24 Jun 2024
Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback
  for Text-to-Image Generation
Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation
Katherine M. Collins
Najoung Kim
Yonatan Bitton
Verena Rieser
Shayegan Omidshafiei
...
Gang Li
Adrian Weller
Junfeng He
Deepak Ramachandran
Krishnamurthy Dvijotham
EGVM
84
3
0
24 Jun 2024
Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs
Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs
Ashwinee Panda
Berivan Isik
Xiangyu Qi
Sanmi Koyejo
Tsachy Weissman
Prateek Mittal
MoMe
139
16
0
24 Jun 2024
Adam-mini: Use Fewer Learning Rates To Gain More
Adam-mini: Use Fewer Learning Rates To Gain More
Yushun Zhang
Congliang Chen
Ziniu Li
Tian Ding
Chenwei Wu
Yinyu Ye
Zhi-Quan Luo
Ruoyu Sun
141
58
0
24 Jun 2024
Modulating Language Model Experiences through Frictions
Modulating Language Model Experiences through Frictions
Katherine M. Collins
Valerie Chen
Ilia Sucholutsky
Hannah Rose Kirk
Malak Sadek
Holli Sargeant
Ameet Talwalkar
Adrian Weller
Umang Bhatt
KELM
118
5
0
24 Jun 2024
WARP: On the Benefits of Weight Averaged Rewarded Policies
WARP: On the Benefits of Weight Averaged Rewarded Policies
Alexandre Ramé
Johan Ferret
Nino Vieillard
Robert Dadashi
Léonard Hussenot
Pierre-Louis Cedoz
Pier Giuseppe Sessa
Sertan Girgin
Arthur Douillard
Olivier Bachem
136
23
0
24 Jun 2024
OCALM: Object-Centric Assessment with Language Models
OCALM: Object-Centric Assessment with Language Models
Timo Kaufmann
Johannes Czech
Antonia Wüst
Quentin Delfosse
Kristian Kersting
Eyke Hüllermeier
LM&RoLRM
94
1
0
24 Jun 2024
AutoDetect: Towards a Unified Framework for Automated Weakness Detection
  in Large Language Models
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
Jiale Cheng
Yida Lu
Xiaotao Gu
Pei Ke
Xiao-Yang Liu
Yuxiao Dong
Hongning Wang
Jie Tang
Minlie Huang
77
6
0
24 Jun 2024
Segment Any Text: A Universal Approach for Robust, Efficient and
  Adaptable Sentence Segmentation
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation
Markus Frohmann
Igor Sterner
Ivan Vulić
Benjamin Minixhofer
Markus Schedl
VLM
113
20
0
24 Jun 2024
Towards a Science Exocortex
Towards a Science Exocortex
Kevin G. Yager
116
2
0
24 Jun 2024
LionGuard: Building a Contextualized Moderation Classifier to Tackle
  Localized Unsafe Content
LionGuard: Building a Contextualized Moderation Classifier to Tackle Localized Unsafe Content
Jessica Foo
Shaun Khoo
82
4
0
24 Jun 2024
Towards Better Graph-based Cross-document Relation Extraction via
  Non-bridge Entity Enhancement and Prediction Debiasing
Towards Better Graph-based Cross-document Relation Extraction via Non-bridge Entity Enhancement and Prediction Debiasing
Hao Yue
Shaopeng Lai
Chengyi Yang
Liang Zhang
Junfeng Yao
Jinsong Su
75
3
0
24 Jun 2024
Towards Comprehensive Preference Data Collection for Reward Modeling
Towards Comprehensive Preference Data Collection for Reward Modeling
Yulan Hu
Qingyang Li
Sheng Ouyang
Ge Chen
Kaihui Chen
Lijun Mei
Xucheng Ye
Fuzheng Zhang
Yong Liu
SyDa
128
4
0
24 Jun 2024
UniCoder: Scaling Code Large Language Model via Universal Code
UniCoder: Scaling Code Large Language Model via Universal Code
Tao Sun
Linzheng Chai
Jian Yang
Yuwei Yin
Hongcheng Guo
Jiaheng Liu
Bing Wang
Liqun Yang
Zhoujun Li
OffRLLRM
112
21
0
24 Jun 2024
On the Transformations across Reward Model, Parameter Update, and
  In-Context Prompt
On the Transformations across Reward Model, Parameter Update, and In-Context Prompt
Deng Cai
Huayang Li
Tingchen Fu
Siheng Li
Weiwen Xu
...
Leyang Cui
Yan Wang
Lemao Liu
Taro Watanabe
Shuming Shi
KELM
78
2
0
24 Jun 2024
Previous
123...636465...126127128
Next