ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,398 papers shown
Title
Instruction Tuning Vs. In-Context Learning: Revisiting Large Language
  Models in Few-Shot Computational Social Science
Instruction Tuning Vs. In-Context Learning: Revisiting Large Language Models in Few-Shot Computational Social Science
Taihang Wang
Xiaoman Xu
Yimin Wang
Ye Jiang
68
2
0
23 Sep 2024
Speechworthy Instruction-tuned Language Models
Speechworthy Instruction-tuned Language Models
Hyundong Justin Cho
Nicolaas Jedema
Leonardo F. R. Ribeiro
Karishma Sharma
Pedro Szekely
Alessandro Moschitti
Ruben Janssen
Jonathan May
ALM
87
1
0
23 Sep 2024
ToxiCraft: A Novel Framework for Synthetic Generation of Harmful Information
ToxiCraft: A Novel Framework for Synthetic Generation of Harmful Information
Zheng Hui
Zhaoxiao Guo
Hang Zhao
Juanyong Duan
Congrui Huang
167
7
0
23 Sep 2024
MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models
MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models
Mohammad Shahab Sepehri
Zalan Fabian
Maryam Soltanolkotabi
Mahdi Soltanolkotabi
MedIm
152
6
0
23 Sep 2024
Backtracking Improves Generation Safety
Backtracking Improves Generation Safety
Yiming Zhang
Jianfeng Chi
Hailey Nguyen
Kartikeya Upasani
Daniel M. Bikel
Jason Weston
Eric Michael Smith
SILM
128
8
0
22 Sep 2024
RACOON: An LLM-based Framework for Retrieval-Augmented Column Type
  Annotation with a Knowledge Graph
RACOON: An LLM-based Framework for Retrieval-Augmented Column Type Annotation with a Knowledge Graph
Linxi Wei
Guorui Xiao
Magdalena Balazinska
86
1
0
22 Sep 2024
Investigating Layer Importance in Large Language Models
Investigating Layer Importance in Large Language Models
Yang Zhang
Yanfei Dong
Kenji Kawaguchi
FAtt
100
10
0
22 Sep 2024
Can AI writing be salvaged? Mitigating Idiosyncrasies and Improving Human-AI Alignment in the Writing Process through Edits
Can AI writing be salvaged? Mitigating Idiosyncrasies and Improving Human-AI Alignment in the Writing Process through Edits
Tuhin Chakrabarty
Philippe Laban
Chien-Sheng Wu
131
13
0
22 Sep 2024
Thought-Path Contrastive Learning via Premise-Oriented Data Augmentation for Logical Reading Comprehension
Thought-Path Contrastive Learning via Premise-Oriented Data Augmentation for Logical Reading Comprehension
Chenxu Wang
Ping Jian
Zhen Yang
LRM
95
0
0
22 Sep 2024
Repairs in a Block World: A New Benchmark for Handling User Corrections
  with Multi-Modal Language Models
Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models
Javier Chiyah-Garcia
Alessandro Suglia
Arash Eshghi
KELM
58
2
0
21 Sep 2024
Interpreting Arithmetic Mechanism in Large Language Models through
  Comparative Neuron Analysis
Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis
Zeping Yu
Sophia Ananiadou
LRMMILM
114
14
0
21 Sep 2024
Towards Automated Patent Workflows: AI-Orchestrated Multi-Agent
  Framework for Intellectual Property Management and Analysis
Towards Automated Patent Workflows: AI-Orchestrated Multi-Agent Framework for Intellectual Property Management and Analysis
Sakhinana Sagar Srinivas
Vijay Sri Vaikunth
Venkataramana Runkana
66
1
0
21 Sep 2024
GroupDebate: Enhancing the Efficiency of Multi-Agent Debate Using Group
  Discussion
GroupDebate: Enhancing the Efficiency of Multi-Agent Debate Using Group Discussion
Tongxuan Liu
Xingyu Wang
Weizhe Huang
Wenjiang Xu
Yuting Zeng
Lei Jiang
Hailong Yang
Jing Li
LLMAG
81
13
0
21 Sep 2024
ChemEval: A Comprehensive Multi-Level Chemical Evaluation for Large
  Language Models
ChemEval: A Comprehensive Multi-Level Chemical Evaluation for Large Language Models
Yuqing Huang
Rongyang Zhang
Xiaoxiao He
Xuyang Zhi
Hao Wang
...
Guoping Hu
Guiquan Liu
Qi Liu
Defu Lian
Enhong Chen
ELM
93
8
0
21 Sep 2024
Applying Pre-trained Multilingual BERT in Embeddings for Improved
  Malicious Prompt Injection Attacks Detection
Applying Pre-trained Multilingual BERT in Embeddings for Improved Malicious Prompt Injection Attacks Detection
M. Rahman
Hossain Shahriar
Fan Wu
A. Cuzzocrea
AAML
88
6
0
20 Sep 2024
T2M-X: Learning Expressive Text-to-Motion Generation from Partially
  Annotated Data
T2M-X: Learning Expressive Text-to-Motion Generation from Partially Annotated Data
Mingdian Liu
Y. Liu
Gurunandan Krishnan
Karl S Bayer
Bing Zhou
VGen
81
0
0
20 Sep 2024
ControlMath: Controllable Data Generation Promotes Math Generalist
  Models
ControlMath: Controllable Data Generation Promotes Math Generalist Models
Nuo Chen
Ning Wu
Jianhui Chang
Jia Li
100
4
0
20 Sep 2024
CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Causal Significance and Consistency
CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Causal Significance and Consistency
Kangsheng Wang
Xiao Zhang
Zizheng Guo
Tianyu Hu
Huimin Ma
LRM
164
7
0
20 Sep 2024
RRM: Robust Reward Model Training Mitigates Reward Hacking
RRM: Robust Reward Model Training Mitigates Reward Hacking
Tianqi Liu
Wei Xiong
Jie Jessie Ren
Lichang Chen
Junru Wu
...
Yuan Liu
Bilal Piot
Abe Ittycheriah
Aviral Kumar
Mohammad Saleh
AAML
99
23
0
20 Sep 2024
STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions
STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions
Robert D Morabito
Sangmitra Madhusudan
Tyler McDonald
Ali Emami
60
2
0
20 Sep 2024
Guided Profile Generation Improves Personalization with LLMs
Guided Profile Generation Improves Personalization with LLMs
Jiarui Zhang
80
7
0
19 Sep 2024
TACO-RL: Task Aware Prompt Compression Optimization with Reinforcement
  Learning
TACO-RL: Task Aware Prompt Compression Optimization with Reinforcement Learning
Shivam Shandilya
Menglin Xia
Supriyo Ghosh
Huiqiang Jiang
Jue Zhang
Qianhui Wu
Victor Rühle
82
8
0
19 Sep 2024
Pay Attention to What Matters
Pay Attention to What Matters
Pedro Luiz Silva
Antonio De Domenico
Ali Maatouk
Fadhel Ayed
ALM
56
1
0
19 Sep 2024
Language Models Learn to Mislead Humans via RLHF
Language Models Learn to Mislead Humans via RLHF
Jiaxin Wen
Ruiqi Zhong
Akbir Khan
Ethan Perez
Jacob Steinhardt
Minlie Huang
Samuel R. Bowman
He He
Shi Feng
113
44
0
19 Sep 2024
Exploring Large Language Models for Product Attribute Value
  Identification
Exploring Large Language Models for Product Attribute Value Identification
Kassem Sabeh
Mouna Kacimi
Johann Gamper
Robert Litschko
Barbara Plank
75
2
0
19 Sep 2024
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward
Dongheng Li
Yongchang Hao
Lili Mou
114
2
0
19 Sep 2024
Unlocking Reasoning Potential in Large Langauge Models by Scaling
  Code-form Planning
Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning
Jiaxin Wen
Jian Guan
Hongning Wang
Wei Wu
Minlie Huang
ReLMOffRLLRM
79
11
0
19 Sep 2024
Enhancing Logical Reasoning in Large Language Models through Graph-based
  Synthetic Data
Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data
Jiaming Zhou
Abbas Ghaddar
Ge Zhang
Liheng Ma
Yaochen Hu
Soumyasundar Pal
Mark Coates
Bin Wang
Yingxue Zhang
Jianye Hao
ReLMLRM
100
4
0
19 Sep 2024
Preference Alignment Improves Language Model-Based TTS
Preference Alignment Improves Language Model-Based TTS
Jinchuan Tian
Chunlei Zhang
Jiatong Shi
Hao Zhang
Jianwei Yu
Shinji Watanabe
Dong Yu
69
8
0
19 Sep 2024
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language Models
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language Models
Peiyi Zhang
Yazhou Zhang
Bo Wang
Lu Rong
Jing Qin
Jing Qin
AI4EdELM
145
2
0
19 Sep 2024
The Central Role of the Loss Function in Reinforcement Learning
The Central Role of the Loss Function in Reinforcement Learning
Kaiwen Wang
Nathan Kallus
Wen Sun
OffRL
317
10
0
19 Sep 2024
Prompts Are Programs Too! Understanding How Developers Build Software Containing Prompts
Prompts Are Programs Too! Understanding How Developers Build Software Containing Prompts
Jenny T Liang
Melissa Lin
Nikitha Rao
Brad A. Myers
154
7
0
19 Sep 2024
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via
  Self-Improvement
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement
An Yang
Beichen Zhang
Binyuan Hui
Bofei Gao
Bowen Yu
...
Mingfeng Xue
Runji Lin
Tianyu Liu
Xingzhang Ren
Zhenru Zhang
OSLMLRM
162
321
0
18 Sep 2024
Finding the Subjective Truth: Collecting 2 Million Votes for
  Comprehensive Gen-AI Model Evaluation
Finding the Subjective Truth: Collecting 2 Million Votes for Comprehensive Gen-AI Model Evaluation
Dimitrios Christodoulou
Mads Kuhlmann-Jørgensen
EGVM
45
6
0
18 Sep 2024
Multitask Mayhem: Unveiling and Mitigating Safety Gaps in LLMs
  Fine-tuning
Multitask Mayhem: Unveiling and Mitigating Safety Gaps in LLMs Fine-tuning
Essa Jan
Nouar Aldahoul
Moiz Ali
Faizan Ahmad
Fareed Zaffar
Yasir Zaki
57
3
0
18 Sep 2024
How to Build the Virtual Cell with Artificial Intelligence: Priorities
  and Opportunities
How to Build the Virtual Cell with Artificial Intelligence: Priorities and Opportunities
Charlotte Bunne
Yusuf Roohani
Yanay Rosen
Ankit Gupta
Xikun Zhang
...
Theofanis Karaletsos
Aviv Regev
Emma Lundberg
J. Leskovec
Stephen R. Quake
120
26
0
18 Sep 2024
Reward-Robust RLHF in LLMs
Reward-Robust RLHF in LLMs
Yuzi Yan
Xingzhou Lou
Jialian Li
Yiping Zhang
Jian Xie
Chao Yu
Yu Wang
Dong Yan
Yuan Shen
106
13
0
18 Sep 2024
From Lists to Emojis: How Format Bias Affects Model Alignment
From Lists to Emojis: How Format Bias Affects Model Alignment
Xuanchang Zhang
Wei Xiong
Lichang Chen
Dinesh Manocha
Heng Huang
Tong Zhang
ALM
122
13
0
18 Sep 2024
Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs
Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs
Guillermo Marco
Luz Rello
Julio Gonzalo
LM&MAALM
113
7
0
17 Sep 2024
Multi-Document Grounded Multi-Turn Synthetic Dialog Generation
Multi-Document Grounded Multi-Turn Synthetic Dialog Generation
Young-Suk Lee
Chulaka Gunasekara
Danish Contractor
Ramón Fernandez Astudillo
Radu Florian
59
1
0
17 Sep 2024
Enriching Datasets with Demographics through Large Language Models:
  What's in a Name?
Enriching Datasets with Demographics through Large Language Models: What's in a Name?
Khaled AlNuaimi
Gautier Marti
Mathieu Ravaut
Abdulla Alketbi
Andreas Henschel
Raed Jaradat
68
1
0
17 Sep 2024
NVLM: Open Frontier-Class Multimodal LLMs
NVLM: Open Frontier-Class Multimodal LLMs
Wenliang Dai
Nayeon Lee
Wei Ping
Zhuoling Yang
Zihan Liu
Jon Barker
Tuomas Rintamaki
Mohammad Shoeybi
Bryan Catanzaro
Ming-Yu Liu
MLLMVLMLRM
134
73
0
17 Sep 2024
CoCA: Regaining Safety-awareness of Multimodal Large Language Models
  with Constitutional Calibration
CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration
Jiahui Gao
Renjie Pi
Tianyang Han
Han Wu
Lanqing Hong
Lingpeng Kong
Xin Jiang
Zhenguo Li
136
8
0
17 Sep 2024
Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments
Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments
M. Rigaki
C. Catania
Sebastian Garcia
LLMAG
108
5
0
17 Sep 2024
LLM-as-a-Judge & Reward Model: What They Can and Cannot Do
LLM-as-a-Judge & Reward Model: What They Can and Cannot Do
Guijin Son
Hyunwoo Ko
Hoyoung Lee
Yewon Kim
Seunghyeok Hong
ALMELM
102
11
0
17 Sep 2024
Self-Evolutionary Large Language Models through Uncertainty-Enhanced
  Preference Optimization
Self-Evolutionary Large Language Models through Uncertainty-Enhanced Preference Optimization
Jianing Wang
Yang Zhou
Xiaocheng Zhang
Mengjiao Bao
Peng Yan
73
2
0
17 Sep 2024
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like
  Language Models
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models
Orion Weller
Benjamin Van Durme
Dawn J Lawrie
Ashwin Paranjape
Yuhao Zhang
Jack Hessel
LRMRALM
97
25
0
17 Sep 2024
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Maojia Song
Shang Hong Sim
Rishabh Bhardwaj
Hai Leong Chieu
Navonil Majumder
Soujanya Poria
147
12
0
17 Sep 2024
REAL: Response Embedding-based Alignment for LLMs
REAL: Response Embedding-based Alignment for LLMs
Honggen Zhang
Xufeng Zhao
Igor Molybog
June Zhang
93
2
0
17 Sep 2024
Semantics Preserving Emoji Recommendation with Large Language Models
Semantics Preserving Emoji Recommendation with Large Language Models
Zhongyi Qiu
Kangyi Qiu
Hanjia Lyu
Wei Xiong
Jiebo Luo
110
1
0
16 Sep 2024
Previous
123...505152...126127128
Next