ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,392 papers shown
Title
Aurora:Activating Chinese chat capability for Mixtral-8x7B sparse
  Mixture-of-Experts through Instruction-Tuning
Aurora:Activating Chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning
Rongsheng Wang
Hao Chen
Ruizhe Zhou
Yaofei Duan
Kunyan Cai
...
Jiaxi Cui
Jian Li
P. Pang
Yapeng Wang
Tao Tan
MoE
75
2
0
22 Dec 2023
Exploiting Novel GPT-4 APIs
Exploiting Novel GPT-4 APIs
Kellin Pelrine
Mohammad Taufeeque
Michal Zajkac
Euan McLean
Adam Gleave
SILM
62
21
0
21 Dec 2023
T-Eval: Evaluating the Tool Utilization Capability of Large Language
  Models Step by Step
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step
Zehui Chen
Weihua Du
Wenwei Zhang
Kuikun Liu
Jiangning Liu
...
Jingming Zhuo
Songyang Zhang
Dahua Lin
Kai-xiang Chen
Feng Zhao
LLMAGELM
122
32
0
21 Dec 2023
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion
  Models with RL Finetuning
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
Desai Xie
Jiahao Li
Hao Tan
Xin Sun
Zhixin Shu
Yi Zhou
Sai Bi
Soren Pirk
Arie E. Kaufman
127
12
0
21 Dec 2023
Typhoon: Thai Large Language Models
Typhoon: Thai Large Language Models
Kunat Pipatanakul
Phatrasek Jirabovonvisut
Potsawee Manakul
Sittipong Sripaisarnmongkol
Ruangsak Patomwong
Pathomporn Chokchainant
Kasima Tharnpipitchai
107
17
0
21 Dec 2023
LLM4VG: Large Language Models Evaluation for Video Grounding
LLM4VG: Large Language Models Evaluation for Video Grounding
Wei Feng
Xin Wang
Hong Chen
Zeyang Zhang
Zihan Song
Yuwei Zhou
Wenwu Zhu
112
8
0
21 Dec 2023
Speech Translation with Large Language Models: An Industrial Practice
Speech Translation with Large Language Models: An Industrial Practice
Zhichao Huang
Rong Ye
Tom Ko
Qianqian Dong
Shanbo Cheng
Mingxuan Wang
Hang Li
128
19
0
21 Dec 2023
Contextual Code Switching for Machine Translation using Language Models
Contextual Code Switching for Machine Translation using Language Models
Arshad Kaji
Manan Shah
54
2
0
20 Dec 2023
Exploring Multimodal Large Language Models for Radiology Report
  Error-checking
Exploring Multimodal Large Language Models for Radiology Report Error-checking
Jinge Wu
Yunsoo Kim
Eva C. Keller
Jamie Chow
Adam P. Levine
Nikolas Pontikos
Zina M. Ibrahim
Paul Taylor
Michelle C. Williams
Honghan Wu
LM&MA
43
4
0
20 Dec 2023
Benchmarking and Analyzing In-context Learning, Fine-tuning and
  Supervised Learning for Biomedical Knowledge Curation: a focused study on
  chemical entities of biological interest
Benchmarking and Analyzing In-context Learning, Fine-tuning and Supervised Learning for Biomedical Knowledge Curation: a focused study on chemical entities of biological interest
Emily Groves
Minhong Wang
Yusuf Abdulle
Holger Kunz
J. Hoelscher-Obermaier
Ronin Wu
Honghan Wu
61
2
0
20 Dec 2023
OpenRL: A Unified Reinforcement Learning Framework
OpenRL: A Unified Reinforcement Learning Framework
Shiyu Huang
Wentse Chen
Yiwen Sun
Fuqing Bie
Weijuan Tu
83
3
0
20 Dec 2023
Assaying on the Robustness of Zero-Shot Machine-Generated Text Detectors
Assaying on the Robustness of Zero-Shot Machine-Generated Text Detectors
Yi-Fan Zhang
Zhang Zhang
Liang Wang
Tien-Ping Tan
Rong Jin
DeLMO
124
11
0
20 Dec 2023
Language Resources for Dutch Large Language Modelling
Language Resources for Dutch Large Language Modelling
Bram Vanroy
MoEALM
57
9
0
20 Dec 2023
WaveCoder: Widespread And Versatile Enhancement For Code Large Language
  Models By Instruction Tuning
WaveCoder: Widespread And Versatile Enhancement For Code Large Language Models By Instruction Tuning
Zhaojian Yu
Xin Zhang
Ning Shang
Yangyu Huang
Can Xu
Yishujie Zhao
Wenxiang Hu
Qiufeng Yin
SyDa
137
28
0
20 Dec 2023
Fine-tuning Large Language Models for Adaptive Machine Translation
Fine-tuning Large Language Models for Adaptive Machine Translation
Yasmin Moslem
Rejwanul Haque
Andy Way
67
29
0
20 Dec 2023
FSscore: A Machine Learning-based Synthetic Feasibility Score Leveraging
  Human Expertise
FSscore: A Machine Learning-based Synthetic Feasibility Score Leveraging Human Expertise
Rebecca M. Neeser
Bruno Correia
Philippe Schwaller
47
1
0
20 Dec 2023
Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is
  Needed?
Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?
Tannon Kew
Florian Schottmann
Rico Sennrich
LRM
100
40
0
20 Dec 2023
ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training
ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training
Rongsheng Wang
Qingsong Yao
Zihang Jiang
Zhiyang He
Xiaodong Tao
Zihang Jiang
S.Kevin Zhou
MedImVLM
116
6
0
20 Dec 2023
Auto311: A Confidence-guided Automated System for Non-emergency Calls
Auto311: A Confidence-guided Automated System for Non-emergency Calls
Zirong Chen
Xutong Sun
Yuanhe Li
Meiyi Ma
95
1
0
19 Dec 2023
InstructVideo: Instructing Video Diffusion Models with Human Feedback
InstructVideo: Instructing Video Diffusion Models with Human Feedback
Hangjie Yuan
Shiwei Zhang
Xiang Wang
Yujie Wei
Tao Feng
Yining Pan
Yingya Zhang
Ziwei Liu
Samuel Albanie
Dong Ni
VGen
116
46
0
19 Dec 2023
LatestEval: Addressing Data Contamination in Language Model Evaluation
  through Dynamic and Time-Sensitive Test Construction
LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction
Yucheng Li
Frank Geurin
Chenghua Lin
59
35
0
19 Dec 2023
Instruct-SCTG: Guiding Sequential Controlled Text Generation through
  Instructions
Instruct-SCTG: Guiding Sequential Controlled Text Generation through Instructions
Yinhong Liu
Yixuan Su
Ehsan Shareghi
Nigel Collier
62
1
0
19 Dec 2023
VQA4CIR: Boosting Composed Image Retrieval with Visual Question
  Answering
VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering
Chun-Mei Feng
Yang Bai
Yaoyu Zhang
Zhen Li
Salman Khan
Wangmeng Zuo
Xinxing Xu
Rick Siow Mong Goh
Yong-Jin Liu
93
7
0
19 Dec 2023
Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model
Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model
Lingjun Zhang
Xinyuan Chen
Yaohui Wang
Yue Lu
Yu Qiao
DiffM
84
36
0
19 Dec 2023
HuTuMotion: Human-Tuned Navigation of Latent Motion Diffusion Models
  with Minimal Feedback
HuTuMotion: Human-Tuned Navigation of Latent Motion Diffusion Models with Minimal Feedback
Gaoge Han
Shaoli Huang
Biwei Huang
Jinglei Tang
VGen
60
2
0
19 Dec 2023
On Early Detection of Hallucinations in Factual Question Answering
On Early Detection of Hallucinations in Factual Question Answering
Ben Snyder
Marius Moisescu
Muhammad Bilal Zafar
HILM
128
28
0
19 Dec 2023
Neuron-Level Knowledge Attribution in Large Language Models
Neuron-Level Knowledge Attribution in Large Language Models
Zeping Yu
Sophia Ananiadou
FAttKELM
92
11
0
19 Dec 2023
Active Preference Inference using Language Models and Probabilistic
  Reasoning
Active Preference Inference using Language Models and Probabilistic Reasoning
Wasu Top Piriyakulkij
Volodymyr Kuleshov
Kevin Ellis
LRM
79
15
0
19 Dec 2023
Xpert: Empowering Incident Management with Query Recommendations via
  Large Language Models
Xpert: Empowering Incident Management with Query Recommendations via Large Language Models
Yuxuan Jiang
Chaoyun Zhang
Shilin He
Zhihao Yang
Ming-Jie Ma
...
Yu Kang
Yingnong Dang
Saravan Rajmohan
Qingwei Lin
Dongmei Zhang
95
22
0
19 Dec 2023
Big Learning Expectation Maximization
Big Learning Expectation Maximization
Yulai Cong
Sijia Li
70
2
0
19 Dec 2023
ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for
  Accelerating Language Models Inference
ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference
Huiping Zhuang
Yihuai Hong
Hongliang Dai
Huiping Zhuang
Cen Chen
91
10
0
19 Dec 2023
A Revisit of Fake News Dataset with Augmented Fact-checking by ChatGPT
A Revisit of Fake News Dataset with Augmented Fact-checking by ChatGPT
Zizhong Li
Haopeng Zhang
Jiawei Zhang
91
6
0
19 Dec 2023
An Adaptive Placement and Parallelism Framework for Accelerating RLHF
  Training
An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Youshao Xiao
Weichang Wu
Zhenglei Zhou
Fagui Mao
Shangchun Zhao
Lin Ju
Lei Liang
Xiaolu Zhang
Jun Zhou
83
6
0
19 Dec 2023
Urban Generative Intelligence (UGI): A Foundational Platform for Agents
  in Embodied City Environment
Urban Generative Intelligence (UGI): A Foundational Platform for Agents in Embodied City Environment
Fengli Xu
Jun Zhang
Chen Gao
J. Feng
Yong Li
AI4CELLMAG
99
32
0
19 Dec 2023
Designing LLM Chains by Adapting Techniques from Crowdsourcing Workflows
Designing LLM Chains by Adapting Techniques from Crowdsourcing Workflows
Madeleine Grunde-McLaughlin
Michelle S. Lam
Ranjay Krishna
Daniel S. Weld
Jeffrey Heer
AI4CE
124
21
0
18 Dec 2023
Traces of Memorisation in Large Language Models for Code
Traces of Memorisation in Large Language Models for Code
Ali Al-Kaswan
Maliheh Izadi
Arie van Deursen
ELM
62
17
0
18 Dec 2023
Iterative Preference Learning from Human Feedback: Bridging Theory and
  Practice for RLHF under KL-Constraint
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint
Wei Xiong
Hanze Dong
Chen Ye
Ziqi Wang
Han Zhong
Heng Ji
Nan Jiang
Tong Zhang
OffRL
135
204
0
18 Dec 2023
An In-depth Look at Gemini's Language Abilities
An In-depth Look at Gemini's Language Abilities
Syeda Nahida Akter
Zichun Yu
Aashiq Muhamed
Tianyue Ou
Alex Bäuerle
Ángel Alexander Cabrera
Krish Dholakia
Chenyan Xiong
Graham Neubig
LRMELM
103
36
0
18 Dec 2023
Explore 3D Dance Generation via Reward Model from Automatically-Ranked
  Demonstrations
Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations
Zilin Wang
Hao-Wen Zhuang
Lu Li
Yinmin Zhang
Junjie Zhong
Jun Chen
Yu Yang
Boshi Tang
Zhiyong Wu
84
3
0
18 Dec 2023
Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM
  Finetuning
Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning
Bingchen Zhao
Haoqin Tu
Chen Wei
Jieru Mei
Cihang Xie
117
36
0
18 Dec 2023
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
Jiahui Gao
Renjie Pi
Jipeng Zhang
Jiacheng Ye
Wanjun Zhong
...
Lanqing Hong
Jianhua Han
Hang Xu
Zhenguo Li
Lingpeng Kong
SyDaReLMLRM
122
119
0
18 Dec 2023
Evaluating and Enhancing Large Language Models for Conversational
  Reasoning on Knowledge Graphs
Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge Graphs
Yuxuan Huang
Lida Shi
Anqi Liu
Hao Xu
LLMAGELMKELMLRM
55
4
0
18 Dec 2023
MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL
MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL
Bing Wang
Changyu Ren
Jian Yang
Xinnian Liang
Jiaqi Bai
...
Zhao Yan
Qian-Wen Zhang
Di Yin
Xing Sun
Zhoujun Li
131
68
0
18 Dec 2023
VinaLLaMA: LLaMA-based Vietnamese Foundation Model
VinaLLaMA: LLaMA-based Vietnamese Foundation Model
Quan Van Nguyen
Huy Quang Pham
Dung Dao
ALM
64
8
0
18 Dec 2023
Retrieval-Augmented Generation for Large Language Models: A Survey
Retrieval-Augmented Generation for Large Language Models: A Survey
Yunfan Gao
Yun Xiong
Xinyu Gao
Kangxiang Jia
Jinliu Pan
Yuxi Bi
Yi Dai
Jiawei Sun
Meng Wang
Haofen Wang
3DVRALM
341
1,846
1
18 Dec 2023
A Comprehensive Survey of Attack Techniques, Implementation, and
  Mitigation Strategies in Large Language Models
A Comprehensive Survey of Attack Techniques, Implementation, and Mitigation Strategies in Large Language Models
Aysan Esmradi
Daniel Wankit Yip
C. Chan
AAML
83
14
0
18 Dec 2023
Dynamic Retrieval Augmented Generation of Ontologies using Artificial
  Intelligence (DRAGON-AI)
Dynamic Retrieval Augmented Generation of Ontologies using Artificial Intelligence (DRAGON-AI)
Sabrina Toro
A. V. Anagnostopoulos
Sue Bello
Kai Blumberg
Rhiannon Cameron
...
Ray Stefancsik
Magalie Weber
Valerie Wood
M. Haendel
Christopher J. Mungall
3DV
79
28
0
18 Dec 2023
Demystifying Instruction Mixing for Fine-tuning Large Language Models
Demystifying Instruction Mixing for Fine-tuning Large Language Models
Renxi Wang
Haonan Li
Minghao Wu
Yuxia Wang
Xudong Han
Chiyu Zhang
Timothy Baldwin
54
0
0
17 Dec 2023
M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Mingsheng Li
Xin Chen
C. Zhang
Sijin Chen
Erik Cambria
Fukun Yin
Gang Yu
Tao Chen
88
26
0
17 Dec 2023
A Survey of Reasoning with Foundation Models
A Survey of Reasoning with Foundation Models
Jiankai Sun
Chuanyang Zheng
Enze Xie
Zhengying Liu
Ruihang Chu
...
Xipeng Qiu
Yi-Chen Guo
Hui Xiong
Qun Liu
Zhenguo Li
ReLMLRMAI4CE
209
85
0
17 Dec 2023
Previous
123...108109110...126127128
Next