ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLM
    ALM
ArXivPDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 7,316 papers shown
Title
DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented
  Dialogue Representations
DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations
Weihao Zeng
Dayuan Fu
Keqing He
Yejie Wang
Yukai Xu
Weiran Xu
51
2
0
31 Mar 2024
LLMs are Good Action Recognizers
LLMs are Good Action Recognizers
Haoxuan Qu
Yujun Cai
Jun Liu
43
16
0
31 Mar 2024
Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization
Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization
Hritik Bansal
Ashima Suvarna
Gantavya Bhatt
Nanyun Peng
Kai-Wei Chang
Aditya Grover
ALM
64
9
0
31 Mar 2024
Dialectical Alignment: Resolving the Tension of 3H and Security Threats
  of LLMs
Dialectical Alignment: Resolving the Tension of 3H and Security Threats of LLMs
Shu Yang
Jiayuan Su
Han Jiang
Mengdi Li
Keyuan Cheng
Muhammad Asif Ali
Lijie Hu
Di Wang
53
5
0
30 Mar 2024
Edinburgh Clinical NLP at SemEval-2024 Task 2: Fine-tune your model
  unless you have access to GPT-4
Edinburgh Clinical NLP at SemEval-2024 Task 2: Fine-tune your model unless you have access to GPT-4
Aryo Pradipta Gema
Giwon Hong
Pasquale Minervini
Luke Daines
Beatrice Alex
35
4
0
30 Mar 2024
Small Language Models Learn Enhanced Reasoning Skills from Medical
  Textbooks
Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks
Hyunjae Kim
Hyeon Hwang
Jiwoo Lee
Sihyeon Park
Dain Kim
Taewhoo Lee
Chanwoong Yoon
Jiwoong Sohn
Donghee Choi
Jaewoo Kang
ELM
AI4MH
LRM
72
19
0
30 Mar 2024
ST-LLM: Large Language Models Are Effective Temporal Learners
ST-LLM: Large Language Models Are Effective Temporal Learners
Ruyang Liu
Chen Li
Haoran Tang
Yixiao Ge
Ying Shan
Ge Li
54
70
0
30 Mar 2024
Long-Tailed Recognition on Binary Networks by Calibrating A Pre-trained
  Model
Long-Tailed Recognition on Binary Networks by Calibrating A Pre-trained Model
Jihun Kim
Dahyun Kim
Hyungrok Jung
Taeil Oh
Jonghyun Choi
MQ
54
0
0
30 Mar 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept,
  Taxonomy, and Methods
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAG
KELM
OffRL
LM&Ro
52
51
0
30 Mar 2024
Instruction-Driven Game Engines on Large Language Models
Instruction-Driven Game Engines on Large Language Models
Hongqiu Wu
Xing-Chen Liu
Haizhen Zhao
Min Zhang
47
1
0
30 Mar 2024
Image-to-Image Matching via Foundation Models: A New Perspective for
  Open-Vocabulary Semantic Segmentation
Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation
Yuan Wang
Rui Sun
Naisong Luo
Yuwen Pan
Tianzhu Zhang
VLM
56
9
0
30 Mar 2024
A Survey of using Large Language Models for Generating Infrastructure as
  Code
A Survey of using Large Language Models for Generating Infrastructure as Code
Kalahasti Ganesh Srivatsa
Sabyasachi Mukhopadhyay
Ganesh Katrapati
Manish Shrivastava
41
1
0
30 Mar 2024
Rationale-based Opinion Summarization
Rationale-based Opinion Summarization
Haoyuan Li
Snigdha Chaturvedi
62
4
0
30 Mar 2024
Are We on the Right Way for Evaluating Large Vision-Language Models?
Are We on the Right Way for Evaluating Large Vision-Language Models?
Lin Chen
Jinsong Li
Xiao-wen Dong
Pan Zhang
Yuhang Zang
...
Haodong Duan
Jiaqi Wang
Yu Qiao
Dahua Lin
Feng Zhao
VLM
83
227
0
29 Mar 2024
ReALM: Reference Resolution As Language Modeling
ReALM: Reference Resolution As Language Modeling
Joel Ruben Antony Moniz
Soundarya Krishnan
Melis Ozyildirim
Prathamesh Saraf
Halim Cagri Ates
Yuan-kang Zhang
Hong-ye Yu
Nidhi Rajshree
50
6
0
29 Mar 2024
Embracing Unknown Step by Step: Towards Reliable Sparse Training in Real
  World
Embracing Unknown Step by Step: Towards Reliable Sparse Training in Real World
Bowen Lei
Dongkuan Xu
Ruqi Zhang
Bani Mallick
UQCV
52
0
0
29 Mar 2024
FABind+: Enhancing Molecular Docking through Improved Pocket Prediction and Pose Generation
FABind+: Enhancing Molecular Docking through Improved Pocket Prediction and Pose Generation
Kaiyuan Gao
Qizhi Pei
Jinhua Zhu
Kun He
Lijun Wu
Lijun Wu
39
6
0
29 Mar 2024
ChatTracer: Large Language Model Powered Real-time Bluetooth Device
  Tracking System
ChatTracer: Large Language Model Powered Real-time Bluetooth Device Tracking System
Qijun Wang
Shichen Zhang
Kunzhe Song
Huacheng Zeng
35
1
0
28 Mar 2024
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
Kai Zhang
Yi Luan
Hexiang Hu
Kenton Lee
Siyuan Qiao
Wenhu Chen
Yu-Chuan Su
Ming-Wei Chang
VLM
LRM
47
34
0
28 Mar 2024
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
Bu Jin
Yupeng Zheng
Pengfei Li
Weize Li
Yuhang Zheng
...
Kun Zhan
Peng Jia
Xiaoxiao Long
Yilun Chen
Hao Zhao
3DV
79
16
0
28 Mar 2024
JDocQA: Japanese Document Question Answering Dataset for Generative
  Language Models
JDocQA: Japanese Document Question Answering Dataset for Generative Language Models
Eri Onami
Shuhei Kurita
Taiki Miyanishi
Taro Watanabe
37
1
0
28 Mar 2024
Mixed Preference Optimization: Reinforcement Learning with Data Selection and Better Reference Model
Mixed Preference Optimization: Reinforcement Learning with Data Selection and Better Reference Model
Qi Gou
Cam-Tu Nguyen
35
8
0
28 Mar 2024
Breaking the Length Barrier: LLM-Enhanced CTR Prediction in Long Textual
  User Behaviors
Breaking the Length Barrier: LLM-Enhanced CTR Prediction in Long Textual User Behaviors
Binzong Geng
Zhaoxin Huan
Xiaolu Zhang
Yong He
Liang Zhang
Fajie Yuan
Jun Zhou
Linjian Mo
29
19
0
28 Mar 2024
Plug-and-Play Grounding of Reasoning in Multimodal Large Language Models
Plug-and-Play Grounding of Reasoning in Multimodal Large Language Models
Jiaxing Chen
Yuxuan Liu
Dehu Li
Xiang An
Weimo Deng
Ziyong Feng
Yongle Zhao
Yin Xie
LRM
46
14
0
28 Mar 2024
Fine-Tuning Language Models with Reward Learning on Policy
Fine-Tuning Language Models with Reward Learning on Policy
Hao Lang
Fei Huang
Yongbin Li
ALM
45
5
0
28 Mar 2024
Text Data-Centric Image Captioning with Interactive Prompts
Text Data-Centric Image Captioning with Interactive Prompts
Yiyu Wang
Hao Luo
Jungang Xu
Yingfei Sun
Fan Wang
VLM
45
0
0
28 Mar 2024
Disentangling Length from Quality in Direct Preference Optimization
Disentangling Length from Quality in Direct Preference Optimization
Ryan Park
Rafael Rafailov
Stefano Ermon
Chelsea Finn
ALM
56
114
0
28 Mar 2024
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large
  Language Models
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
Patrick Chao
Edoardo Debenedetti
Alexander Robey
Maksym Andriushchenko
Francesco Croce
...
Nicolas Flammarion
George J. Pappas
F. Tramèr
Hamed Hassani
Eric Wong
ALM
ELM
AAML
57
101
0
28 Mar 2024
Learning From Correctness Without Prompting Makes LLM Efficient Reasoner
Learning From Correctness Without Prompting Makes LLM Efficient Reasoner
Yuxuan Yao
Han Wu
Zhijiang Guo
Biyan Zhou
Jiahui Gao
Sichun Luo
Hanxu Hou
Xiaojin Fu
Linqi Song
LLMAG
LRM
50
9
0
28 Mar 2024
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Zeren Chen
Zhelun Shi
Xiaoya Lu
Lehan He
Sucheng Qian
...
Zhen-fei Yin
Jing Shao
Jing Shao
Cewu Lu
Cewu Lu
43
5
0
28 Mar 2024
TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios
TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios
Xiaokang Zhang
Jing Zhang
Zeyao Ma
Yang Li
Bohan Zhang
...
Didong Li
Shu Zhao
Juan-Zi Li
Jie Tang
J. Tang
LMTD
RALM
46
22
0
28 Mar 2024
IDGenRec: LLM-RecSys Alignment with Textual ID Learning
IDGenRec: LLM-RecSys Alignment with Textual ID Learning
Juntao Tan
Shuyuan Xu
Wenyue Hua
Yingqiang Ge
Zelong Li
Yongfeng Zhang
51
23
0
27 Mar 2024
A Survey on Large Language Models from Concept to Implementation
A Survey on Large Language Models from Concept to Implementation
Chen Wang
Jin Zhao
Jiaqi Gong
LLMAG
LM&MA
47
3
0
27 Mar 2024
Mini-Gemini: Mining the Potential of Multi-modality Vision Language
  Models
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Yanwei Li
Yuechen Zhang
Chengyao Wang
Zhisheng Zhong
Yixin Chen
Ruihang Chu
Shaoteng Liu
Jiaya Jia
VLM
MLLM
MoE
45
215
0
27 Mar 2024
CYCLE: Learning to Self-Refine the Code Generation
CYCLE: Learning to Self-Refine the Code Generation
Yangruibo Ding
Marcus J. Min
Gail E. Kaiser
Baishakhi Ray
41
29
0
27 Mar 2024
Understanding the Learning Dynamics of Alignment with Human Feedback
Understanding the Learning Dynamics of Alignment with Human Feedback
Shawn Im
Yixuan Li
ALM
37
11
0
27 Mar 2024
Non-Linear Inference Time Intervention: Improving LLM Truthfulness
Non-Linear Inference Time Intervention: Improving LLM Truthfulness
Jakub Hoscilowicz
Adam Wiacek
Jan Chojnacki
Adam Cieślak
Leszek Michon
Vitalii Urbanevych
Artur Janicki
KELM
38
2
0
27 Mar 2024
Vulnerability Detection with Code Language Models: How Far Are We?
Vulnerability Detection with Code Language Models: How Far Are We?
Yangruibo Ding
Yanjun Fu
Omniyyah Ibrahim
Chawin Sitawarin
Xinyun Chen
Basel Alomair
David Wagner
Baishakhi Ray
Yizheng Chen
AAML
51
45
0
27 Mar 2024
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text
Elliot Bolton
Abhinav Venigalla
Michihiro Yasunaga
David Leo Wright Hall
Betty Xiong
...
R. Daneshjou
Jonathan Frankle
Percy Liang
Michael Carbin
Christopher D. Manning
LM&MA
MedIm
34
52
0
27 Mar 2024
Improving Attributed Text Generation of Large Language Models via
  Preference Learning
Improving Attributed Text Generation of Large Language Models via Preference Learning
Dongfang Li
Zetian Sun
Baotian Hu
Zhenyu Liu
Xinshuo Hu
Xuebo Liu
Min Zhang
53
13
0
27 Mar 2024
BLADE: Enhancing Black-box Large Language Models with Small
  Domain-Specific Models
BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
Haitao Li
Qingyao Ai
Jia Chen
Qian Dong
Zhijing Wu
Yiqun Liu
Chong Chen
Qi Tian
AILaw
62
13
0
27 Mar 2024
Rejection Improves Reliability: Training LLMs to Refuse Unknown
  Questions Using RL from Knowledge Feedback
Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback
Hongshen Xu
Zichen Zhu
Situo Zhang
Da Ma
Shuai Fan
Lu Chen
Kai Yu
HILM
44
35
0
27 Mar 2024
Quantifying and Mitigating Unimodal Biases in Multimodal Large Language
  Models: A Causal Perspective
Quantifying and Mitigating Unimodal Biases in Multimodal Large Language Models: A Causal Perspective
Meiqi Chen
Yixin Cao
Yan Zhang
Chaochao Lu
37
13
0
27 Mar 2024
LC-LLM: Explainable Lane-Change Intention and Trajectory Predictions
  with Large Language Models
LC-LLM: Explainable Lane-Change Intention and Trajectory Predictions with Large Language Models
Mingxing Peng
Xusen Guo
Xianda Chen
Meixin Zhu
Kehua Chen
Hao
Hao Yang
Xuesong Wang
Yinhai Wang
LRM
30
16
0
27 Mar 2024
Beyond Embeddings: The Promise of Visual Table in Visual Reasoning
Beyond Embeddings: The Promise of Visual Table in Visual Reasoning
Yiwu Zhong
Zi-Yuan Hu
Michael R. Lyu
Liwei Wang
31
1
0
27 Mar 2024
Boosting Conversational Question Answering with Fine-Grained
  Retrieval-Augmentation and Self-Check
Boosting Conversational Question Answering with Fine-Grained Retrieval-Augmentation and Self-Check
Linhao Ye
Zhikai Lei
Jia-Peng Yin
Qin Chen
Jie Zhou
Liang He
3DV
RALM
34
17
0
27 Mar 2024
Exploring the Privacy Protection Capabilities of Chinese Large Language
  Models
Exploring the Privacy Protection Capabilities of Chinese Large Language Models
Yuqi Yang
Xiaowen Huang
Jitao Sang
ELM
PILM
AILaw
57
1
0
27 Mar 2024
FoC: Figure out the Cryptographic Functions in Stripped Binaries with LLMs
FoC: Figure out the Cryptographic Functions in Stripped Binaries with LLMs
Guoqiang Chen
Xiuwei Shang
Shaoyin Cheng
Yanming Zhang
Weiming Zhang
Neng H. Yu
N. Yu
94
2
0
27 Mar 2024
Large Language Models as Financial Data Annotators: A Study on
  Effectiveness and Efficiency
Large Language Models as Financial Data Annotators: A Study on Effectiveness and Efficiency
Toyin Aguda
S. Siddagangappa
Elena Kochkina
Simerjot Kaur
Dongsheng Wang
Charese Smiley
Sameena Shah
46
10
0
26 Mar 2024
ChatGPT Role-play Dataset: Analysis of User Motives and Model
  Naturalness
ChatGPT Role-play Dataset: Analysis of User Motives and Model Naturalness
Sabrina Bodmer
Ameeta Agrawal
Judit Dombi
Tetyana Sydorenko
Jung In Lee
32
4
0
26 Mar 2024
Previous
123...757677...145146147
Next