ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,388 papers shown
Title
Multi-Modal Hallucination Control by Visual Information Grounding
Multi-Modal Hallucination Control by Visual Information Grounding
Alessandro Favero
Luca Zancato
Matthew Trager
Siddharth Choudhary
Pramuditha Perera
Alessandro Achille
Ashwin Swaminathan
Stefano Soatto
MLLM
132
80
0
20 Mar 2024
Testing the Limits of Jailbreaking Defenses with the Purple Problem
Testing the Limits of Jailbreaking Defenses with the Purple Problem
Taeyoun Kim
Suhas Kotha
Aditi Raghunathan
AAML
91
6
0
20 Mar 2024
Ink and Individuality: Crafting a Personalised Narrative in the Age of
  LLMs
Ink and Individuality: Crafting a Personalised Narrative in the Age of LLMs
Azmine Toushik Wasi
Raima Islam
Rafia Islam
81
4
0
20 Mar 2024
Train & Constrain: Phonologically Informed Tongue-Twister Generation
  from Topics and Paraphrases
Train & Constrain: Phonologically Informed Tongue-Twister Generation from Topics and Paraphrases
Tyler Loakman
Chen Tang
Chenghua Lin
103
4
0
20 Mar 2024
RewardBench: Evaluating Reward Models for Language Modeling
RewardBench: Evaluating Reward Models for Language Modeling
Nathan Lambert
Valentina Pyatkin
Jacob Morrison
Lester James V. Miranda
Bill Yuchen Lin
...
Sachin Kumar
Tom Zick
Yejin Choi
Noah A. Smith
Hanna Hajishirzi
ALM
195
260
0
20 Mar 2024
Chain-of-Interaction: Enhancing Large Language Models for Psychiatric
  Behavior Understanding by Dyadic Contexts
Chain-of-Interaction: Enhancing Large Language Models for Psychiatric Behavior Understanding by Dyadic Contexts
Guangzeng Han
Weisi Liu
Xiaolei Huang
Brian Borsari
79
22
0
20 Mar 2024
Teacher-Student Training for Debiasing: General Permutation Debiasing
  for Large Language Models
Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models
Adian Liusie
Yassir Fathullah
Mark Gales
54
5
0
20 Mar 2024
How Gender Interacts with Political Values: A Case Study on Czech BERT
  Models
How Gender Interacts with Political Values: A Case Study on Czech BERT Models
Adnan Al Ali
Jindvrich Libovický
50
0
0
20 Mar 2024
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal
  Large Language Models
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Wenqiao Zhang
Tianwei Lin
Jiang Liu
Fangxun Shu
Haoyuan Li
...
Zheqi Lv
Hao Jiang
Juncheng Li
Siliang Tang
Yueting Zhuang
VLMMLLM
95
6
0
20 Mar 2024
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Yaowei Zheng
Richong Zhang
Junhao Zhang
Yanhan Ye
Zheyan Luo
Zhangchi Feng
Yongqiang Ma
184
559
0
20 Mar 2024
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in
  Text-to-Image Generation
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation
Jingkun An
Yinghao Zhu
Zongjian Li
Haoran Feng
Bohua Chen
Yemin Shi
Chengwei Pan
74
2
0
20 Mar 2024
Hyacinth6B: A large language model for Traditional Chinese
Hyacinth6B: A large language model for Traditional Chinese
Chih-Wei Song
Yin-Te Tsai
107
0
0
20 Mar 2024
Mapping LLM Security Landscapes: A Comprehensive Stakeholder Risk
  Assessment Proposal
Mapping LLM Security Landscapes: A Comprehensive Stakeholder Risk Assessment Proposal
Rahul Pankajakshan
Sumitra Biswal
Yuvaraj Govindarajulu
Gilad Gressel
90
15
0
20 Mar 2024
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large
  Vision Language Models
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Tongtian Yue
Jie Cheng
Longteng Guo
Xingyuan Dai
Zijia Zhao
Xingjian He
Gang Xiong
Yisheng Lv
Jing Liu
115
11
0
20 Mar 2024
Facilitating Pornographic Text Detection for Open-Domain Dialogue
  Systems via Knowledge Distillation of Large Language Models
Facilitating Pornographic Text Detection for Open-Domain Dialogue Systems via Knowledge Distillation of Large Language Models
Huachuan Qiu
Shuai Zhang
Hongliang He
Anqi Li
Zhenzhong Lan
94
1
0
20 Mar 2024
Diffusion Model for Data-Driven Black-Box Optimization
Diffusion Model for Data-Driven Black-Box Optimization
Zihao Li
Hui Yuan
Kaixuan Huang
Chengzhuo Ni
Yinyu Ye
Minshuo Chen
Mengdi Wang
DiffM
109
13
0
20 Mar 2024
Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language
  Models
Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Zuyan Liu
Yuhao Dong
Yongming Rao
Jie Zhou
Jiwen Lu
LRM
79
21
0
19 Mar 2024
TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation
TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation
Yufei Liu
Junwei Zhu
Junshu Tang
Shijie Zhang
Jiangning Zhang
Weijian Cao
Chengjie Wang
Yunsheng Wu
Dongjin Huang
75
10
0
19 Mar 2024
HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
Fucai Ke
Zhixi Cai
Simindokht Jahangard
Weiqing Wang
P. D. Haghighi
Hamid Rezatofighi
LRM
99
12
0
19 Mar 2024
MELTing point: Mobile Evaluation of Language Transformers
MELTing point: Mobile Evaluation of Language Transformers
Stefanos Laskaridis
Kleomenis Katevas
Lorenzo Minto
Hamed Haddadi
95
24
0
19 Mar 2024
Contextual Moral Value Alignment Through Context-Based Aggregation
Contextual Moral Value Alignment Through Context-Based Aggregation
Pierre Dognin
Jesus Rios
Ronny Luss
Inkit Padhi
Matthew D Riemer
Miao Liu
P. Sattigeri
Manish Nagireddy
Kush R. Varshney
Djallel Bouneffouf
69
6
0
19 Mar 2024
WaterVG: Waterway Visual Grounding based on Text-Guided Vision and
  mmWave Radar
WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar
Runwei Guan
Liye Jia
Fengyufan Yang
Shanliang Yao
Erick Purwanto
...
Eng Gee Lim
Jeremy S. Smith
Ka Lok Man
Xuming Hu
Yutao Yue
119
9
0
19 Mar 2024
LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation
  Benchmark for Chinese Large Language Models
LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Chuang Liu
Renren Jin
Yuqi Ren
Deyi Xiong
ELM
120
0
0
19 Mar 2024
AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented
  Stock-Chain Framework
AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework
Xiang Li
Zhenyu Li
Chen Shi
Yong-mei Xu
Qing Du
Mingkui Tan
Jun Huang
Wei Lin
AIFin
119
29
0
19 Mar 2024
RigorLLM: Resilient Guardrails for Large Language Models against
  Undesired Content
RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content
Zhuowen Yuan
Zidi Xiong
Yi Zeng
Ning Yu
Ruoxi Jia
Basel Alomair
Yue Liu
AAMLKELM
130
45
0
19 Mar 2024
Embodied LLM Agents Learn to Cooperate in Organized Teams
Embodied LLM Agents Learn to Cooperate in Organized Teams
Xudong Guo
Kaixuan Huang
Jiale Liu
Wenhui Fan
Natalia Vélez
Qingyun Wu
Huazheng Wang
Thomas L. Griffiths
Mengdi Wang
LM&RoLLMAG
140
49
0
19 Mar 2024
CrossTune: Black-Box Few-Shot Classification with Label Enhancement
CrossTune: Black-Box Few-Shot Classification with Label Enhancement
Danqing Luo
Chen Zhang
Yan Zhang
Haizhou Li
84
2
0
19 Mar 2024
Third-Party Language Model Performance Prediction from Instruction
Third-Party Language Model Performance Prediction from Instruction
Rahul Nadkarni
Yizhong Wang
Noah A. Smith
ELMLRM
67
0
0
19 Mar 2024
Advancing Time Series Classification with Multimodal Language Modeling
Advancing Time Series Classification with Multimodal Language Modeling
Mingyue Cheng
Yiheng Chen
Qi Liu
Zhiding Liu
Yucong Luo
AI4TS
89
12
0
19 Mar 2024
Characteristic AI Agents via Large Language Models
Characteristic AI Agents via Large Language Models
Xi Wang
Hongliang Dai
Shen Gao
Piji Li
131
3
0
19 Mar 2024
Securing Large Language Models: Threats, Vulnerabilities and Responsible Practices
Securing Large Language Models: Threats, Vulnerabilities and Responsible Practices
Sara Abdali
Richard Anarfi
C. Barberan
Jia He
Erfan Shayegani
PILM
139
31
0
19 Mar 2024
OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and
  Safety
OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety
Chuang Liu
Linhao Yu
Jiaxuan Li
Renren Jin
Yufei Huang
...
Tao Liu
Jinwang Song
Hongying Zan
Sun Li
Deyi Xiong
ELM
100
7
0
18 Mar 2024
Leveraging Large Language Models to Extract Information on Substance Use
  Disorder Severity from Clinical Notes: A Zero-shot Learning Approach
Leveraging Large Language Models to Extract Information on Substance Use Disorder Severity from Clinical Notes: A Zero-shot Learning Approach
Maria Mahbub
Gregory M Dams
Sudarshan Srinivasan
Caitlin Rizy
Ioana Danciu
Jodie Trafton
Kathryn Knight
69
4
0
18 Mar 2024
Large language models in 6G security: challenges and opportunities
Large language models in 6G security: challenges and opportunities
Tri Nguyen
Huong Nguyen
Ahmad Ijaz
Saeid Sheikhi
Athanasios V. Vasilakos
Panos Kostakos
ELM
73
13
0
18 Mar 2024
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination
  for Simulated-World Control
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control
Enshen Zhou
Yiran Qin
Zhen-fei Yin
Yuzhou Huang
Ruimao Zhang
Lu Sheng
Yu Qiao
Jing Shao
LM&RoAI4CE
116
36
0
18 Mar 2024
Enhancing Taiwanese Hokkien Dual Translation by Exploring and
  Standardizing of Four Writing Systems
Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems
Bo-Han Lu
Yi-Hsuan Lin
En-Shiun Annie Lee
Richard Tzong-Han Tsai
60
0
0
18 Mar 2024
Language Evolution with Deep Learning
Language Evolution with Deep Learning
Mathieu Rita
Paul Michel
Rahma Chaabouni
Olivier Pietquin
Emmanuel Dupoux
Florian Strub
65
3
0
18 Mar 2024
From Explainable to Interpretable Deep Learning for Natural Language
  Processing in Healthcare: How Far from Reality?
From Explainable to Interpretable Deep Learning for Natural Language Processing in Healthcare: How Far from Reality?
Guangming Huang
Yingya Li
Shoaib Jameel
Yunfei Long
G. Papanastasiou
82
23
0
18 Mar 2024
Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for
  Language Models
Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models
Yi Luo
Zheng-Wen Lin
Yuhao Zhang
Jiashuo Sun
Chen Lin
Chengjin Xu
Xiangdong Su
Yelong Shen
Jian Guo
Yeyun Gong
LM&MAELMALMAI4TS
47
1
0
18 Mar 2024
Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Sha Zhang
Di Huang
Jiajun Deng
Shixiang Tang
Wanli Ouyang
Tong He
Yanyong Zhang
VGen
71
18
0
18 Mar 2024
LogicalDefender: Discovering, Extracting, and Utilizing Common-Sense
  Knowledge
LogicalDefender: Discovering, Extracting, and Utilizing Common-Sense Knowledge
Yuhe Liu
Mengxue Kang
Zengchang Qin
Xiangxiang Chu
NAIVLM
56
0
0
18 Mar 2024
Visual Preference Inference: An Image Sequence-Based Preference
  Reasoning in Tabletop Object Manipulation
Visual Preference Inference: An Image Sequence-Based Preference Reasoning in Tabletop Object Manipulation
Joonhyung Lee
Sangbeom Park
Yongin Kwon
Jemin Lee
Minwook Ahn
Sungjoon Choi
76
0
0
18 Mar 2024
Word Order's Impacts: Insights from Reordering and Generation Analysis
Word Order's Impacts: Insights from Reordering and Generation Analysis
Qinghua Zhao
Jiaang Li
Lei Li
Zenghui Zhou
Junfeng Liu
75
0
0
18 Mar 2024
StyleChat: Learning Recitation-Augmented Memory in LLMs for Stylized
  Dialogue Generation
StyleChat: Learning Recitation-Augmented Memory in LLMs for Stylized Dialogue Generation
Jinpeng Li
Zekai Zhang
Quan Tu
Xin Cheng
Dongyan Zhao
Rui Yan
116
3
0
18 Mar 2024
InsCL: A Data-efficient Continual Learning Paradigm for Fine-tuning
  Large Language Models with Instructions
InsCL: A Data-efficient Continual Learning Paradigm for Fine-tuning Large Language Models with Instructions
Yifan Wang
Yafei Liu
Chufan Shi
Haoling Li
Chen Chen
H. Lu
Yujiu Yang
CLL
100
35
0
18 Mar 2024
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient
  LLMs Under Compression
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression
Junyuan Hong
Jinhao Duan
Chenhui Zhang
Zhangheng Li
Chulin Xie
...
B. Kailkhura
Dan Hendrycks
Dawn Song
Zhangyang Wang
Yue Liu
112
28
0
18 Mar 2024
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Kevin Xu
Yeganeh Kordi
Kate Sanders
Yizhong Wang
Adam Byerly
Kate Sanders
Adam Byerly
Jingyu Zhang
Benjamin Van Durme
Daniel Khashabi
LLMAG
129
9
0
18 Mar 2024
Driving Style Alignment for LLM-powered Driver Agent
Driving Style Alignment for LLM-powered Driver Agent
Ruoxuan Yang
Xinyue Zhang
Anais Fernandez-Laaksonen
Xin Ding
Jiangtao Gong
70
11
0
17 Mar 2024
Improving Dialogue Agents by Decomposing One Global Explicit Annotation
  with Local Implicit Multimodal Feedback
Improving Dialogue Agents by Decomposing One Global Explicit Annotation with Local Implicit Multimodal Feedback
Dong Won Lee
Hae Won Park
Yoon Kim
C. Breazeal
Louis-Philippe Morency
111
0
0
17 Mar 2024
SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant
SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant
Guohao Sun
Can Qin
Jiamian Wang
Zeyuan Chen
Ran Xu
Zhiqiang Tao
MLLMVLMLRM
92
13
0
17 Mar 2024
Previous
123...878889...126127128
Next