ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLM
    ALM
ArXivPDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 7,311 papers shown
Title
GPT-Connect: Interaction between Text-Driven Human Motion Generator and
  3D Scenes in a Training-free Manner
GPT-Connect: Interaction between Text-Driven Human Motion Generator and 3D Scenes in a Training-free Manner
Haoxuan Qu
Ziyan Guo
Jun Liu
VGen
56
3
0
22 Mar 2024
On Zero-Shot Counterspeech Generation by LLMs
On Zero-Shot Counterspeech Generation by LLMs
Punyajoy Saha
Aalok Agrawal
Abhik Jana
Chris Biemann
Animesh Mukherjee
48
12
0
22 Mar 2024
Stance Reasoner: Zero-Shot Stance Detection on Social Media with
  Explicit Reasoning
Stance Reasoner: Zero-Shot Stance Detection on Social Media with Explicit Reasoning
Maksym Taranukhin
Vered Shwartz
E. Milios
LRM
44
6
0
22 Mar 2024
Can 3D Vision-Language Models Truly Understand Natural Language?
Can 3D Vision-Language Models Truly Understand Natural Language?
Weipeng Deng
Jihan Yang
Runyu Ding
Jiahui Liu
Yijiang Li
Xiaojuan Qi
Edith C.H. Ngai
47
4
0
21 Mar 2024
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual
  Math Problems?
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Renrui Zhang
Dongzhi Jiang
Yichi Zhang
Haokun Lin
Ziyu Guo
...
Aojun Zhou
Pan Lu
Kai-Wei Chang
Peng Gao
Hongsheng Li
34
173
0
21 Mar 2024
DreamReward: Text-to-3D Generation with Human Preference
DreamReward: Text-to-3D Generation with Human Preference
Junliang Ye
Fangfu Liu
Qixiu Li
Zhengyi Wang
Yikai Wang
Xinzhou Wang
Yueqi Duan
Jun Zhu
74
21
0
21 Mar 2024
MyVLM: Personalizing VLMs for User-Specific Queries
MyVLM: Personalizing VLMs for User-Specific Queries
Yuval Alaluf
Elad Richardson
Sergey Tulyakov
Kfir Aberman
Daniel Cohen-Or
MLLM
VLM
43
18
0
21 Mar 2024
ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy
ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy
Zonghan Yang
Peng Li
Ming Yan
Ji Zhang
Fei Huang
Yang Liu
LLMAG
LRM
57
9
0
21 Mar 2024
From Large to Tiny: Distilling and Refining Mathematical Expertise for
  Math Word Problems with Weakly Supervision
From Large to Tiny: Distilling and Refining Mathematical Expertise for Math Word Problems with Weakly Supervision
Qingwen Lin
Boyan Xu
Zhengting Huang
Ruichu Cai
36
2
0
21 Mar 2024
FIT-RAG: Black-Box RAG with Factual Information and Token Reduction
FIT-RAG: Black-Box RAG with Factual Information and Token Reduction
Yuren Mao
Xuemei Dong
Wenyi Xu
Yunjun Gao
Bin Wei
Ying Zhang
43
9
0
21 Mar 2024
ChainLM: Empowering Large Language Models with Improved Chain-of-Thought
  Prompting
ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting
Xiaoxue Cheng
Junyi Li
Wayne Xin Zhao
Ji-Rong Wen
LRM
AI4CE
ReLM
57
7
0
21 Mar 2024
ERD: A Framework for Improving LLM Reasoning for Cognitive Distortion
  Classification
ERD: A Framework for Improving LLM Reasoning for Cognitive Distortion Classification
Sehee Lim
Yejin Kim
Chi-Hyun Choi
Jy-yong Sohn
Byung-Hoon Kim
41
3
0
21 Mar 2024
Reinforcement Learning from Reflective Feedback (RLRF): Aligning and
  Improving LLMs via Fine-Grained Self-Reflection
Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Kyungjae Lee
Dasol Hwang
Sunghyun Park
Youngsoo Jang
Moontae Lee
48
8
0
21 Mar 2024
Improving the Robustness of Large Language Models via Consistency
  Alignment
Improving the Robustness of Large Language Models via Consistency Alignment
Zhao Yukun
Lingyong Yan
Weiwei Sun
Guoliang Xing
Shuaiqiang Wang
Meng Chong
Zhicong Cheng
Zhaochun Ren
Yin Dawei
35
19
0
21 Mar 2024
MMIDR: Teaching Large Language Model to Interpret Multimodal
  Misinformation via Knowledge Distillation
MMIDR: Teaching Large Language Model to Interpret Multimodal Misinformation via Knowledge Distillation
Longzheng Wang
Xiaohan Xu
Lei Zhang
Jiarui Lu
Yongxiu Xu
Hongbo Xu
Xuancheng Huang
Chuang Zhang
48
4
0
21 Mar 2024
Policy Mirror Descent with Lookahead
Policy Mirror Descent with Lookahead
Kimon Protopapas
Anas Barakat
29
1
0
21 Mar 2024
Empowering Segmentation Ability to Multi-modal Large Language Models
Empowering Segmentation Ability to Multi-modal Large Language Models
Yuqi Yang
Peng-Tao Jiang
Jing Wang
Hao Zhang
Kai Zhao
Jinwei Chen
Yue Liu
LRM
VLM
35
3
0
21 Mar 2024
Protected group bias and stereotypes in Large Language Models
Protected group bias and stereotypes in Large Language Models
Hadas Kotek
David Q. Sun
Zidi Xiu
Margit Bowler
Christopher Klein
AILaw
ALM
33
3
0
21 Mar 2024
On Prompt Sensitivity of ChatGPT in Affective Computing
On Prompt Sensitivity of ChatGPT in Affective Computing
Mostafa M. Amin
Björn W. Schuller
27
6
0
20 Mar 2024
Multi-Modal Hallucination Control by Visual Information Grounding
Multi-Modal Hallucination Control by Visual Information Grounding
Alessandro Favero
L. Zancato
Matthew Trager
Siddharth Choudhary
Pramuditha Perera
Alessandro Achille
Ashwin Swaminathan
Stefano Soatto
MLLM
90
63
0
20 Mar 2024
Testing the Limits of Jailbreaking Defenses with the Purple Problem
Testing the Limits of Jailbreaking Defenses with the Purple Problem
Taeyoun Kim
Suhas Kotha
Aditi Raghunathan
AAML
49
6
0
20 Mar 2024
Ink and Individuality: Crafting a Personalised Narrative in the Age of
  LLMs
Ink and Individuality: Crafting a Personalised Narrative in the Age of LLMs
Azmine Toushik Wasi
Raima Islam
Rafia Islam
35
3
0
20 Mar 2024
Train & Constrain: Phonologically Informed Tongue-Twister Generation
  from Topics and Paraphrases
Train & Constrain: Phonologically Informed Tongue-Twister Generation from Topics and Paraphrases
Tyler Loakman
Chen Tang
Chenghua Lin
48
4
0
20 Mar 2024
RewardBench: Evaluating Reward Models for Language Modeling
RewardBench: Evaluating Reward Models for Language Modeling
Nathan Lambert
Valentina Pyatkin
Jacob Morrison
Lester James V. Miranda
Bill Yuchen Lin
...
Sachin Kumar
Tom Zick
Yejin Choi
Noah A. Smith
Hanna Hajishirzi
ALM
85
220
0
20 Mar 2024
Chain-of-Interaction: Enhancing Large Language Models for Psychiatric
  Behavior Understanding by Dyadic Contexts
Chain-of-Interaction: Enhancing Large Language Models for Psychiatric Behavior Understanding by Dyadic Contexts
Guangzeng Han
Weisi Liu
Xiaolei Huang
Brian Borsari
41
21
0
20 Mar 2024
Teacher-Student Training for Debiasing: General Permutation Debiasing
  for Large Language Models
Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models
Adian Liusie
Yassir Fathullah
Mark Gales
30
5
0
20 Mar 2024
How Gender Interacts with Political Values: A Case Study on Czech BERT
  Models
How Gender Interacts with Political Values: A Case Study on Czech BERT Models
Adnan Al Ali
Jindvrich Libovický
33
0
0
20 Mar 2024
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal
  Large Language Models
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Wenqiao Zhang
Tianwei Lin
Jiang Liu
Fangxun Shu
Haoyuan Li
...
Zheqi Lv
Hao Jiang
Juncheng Li
Siliang Tang
Yueting Zhuang
VLM
MLLM
43
4
0
20 Mar 2024
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Yaowei Zheng
Richong Zhang
Junhao Zhang
Yanhan Ye
Zheyan Luo
Zhangchi Feng
Yongqiang Ma
55
401
0
20 Mar 2024
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in
  Text-to-Image Generation
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation
Jingkun An
Yinghao Zhu
Zongjian Li
Haoran Feng
Bohua Chen
Yemin Shi
Chengwei Pan
43
2
0
20 Mar 2024
Hyacinth6B: A large language model for Traditional Chinese
Hyacinth6B: A large language model for Traditional Chinese
Chih-Wei Song
Yin-Te Tsai
37
0
0
20 Mar 2024
Mapping LLM Security Landscapes: A Comprehensive Stakeholder Risk
  Assessment Proposal
Mapping LLM Security Landscapes: A Comprehensive Stakeholder Risk Assessment Proposal
Rahul Pankajakshan
Sumitra Biswal
Yuvaraj Govindarajulu
Gilad Gressel
43
15
0
20 Mar 2024
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large
  Vision Language Models
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Tongtian Yue
Jie Cheng
Longteng Guo
Xingyuan Dai
Zijia Zhao
Xingjian He
Gang Xiong
Yisheng Lv
Jing Liu
48
9
0
20 Mar 2024
Facilitating Pornographic Text Detection for Open-Domain Dialogue
  Systems via Knowledge Distillation of Large Language Models
Facilitating Pornographic Text Detection for Open-Domain Dialogue Systems via Knowledge Distillation of Large Language Models
Huachuan Qiu
Shuai Zhang
Hongliang He
Anqi Li
Zhenzhong Lan
48
1
0
20 Mar 2024
Diffusion Model for Data-Driven Black-Box Optimization
Diffusion Model for Data-Driven Black-Box Optimization
Zihao Li
Hui Yuan
Kaixuan Huang
Chengzhuo Ni
Yinyu Ye
Minshuo Chen
Mengdi Wang
DiffM
45
10
0
20 Mar 2024
Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language
  Models
Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Zuyan Liu
Yuhao Dong
Yongming Rao
Jie Zhou
Jiwen Lu
LRM
27
13
0
19 Mar 2024
TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation
TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation
Yufei Liu
Junwei Zhu
Junshu Tang
Shijie Zhang
Jiangning Zhang
Weijian Cao
Chengjie Wang
Yunsheng Wu
Dongjin Huang
57
8
0
19 Mar 2024
HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
Fucai Ke
Zhixi Cai
Simindokht Jahangard
Weiqing Wang
P. D. Haghighi
Hamid Rezatofighi
LRM
56
10
0
19 Mar 2024
MELTing point: Mobile Evaluation of Language Transformers
MELTing point: Mobile Evaluation of Language Transformers
Stefanos Laskaridis
Kleomenis Katevas
Lorenzo Minto
Hamed Haddadi
29
21
0
19 Mar 2024
Contextual Moral Value Alignment Through Context-Based Aggregation
Contextual Moral Value Alignment Through Context-Based Aggregation
Pierre Dognin
Jesus Rios
Ronny Luss
Inkit Padhi
Matthew D Riemer
Miao Liu
P. Sattigeri
Manish Nagireddy
Kush R. Varshney
Djallel Bouneffouf
44
5
0
19 Mar 2024
WaterVG: Waterway Visual Grounding based on Text-Guided Vision and
  mmWave Radar
WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar
Runwei Guan
Liye Jia
Fengyufan Yang
Shanliang Yao
Erick Purwanto
...
Eng Gee Lim
Jeremy S. Smith
Ka Lok Man
Xuming Hu
Yutao Yue
47
9
0
19 Mar 2024
LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation
  Benchmark for Chinese Large Language Models
LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Chuang Liu
Renren Jin
Yuqi Ren
Deyi Xiong
ELM
43
0
0
19 Mar 2024
AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented
  Stock-Chain Framework
AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework
Xiang Li
Zhenyu Li
Chen Shi
Yong-mei Xu
Qing Du
Mingkui Tan
Jun Huang
Wei Lin
AIFin
45
26
0
19 Mar 2024
RigorLLM: Resilient Guardrails for Large Language Models against
  Undesired Content
RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content
Zhuowen Yuan
Zidi Xiong
Yi Zeng
Ning Yu
Ruoxi Jia
D. Song
Yue Liu
AAML
KELM
42
38
0
19 Mar 2024
Securing Large Language Models: Threats, Vulnerabilities and Responsible
  Practices
Securing Large Language Models: Threats, Vulnerabilities and Responsible Practices
Sara Abdali
Richard Anarfi
C. Barberan
Jia He
PILM
73
25
0
19 Mar 2024
Embodied LLM Agents Learn to Cooperate in Organized Teams
Embodied LLM Agents Learn to Cooperate in Organized Teams
Xudong Guo
Kaixuan Huang
Jiale Liu
Wenhui Fan
Natalia Vélez
Qingyun Wu
Huazheng Wang
Thomas L. Griffiths
Mengdi Wang
LM&Ro
LLMAG
54
38
0
19 Mar 2024
CrossTune: Black-Box Few-Shot Classification with Label Enhancement
CrossTune: Black-Box Few-Shot Classification with Label Enhancement
Danqing Luo
Chen Zhang
Yan Zhang
Haizhou Li
45
2
0
19 Mar 2024
Third-Party Language Model Performance Prediction from Instruction
Third-Party Language Model Performance Prediction from Instruction
Rahul Nadkarni
Yizhong Wang
Noah A. Smith
ELM
LRM
53
0
0
19 Mar 2024
Advancing Time Series Classification with Multimodal Language Modeling
Advancing Time Series Classification with Multimodal Language Modeling
Mingyue Cheng
Yiheng Chen
Qi Liu
Zhiding Liu
Yucong Luo
AI4TS
45
11
0
19 Mar 2024
Characteristic AI Agents via Large Language Models
Characteristic AI Agents via Large Language Models
Xi Wang
Hongliang Dai
Shen Gao
Piji Li
53
3
0
19 Mar 2024
Previous
123...777879...145146147
Next