ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,381 papers shown
Title
Self-QA: Unsupervised Knowledge Guided Language Model Alignment
Self-QA: Unsupervised Knowledge Guided Language Model Alignment
Xuanyu Zhang
Qing Yang
ALM
51
12
0
19 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive
  Critiquing
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELMLRM
156
399
0
19 May 2023
Attributable and Scalable Opinion Summarization
Attributable and Scalable Opinion Summarization
Tom Hosking
Hao Tang
Mirella Lapata
71
9
0
19 May 2023
InstructIE: A Bilingual Instruction-based Information Extraction Dataset
InstructIE: A Bilingual Instruction-based Information Extraction Dataset
Honghao Gui
Shuofei Qiao
Jintian Zhang
Hongbin Ye
Mengshu Sun
Lei Liang
Jeff Z. Pan
Huajun Chen
Ningyu Zhang
76
9
0
19 May 2023
Shattering the Agent-Environment Interface for Fine-Tuning Inclusive
  Language Models
Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models
Wanqiao Xu
Shi Dong
Dilip Arumugam
Benjamin Van Roy
78
8
0
19 May 2023
A Survey of Safety and Trustworthiness of Large Language Models through
  the Lens of Verification and Validation
A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation
Xiaowei Huang
Wenjie Ruan
Wei Huang
Gao Jin
Yizhen Dong
...
Sihao Wu
Peipei Xu
Dengyu Wu
André Freitas
Mustafa A. Mustafa
ALM
132
96
0
19 May 2023
UniControl: A Unified Diffusion Model for Controllable Visual Generation
  In the Wild
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Can Qin
Shu Zhen Zhang
Ning Yu
Yihao Feng
Xinyi Yang
...
Caiming Xiong
Silvio Savarese
Stefano Ermon
Yun Fu
Ran Xu
113
136
0
18 May 2023
Temporal Knowledge Graph Forecasting Without Knowledge Using In-Context
  Learning
Temporal Knowledge Graph Forecasting Without Knowledge Using In-Context Learning
Dong-Ho Lee
Kian Ahrabian
Woojeong Jin
Fred Morstatter
Jay Pujara
118
43
0
17 May 2023
PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering
PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering
Xiaoman Zhang
Chaoyi Wu
Ziheng Zhao
Weixiong Lin
Ya Zhang
Yanfeng Wang
Weidi Xie
LM&MA
159
183
0
17 May 2023
What You See is What You Read? Improving Text-Image Alignment Evaluation
What You See is What You Read? Improving Text-Image Alignment Evaluation
Michal Yarom
Yonatan Bitton
Soravit Changpinyo
Roee Aharoni
Jonathan Herzig
Oran Lang
E. Ofek
Idan Szpektor
EGVM
166
85
0
17 May 2023
Controllable Speaking Styles Using a Large Language Model
Controllable Speaking Styles Using a Large Language Model
A. Sigurgeirsson
Simon King
55
3
0
17 May 2023
LeTI: Learning to Generate from Textual Interactions
LeTI: Learning to Generate from Textual Interactions
Xingyao Wang
Hao Peng
Reyhaneh Jabbarvand
Heng Ji
116
30
0
17 May 2023
Prompt-Tuning Decision Transformer with Preference Ranking
Prompt-Tuning Decision Transformer with Preference Ranking
Shengchao Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
85
14
0
16 May 2023
StructGPT: A General Framework for Large Language Model to Reason over
  Structured Data
StructGPT: A General Framework for Large Language Model to Reason over Structured Data
Jinhao Jiang
Kun Zhou
Zican Dong
Keming Ye
Wayne Xin Zhao
Ji-Rong Wen
LRMLMTDRALM
174
301
0
16 May 2023
Pre-Training to Learn in Context
Pre-Training to Learn in Context
Yuxian Gu
Li Dong
Furu Wei
Minlie Huang
CLIPLRMReLM
167
38
0
16 May 2023
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca
Zhengxuan Wu
Atticus Geiger
Thomas Icard
Christopher Potts
Noah D. Goodman
MILM
87
93
0
15 May 2023
Schema-adaptable Knowledge Graph Construction
Schema-adaptable Knowledge Graph Construction
Hongbin Ye
Honghao Gui
Xin Xu
Xi Chen
Huajun Chen
Ningyu Zhang
114
4
0
15 May 2023
Symbol tuning improves in-context learning in language models
Symbol tuning improves in-context learning in language models
Jerry W. Wei
Le Hou
Andrew Kyle Lampinen
Xiangning Chen
Da Huang
...
Xinyun Chen
Yifeng Lu
Denny Zhou
Tengyu Ma
Quoc V. Le
LRM
90
80
0
15 May 2023
Helping the Helper: Supporting Peer Counselors via AI-Empowered Practice and Feedback
Helping the Helper: Supporting Peer Counselors via AI-Empowered Practice and Feedback
Shang-ling Hsu
Raj Sanjay Shah
Prathik Senthil
Zahra Ashktorab
Casey Dugan
Werner Geyer
Diyi Yang
119
24
0
15 May 2023
$SmartProbe$: A Virtual Moderator for Market Research Surveys
SmartProbeSmartProbeSmartProbe: A Virtual Moderator for Market Research Surveys
Joshua Seltzer
Jia Pan
Kathy Cheng
Yuxiao Sun
Santosh Kolagati
Jimmy Lin
Shi Zong
47
1
0
14 May 2023
Learning to Simulate Natural Language Feedback for Interactive Semantic
  Parsing
Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing
Hao Yan
Saurabh Srivastava
Yintao Tai
Sida I. Wang
Wen-tau Yih
Ziyu Yao
73
19
0
14 May 2023
CodeT5+: Open Code Large Language Models for Code Understanding and
  Generation
CodeT5+: Open Code Large Language Models for Code Understanding and Generation
Yue Wang
Hung Le
Akhilesh Deepak Gotmare
Nghi D. Q. Bui
Junnan Li
Steven C. H. Hoi
ALM
140
504
0
13 May 2023
Synergistic Interplay between Search and Large Language Models for
  Information Retrieval
Synergistic Interplay between Search and Large Language Models for Information Retrieval
Jiazhan Feng
Chongyang Tao
Xiubo Geng
Tao Shen
Can Xu
Guodong Long
Dongyan Zhao
Daxin Jiang
KELM
129
6
0
12 May 2023
GFlowNets with Human Feedback
GFlowNets with Human Feedback
Yinchuan Li
Shuang Luo
Yunfeng Shao
Jianye Hao
AI4CE
68
5
0
11 May 2023
Evaluating Embedding APIs for Information Retrieval
Evaluating Embedding APIs for Information Retrieval
Ehsan Kamalloo
Xinyu Crystina Zhang
Odunayo Ogundepo
Nandan Thakur
David Alfonso-Hermelo
Mehdi Rezagholizadeh
Jimmy J. Lin
RALM
114
22
0
10 May 2023
A Glimpse in ChatGPT Capabilities and its impact for AI research
A Glimpse in ChatGPT Capabilities and its impact for AI research
Frank Joublin
Antonello Ceravola
Joerg Deigmoeller
Michael Gienger
M. Franzius
Julian Eggert
SILMAI4MHALMELM
75
15
0
10 May 2023
MAUPQA: Massive Automatically-created Polish Question Answering Dataset
MAUPQA: Massive Automatically-created Polish Question Answering Dataset
Piotr Rybak
76
12
0
09 May 2023
Distilling Script Knowledge from Large Language Models for Constrained
  Language Planning
Distilling Script Knowledge from Large Language Models for Constrained Language Planning
Siyu Yuan
Jiangjie Chen
Ziquan Fu
Xuyang Ge
Soham Shah
C. R. Jankowski
Yanghua Xiao
Deqing Yang
96
56
0
09 May 2023
Coherent Wave Dynamics and Language Generation of a Generative
  Pre-trained Transformer
Coherent Wave Dynamics and Language Generation of a Generative Pre-trained Transformer
Tao Hong
36
0
0
08 May 2023
The Current State of Summarization
The Current State of Summarization
Fabian Retkowski
78
6
0
08 May 2023
Influence of External Information on Large Language Models Mirrors
  Social Cognitive Patterns
Influence of External Information on Large Language Models Mirrors Social Cognitive Patterns
Ning Bian
Hongyu Lin
Peilin Liu
Yaojie Lu
Chunkang Zhang
Xianpei Han
Xianpei Han
Le Sun
78
14
0
08 May 2023
Augmented Large Language Models with Parametric Knowledge Guiding
Augmented Large Language Models with Parametric Knowledge Guiding
Ziyang Luo
Can Xu
Pu Zhao
Xiubo Geng
Chongyang Tao
Jing Ma
Qingwei Lin
Daxin Jiang
KELMRALM
109
47
0
08 May 2023
Enhancing Knowledge Graph Construction Using Large Language Models
Enhancing Knowledge Graph Construction Using Large Language Models
Milena Trajanoska
Riste Stojanov
D. Trajanov
92
56
0
08 May 2023
Improving Cross-Task Generalization with Step-by-Step Instructions
Improving Cross-Task Generalization with Step-by-Step Instructions
Yang Wu
Yanyan Zhao
Zhongyang Li
Bing Qin
Kai Xiong
LRMALM
77
9
0
08 May 2023
Unified Demonstration Retriever for In-Context Learning
Unified Demonstration Retriever for In-Context Learning
Xiaonan Li
Kai Lv
Hang Yan
Tianya Lin
Wei-wei Zhu
Yuan Ni
Guotong Xie
Xiaoling Wang
Xipeng Qiu
RALMVPVLM
84
142
0
07 May 2023
Refining the Responses of LLMs by Themselves
Refining the Responses of LLMs by Themselves
Tianqiang Yan
Tiansheng Xu
51
3
0
06 May 2023
DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System
  for Multilingual Named Entity Recognition
DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition
Zeqi Tan
Shen Huang
Zixia Jia
Jiong Cai
Hai-Tao Zheng
...
Yueting Zhuang
Kewei Tu
Pengjun Xie
Fei Huang
Yong Jiang
74
8
0
05 May 2023
Towards Applying Powerful Large AI Models in Classroom Teaching:
  Opportunities, Challenges and Prospects
Towards Applying Powerful Large AI Models in Classroom Teaching: Opportunities, Challenges and Prospects
Kehui Tan
Tianqi Pang
Chenyou Fan
Song Yu
68
16
0
05 May 2023
VicunaNER: Zero/Few-shot Named Entity Recognition using Vicuna
VicunaNER: Zero/Few-shot Named Entity Recognition using Vicuna
Shezheng Song
69
13
0
05 May 2023
Can LLM Already Serve as A Database Interface? A BIg Bench for
  Large-Scale Database Grounded Text-to-SQLs
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs
Jinyang Li
Binyuan Hui
Ge Qu
Jiaxi Yang
Binhua Li
...
Guoliang Li
Kevin C. C. Chang
Fei Huang
Reynold Cheng
Yongbin Li
LMTD
184
422
0
04 May 2023
Semantic Space Grounded Weighted Decoding for Multi-Attribute
  Controllable Dialogue Generation
Semantic Space Grounded Weighted Decoding for Multi-Attribute Controllable Dialogue Generation
Zhiling Zhang
Mengyue Wu
Ke Zhu
AI4CE
76
1
0
04 May 2023
Towards Weakly-Supervised Hate Speech Classification Across Datasets
Towards Weakly-Supervised Hate Speech Classification Across Datasets
Yiping Jin
Leo Wanner
Vishakha Kadam
A. Shvets
74
5
0
04 May 2023
"Oops, Did I Just Say That?" Testing and Repairing Unethical Suggestions
  of Large Language Models with Suggest-Critique-Reflect Process
"Oops, Did I Just Say That?" Testing and Repairing Unethical Suggestions of Large Language Models with Suggest-Critique-Reflect Process
Anna Glazkova
Zongjie Li
Michael Kadantsev
Maksim Glazkov
KELM
86
14
0
04 May 2023
Personalized Abstractive Summarization by Tri-agent Generation Pipeline
Personalized Abstractive Summarization by Tri-agent Generation Pipeline
Md Aminul Haque Palash
Sourav Saha
Faria Afrin
Pengcheng He
108
3
0
04 May 2023
Entity Tracking in Language Models
Entity Tracking in Language Models
Najoung Kim
Sebastian Schuster
147
22
0
03 May 2023
Can Large Language Models Be an Alternative to Human Evaluations?
Can Large Language Models Be an Alternative to Human Evaluations?
Cheng-Han Chiang
Hung-yi Lee
ALMLM&MA
303
634
0
03 May 2023
Distill or Annotate? Cost-Efficient Fine-Tuning of Compact Models
Distill or Annotate? Cost-Efficient Fine-Tuning of Compact Models
Junmo Kang
Wei Xu
Alan Ritter
128
15
0
02 May 2023
How to Unleash the Power of Large Language Models for Few-shot Relation
  Extraction?
How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?
Xin Xu
Yuqi Zhu
Xiaohan Wang
Ningyu Zhang
KELMLRM
131
55
0
02 May 2023
Generating images of rare concepts using pre-trained diffusion models
Generating images of rare concepts using pre-trained diffusion models
Dvir Samuel
Rami Ben-Ari
Simon Raviv
N. Darshan
Gal Chechik
243
45
0
27 Apr 2023
CONSCENDI: A Contrastive and Scenario-Guided Distillation Approach to
  Guardrail Models for Virtual Assistants
CONSCENDI: A Contrastive and Scenario-Guided Distillation Approach to Guardrail Models for Virtual Assistants
A. Sun
Varun Nair
Elliot Schumacher
Anitha Kannan
80
3
0
27 Apr 2023
Previous
123...119120121...126127128
Next