ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.11416
  4. Cited By
Scaling Instruction-Finetuned Language Models

Scaling Instruction-Finetuned Language Models

20 October 2022
Hyung Won Chung
Le Hou
Shayne Longpre
Barret Zoph
Yi Tay
W. Fedus
Eric Li
Xuezhi Wang
Mostafa Dehghani
Siddhartha Brahma
Albert Webson
S. Gu
Zhuyun Dai
Mirac Suzgun
Xinyun Chen
Aakanksha Chowdhery
Alex Castro-Ros
Marie Pellat
Kevin Robinson
Dasha Valter
Sharan Narang
Gaurav Mishra
Adams Wei Yu
Vincent Zhao
Yanping Huang
Andrew M. Dai
Hongkun Yu
Slav Petrov
Ed H. Chi
J. Dean
Jacob Devlin
Adam Roberts
Denny Zhou
Quoc V. Le
Jason W. Wei
    ReLM
    LRM
ArXivPDFHTML

Papers citing "Scaling Instruction-Finetuned Language Models"

50 / 549 papers shown
Title
Teaching the Pre-trained Model to Generate Simple Texts for Text
  Simplification
Teaching the Pre-trained Model to Generate Simple Texts for Text Simplification
Renliang Sun
Wei-ping Xu
Xiaojun Wan
CLL
24
17
0
21 May 2023
Lifting the Curse of Capacity Gap in Distilling Language Models
Lifting the Curse of Capacity Gap in Distilling Language Models
Chen Zhang
Yang Yang
Jiahao Liu
Jingang Wang
Yunsen Xian
Benyou Wang
Dawei Song
MoE
32
19
0
20 May 2023
OPT-R: Exploring the Role of Explanations in Finetuning and Prompting
  for Reasoning Skills of Large Language Models
OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models
Badr AlKhamissi
Siddharth Verma
Ping Yu
Zhijing Jin
Asli Celikyilmaz
Mona T. Diab
LRM
ReLM
35
10
0
19 May 2023
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented
  Languages
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Sebastian Ruder
J. Clark
Alexander Gutkin
Mihir Kale
Min Ma
...
Dan Garrette
R. Ingle
Melvin Johnson
Dmitry Panteleev
Partha P. Talukdar
ELM
22
38
0
19 May 2023
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
Hiroki Furuta
Kuang-Huei Lee
Ofir Nachum
Yutaka Matsuo
Aleksandra Faust
S. Gu
Izzeddin Gur
LM&Ro
36
93
0
19 May 2023
Pengi: An Audio Language Model for Audio Tasks
Pengi: An Audio Language Model for Audio Tasks
Soham Deshmukh
Benjamin Elizalde
Rita Singh
Huaming Wang
MLLM
AuLLM
39
159
0
19 May 2023
Prompting with Pseudo-Code Instructions
Prompting with Pseudo-Code Instructions
Mayank Mishra
Prince Kumar
Riyaz Ahmad Bhat
V. Rudramurthy
Danish Contractor
Srikanth G. Tamilselvam
45
13
0
19 May 2023
Examining Inter-Consistency of Large Language Models Collaboration: An
  In-depth Analysis via Debate
Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via Debate
Kai Xiong
Xiao Ding
Yixin Cao
Ting Liu
Bing Qin
21
60
0
19 May 2023
MedBLIP: Bootstrapping Language-Image Pre-training from 3D Medical
  Images and Texts
MedBLIP: Bootstrapping Language-Image Pre-training from 3D Medical Images and Texts
Qiuhui Chen
Xinyue Hu
Zirui Wang
Yi Hong
LM&MA
MedIm
22
34
0
18 May 2023
Language Models Meet World Models: Embodied Experiences Enhance Language
  Models
Language Models Meet World Models: Embodied Experiences Enhance Language Models
Jiannan Xiang
Tianhua Tao
Yi Gu
Tianmin Shu
Zirui Wang
Zichao Yang
Zhiting Hu
ALM
LLMAG
LM&Ro
CLL
36
94
0
18 May 2023
PaLM 2 Technical Report
PaLM 2 Technical Report
Rohan Anil
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
...
Ce Zheng
Wei Zhou
Denny Zhou
Slav Petrov
Yonghui Wu
ReLM
LRM
122
1,152
0
17 May 2023
Large-Scale Text Analysis Using Generative Language Models: A Case Study
  in Discovering Public Value Expressions in AI Patents
Large-Scale Text Analysis Using Generative Language Models: A Case Study in Discovering Public Value Expressions in AI Patents
Sergio Pelaez
Gaurav Verma
Barbara Ribeiro
P. Shapira
26
13
0
17 May 2023
MemoryBank: Enhancing Large Language Models with Long-Term Memory
MemoryBank: Enhancing Large Language Models with Long-Term Memory
Wanjun Zhong
Lianghong Guo
Qi-Fei Gao
He Ye
Yanlin Wang
LLMAG
RALM
KELM
28
125
0
17 May 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Language Model Tokenizers Introduce Unfairness Between Languages
Aleksandar Petrov
Emanuele La Malfa
Philip Torr
Adel Bibi
34
97
0
17 May 2023
CoEdIT: Text Editing by Task-Specific Instruction Tuning
CoEdIT: Text Editing by Task-Specific Instruction Tuning
Vipul Raheja
Dhruv Kumar
Ryan Koo
Dongyeop Kang
ALM
23
56
0
17 May 2023
PALR: Personalization Aware LLMs for Recommendation
PALR: Personalization Aware LLMs for Recommendation
Fan Yang
Zheng Chen
Ziyan Jiang
Eunah Cho
Xiaojiang Huang
Yanbin Lu
33
106
0
12 May 2023
Self-Chained Image-Language Model for Video Localization and Question
  Answering
Self-Chained Image-Language Model for Video Localization and Question Answering
Shoubin Yu
Jaemin Cho
Prateek Yadav
Joey Tianyi Zhou
54
130
0
11 May 2023
How Good are Commercial Large Language Models on African Languages?
How Good are Commercial Large Language Models on African Languages?
Jessica Ojo
Kelechi Ogueji
26
5
0
11 May 2023
MoT: Memory-of-Thought Enables ChatGPT to Self-Improve
MoT: Memory-of-Thought Enables ChatGPT to Self-Improve
Xiaonan Li
Xipeng Qiu
ReLM
KELM
LRM
AI4MH
26
32
0
09 May 2023
The Current State of Summarization
The Current State of Summarization
Fabian Retkowski
23
6
0
08 May 2023
Prompted LLMs as Chatbot Modules for Long Open-domain Conversation
Prompted LLMs as Chatbot Modules for Long Open-domain Conversation
Gibbeum Lee
Volker Hartmann
Jongho Park
Dimitris Papailiopoulos
Kangwook Lee
27
63
0
08 May 2023
VPGTrans: Transfer Visual Prompt Generator across LLMs
VPGTrans: Transfer Visual Prompt Generator across LLMs
Ao Zhang
Hao Fei
Yuan Yao
Wei Ji
Li Li
Zhiyuan Liu
Tat-Seng Chua
MLLM
VLM
32
85
0
02 May 2023
AI, write an essay for me: A large-scale comparison of human-written
  versus ChatGPT-generated essays
AI, write an essay for me: A large-scale comparison of human-written versus ChatGPT-generated essays
Steffen Herbold
Annette Hautli-Janisz
Ute Heuer
Zlata Kikteva
Alexander Trautsch
DeLMO
85
23
0
24 Apr 2023
Domain Mastery Benchmark: An Ever-Updating Benchmark for Evaluating Holistic Domain Knowledge of Large Language Model--A Preliminary Release
Zhouhong Gu
Xiaoxuan Zhu
Haoning Ye
Lin Zhang
Zhuozhi Xiong
Zihan Li
Qi He
Sihang Jiang
Hongwei Feng
Yanghua Xiao
ELM
ALM
47
2
0
23 Apr 2023
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
Archiki Prasad
Swarnadeep Saha
Xiang Zhou
Joey Tianyi Zhou
LRM
32
45
0
21 Apr 2023
A Latent Space Theory for Emergent Abilities in Large Language Models
A Latent Space Theory for Emergent Abilities in Large Language Models
Hui Jiang
LRM
25
35
0
19 Apr 2023
Learning to Compress Prompts with Gist Tokens
Learning to Compress Prompts with Gist Tokens
Jesse Mu
Xiang Lisa Li
Noah D. Goodman
VLM
50
206
0
17 Apr 2023
SikuGPT: A Generative Pre-trained Model for Intelligent Information
  Processing of Ancient Texts from the Perspective of Digital Humanities
SikuGPT: A Generative Pre-trained Model for Intelligent Information Processing of Ancient Texts from the Perspective of Digital Humanities
Chang Liu
Dongbo Wang
Zhixiao Zhao
Die Hu
Mengcheng Wu
...
Si Shen
Bin Li
Jiangfeng Liu
Hai Zhang
Lianzheng Zhao
21
9
0
16 Apr 2023
Semantic Feature Verification in FLAN-T5
Semantic Feature Verification in FLAN-T5
Siddharth Suresh
Kushin Mukherjee
Timothy T. Rogers
11
8
0
12 Apr 2023
OpenAGI: When LLM Meets Domain Experts
OpenAGI: When LLM Meets Domain Experts
Yingqiang Ge
Wenyue Hua
Kai Mei
Jianchao Ji
Juntao Tan
Shuyuan Xu
Zelong Li
Yongfeng Zhang
VLM
LRM
40
212
0
10 Apr 2023
Video ChatCaptioner: Towards Enriched Spatiotemporal Descriptions
Video ChatCaptioner: Towards Enriched Spatiotemporal Descriptions
Jun Chen
Deyao Zhu
Kilichbek Haydarov
Xiang Li
Mohamed Elhoseiny
25
37
0
09 Apr 2023
Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models
Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models
Eric Zhang
Kai Wang
Xingqian Xu
Zhangyang Wang
Humphrey Shi
DiffM
46
175
0
30 Mar 2023
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init
  Attention
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
Renrui Zhang
Jiaming Han
Chris Liu
Peng Gao
Aojun Zhou
Xiangfei Hu
Shilin Yan
Pan Lu
Hongsheng Li
Yu Qiao
MLLM
71
745
0
28 Mar 2023
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Kunchang Li
Yali Wang
Yizhuo Li
Yi Wang
Yinan He
Limin Wang
Yu Qiao
VGen
57
156
0
28 Mar 2023
Natural Language Reasoning, A Survey
Natural Language Reasoning, A Survey
Fei Yu
Hongbo Zhang
Prayag Tiwari
Benyou Wang
ReLM
LRM
49
53
0
26 Mar 2023
Error Analysis Prompting Enables Human-Like Translation Evaluation in
  Large Language Models
Error Analysis Prompting Enables Human-Like Translation Evaluation in Large Language Models
Qingyu Lu
Baopu Qiu
Liang Ding
Liping Xie
Tom Kocmi
Dacheng Tao
LRM
ALM
ELM
26
108
0
24 Mar 2023
Mirror: A Natural Language Interface for Data Querying, Summarization,
  and Visualization
Mirror: A Natural Language Interface for Data Querying, Summarization, and Visualization
Canwen Xu
Julian McAuley
Penghan Wang
VLM
24
4
0
15 Mar 2023
Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on
  Consistency with Human Preferences
Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on Consistency with Human Preferences
Yunjie Ji
Yan Gong
Yiping Peng
Chao Ni
Peiyan Sun
Dongyu Pan
Baochang Ma
Xiangang Li
ELM
ALM
AI4MH
30
37
0
14 Mar 2023
Larger language models do in-context learning differently
Larger language models do in-context learning differently
Jerry W. Wei
Jason W. Wei
Yi Tay
Dustin Tran
Albert Webson
...
Xinyun Chen
Hanxiao Liu
Da Huang
Denny Zhou
Tengyu Ma
ReLM
LRM
49
354
0
07 Mar 2023
UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and
  Distillation of Rerankers
UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Jon Saad-Falcon
Omar Khattab
Keshav Santhanam
Radu Florian
M. Franz
Salim Roukos
Avirup Sil
Md Arafat Sultan
Christopher Potts
24
41
0
01 Mar 2023
Goal Driven Discovery of Distributional Differences via Language
  Descriptions
Goal Driven Discovery of Distributional Differences via Language Descriptions
Ruiqi Zhong
Peter Zhang
Steve Li
Jinwoo Ahn
Dan Klein
Jacob Steinhardt
47
48
0
28 Feb 2023
CHiLL: Zero-shot Custom Interpretable Feature Extraction from Clinical
  Notes with Large Language Models
CHiLL: Zero-shot Custom Interpretable Feature Extraction from Clinical Notes with Large Language Models
Denis Jered McInerney
Geoffrey S. Young
Jan-Willem van de Meent
Byron C. Wallace
12
18
0
23 Feb 2023
Language Model Crossover: Variation through Few-Shot Prompting
Language Model Crossover: Variation through Few-Shot Prompting
Elliot Meyerson
M. Nelson
Herbie Bradley
Adam Gaier
Arash Moradi
Amy K. Hoover
Joel Lehman
VLM
40
79
0
23 Feb 2023
Can Pre-trained Vision and Language Models Answer Visual
  Information-Seeking Questions?
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?
Yang Chen
Hexiang Hu
Yi Luan
Haitian Sun
Soravit Changpinyo
Alan Ritter
Ming-Wei Chang
48
80
0
23 Feb 2023
Exploring the Potential of Machine Translation for Generating Named
  Entity Datasets: A Case Study between Persian and English
Exploring the Potential of Machine Translation for Generating Named Entity Datasets: A Case Study between Persian and English
A. Sartipi
A. Fatemi
26
3
0
19 Feb 2023
Do Deep Neural Networks Capture Compositionality in Arithmetic
  Reasoning?
Do Deep Neural Networks Capture Compositionality in Arithmetic Reasoning?
Keito Kudo
Yoichi Aoki
Tatsuki Kuribayashi
Ana Brassard
Masashi Yoshikawa
Keisuke Sakaguchi
Kentaro Inui
CoGe
24
10
0
15 Feb 2023
Towards Agile Text Classifiers for Everyone
Towards Agile Text Classifiers for Everyone
Maximilian Mozes
Jessica Hoffmann
Katrin Tomanek
Muhamed Kouate
Nithum Thain
Ann Yuan
Tolga Bolukbasi
Lucas Dixon
40
13
0
13 Feb 2023
Scaling Vision Transformers to 22 Billion Parameters
Scaling Vision Transformers to 22 Billion Parameters
Mostafa Dehghani
Josip Djolonga
Basil Mustafa
Piotr Padlewski
Jonathan Heek
...
Mario Luvcić
Xiaohua Zhai
Daniel Keysers
Jeremiah Harmsen
N. Houlsby
MLLM
63
572
0
10 Feb 2023
CodeLMSec Benchmark: Systematically Evaluating and Finding Security
  Vulnerabilities in Black-Box Code Language Models
CodeLMSec Benchmark: Systematically Evaluating and Finding Security Vulnerabilities in Black-Box Code Language Models
Hossein Hajipour
Keno Hassler
Thorsten Holz
Lea Schonherr
Mario Fritz
ELM
40
20
0
08 Feb 2023
Large Language Models Can Be Easily Distracted by Irrelevant Context
Large Language Models Can Be Easily Distracted by Irrelevant Context
Freda Shi
Xinyun Chen
Kanishka Misra
Nathan Scales
David Dohan
Ed H. Chi
Nathanael Scharli
Denny Zhou
ReLM
RALM
LRM
33
537
0
31 Jan 2023
Previous
123...10119
Next