ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.11416
  4. Cited By
Scaling Instruction-Finetuned Language Models

Scaling Instruction-Finetuned Language Models

20 October 2022
Hyung Won Chung
Le Hou
Shayne Longpre
Barret Zoph
Yi Tay
W. Fedus
Eric Li
Xuezhi Wang
Mostafa Dehghani
Siddhartha Brahma
Albert Webson
S. Gu
Zhuyun Dai
Mirac Suzgun
Xinyun Chen
Aakanksha Chowdhery
Alex Castro-Ros
Marie Pellat
Kevin Robinson
Dasha Valter
Sharan Narang
Gaurav Mishra
Adams Wei Yu
Vincent Zhao
Yanping Huang
Andrew M. Dai
Hongkun Yu
Slav Petrov
Ed H. Chi
J. Dean
Jacob Devlin
Adam Roberts
Denny Zhou
Quoc V. Le
Jason W. Wei
    ReLM
    LRM
ArXivPDFHTML

Papers citing "Scaling Instruction-Finetuned Language Models"

50 / 549 papers shown
Title
Llama 2: Open Foundation and Fine-Tuned Chat Models
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
120
11,099
0
18 Jul 2023
Unsupervised Calibration through Prior Adaptation for Text
  Classification using Large Language Models
Unsupervised Calibration through Prior Adaptation for Text Classification using Large Language Models
Lautaro Estienne
Luciana Ferrer
Matías Vera
Pablo Piantanida
VLM
34
1
0
13 Jul 2023
TIM: Teaching Large Language Models to Translate with Comparison
TIM: Teaching Large Language Models to Translate with Comparison
Jiali Zeng
Fandong Meng
Yongjing Yin
Jie Zhou
37
55
0
10 Jul 2023
Linear Alignment of Vision-language Models for Image Captioning
Linear Alignment of Vision-language Models for Image Captioning
Fabian Paischer
M. Hofmarcher
Sepp Hochreiter
Thomas Adler
CLIP
VLM
50
0
0
10 Jul 2023
Large Language Models as Batteries-Included Zero-Shot ESCO Skills
  Matchers
Large Language Models as Batteries-Included Zero-Shot ESCO Skills Matchers
Benjamin Clavié
Guillaume Soulié
26
11
0
07 Jul 2023
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Shilong Zhang
Pei Sun
Shoufa Chen
Min Xiao
Wenqi Shao
Wenwei Zhang
Yu Liu
Kai-xiang Chen
Ping Luo
VLM
MLLM
85
224
0
07 Jul 2023
Lost in the Middle: How Language Models Use Long Contexts
Lost in the Middle: How Language Models Use Long Contexts
Nelson F. Liu
Kevin Lin
John Hewitt
Ashwin Paranjape
Michele Bevilacqua
Fabio Petroni
Percy Liang
RALM
40
1,424
0
06 Jul 2023
KoRC: Knowledge oriented Reading Comprehension Benchmark for Deep Text
  Understanding
KoRC: Knowledge oriented Reading Comprehension Benchmark for Deep Text Understanding
Zijun Yao
Yantao Liu
Xin Lv
S. Cao
Jifan Yu
Lei Hou
Juanzi Li
37
10
0
06 Jul 2023
PULSAR at MEDIQA-Sum 2023: Large Language Models Augmented by Synthetic
  Dialogue Convert Patient Dialogues to Medical Records
PULSAR at MEDIQA-Sum 2023: Large Language Models Augmented by Synthetic Dialogue Convert Patient Dialogues to Medical Records
Viktor Schlegel
Hao Li
Yuping Wu
Anand Subramanian
Thanh-Tung Nguyen
...
Daniel Beck
Xiaojun Zeng
R. Batista-Navarro
Stefan Winkler
Goran Nenadic
LM&MA
MedIm
29
9
0
05 Jul 2023
Personality Traits in Large Language Models
Personality Traits in Large Language Models
Gregory Serapio-García
Mustafa Safdari
Clément Crepy
Luning Sun
Stephen Fitz
P. Romero
Marwa Abdulhai
Aleksandra Faust
Maja J. Matarić
LM&MA
LLMAG
58
119
0
01 Jul 2023
On the Exploitability of Instruction Tuning
On the Exploitability of Instruction Tuning
Manli Shu
Jiong Wang
Chen Zhu
Jonas Geiping
Chaowei Xiao
Tom Goldstein
SILM
36
92
0
28 Jun 2023
Differentiable Instruction Optimization for Cross-Task Generalization
Differentiable Instruction Optimization for Cross-Task Generalization
Masaru Isonuma
Junichiro Mori
Ichiro Sakata
29
0
0
16 Jun 2023
Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering
Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering
Rabiul Awal
Le Zhang
Aishwarya Agrawal
LRM
46
12
0
16 Jun 2023
Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions
Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions
Weizhen He
Yihe Deng
Shixiang Tang
Qihao Chen
Qingsong Xie
...
Feng Zhu
Rui Zhao
Wanli Ouyang
Donglian Qi
Yunfeng Yan
77
19
0
13 Jun 2023
INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large
  Language Models
INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models
Yew Ken Chia
Pengfei Hong
Lidong Bing
Soujanya Poria
ELM
25
63
0
07 Jun 2023
PokemonChat: Auditing ChatGPT for Pokémon Universe Knowledge
PokemonChat: Auditing ChatGPT for Pokémon Universe Knowledge
Laura Cabello
Jiaang Li
Ilias Chalkidis
ELM
AI4MH
LRM
18
2
0
05 Jun 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Shentao Yang
Shujian Zhang
Congying Xia
Yihao Feng
Caiming Xiong
Mi Zhou
29
23
0
01 Jun 2023
What does the Failure to Reason with "Respectively" in Zero/Few-Shot
  Settings Tell Us about Language Models?
What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models?
Ruixiang Cui
Seolhwa Lee
Daniel Hershcovich
Anders Søgaard
33
2
0
31 May 2023
Faith and Fate: Limits of Transformers on Compositionality
Faith and Fate: Limits of Transformers on Compositionality
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
...
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLM
LRM
30
334
0
29 May 2023
Information Association for Language Model Updating by Mitigating
  LM-Logical Discrepancy
Information Association for Language Model Updating by Mitigating LM-Logical Discrepancy
Pengfei Yu
Heng Ji
KELM
36
9
0
29 May 2023
A Practical Toolkit for Multilingual Question and Answer Generation
A Practical Toolkit for Multilingual Question and Answer Generation
Asahi Ushio
Fernando Alva-Manchego
Jose Camacho-Collados
SyDa
28
13
0
27 May 2023
On the Tool Manipulation Capability of Open-source Large Language Models
On the Tool Manipulation Capability of Open-source Large Language Models
Qiantong Xu
Fenglu Hong
Yangqiu Song
Changran Hu
Zheng Chen
Jian Zhang
LLMAG
35
69
0
25 May 2023
ChatBridge: Bridging Modalities with Large Language Model as a Language
  Catalyst
ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst
Zijia Zhao
Longteng Guo
Tongtian Yue
Si-Qing Chen
Shuai Shao
Xinxin Zhu
Zehuan Yuan
Jing Liu
MLLM
40
52
0
25 May 2023
The False Promise of Imitating Proprietary LLMs
The False Promise of Imitating Proprietary LLMs
Arnav Gudibande
Eric Wallace
Charles Burton Snell
Xinyang Geng
Hao Liu
Pieter Abbeel
Sergey Levine
Dawn Song
ALM
44
198
0
25 May 2023
Large Language Models for User Interest Journeys
Large Language Models for User Interest Journeys
Konstantina Christakopoulou
Alberto Lalama
Cj Adams
Iris Qu
Yifat Amir
...
Dina Bseiso
Sarah Scodel
Lucas Dixon
Ed H. Chi
Minmin Chen
21
25
0
24 May 2023
EvEval: A Comprehensive Evaluation of Event Semantics for Large Language
  Models
EvEval: A Comprehensive Evaluation of Event Semantics for Large Language Models
Zhengwei Tao
Zhi Jin
Xiaoying Bai
Haiyan Zhao
Yanlin Feng
Jia Li
Wenpeng Hu
37
4
0
24 May 2023
Neural Summarization of Electronic Health Records
Neural Summarization of Electronic Health Records
Koyena Pal
Seyed Ali Bahrainian
Laura Y. Mercurio
Carsten Eickhoff
25
3
0
24 May 2023
A RelEntLess Benchmark for Modelling Graded Relations between Named
  Entities
A RelEntLess Benchmark for Modelling Graded Relations between Named Entities
Asahi Ushio
Jose Camacho-Collados
Steven Schockaert
29
1
0
24 May 2023
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual
  Transfer
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer
Akari Asai
Sneha Kudugunta
Xinyan Velocity Yu
Terra Blevins
Hila Gonen
Machel Reid
Yulia Tsvetkov
Sebastian Ruder
Hannaneh Hajishirzi
44
54
0
24 May 2023
PromptNER: Prompting For Named Entity Recognition
PromptNER: Prompting For Named Entity Recognition
D. Ashok
Zachary Chase Lipton
30
37
0
24 May 2023
ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space
  Manipulation
ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation
Dongxu Yue
Qin Guo
Munan Ning
Jiaxi Cui
Yuesheng Zhu
Liuliang Yuan
DiffM
29
11
0
24 May 2023
Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for
  Large Language Models
Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language Models
Sheng Shen
Le Hou
Yan-Quan Zhou
Nan Du
Shayne Longpre
...
Vincent Zhao
Hongkun Yu
Kurt Keutzer
Trevor Darrell
Denny Zhou
ALM
MoE
38
54
0
24 May 2023
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance
Chenxi Whitehouse
Monojit Choudhury
Alham Fikri Aji
SyDa
LRM
32
68
0
23 May 2023
Weakly-Supervised Learning of Visual Relations in Multimodal Pretraining
Weakly-Supervised Learning of Visual Relations in Multimodal Pretraining
Emanuele Bugliarello
Aida Nematzadeh
Lisa Anne Hendricks
SSL
30
5
0
23 May 2023
Active Learning Principles for In-Context Learning with Large Language
  Models
Active Learning Principles for In-Context Learning with Large Language Models
Katerina Margatina
Timo Schick
Nikolaos Aletras
Jane Dwivedi-Yu
32
39
0
23 May 2023
Pre-training Multi-task Contrastive Learning Models for Scientific
  Literature Understanding
Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding
Yu Zhang
Hao Cheng
Zhihong Shen
Xiaodong Liu
Yejiang Wang
Jianfeng Gao
32
14
0
23 May 2023
Dr.ICL: Demonstration-Retrieved In-context Learning
Dr.ICL: Demonstration-Retrieved In-context Learning
Man Luo
Xin Xu
Zhuyun Dai
Panupong Pasupat
Mehran Kazemi
Chitta Baral
Vaiva Imbrasaite
Vincent Zhao
RALM
26
48
0
23 May 2023
CTQScorer: Combining Multiple Features for In-context Example Selection
  for Machine Translation
CTQScorer: Combining Multiple Features for In-context Example Selection for Machine Translation
Aswanth Kumar
Ratish Puduppully
Raj Dabre
Anoop Kunchukuttan
44
12
0
23 May 2023
Evaluating Factual Consistency of Summaries with Large Language Models
Evaluating Factual Consistency of Summaries with Large Language Models
Shiqi Chen
Siyang Gao
Junxian He
ELM
LRM
HILM
37
6
0
23 May 2023
Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in
  Multilingual Machine Translation
Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in Multilingual Machine Translation
Minwoo Lee
Hyukhun Koh
Kang-il Lee
Dongdong Zhang
Minsu Kim
Kyomin Jung
35
9
0
23 May 2023
Learning from Mistakes via Cooperative Study Assistant for Large
  Language Models
Learning from Mistakes via Cooperative Study Assistant for Large Language Models
Danqing Wang
Lei Li
37
6
0
23 May 2023
Images in Language Space: Exploring the Suitability of Large Language
  Models for Vision & Language Tasks
Images in Language Space: Exploring the Suitability of Large Language Models for Vision & Language Tasks
Sherzod Hakimov
David Schlangen
VLM
36
5
0
23 May 2023
Do All Languages Cost the Same? Tokenization in the Era of Commercial
  Language Models
Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models
Orevaoghene Ahia
Sachin Kumar
Hila Gonen
Jungo Kasai
David R. Mortensen
Noah A. Smith
Yulia Tsvetkov
51
82
0
23 May 2023
TaskWeb: Selecting Better Source Tasks for Multi-task NLP
TaskWeb: Selecting Better Source Tasks for Multi-task NLP
Joongwon Kim
Akari Asai
Gabriel Ilharco
Hannaneh Hajishirzi
29
11
0
22 May 2023
Multi-Task Instruction Tuning of LLaMa for Specific Scenarios: A
  Preliminary Study on Writing Assistance
Multi-Task Instruction Tuning of LLaMa for Specific Scenarios: A Preliminary Study on Writing Assistance
Yue Zhang
Leyang Cui
Deng Cai
Xinting Huang
Tao Fang
Wei Bi
ALM
29
36
0
22 May 2023
MaNtLE: Model-agnostic Natural Language Explainer
MaNtLE: Model-agnostic Natural Language Explainer
Rakesh R Menon
Kerem Zaman
Shashank Srivastava
FAtt
LRM
24
2
0
22 May 2023
Fact-Checking Complex Claims with Program-Guided Reasoning
Fact-Checking Complex Claims with Program-Guided Reasoning
Liangming Pan
Xiaobao Wu
Xinyuan Lu
A. Luu
William Yang Wang
Min-Yen Kan
Preslav Nakov
LRM
48
116
0
22 May 2023
Beyond Labels: Empowering Human Annotators with Natural Language
  Explanations through a Novel Active-Learning Architecture
Beyond Labels: Empowering Human Annotators with Natural Language Explanations through a Novel Active-Learning Architecture
Bingsheng Yao
Ishan Jindal
Lucian Popa
Yannis Katsis
Sayan Ghosh
...
Yuxuan Lu
Shashank Srivastava
Yunyao Li
James A. Hendler
Dakuo Wang
34
10
0
22 May 2023
Automated Few-shot Classification with Instruction-Finetuned Language
  Models
Automated Few-shot Classification with Instruction-Finetuned Language Models
Rami Aly
Xingjian Shi
Kaixiang Lin
Aston Zhang
A. Wilson
38
9
0
21 May 2023
Multilingual Simplification of Medical Texts
Multilingual Simplification of Medical Texts
Sebastian Antony Joseph
Kathryn Kazanas
Keziah Reina
Vishnesh J. Ramanathan
Wei-ping Xu
Byron C. Wallace
Junyi Jessy Li
38
13
0
21 May 2023
Previous
123...101189
Next