ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,984 papers shown
Title
Enhancing Robustness of AI Offensive Code Generators via Data
  Augmentation
Enhancing Robustness of AI Offensive Code Generators via Data Augmentation
Cristina Improta
Pietro Liguori
R. Natella
B. Cukic
Domenico Cotroneo
AAML
112
5
0
08 Jun 2023
K2: A Foundation Language Model for Geoscience Knowledge Understanding
  and Utilization
K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization
Cheng Deng
Tianhang Zhang
Zhongmou He
Yi Xu
Qiyuan Chen
...
Weinan Zhang
Xinbing Wang
Cheng Zhou
Zhouhan Lin
Junxian He
ALM
94
70
0
08 Jun 2023
Mapping the Challenges of HCI: An Application and Evaluation of ChatGPT
  and GPT-4 for Mining Insights at Scale
Mapping the Challenges of HCI: An Application and Evaluation of ChatGPT and GPT-4 for Mining Insights at Scale
Jonas Oppenlaender
Joonas Hamalainen
92
6
0
08 Jun 2023
T3L: Translate-and-Test Transfer Learning for Cross-Lingual Text
  Classification
T3L: Translate-and-Test Transfer Learning for Cross-Lingual Text Classification
Inigo Jauregi Unanue
Gholamreza Haffari
Massimo Piccardi
VLM
64
11
0
08 Jun 2023
Unified Embedding Based Personalized Retrieval in Etsy Search
Unified Embedding Based Personalized Retrieval in Etsy Search
Rishikesh Jha
Siddharth Subramaniyam
E. Benjamin
T. Taula
DML
60
3
0
07 Jun 2023
Data Augmentation for Improving Tail-traffic Robustness in Skill-routing
  for Dialogue Systems
Data Augmentation for Improving Tail-traffic Robustness in Skill-routing for Dialogue Systems
Ting-Wei Wu
Fatemeh Sheikholeslami
Mohammad Kachuee
Jaeyoung Do
Sungjin Lee
50
0
0
07 Jun 2023
Absformer: Transformer-based Model for Unsupervised Multi-Document
  Abstractive Summarization
Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization
M. Trabelsi
H. Uzunalioglu
81
2
0
07 Jun 2023
INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large
  Language Models
INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models
Yew Ken Chia
Pengfei Hong
Lidong Bing
Soujanya Poria
ELM
87
65
0
07 Jun 2023
ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural
  Language to SQL Systems
ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems
Yi Zhang
Jan Deriu
George Katsogiannis-Meimarakis
Catherine Kosten
Georgia Koutrika
Kurt Stockinger
81
25
0
07 Jun 2023
Prompter: Zero-shot Adaptive Prefixes for Dialogue State Tracking Domain
  Adaptation
Prompter: Zero-shot Adaptive Prefixes for Dialogue State Tracking Domain Adaptation
Taha İbrahim Aksu
MingSung Kan
Nancy F. Chen
VLM
89
9
0
07 Jun 2023
UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot
  Vision-Language Tasks
UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks
Yanan Sun
Zi-Qi Zhong
Qi Fan
Chi-Keung Tang
Yu-Wing Tai
VLM
78
4
0
07 Jun 2023
Improving Open Language Models by Learning from Organic Interactions
Improving Open Language Models by Learning from Organic Interactions
Jing Xu
Da Ju
Joshua Lane
M. Komeili
Eric Michael Smith
...
Rashel Moritz
Sainbayar Sukhbaatar
Y-Lan Boureau
Jason Weston
Kurt Shuster
79
9
0
07 Jun 2023
On the Reliability of Watermarks for Large Language Models
On the Reliability of Watermarks for Large Language Models
John Kirchenbauer
Jonas Geiping
Yuxin Wen
Manli Shu
Khalid Saifullah
Kezhi Kong
Kasun Fernando
Aniruddha Saha
Micah Goldblum
Tom Goldstein
WaLM
77
123
0
07 Jun 2023
Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis,
  and LLMs Evaluations
Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations
Lifan Yuan
Yangyi Chen
Ganqu Cui
Hongcheng Gao
Fangyuan Zou
Xingyi Cheng
Heng Ji
Zhiyuan Liu
Maosong Sun
150
84
0
07 Jun 2023
Multi-Task Training with In-Domain Language Models for Diagnostic
  Reasoning
Multi-Task Training with In-Domain Language Models for Diagnostic Reasoning
Brihat Sharma
Yanjun Gao
Timothy A. Miller
M. Churpek
Majid Afshar
Dmitriy Dligach
ELMLRM
85
9
0
07 Jun 2023
Git-Theta: A Git Extension for Collaborative Development of Machine
  Learning Models
Git-Theta: A Git Extension for Collaborative Development of Machine Learning Models
Nikhil Kandpal
Brian Lester
Mohammed Muqeeth
Anisha Mascarenhas
Monty Evans
Vishal Baskaran
Tenghao Huang
Haokun Liu
Colin Raffel
VLM
77
12
0
07 Jun 2023
Enhancing In-Context Learning with Answer Feedback for Multi-Span
  Question Answering
Enhancing In-Context Learning with Answer Feedback for Multi-Span Question Answering
Zixian Huang
Jiaying Zhou
Gengyang Xiao
Gong Cheng
KELM
55
10
0
07 Jun 2023
Examining Bias in Opinion Summarisation Through the Perspective of
  Opinion Diversity
Examining Bias in Opinion Summarisation Through the Perspective of Opinion Diversity
Nannan Huang
Lin Tian
Haytham M. Fayek
Xiuzhen Zhang
71
9
0
07 Jun 2023
Transfer Learning of Transformer-based Speech Recognition Models from
  Czech to Slovak
Transfer Learning of Transformer-based Speech Recognition Models from Czech to Slovak
Jan Lehecka
J. Psutka
Josef Psutka
52
2
0
07 Jun 2023
Fine-Grained Visual Prompting
Fine-Grained Visual Prompting
Lingfeng Yang
Yueze Wang
Xiang Li
Xinlong Wang
Jian Yang
ObjDVLM
126
68
0
07 Jun 2023
Cross-Genre Argument Mining: Can Language Models Automatically Fill in
  Missing Discourse Markers?
Cross-Genre Argument Mining: Can Language Models Automatically Fill in Missing Discourse Markers?
Gil Rocha
Henrique Lopes Cardoso
Jonas Belouadi
Steffen Eger
61
5
0
07 Jun 2023
Transfer Learning from Pre-trained Language Models Improves End-to-End
  Speech Summarization
Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization
Kohei Matsuura
Takanori Ashihara
Takafumi Moriya
Tomohiro Tanaka
Takatomo Kano
A. Ogawa
Marc Delcroix
77
8
0
07 Jun 2023
MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation
  of Videos
MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
Jielin Qiu
Jiacheng Zhu
William Jongwon Han
Aditesh Kumar
Karthik Mittal
...
Linjie Li
Jianfeng Wang
Ding Zhao
Bo Li
Lijuan Wang
VGen
79
8
0
07 Jun 2023
When to Read Documents or QA History: On Unified and Selective
  Open-domain QA
When to Read Documents or QA History: On Unified and Selective Open-domain QA
Kyungjae Lee
Sanghyun Han
Seung-won Hwang
Moontae Lee
RALM
76
4
0
07 Jun 2023
From the One, Judge of the Whole: Typed Entailment Graph Construction
  with Predicate Generation
From the One, Judge of the Whole: Typed Entailment Graph Construction with Predicate Generation
Zhibin Chen
Yansong Feng
Dongyan Zhao
58
0
0
07 Jun 2023
Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data
  Augmentation
Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation
Xiusi Chen
Yu Zhang
Jinliang Deng
Jyun-Yu Jiang
Wei Wang
84
12
0
07 Jun 2023
World Models for Math Story Problems
World Models for Math Story Problems
Andreas Opedal
Niklas Stoehr
Abulhair Saparov
Mrinmaya Sachan
ReLM
124
13
0
07 Jun 2023
An Empirical Analysis of Parameter-Efficient Methods for Debiasing
  Pre-Trained Language Models
An Empirical Analysis of Parameter-Efficient Methods for Debiasing Pre-Trained Language Models
Zhongbin Xie
Thomas Lukasiewicz
65
13
0
06 Jun 2023
Triggering Multi-Hop Reasoning for Question Answering in Language Models
  using Soft Prompts and Random Walks
Triggering Multi-Hop Reasoning for Question Answering in Language Models using Soft Prompts and Random Walks
Kanishka Misra
Cicero Nogueira dos Santos
Siamak Shakeri
KELMLRM
73
2
0
06 Jun 2023
Büyük dil modellerinin Türkçe verisetleri ile
  eğitilmesi ve ince ayarlanması
Büyük dil modellerinin Türkçe verisetleri ile eğitilmesi ve ince ayarlanması
A. Arslan
VLM
32
0
0
06 Jun 2023
Leveraging Explicit Procedural Instructions for Data-Efficient Action
  Prediction
Leveraging Explicit Procedural Instructions for Data-Efficient Action Prediction
Julia White
Arushi Raghuvanshi
Yada Pruksachatkun
50
0
0
06 Jun 2023
MISGENDERED: Limits of Large Language Models in Understanding Pronouns
MISGENDERED: Limits of Large Language Models in Understanding Pronouns
Tamanna Hossain
Sunipa Dev
Sameer Singh
AILaw
106
41
0
06 Jun 2023
CL-UZH at SemEval-2023 Task 10: Sexism Detection through Incremental
  Fine-Tuning and Multi-Task Learning with Label Descriptions
CL-UZH at SemEval-2023 Task 10: Sexism Detection through Incremental Fine-Tuning and Multi-Task Learning with Label Descriptions
Janis Goldzycher
65
1
0
06 Jun 2023
ATT3D: Amortized Text-to-3D Object Synthesis
ATT3D: Amortized Text-to-3D Object Synthesis
Jonathan Lorraine
Kevin Xie
Fangyin Wei
Chen-Hsuan Lin
Towaki Takikawa
Nicholas Sharp
Nayeon Lee
Xuan Li
Sanja Fidler
James Lucas
DiffM
100
89
0
06 Jun 2023
Correction of Errors in Preference Ratings from Automated Metrics for
  Text Generation
Correction of Errors in Preference Ratings from Automated Metrics for Text Generation
Jan Deriu
Pius von Daniken
Don Tuggener
Mark Cieliebak
74
2
0
06 Jun 2023
Prompt Space Optimizing Few-shot Reasoning Success with Large Language
  Models
Prompt Space Optimizing Few-shot Reasoning Success with Large Language Models
Fobo Shi
Peijun Qing
Ke Wang
Nan Wang
Youbo Lei
H. Lu
Xiaodong Lin
Duantengchuan Li
VLMReLMLLMAGLRM
95
12
0
06 Jun 2023
Soft Merging of Experts with Adaptive Routing
Soft Merging of Experts with Adaptive Routing
Mohammed Muqeeth
Haokun Liu
Colin Raffel
MoMeMoE
111
54
0
06 Jun 2023
Evaluating the Effectiveness of Natural Language Inference for Hate
  Speech Detection in Languages with Limited Labeled Data
Evaluating the Effectiveness of Natural Language Inference for Hate Speech Detection in Languages with Limited Labeled Data
Janis Goldzycher
Moritz Preisig
Chantal Amrhein
Gerold Schneider
71
3
0
06 Jun 2023
MolFM: A Multimodal Molecular Foundation Model
MolFM: A Multimodal Molecular Foundation Model
Yi Luo
Kai Yang
Massimo Hong
Xingyi Liu
Zaiqing Nie
78
40
0
06 Jun 2023
SciLit: A Platform for Joint Scientific Literature Discovery,
  Summarization and Citation Generation
SciLit: A Platform for Joint Scientific Literature Discovery, Summarization and Citation Generation
Nianlong Gu
Richard H. R. Hahnloser
114
5
0
06 Jun 2023
Towards Adaptable and Interactive Image Captioning with Data
  Augmentation and Episodic Memory
Towards Adaptable and Interactive Image Captioning with Data Augmentation and Episodic Memory
Aliki Anagnostopoulou
Mareike Hartmann
Daniel Sonntag
CLLVLM
80
0
0
06 Jun 2023
Putting Humans in the Image Captioning Loop
Putting Humans in the Image Captioning Loop
Aliki Anagnostopoulou
Mareike Hartmann
Daniel Sonntag
VLM
57
1
0
06 Jun 2023
TwistList: Resources and Baselines for Tongue Twister Generation
TwistList: Resources and Baselines for Tongue Twister Generation
Tyler Loakman
Chen Tang
Chenghua Lin
79
15
0
06 Jun 2023
Diversifying Joint Vision-Language Tokenization Learning
Diversifying Joint Vision-Language Tokenization Learning
Vardaan Pahuja
A. Piergiovanni
A. Angelova
85
0
0
06 Jun 2023
Generate-then-Retrieve: Intent-Aware FAQ Retrieval in Product Search
Generate-then-Retrieve: Intent-Aware FAQ Retrieval in Product Search
Zhiyu Zoey Chen
J. Choi
B. Fetahu
Oleg Rokhlenko
S. Malmasi
RALM
64
6
0
06 Jun 2023
Few Shot Rationale Generation using Self-Training with Dual Teachers
Few Shot Rationale Generation using Self-Training with Dual Teachers
Aditya Srikanth Veerubhotla
Lahari Poddar
J. Yin
Gyuri Szarvas
S. Eswaran
LRM
137
2
0
05 Jun 2023
A Scalable and Adaptive System to Infer the Industry Sectors of
  Companies: Prompt + Model Tuning of Generative Language Models
A Scalable and Adaptive System to Infer the Industry Sectors of Companies: Prompt + Model Tuning of Generative Language Models
Le-le Cao
Vilhelm von Ehrenheim
Astrid Berghult
Cecilia Henje
Richard Anselmo Stahl
Joar Wandborg
S. Stan
Armin Catovic
Erik Ferm
Hannes Ingelhag
68
4
0
05 Jun 2023
Zero-Shot 3D Shape Correspondence
Zero-Shot 3D Shape Correspondence
Ahmed Abdelreheem
Abdelrahman Eldesokey
M. Ovsjanikov
Peter Wonka
113
25
0
05 Jun 2023
Information Flow Control in Machine Learning through Modular Model
  Architecture
Information Flow Control in Machine Learning through Modular Model Architecture
Trishita Tiwari
Suchin Gururangan
Chuan Guo
Weizhe Hua
Sanjay Kariyappa
Udit Gupta
Wenjie Xiong
Kiwan Maeng
Hsien-Hsin S. Lee
G. E. Suh
75
6
0
05 Jun 2023
Infusing Lattice Symmetry Priors in Attention Mechanisms for
  Sample-Efficient Abstract Geometric Reasoning
Infusing Lattice Symmetry Priors in Attention Mechanisms for Sample-Efficient Abstract Geometric Reasoning
Mattia Atzeni
Mrinmaya Sachan
Andreas Loukas
LRM
68
3
0
05 Jun 2023
Previous
123...125126127...198199200
Next