ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,910 papers shown
Title
Assessment of Transformer-Based Encoder-Decoder Model for Human-Like
  Summarization
Assessment of Transformer-Based Encoder-Decoder Model for Human-Like Summarization
Sindhu Nair
Y. S. Rao
Radha Shankarmani
53
1
0
22 Oct 2024
Can Large Language Models Act as Ensembler for Multi-GNNs?
Can Large Language Models Act as Ensembler for Multi-GNNs?
Hanqi Duan
Yao Cheng
Jianxiang Yu
Xiang Li
AI4CE
77
0
0
22 Oct 2024
Correct after Answer: Enhancing Multi-Span Question Answering with
  Post-Processing Method
Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method
Jiayi Lin
Chenyang Zhang
Haibo Tong
Dongyu Zhang
Qingqing Hong
Bingxuan Hou
Junli Wang
84
0
0
22 Oct 2024
Beyond Retrieval: Generating Narratives in Conversational Recommender
  Systems
Beyond Retrieval: Generating Narratives in Conversational Recommender Systems
Krishna Sayana
Raghavendra Vasudeva
Yuri Vasilevski
Kun Su
Liam Hebert
H. Pham
Ambarish Jash
Sukhdeep S. Sodhi
3DV
74
4
0
22 Oct 2024
MotionGlot: A Multi-Embodied Motion Generation Model
MotionGlot: A Multi-Embodied Motion Generation Model
Sudarshan Harithas
Srinath Sridhar
176
2
0
22 Oct 2024
Self-calibration for Language Model Quantization and Pruning
Self-calibration for Language Model Quantization and Pruning
Miles Williams
G. Chrysostomou
Nikolaos Aletras
MQ
492
0
0
22 Oct 2024
SoK: Dataset Copyright Auditing in Machine Learning Systems
SoK: Dataset Copyright Auditing in Machine Learning Systems
L. Du
Xuanru Zhou
M. Chen
Chusong Zhang
Zhou Su
Peng Cheng
Jiming Chen
Zhikun Zhang
MLAU
128
6
0
22 Oct 2024
ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage
ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage
Taewhoo Lee
Chanwoong Yoon
Kyochul Jang
Donghyeon Lee
Minju Song
Hyunjae Kim
Jaewoo Kang
ELM
80
1
0
22 Oct 2024
Foundation Models for Rapid Autonomy Validation
Foundation Models for Rapid Autonomy Validation
Alec Farid
Peter Schleede
Aaron Huang
Christoffer Heckman
103
0
0
22 Oct 2024
DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding
DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding
Manan Suri
Puneet Mathur
Franck Dernoncourt
R. Jain
Vlad I. Morariu
Ramit Sawhney
Preslav Nakov
Dinesh Manocha
122
3
0
21 Oct 2024
Elucidating the design space of language models for image generation
Elucidating the design space of language models for image generation
Xuantong Liu
Shaozhe Hao
Xianbiao Qi
Tianyang Hu
Jun Wang
Rong Xiao
Yuan Yao
VLM
80
3
0
21 Oct 2024
Building A Coding Assistant via the Retrieval-Augmented Language Model
Building A Coding Assistant via the Retrieval-Augmented Language Model
Xinze Li
Hanbin Wang
Zhenghao Liu
S. Yu
Shuo Wang
Yukun Yan
Yukai Fu
Yu Gu
Ge Yu
3DVRALM
55
4
0
21 Oct 2024
Small Contributions, Small Networks: Efficient Neural Network Pruning
  Based on Relative Importance
Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance
Mostafa Hussien
Mahmoud Afifi
K. Nguyen
M. Cheriet
94
0
0
21 Oct 2024
Natural GaLore: Accelerating GaLore for memory-efficient LLM Training
  and Fine-tuning
Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning
Arijit Das
36
2
0
21 Oct 2024
PROMPTHEUS: A Human-Centered Pipeline to Streamline SLRs with LLMs
PROMPTHEUS: A Human-Centered Pipeline to Streamline SLRs with LLMs
João Pedro Fernandes Torres
Catherine Mulligan
Joaquim A. Jorge
Catarina Moreira
80
3
0
21 Oct 2024
Zero-Shot Scene Reconstruction from Single Images with Deep Prior
  Assembly
Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly
Junsheng Zhou
Yu-Shen Liu
Zhizhong Han
ViT
118
11
0
21 Oct 2024
Mitigating Object Hallucination via Concentric Causal Attention
Mitigating Object Hallucination via Concentric Causal Attention
Yun Xing
Yiheng Li
Ivan Laptev
Shijian Lu
108
23
0
21 Oct 2024
Mesa-Extrapolation: A Weave Position Encoding Method for Enhanced
  Extrapolation in LLMs
Mesa-Extrapolation: A Weave Position Encoding Method for Enhanced Extrapolation in LLMs
Xin Ma
Yang Liu
Qingbin Liu
Xiaoxu Ma
46
1
0
21 Oct 2024
Improve Dense Passage Retrieval with Entailment Tuning
Improve Dense Passage Retrieval with Entailment Tuning
Lu Dai
Hao Liu
Hui Xiong
RALM
125
4
0
21 Oct 2024
Reducing Hallucinations in Vision-Language Models via Latent Space
  Steering
Reducing Hallucinations in Vision-Language Models via Latent Space Steering
Sheng Liu
Haotian Ye
Lei Xing
James Zou
VLMLLMSV
167
9
0
21 Oct 2024
A Survey of Conversational Search
A Survey of Conversational Search
Fengran Mo
Kelong Mao
Ziliang Zhao
Hongjin Qian
Haonan Chen
Yiruo Cheng
Xiaochen Li
Yinlin Zhu
Zhicheng Dou
Jian-Yun Nie
KELM
96
6
0
21 Oct 2024
Pruning Foundation Models for High Accuracy without Retraining
Pruning Foundation Models for High Accuracy without Retraining
Pu Zhao
Fei Sun
Xuan Shen
Pinrui Yu
Zhenglun Kong
Yanzhi Wang
Xue Lin
82
13
0
21 Oct 2024
LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics
LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics
Thomas Robert
M. Safaryan
Ionut-Vlad Modoranu
Dan Alistarh
ODL
81
7
0
21 Oct 2024
Allegro: Open the Black Box of Commercial-Level Video Generation Model
Allegro: Open the Black Box of Commercial-Level Video Generation Model
Yuan Zhou
Qiuyue Wang
Yuxuan Cai
Huan Yang
VGenVLM
155
37
0
20 Oct 2024
Upsampling DINOv2 features for unsupervised vision tasks and weakly
  supervised materials segmentation
Upsampling DINOv2 features for unsupervised vision tasks and weakly supervised materials segmentation
Ronan Docherty
Antonis Vamvakeros
Samuel J. Cooper
69
2
0
20 Oct 2024
Causality for Large Language Models
Causality for Large Language Models
Anpeng Wu
Kun Kuang
Minqin Zhu
Yingrong Wang
Yujia Zheng
Kairong Han
Yangqiu Song
Guangyi Chen
Leilei Gan
Kun Zhang
LRM
115
9
0
20 Oct 2024
Contextual Augmented Multi-Model Programming (CAMP): A Hybrid Local-Cloud Copilot Framework
Contextual Augmented Multi-Model Programming (CAMP): A Hybrid Local-Cloud Copilot Framework
Yuchen Wang
Shangxin Guo
C. Tan
86
0
0
20 Oct 2024
Grammatical Error Correction for Low-Resource Languages: The Case of Zarma
Grammatical Error Correction for Low-Resource Languages: The Case of Zarma
Mamadou K. Keita
Christopher Homan
Sofiane Abdoulaye Hamani
Adwoa Bremang
Marcos Zampieri
Habibatou Abdoulaye Alfari
Elysabhete Amadou Ibrahim
127
0
0
20 Oct 2024
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant
Alan Dao
Dinh Bach Vu
Huy Hoang Ha
AuLLMVLM
141
5
0
20 Oct 2024
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary
Hao-Tang Tsui
Chien-Yao Wang
H. Liao
ObjDVLM
155
0
0
20 Oct 2024
BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression
BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression
Yuankai Li
Jia-Chen Gu
Di Wu
Kai-Wei Chang
Nanyun Peng
RALMMQ
72
0
0
20 Oct 2024
A survey of neural-network-based methods utilising comparable data for
  finding translation equivalents
A survey of neural-network-based methods utilising comparable data for finding translation equivalents
Michaela Denisová
Pavel Rychlý
81
0
0
19 Oct 2024
Are LLMs Good Zero-Shot Fallacy Classifiers?
Are LLMs Good Zero-Shot Fallacy Classifiers?
Fengjun Pan
Xiaobao Wu
Zongrui Li
Anh Tuan Luu
LRM
137
14
0
19 Oct 2024
Group Diffusion Transformers are Unsupervised Multitask Learners
Group Diffusion Transformers are Unsupervised Multitask Learners
Lianghua Huang
Wei Wang
Zhi-Fan Wu
Huanzhang Dou
Yupeng Shi
Yutong Feng
C. Liang
Yu Liu
Jingren Zhou
VLM
126
13
0
19 Oct 2024
Cross-Document Event-Keyed Summarization
Cross-Document Event-Keyed Summarization
William Walden
Pavlo Kuchmiichuk
Alexander Martin
Chihsheng Jin
Angela Cao
Claire Sun
Curisia Allen
Aaron Steven White
RALM
53
0
0
18 Oct 2024
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities
Shaozhe Hao
Xuantong Liu
Xianbiao Qi
Shihao Zhao
Bojia Zi
Rong Xiao
Kai Han
Kwan-Yee K. Wong
198
3
0
18 Oct 2024
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Zhepeng Cen
Yao Liu
Siliang Zeng
Pratik Chaudhar
Huzefa Rangwala
George Karypis
Rasool Fakoor
SyDaAIFin
135
3
0
18 Oct 2024
Large Language Models Are Overparameterized Text Encoders
Large Language Models Are Overparameterized Text Encoders
Thennal D K
Tim Fischer
Chris Biemann
85
2
0
18 Oct 2024
MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts
MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts
R. Teo
Tan M. Nguyen
MoE
94
3
0
18 Oct 2024
LEAD: Latent Realignment for Human Motion Diffusion
LEAD: Latent Realignment for Human Motion Diffusion
Nefeli Andreou
Xi Wang
Victoria Fernandez-Abrevaya
Marie-Paule Cani
Y. Chrysanthou
Vicky Kalogeiton
VGenDiffM
88
4
0
18 Oct 2024
HYPNOS : Highly Precise Foreground-focused Diffusion Finetuning for
  Inanimate Objects
HYPNOS : Highly Precise Foreground-focused Diffusion Finetuning for Inanimate Objects
Oliverio Theophilus Nathanael
Jonathan Samuel Lumentut
Nicholas Hans Muliawan
Edbert Valencio Angky
Felix Indra Kurniadi
Alfi Yusrotis Zakiyyah
Jeklin Harefa
DiffM
49
1
0
18 Oct 2024
Speciesism in Natural Language Processing Research
Speciesism in Natural Language Processing Research
Masashi Takeshita
Rafal Rzepka
68
2
0
18 Oct 2024
RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping
  Language-Image Pre-training
RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training
Muhe Ding
Yang Ma
Pengda Qin
Jianlong Wu
Yuhong Li
Liqiang Nie
78
1
0
18 Oct 2024
Leveraging Large Language Models for Enhancing Public Transit Services
Leveraging Large Language Models for Enhancing Public Transit Services
Jiahao Wang
Amer Shalaby
51
2
0
18 Oct 2024
ViConsFormer: Constituting Meaningful Phrases of Scene Texts using
  Transformer-based Method in Vietnamese Text-based Visual Question Answering
ViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question Answering
Nghia Hieu Nguyen
Tho Thanh Quan
Ngan Luu-Thuy Nguyen
75
0
0
18 Oct 2024
EvoPress: Accurate Dynamic Model Compression via Evolutionary Search
EvoPress: Accurate Dynamic Model Compression via Evolutionary Search
Oliver Sieberling
Denis Kuznedelev
Eldar Kurtic
Dan Alistarh
MQ
71
5
0
18 Oct 2024
Rationale Behind Essay Scores: Enhancing S-LLM's Multi-Trait Essay Scoring with Rationale Generated by LLMs
Rationale Behind Essay Scores: Enhancing S-LLM's Multi-Trait Essay Scoring with Rationale Generated by LLMs
SeongYeub Chu
JongWoo Kim
Bryan Wong
MunYong Yi
LRM
94
3
0
18 Oct 2024
Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts
Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts
German Gritsai
Anastasia Voznyuk
Andrey Grabovoy
Yury Chekhovich
DeLMO
143
2
0
18 Oct 2024
Fluid: Scaling Autoregressive Text-to-image Generative Models with
  Continuous Tokens
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens
Lijie Fan
Tianhong Li
Siyang Qin
Yuanzhen Li
Chen Sun
Michael Rubinstein
Deqing Sun
Kaiming He
Yonglong Tian
VLMDiffM
131
57
0
17 Oct 2024
VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
Runsen Xu
Zhiwei Huang
Tai Wang
Yuxiao Chen
Jiangmiao Pang
Dahua Lin
VGen
97
18
0
17 Oct 2024
Previous
123...313233...197198199
Next