ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,920 papers shown
Title
Predicting from Strings: Language Model Embeddings for Bayesian
  Optimization
Predicting from Strings: Language Model Embeddings for Bayesian Optimization
Tung Nguyen
Qiuyi Zhang
Bangding Yang
Chansoo Lee
J. Bornschein
Yingjie Miao
Sagi Perel
Yutian Chen
Xingyou Song
BDL
99
4
0
14 Oct 2024
AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved
  Layer-wise Pruning of Large Language Models
AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
Haiquan Lu
Yefan Zhou
Shiwei Liu
Zhangyang Wang
Michael W. Mahoney
Yaoqing Yang
72
10
0
14 Oct 2024
3DArticCyclists: Generating Synthetic Articulated 8D Pose-Controllable Cyclist Data for Computer Vision Applications
3DArticCyclists: Generating Synthetic Articulated 8D Pose-Controllable Cyclist Data for Computer Vision Applications
Eduardo R. Corral-Soto
Yang Liu
Tongtong Cao
Y. Ren
Liu Bingbing
151
0
0
14 Oct 2024
Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs
Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs
Haozhen Zhang
Tao Feng
Jiaxuan You
AI4TSRALM
130
5
0
14 Oct 2024
Depth Any Video with Scalable Synthetic Data
Depth Any Video with Scalable Synthetic Data
Honghui Yang
Di Huang
Wei Yin
Chunhua Shen
Haifeng Liu
Xiaofei He
Binbin Lin
Wanli Ouyang
Tong He
VGenMDE
129
19
0
14 Oct 2024
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
Guorui Zheng
Xidong Wang
Juhao Liang
Nuo Chen
Yuping Zheng
Benyou Wang
MoE
136
5
0
14 Oct 2024
FunnelRAG: A Coarse-to-Fine Progressive Retrieval Paradigm for RAG
FunnelRAG: A Coarse-to-Fine Progressive Retrieval Paradigm for RAG
X. Zhao
Yan Zhong
Zetian Sun
Xinshuo Hu
Zhenyu Liu
Dongfang Li
Baotian Hu
Min Zhang
254
8
0
14 Oct 2024
An Annotated Dataset of Errors in Premodern Greek and Baselines for Detecting Them
An Annotated Dataset of Errors in Premodern Greek and Baselines for Detecting Them
Creston Brooks
J. Haubold
Charlie Cowen-Breen
Jay White
Desmond DeVaul
Frederick Riemenschneider
Karthik Narasimhan
B. Graziosi
133
0
0
14 Oct 2024
Spatio-Temporal Control for Masked Motion Synthesis
Spatio-Temporal Control for Masked Motion Synthesis
Ekkasit Pinyoanuntapong
Muhammad Usama Saleem
Korrawe Karunratanakul
Pu Wang
Hongfei Xue
Chong Chen
Chuan Guo
Junli Cao
J. Ren
Sergey Tulyakov
VGen
92
7
0
14 Oct 2024
LoRE: Logit-Ranked Retriever Ensemble for Enhancing Open-Domain Question
  Answering
LoRE: Logit-Ranked Retriever Ensemble for Enhancing Open-Domain Question Answering
Saikrishna Sanniboina
Shiv Trivedi
Sreenidhi Vijayaraghavan
RALM
28
1
0
13 Oct 2024
NARAIM: Native Aspect Ratio Autoregressive Image Models
NARAIM: Native Aspect Ratio Autoregressive Image Models
Daniel Gallo Fernández
Robert van der Klis
Rǎzvan-Andrei Matişan
Janusz Partyka
E. Gavves
Samuele Papa
Phillip Lippe
29
0
0
13 Oct 2024
Leveraging Customer Feedback for Multi-modal Insight Extraction
Leveraging Customer Feedback for Multi-modal Insight Extraction
Sandeep Sricharan Mukku
Abinesh Kanagarajan
Pushpendu Ghosh
Chetan Aggarwal
29
0
0
13 Oct 2024
ECIS-VQG: Generation of Entity-centric Information-seeking Questions
  from Videos
ECIS-VQG: Generation of Entity-centric Information-seeking Questions from Videos
Arpan Phukan
Manish Gupta
Asif Ekbal
VGen
82
0
0
13 Oct 2024
'Quis custodiet ipsos custodes?' Who will watch the watchmen? On
  Detecting AI-generated peer-reviews
'Quis custodiet ipsos custodes?' Who will watch the watchmen? On Detecting AI-generated peer-reviews
Sandeep Kumar
Mohit Sahu
Vardhan Gacche
Tirthankar Ghosal
Asif Ekbal
DeLMO
93
2
0
13 Oct 2024
ChartKG: A Knowledge-Graph-Based Representation for Chart Images
ChartKG: A Knowledge-Graph-Based Representation for Chart Images
Zhiguang Zhou
Haoxuan Wang
Zhengqing Zhao
Fengling Zheng
Yongheng Wang
Wei Chen
Yong Wang
134
1
0
13 Oct 2024
Empirical Study of Mutual Reinforcement Effect and Application in
  Few-shot Text Classification Tasks via Prompt
Empirical Study of Mutual Reinforcement Effect and Application in Few-shot Text Classification Tasks via Prompt
Chengguang Gan
Tatsunori Mori
72
0
0
13 Oct 2024
Reverse Modeling in Large Language Models
Reverse Modeling in Large Language Models
S. Yu
Yuanchen Xu
Cunxiao Du
Yanying Zhou
Minghui Qiu
Q. Sun
Hao Zhang
Jiawei Wu
162
2
0
13 Oct 2024
Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization
Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization
Alireza Salemi
Hamed Zamani
RALM
98
6
0
13 Oct 2024
TULIP: Token-length Upgraded CLIP
TULIP: Token-length Upgraded CLIP
Ivona Najdenkoska
Mohammad Mahdi Derakhshani
Yuki M. Asano
Nanne van Noord
Marcel Worring
Cees G. M. Snoek
VLM
143
4
0
13 Oct 2024
Transformer-based Language Models for Reasoning in the Description Logic
  ALCQ
Transformer-based Language Models for Reasoning in the Description Logic ALCQ
Angelos Poulis
Eleni Tsalapati
Manolis Koubarakis
ReLMLRM
59
1
0
12 Oct 2024
The Future of Learning in the Age of Generative AI: Automated Question
  Generation and Assessment with Large Language Models
The Future of Learning in the Age of Generative AI: Automated Question Generation and Assessment with Large Language Models
Subhankar Maity
Aniket Deroy
AI4EdELM
103
6
0
12 Oct 2024
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language
  Model for Commonsense Reasoning
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning
Jiachun Li
Pengfei Cao
Chenhao Wang
Zhuoran Jin
Yubo Chen
Kang Liu
Xiaojian Jiang
Jiexin Xu
Jun Zhao
LRMKELM
64
1
0
12 Oct 2024
LexSumm and LexT5: Benchmarking and Modeling Legal Summarization Tasks
  in English
LexSumm and LexT5: Benchmarking and Modeling Legal Summarization Tasks in English
T. Y. S. S. Santosh
Cornelius Weiss
Matthias Grabmair
AILawELM
99
2
0
12 Oct 2024
Text Classification using Graph Convolutional Networks: A Comprehensive
  Survey
Text Classification using Graph Convolutional Networks: A Comprehensive Survey
Syed Mustafa Haider Rizvi
Ramsha Imran
Arif Mahmood
GNNOODFaML
51
2
0
12 Oct 2024
CLIP-SCGI: Synthesized Caption-Guided Inversion for Person
  Re-Identification
CLIP-SCGI: Synthesized Caption-Guided Inversion for Person Re-Identification
Qianru Han
Xinwei He
Zhi Liu
Sannyuya Liu
Ying Zhang
Jinhai Xiang
50
2
0
12 Oct 2024
Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog
  Generation
Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation
Jinyoung Park
Minseok Joo
Joo-Kyung Kim
H. Kim
RALM
60
1
0
12 Oct 2024
FlatQuant: Flatness Matters for LLM Quantization
FlatQuant: Flatness Matters for LLM Quantization
Yuxuan Sun
Ruikang Liu
Haoli Bai
Han Bao
Kang Zhao
...
Lu Hou
Chun Yuan
Xin Jiang
Wen Liu
Jun Yao
MQ
176
11
0
12 Oct 2024
CollabEdit: Towards Non-destructive Collaborative Knowledge Editing
CollabEdit: Towards Non-destructive Collaborative Knowledge Editing
Jiamu Zheng
Jinghuai Zhang
Tianyu Du
Xuhong Zhang
Jianwei Yin
Tao Lin
KELM
254
0
0
12 Oct 2024
CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation
CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation
Yifeng Xu
Zhenliang He
Shiguang Shan
Xilin Chen
DiffM
69
6
0
12 Oct 2024
nach0-pc: Multi-task Language Model with Molecular Point Cloud Encoder
nach0-pc: Multi-task Language Model with Molecular Point Cloud Encoder
Maksim Kuznetsov
Airat Valiev
Alex Aliper
Daniil Polykovskiy
E. Tutubalina
Rim Shayakhmetov
Z. Miftahutdinov
64
0
0
11 Oct 2024
M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop
  Chain-of-Thought
M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop Chain-of-Thought
G. Kumari
Kirtan Jain
Asif Ekbal
113
4
0
11 Oct 2024
ACER: Automatic Language Model Context Extension via Retrieval
ACER: Automatic Language Model Context Extension via Retrieval
Luyu Gao
Yunyi Zhang
Jamie Callan
RALM
56
0
0
11 Oct 2024
Mentor-KD: Making Small Language Models Better Multi-step Reasoners
Mentor-KD: Making Small Language Models Better Multi-step Reasoners
Hojae Lee
Junho Kim
SangKeun Lee
LRM
67
3
0
11 Oct 2024
Lifelong Event Detection via Optimal Transport
Lifelong Event Detection via Optimal Transport
Viet Dao
Van-Cuong Pham
Quyen Tran
Thanh-Thien Le
Linh Ngo Van
Truong Nguyen
CLL
87
1
0
11 Oct 2024
A Benchmark for Cross-Domain Argumentative Stance Classification on
  Social Media
A Benchmark for Cross-Domain Argumentative Stance Classification on Social Media
Jiaqing Yuan
Ruijie Xi
Munindar P. Singh
47
0
0
11 Oct 2024
Humanity in AI: Detecting the Personality of Large Language Models
Humanity in AI: Detecting the Personality of Large Language Models
Baohua Zhan
Yongyi Huang
Wenyao Cui
Huaping Zhang
Jianyun Shang
39
0
0
11 Oct 2024
Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models
Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models
Zióu Zheng
Christopher Malon
Martin Renqiang Min
Xiaodan Zhu
LRM
354
0
0
11 Oct 2024
Data Processing for the OpenGPT-X Model Family
Data Processing for the OpenGPT-X Model Family
Nicolo' Brandizzi
Hammam Abdelwahab
Anirban Bhowmick
Lennard Helmer
Benny Jörg Stein
...
Georg Rehm
Dennis Wegener
Nicolas Flores-Herr
Joachim Kohler
Johannes Leveling
VLM
138
2
0
11 Oct 2024
SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction
SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction
Yang Zhou
Hao Shao
Letian Wang
Steven Waslander
Hongsheng Li
Yu Liu
86
2
0
11 Oct 2024
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Jiatao Gu
Yuyang Wang
Yizhe Zhang
Qihang Zhang
Dinghuai Zhang
Navdeep Jaitly
Josh Susskind
Shuangfei Zhai
DiffM
137
17
0
10 Oct 2024
A Target-Aware Analysis of Data Augmentation for Hate Speech Detection
A Target-Aware Analysis of Data Augmentation for Hate Speech Detection
Camilla Casula
Sara Tonelli
59
1
0
10 Oct 2024
Disease Entity Recognition and Normalization is Improved with Large
  Language Model Derived Synthetic Normalized Mentions
Disease Entity Recognition and Normalization is Improved with Large Language Model Derived Synthetic Normalized Mentions
Kuleen Sasse
Shinjitha Vadlakonda
Richard Kennedy
J. D. Osborne
AI4CE
68
0
0
10 Oct 2024
Modeling User Preferences with Automatic Metrics: Creating a
  High-Quality Preference Dataset for Machine Translation
Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation
Sweta Agrawal
José G. C. de Souza
Ricardo Rei
António Farinhas
Gonçalo Faria
Patrick Fernandes
Nuno M. Guerreiro
Andre Martins
68
5
0
10 Oct 2024
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Yu-Chen Lin
Wei-Hua Li
Jun-Cheng Chen
Chu-Song Chen
65
1
0
10 Oct 2024
When and Where Did it Happen? An Encoder-Decoder Model to Identify
  Scenario Context
When and Where Did it Happen? An Encoder-Decoder Model to Identify Scenario Context
Enrique Noriega-Atala
Robert Vacareanu
Salena Torres Ashton
A. Pyarelal
Clayton T. Morrison
Mihai Surdeanu
48
0
0
10 Oct 2024
CrossQuant: A Post-Training Quantization Method with Smaller
  Quantization Kernel for Precise Large Language Model Compression
CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression
Wenyuan Liu
Xindian Ma
Peng Zhang
Yan Wang
MQ
58
1
0
10 Oct 2024
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Cristian Meo
Mircea Lica
Zarif Ikram
Akihiro Nakano
Vedant Shah
Aniket Didolkar
Dianbo Liu
Anirudh Goyal
Justin Dauwels
OffRL
242
0
0
10 Oct 2024
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation
Qingwen Bu
Hongyang Li
Li Chen
Jisong Cai
Jia Zeng
Heming Cui
Maoqing Yao
Yu Qiao
159
11
0
10 Oct 2024
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Jinbin Bai
Tian-Chun Ye
Wei Chow
Enxin Song
Qing-Guo Chen
Hefei Ling
Zhen Dong
Lei Zhu
162
19
0
10 Oct 2024
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Songming Liu
Lingxuan Wu
Bangguo Li
Hengkai Tan
Huayu Chen
Zhengyi Wang
Ke Xu
Hang Su
Jun Zhu
146
126
0
10 Oct 2024
Previous
123...333435...197198199
Next