ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,938 papers shown
Title
Open-domain Implicit Format Control for Large Language Model Generation
Open-domain Implicit Format Control for Large Language Model Generation
Yiqun Yao
Wenjia Ma
Xuezhi Fang
Xin Jiang
Xiang Li
Xuying Meng
Peng Han
Jing Li
Aixin Sun
Yequan Wang
75
2
0
08 Aug 2024
Explicating the Implicit: Argument Detection Beyond Sentence Boundaries
Explicating the Implicit: Argument Detection Beyond Sentence Boundaries
Paul Roit
Aviv Slobodkin
Eran Hirsch
Arie Cattan
Ayal Klein
Valentina Pyatkin
Ido Dagan
99
1
0
08 Aug 2024
Diffusion Guided Language Modeling
Diffusion Guided Language Modeling
Justin Lovelace
Varsha Kishore
Yiwei Chen
Kilian Q. Weinberger
124
8
0
08 Aug 2024
Towards Linguistic Neural Representation Learning and Sentence Retrieval
  from Electroencephalogram Recordings
Towards Linguistic Neural Representation Learning and Sentence Retrieval from Electroencephalogram Recordings
Jinzhao Zhou
Yiqun Duan
Ziyi Zhao
Yu-Cheng Chang
Yu-Kai Wang
T. Do
Chin-Teng Lin
130
1
0
08 Aug 2024
Zero-shot Factual Consistency Evaluation Across Domains
Zero-shot Factual Consistency Evaluation Across Domains
Raunak Agarwal
HILM
120
0
0
07 Aug 2024
Improving Large Language Model (LLM) fidelity through context-aware
  grounding: A systematic approach to reliability and veracity
Improving Large Language Model (LLM) fidelity through context-aware grounding: A systematic approach to reliability and veracity
Wrick Talukdar
Anjanava Biswas
KELM
59
6
0
07 Aug 2024
MathBridge: A Large Corpus Dataset for Translating Spoken Mathematical
  Expressions into $LaTeX$ Formulas for Improved Readability
MathBridge: A Large Corpus Dataset for Translating Spoken Mathematical Expressions into LaTeXLaTeXLaTeX Formulas for Improved Readability
Kyudan Jung
Sieun Hyeon
Jeong Youn Kwon
N. Kim
Hyun Gon Ryu
Hyuk-Jae Lee
Jaeyoung Do
84
3
0
07 Aug 2024
Generative Language Models with Retrieval Augmented Generation for
  Automated Short Answer Scoring
Generative Language Models with Retrieval Augmented Generation for Automated Short Answer Scoring
Zifan Wang
Christopher Ormerod
ELM
65
1
0
07 Aug 2024
Advancing Multimodal Large Language Models with Quantization-Aware Scale
  Learning for Efficient Adaptation
Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation
Jingjing Xie
Yuxin Zhang
Mingbao Lin
Liujuan Cao
Rongrong Ji
MQ
82
5
0
07 Aug 2024
A Convex-optimization-based Layer-wise Post-training Pruner for Large
  Language Models
A Convex-optimization-based Layer-wise Post-training Pruner for Large Language Models
Pengxiang Zhao
Hanyu Hu
Ping Li
Yi Zheng
Zhefeng Wang
Xiaoming Yuan
71
1
0
07 Aug 2024
Improving the quality of Persian clinical text with a novel spelling
  correction system
Improving the quality of Persian clinical text with a novel spelling correction system
Seyed Mohammad Sadegh Dashti
S. F. Dashti
89
0
0
07 Aug 2024
Compress and Compare: Interactively Evaluating Efficiency and Behavior
  Across ML Model Compression Experiments
Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments
Angie Boggust
Venkatesh Sivaraman
Yannick Assogba
Donghao Ren
Dominik Moritz
Fred Hohman
VLM
87
3
0
06 Aug 2024
Synthesizing Text-to-SQL Data from Weak and Strong LLMs
Synthesizing Text-to-SQL Data from Weak and Strong LLMs
Jiaxi Yang
Binyuan Hui
Min Yang
Jian Yang
Junyang Lin
Chang Zhou
SyDa
102
34
0
06 Aug 2024
Lighthouse: A User-Friendly Library for Reproducible Video Moment
  Retrieval and Highlight Detection
Lighthouse: A User-Friendly Library for Reproducible Video Moment Retrieval and Highlight Detection
Taichi Nishimura
Shota Nakada
Hokuto Munakata
Tatsuya Komatsu
VLM
92
2
0
06 Aug 2024
Empathy Level Alignment via Reinforcement Learning for Empathetic Response Generation
Empathy Level Alignment via Reinforcement Learning for Empathetic Response Generation
Hui Ma
Bo Zhang
Bo Xu
Jian Wang
Hongfei Lin
Xiao Sun
142
1
0
06 Aug 2024
Development of REGAI: Rubric Enabled Generative Artificial Intelligence
Development of REGAI: Rubric Enabled Generative Artificial Intelligence
Zach Johnson
Jeremy Straub
105
1
0
05 Aug 2024
Entity Retrieval for Answering Entity-Centric Questions
Entity Retrieval for Answering Entity-Centric Questions
Hassan S. Shavarani
Anoop Sarkar
RALM
61
3
0
05 Aug 2024
Leveraging the Power of LLMs: A Fine-Tuning Approach for High-Quality
  Aspect-Based Summarization
Leveraging the Power of LLMs: A Fine-Tuning Approach for High-Quality Aspect-Based Summarization
Ankan Mullick
Sombit Bose
Rounak Saha
Ayan Kumar Bhowmick
Aditya Vempaty
Pawan Goyal
Niloy Ganguly
Prasenjit Dey
Ravi Kokku
82
0
0
05 Aug 2024
Contrastive Learning-based Multi Modal Architecture for Emoticon
  Prediction by Employing Image-Text Pairs
Contrastive Learning-based Multi Modal Architecture for Emoticon Prediction by Employing Image-Text Pairs
Ananya Pandey
Dinesh Kumar Vishwakarma
30
0
0
05 Aug 2024
Enhancing AI-based Generation of Software Exploits with Contextual
  Information
Enhancing AI-based Generation of Software Exploits with Contextual Information
Pietro Liguori
Cristina Improta
R. Natella
B. Cukic
Domenico Cotroneo
99
2
0
05 Aug 2024
From Generalist to Specialist: Exploring CWE-Specific Vulnerability
  Detection
From Generalist to Specialist: Exploring CWE-Specific Vulnerability Detection
Syafiq Al Atiiq
Christian Gehrmann
Kevin Dahlén
Karim Khalil
39
1
0
05 Aug 2024
DRFormer: Multi-Scale Transformer Utilizing Diverse Receptive Fields for
  Long Time-Series Forecasting
DRFormer: Multi-Scale Transformer Utilizing Diverse Receptive Fields for Long Time-Series Forecasting
Ruixin Ding
Yuqi Chen
Yu-Ting Lan
Wei Zhang
AI4TS
78
3
0
05 Aug 2024
Advancing Post-OCR Correction: A Comparative Study of Synthetic Data
Advancing Post-OCR Correction: A Comparative Study of Synthetic Data
Shuhao Guan
Derek Greene
102
8
0
05 Aug 2024
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Dongyang Liu
Shitian Zhao
Le Zhuo
Weifeng Lin
Ping Luo
Xinyue Li
Qi Qin
Yu Qiao
Hongsheng Li
Peng Gao
MLLM
168
59
0
05 Aug 2024
Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services
Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services
Shaopeng Fu
Xuexue Sun
Ke Qing
Tianhang Zheng
Di Wang
AAMLMIACVSILM
125
0
0
05 Aug 2024
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models
Zi Liang
Haibo Hu
Qingqing Ye
Yaxin Xiao
Haoyang Li
AAMLELMSILM
146
9
0
05 Aug 2024
Generative Retrieval with Few-shot Indexing
Generative Retrieval with Few-shot Indexing
Arian Askari
Chuan Meng
Mohammad Aliannejadi
Zhaochun Ren
Evangelos Kanoulas
Suzan Verberne
RALM
111
3
0
04 Aug 2024
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey
  on Methods and Datasets
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets
Shima Foolad
Kourosh Kiani
R. Rastgoo
FaML
94
0
0
04 Aug 2024
Optimal and efficient text counterfactuals using Graph Neural Networks
Optimal and efficient text counterfactuals using Graph Neural Networks
Dimitris Lymperopoulos
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
73
1
0
04 Aug 2024
STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs
STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs
Peijie Dong
Lujun Li
Dayou Du
Yuhan Chen
Zhenheng Tang
...
Wei Xue
Wenhan Luo
Qi-fei Liu
Yi-Ting Guo
Xiaowen Chu
MQ
91
10
0
03 Aug 2024
Summarization of Investment Reports Using Pre-trained Model
Summarization of Investment Reports Using Pre-trained Model
Hiroki Sakaji
Ryotaro Kobayashi
Kiyoshi Izumi
Hiroyuki Mitsugi
Wataru Kuramoto
45
0
0
03 Aug 2024
Transforming Slot Schema Induction with Generative Dialogue State
  Inference
Transforming Slot Schema Induction with Generative Dialogue State Inference
Ange Lou
Yike Zhang
Jinho D. Choi
73
1
0
03 Aug 2024
The Phantom Menace: Unmasking Privacy Leakages in Vision-Language Models
The Phantom Menace: Unmasking Privacy Leakages in Vision-Language Models
Simone Caldarella
Massimiliano Mancini
Elisa Ricci
Rahaf Aljundi
PILM
80
2
0
02 Aug 2024
VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling
VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling
Qian Zhang
Xiangzi Dai
Ninghua Yang
Xiang An
Ziyong Feng
Xingyu Ren
VLMCLIP
126
22
0
02 Aug 2024
Actra: Optimized Transformer Architecture for Vision-Language-Action
  Models in Robot Learning
Actra: Optimized Transformer Architecture for Vision-Language-Action Models in Robot Learning
Yueen Ma
Dafeng Chi
Shiguang Wu
Yuecheng Liu
Yuzheng Zhuang
Jianye Hao
Irwin King
71
5
0
02 Aug 2024
Bridging Information Gaps in Dialogues With Grounded Exchanges Using
  Knowledge Graphs
Bridging Information Gaps in Dialogues With Grounded Exchanges Using Knowledge Graphs
Phillip Schneider
Nektarios Machner
Kristiina Jokinen
Florian Matthes
59
1
0
02 Aug 2024
PERSOMA: PERsonalized SOft ProMpt Adapter Architecture for Personalized
  Language Prompting
PERSOMA: PERsonalized SOft ProMpt Adapter Architecture for Personalized Language Prompting
Liam Hebert
Krishna Sayana
Ambarish Jash
Alexandros Karatzoglou
Geordie Williamson
Sumanth Doddapaneni
Yanli Cai
Dima Kuzmin
81
4
0
02 Aug 2024
Task Prompt Vectors: Effective Initialization through Multi-Task Soft-Prompt Transfer
Task Prompt Vectors: Effective Initialization through Multi-Task Soft-Prompt Transfer
Wei Chen
Long Chen
Ivan Srba
Yu Wu
MoMeVLM
80
4
0
02 Aug 2024
Deep Learning based Visually Rich Document Content Understanding: A Survey
Deep Learning based Visually Rich Document Content Understanding: A Survey
Muhammad Ali
Jean Lee
Salman Khan
Eduard Hovy
115
6
0
02 Aug 2024
Automatic Pull Request Description Generation Using LLMs: A T5 Model
  Approach
Automatic Pull Request Description Generation Using LLMs: A T5 Model Approach
Md. Nazmus Sakib
Alexandru Drimbarean
Md Mashrur Arifin
33
1
0
01 Aug 2024
Are Bigger Encoders Always Better in Vision Large Models?
Are Bigger Encoders Always Better in Vision Large Models?
Bozhou Li
Hao Liang
Zimo Meng
Wentao Zhang
VLM
82
3
0
01 Aug 2024
Intermittent Semi-working Mask: A New Masking Paradigm for LLMs
Intermittent Semi-working Mask: A New Masking Paradigm for LLMs
Mingcong Lu
Jiangcai Zhu
Wang Hao
Zheng Li
Shusheng Zhang
Kailai Shao
Chao Chen
Nan Li
Feng Wang
Xin Lu
67
0
0
01 Aug 2024
GalleryGPT: Analyzing Paintings with Large Multimodal Models
GalleryGPT: Analyzing Paintings with Large Multimodal Models
Yi Bin
Wenhao Shi
Yujuan Ding
Zhiqiang Hu
Zheng Wang
Yang Yang
See-Kiong Ng
H. Shen
MLLM
96
11
0
01 Aug 2024
What comes after transformers? -- A selective survey connecting ideas in
  deep learning
What comes after transformers? -- A selective survey connecting ideas in deep learning
Johannes Schneider
AI4CE
125
2
0
01 Aug 2024
Sentence-wise Speech Summarization: Task, Datasets, and End-to-End
  Modeling with LM Knowledge Distillation
Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation
Kohei Matsuura
Takanori Ashihara
Takafumi Moriya
Masato Mimura
Takatomo Kano
A. Ogawa
Marc Delcroix
70
2
0
01 Aug 2024
UniMoT: Unified Molecule-Text Language Model with Discrete Token Representation
UniMoT: Unified Molecule-Text Language Model with Discrete Token Representation
Jiayuan Zhu
Yunli Qi
Yongqiang Chen
Nan Yin
Zhen Wang
Quanming Yao
125
11
0
01 Aug 2024
Automatic Generation of Behavioral Test Cases For Natural Language
  Processing Using Clustering and Prompting
Automatic Generation of Behavioral Test Cases For Natural Language Processing Using Clustering and Prompting
Ying Li
Rahul Singh
Tarun Joshi
Agus Sudjianto
44
1
0
31 Jul 2024
Gemma 2: Improving Open Language Models at a Practical Size
Gemma 2: Improving Open Language Models at a Practical Size
Gemma Team
Gemma Team Morgane Riviere
Shreya Pathak
Pier Giuseppe Sessa
Cassidy Hardin
...
Noah Fiedel
Armand Joulin
Kathleen Kenealy
Robert Dadashi
Alek Andreev
VLMMoEOSLM
154
924
0
31 Jul 2024
GPT-3 Powered Information Extraction for Building Robust Knowledge Bases
GPT-3 Powered Information Extraction for Building Robust Knowledge Bases
Ritabrata Roy Choudhury
Soumik Dey
69
1
0
31 Jul 2024
Generative Sentiment Analysis via Latent Category Distribution and
  Constrained Decoding
Generative Sentiment Analysis via Latent Category Distribution and Constrained Decoding
Jun Zhou
Dongyang Yu
Kamran Aziz
Fangfang Su
Qing Zhang
Fei Li
Donghong Ji
41
1
0
31 Jul 2024
Previous
123...434445...197198199
Next