Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,924 papers shown
Title
Do We Need Domain-Specific Embedding Models? An Empirical Investigation
Yixuan Tang
Yi Yang
AIFin
197
6
0
27 Sep 2024
Word2Wave: Language Driven Mission Programming for Efficient Subsea Deployments of Marine Robots
Ruo Chen
David Blow
Adnan Abdullah
Md Jahidul Islam
134
1
0
27 Sep 2024
Realistic Evaluation of Model Merging for Compositional Generalization
Derek Tam
Yash Kant
Brian Lester
Igor Gilitschenski
Colin Raffel
MoMe
99
9
0
26 Sep 2024
Trustworthy AI: Securing Sensitive Data in Large Language Models
G. Feretzakis
V. Verykios
65
17
0
26 Sep 2024
Trustworthy Text-to-Image Diffusion Models: A Timely and Focused Survey
Yi Zhang
Zhen Chen
Chih-Hong Cheng
Wenjie Ruan
Xiaowei Huang
Dezong Zhao
David Flynn
Siddartha Khastgir
Xingyu Zhao
MedIm
99
4
0
26 Sep 2024
EgoLM: Multi-Modal Language Model of Egocentric Motions
Fangzhou Hong
Vladimir Guzov
Hyo Jin Kim
Yuting Ye
Richard Newcombe
Ziwei Liu
Lingni Ma
83
4
0
26 Sep 2024
BeanCounter: A low-toxicity, large-scale, and open dataset of business-oriented text
Siyan Wang
Bradford Levy
66
2
0
26 Sep 2024
Autoregressive Generation Strategies for Top-K Sequential Recommendations
Anna Volodkevich
Danil Gusak
Anton Klenitskiy
Alexey Vasilev
46
0
0
26 Sep 2024
A Survey of Spatio-Temporal EEG data Analysis: from Models to Applications
Pengfei Wang
Huanran Zheng
Silong Dai
Yiqiao Wang
Xiaotian Gu
Yuanbin Wu
Xiaoling Wang
SyDa
AI4TS
159
4
0
26 Sep 2024
EAGLE: Egocentric AGgregated Language-video Engine
Jing Bi
Yunlong Tang
Luchuan Song
Ali Vosoughi
Nguyen Nguyen
Chenliang Xu
97
11
0
26 Sep 2024
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
Gongfan Fang
Hongxu Yin
Saurav Muralidharan
Greg Heinrich
Jeff Pool
Jan Kautz
Pavlo Molchanov
Xinchao Wang
73
10
0
26 Sep 2024
Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards
Heejin Do
Sangwon Ryu
Gary Geunbae Lee
78
2
0
26 Sep 2024
CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches
Sifan Wu
Amir Khasahmadi
Mor Katz
P. Jayaraman
Yewen Pu
K. Willis
Bang Liu
3DV
78
9
0
26 Sep 2024
JoyType: A Robust Design for Multilingual Visual Text Creation
Chao Li
Chen Jiang
Xiaolong Liu
Jun Zhao
Guoxin Wang
DiffM
130
7
0
26 Sep 2024
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
Shaoxiong Ji
Zihao Li
Indraneil Paul
Jaakko Paavola
Peiqin Lin
...
Dayyán O'Brien
Hengyu Luo
Hinrich Schütze
Jörg Tiedemann
Barry Haddow
CLL
120
8
0
26 Sep 2024
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
Yifei Liu
Jicheng Wen
Yang Wang
Shengyu Ye
Li Lyna Zhang
Ting Cao
Cheng Li
Mao Yang
MQ
241
16
0
25 Sep 2024
ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis
Fangshuo Zhou
Huaxia Li
Rui Hu
Sensen Wu
Hailin Feng
Zhenhong Du
Liuchang Xu
DiffM
68
2
0
25 Sep 2024
Detecting Temporal Ambiguity in Questions
Bhawna Piryani
Abdelrahman Abdallah
Jamshid Mozafari
Adam Jatowt
65
1
0
25 Sep 2024
Harnessing Diversity for Important Data Selection in Pretraining Large Language Models
Chi Zhang
Huaping Zhong
Kuan Zhang
Chengliang Chai
Rui Wang
...
Lei Cao
Ju Fan
Ye Yuan
Guoren Wang
Conghui He
TDI
110
10
0
25 Sep 2024
Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions
Zeyneb N. Kaya
Souvick Ghosh
55
0
0
25 Sep 2024
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
Kyuheon Jung
Yongdeuk Seo
Seongwoo Cho
Jaeyoung Kim
Hyun-seok Min
Sungchul Choi
33
1
0
25 Sep 2024
Enhancing Temporal Sensitivity and Reasoning for Time-Sensitive Question Answering
Wanqi Yang
Yanda Li
Meng Fang
Ling Chen
93
8
0
25 Sep 2024
E-SQL: Direct Schema Linking via Question Enrichment in Text-to-SQL
Hasan Alp Caferoğlu
Özgür Ulusoy
126
22
0
25 Sep 2024
Probing Omissions and Distortions in Transformer-based RDF-to-Text Models
J. Faille
Albert Gatt
Claire Gardent
84
0
0
25 Sep 2024
SynTQA: Synergistic Table-based Question Answering via Mixture of Text-to-SQL and E2E TQA
Siyue Zhang
Anh Tuan Luu
Chen Zhao
LMTD
77
6
0
25 Sep 2024
Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated Texts
Taehun Cha
Donghun Lee
HILM
62
1
0
25 Sep 2024
Domain-Independent Automatic Generation of Descriptive Texts for Time-Series Data
Kota Dohi
Aoi Ito
Harsh Purohit
Tomoya Nishida
Takashi Endo
Yohei Kawaguchi
55
3
0
25 Sep 2024
Ascend HiFloat8 Format for Deep Learning
Yuanyong Luo
Zhongxing Zhang
Richard Wu
Hu Liu
Ying Jin
...
Korviakov Vladimir
Bobrin Maxim
Yuhao Hu
Guanfu Chen
Zeyi Huang
MQ
47
2
0
25 Sep 2024
Entailment-Driven Privacy Policy Classification with LLMs
Bhanuka Silva
Dishanika Denipitiyage
Suranga Seneviratne
Anirban Mahanti
Aruna Seneviratne
AILaw
55
0
0
25 Sep 2024
Unsupervised Text Representation Learning via Instruction-Tuning for Zero-Shot Dense Retrieval
Qiuhai Zeng
Zimeng Qiu
Dae Yon Hwang
Xin He
William M. Campbell
RALM
52
0
0
24 Sep 2024
Strategies for Improving NL-to-FOL Translation with LLMs: Data Generation, Incremental Fine-Tuning, and Verification
Ramya Keerthy Thatikonda
Paul Burgess
Wray Buntine
Jiuzhou Han
LRM
43
3
0
24 Sep 2024
Unlocking Markets: A Multilingual Benchmark to Cross-Market Question Answering
Yifei Yuan
Yang Deng
Anders Søgaard
Mohammad Aliannejadi
55
0
0
24 Sep 2024
Konstruktor: A Strong Baseline for Simple Knowledge Graph Question Answering
M. Lysyuk
Mikhail Salnikov
Pavel Braslavski
Alexander Panchenko
71
1
0
24 Sep 2024
Hyperbolic Image-and-Pointcloud Contrastive Learning for 3D Classification
Naiwen Hu
Haozhe Cheng
Yifan Xie
Pengcheng Shi
Jihua Zhu
3DPC
105
0
0
24 Sep 2024
Making Text Embedders Few-Shot Learners
Chaofan Li
Minghao Qin
Shitao Xiao
Jianlyu Chen
Kun Luo
Yingxia Shao
Defu Lian
Zheng Liu
111
37
0
24 Sep 2024
Qualitative Insights Tool (QualIT): LLM Enhanced Topic Modeling
Satya Kapoor
Alex Gil
S. Bhaduri
Anshul Mittal
Rutu Mulkar
60
4
0
24 Sep 2024
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts
Xiaoming Shi
Shiyu Wang
Yuqi Nie
Dianqi Li
Zhou Ye
Qingsong Wen
Ming Jin
AI4TS
185
56
0
24 Sep 2024
Enhancing Text-to-SQL Capabilities of Large Language Models via Domain Database Knowledge Injection
Xingyu Ma
Xin Tian
Lingxiang Wu
Xuepeng Wang
Xueming Tang
Jinqiao Wang
201
1
0
24 Sep 2024
Zero-shot forecasting of chaotic systems
Yuanzhao Zhang
William Gilpin
AI4TS
271
8
0
24 Sep 2024
Improving Academic Skills Assessment with NLP and Ensemble Learning
Xinyi Huang
Yingyi Wu
Danyang Zhang
Jiacheng Hu
Yujian Long
51
7
0
23 Sep 2024
In-Context Learning May Not Elicit Trustworthy Reasoning: A-Not-B Errors in Pretrained Language Models
Pengrui Han
Peiyang Song
Haofei Yu
Jiaxuan You
ReLM
LRM
80
1
0
23 Sep 2024
Steward: Natural Language Web Automation
Brian Tang
Kang G. Shin
LLMAG
66
1
0
23 Sep 2024
Efficiently Dispatching Flash Attention For Partially Filled Attention Masks
Agniv Sharma
Jonas Geiping
61
1
0
23 Sep 2024
Scaling Laws of Decoder-Only Models on the Multilingual Machine Translation Task
Gaëtan Caillaut
Raheel Qader
Mariam Nakhlé
Jingshu Liu
Jean-Gabriel Barthélemy
64
1
0
23 Sep 2024
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely
Siyun Zhao
Yuqing Yang
Zilong Wang
Zhiyuan He
Luna Qiu
Lili Qiu
SyDa
RALM
3DV
118
42
0
23 Sep 2024
With Ears to See and Eyes to Hear: Sound Symbolism Experiments with Multimodal Large Language Models
Tyler Loakman
Yucheng Li
Chenghua Lin
VLM
56
1
0
23 Sep 2024
Knowledge Planning in Large Language Models for Domain-Aligned Counseling Summarization
Aseem Srivastava
Smriti Joshi
Tanmoy Chakraborty
Md. Shad Akhtar
52
4
0
23 Sep 2024
Pre-trained Language Model and Knowledge Distillation for Lightweight Sequential Recommendation
Li Li
Mingyue Cheng
Zhiding Liu
Hao Zhang
Qi Liu
Enhong Chen
VLM
75
0
0
23 Sep 2024
OMPar: Automatic Parallelization with AI-Driven Source-to-Source Compilation
Tal Kadosh
N. Hasabnis
Prema Soundararajan
Vy A. Vo
Mihai Capota
Nesreen Ahmed
Yuval Pinter
Gal Oren
VLM
64
2
0
23 Sep 2024
VLM's Eye Examination: Instruct and Inspect Visual Competency of Vision Language Models
Nam Hyeon-Woo
Moon Ye-Bin
Wonseok Choi
Lee Hyun
Tae-Hyun Oh
CoGe
68
3
0
23 Sep 2024
Previous
1
2
3
...
37
38
39
...
197
198
199
Next