ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.05131
  4. Cited By
UL2: Unifying Language Learning Paradigms
v1v2v3 (latest)

UL2: Unifying Language Learning Paradigms

10 May 2022
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
Xavier Garcia
Jason W. Wei
Xuezhi Wang
Hyung Won Chung
Siamak Shakeri
Dara Bahri
Tal Schuster
H. Zheng
Denny Zhou
N. Houlsby
Donald Metzler
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "UL2: Unifying Language Learning Paradigms"

50 / 227 papers shown
Title
S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation
S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation
Yichen Xie
Runsheng Xu
Tong He
Jyh-Jing Hwang
Katie Luo
...
Letian Chen
Yiren Lu
Zhaoqi Leng
Dragomir Anguelov
Mingxing Tan
VLMLRM
59
0
0
30 May 2025
Test-Time Learning for Large Language Models
Test-Time Learning for Large Language Models
Jinwu Hu
Zhitian Zhang
Guohao Chen
Xutao Wen
Chao Shuai
Wei Luo
Bin Xiao
Yuanqing Li
Mingkui Tan
59
0
0
27 May 2025
Learning Extrapolative Sequence Transformations from Markov Chains
Learning Extrapolative Sequence Transformations from Markov Chains
Sophia Hager
Aleem Khan
Andrew Wang
Nicholas Andrews
BDL
44
0
0
26 May 2025
FlowKV: Enhancing Multi-Turn Conversational Coherence in LLMs via Isolated Key-Value Cache Management
FlowKV: Enhancing Multi-Turn Conversational Coherence in LLMs via Isolated Key-Value Cache Management
Xiang Liu
Hong Chen
Xuming Hu
Xiaowen Chu
96
0
0
21 May 2025
Large Language Models and Their Applications in Roadway Safety and Mobility Enhancement: A Comprehensive Review
Large Language Models and Their Applications in Roadway Safety and Mobility Enhancement: A Comprehensive Review
Muhammad Monjurul Karim
Yan Shi
Shucheng Zhang
Bingzhang Wang
Mehrdad Nasri
Yinhai Wang
35
0
0
19 May 2025
Neural Encoding and Decoding at Scale
Neural Encoding and Decoding at Scale
Yizi Zhang
Yanchen Wang
Mehdi Azabou
Alexandre Andre
Zixuan Wang
Hanrui Lyu
International Brain Laboratory
Eva L. Dyer
Liam Paninski
Cole Hurwitz
AI4CE
172
1
0
11 Apr 2025
Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation
Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation
Biao Zhang
Fedor Moiseev
Joshua Ainslie
Paul Suganthan
Min Ma
Surya Bhupatiraju
Fede Lebron
Orhan Firat
Armand Joulin
Zhe Dong
AI4CE
61
0
0
08 Apr 2025
DDPT: Diffusion-Driven Prompt Tuning for Large Language Model Code Generation
DDPT: Diffusion-Driven Prompt Tuning for Large Language Model Code Generation
Jinyang Li
Sangwon Hyun
Muhammad Ali Babar
74
0
0
06 Apr 2025
Do LLMs Surpass Encoders for Biomedical NER?
Do LLMs Surpass Encoders for Biomedical NER?
Motasem S Obeidat
Md Sultan al Nahian
R. Kavuluru
89
0
0
01 Apr 2025
Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study
Wei Wei
Yue-Jiao Gong
Jun Zhang
110
0
0
11 Mar 2025
Reasoning with Latent Thoughts: On the Power of Looped Transformers
Reasoning with Latent Thoughts: On the Power of Looped Transformers
Nikunj Saunshi
Nishanth Dikkala
Zhiyuan Li
Sanjiv Kumar
Sashank J. Reddi
OffRLLRMAI4CE
159
22
0
24 Feb 2025
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasks
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasks
Jing Yang
Max Glockner
Anderson de Rezende Rocha
Iryna Gurevych
LRM
171
1
0
07 Feb 2025
Multi-Task Model Merging via Adaptive Weight Disentanglement
Multi-Task Model Merging via Adaptive Weight Disentanglement
Feng Xiong
Runxi Cheng
Wang Chen
Zhanqiu Zhang
Yiwen Guo
Chun Yuan
Ruifeng Xu
MoMe
212
8
0
10 Jan 2025
iServe: An Intent-based Serving System for LLMs
iServe: An Intent-based Serving System for LLMs
Dimitrios Liakopoulos
Tianrui Hu
Prasoon Sinha
N. Yadwadkar
VLM
532
0
0
08 Jan 2025
MM-Path: Multi-modal, Multi-granularity Path Representation Learning -- Extended Version
MM-Path: Multi-modal, Multi-granularity Path Representation Learning -- Extended Version
Ronghui Xu
Hanyin Cheng
Chenjuan Guo
Hongfan Gao
Jiaxi Hu
Sean Bin Yang
Bin Yang
200
5
0
03 Jan 2025
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Hongjie Wang
Chih-Yao Ma
Yen-Cheng Liu
Ji Hou
Tao Xu
...
Peizhao Zhang
Tingbo Hou
Peter Vajda
N. Jha
Xiaoliang Dai
LMTDVGenVLMDiffM
200
11
0
13 Dec 2024
Can bidirectional encoder become the ultimate winner for downstream
  applications of foundation models?
Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Lewen Yang
Xuanyu Zhou
Juao Fan
Xinyi Xie
Shengxin Zhu
AI4CE
127
0
0
27 Nov 2024
CoA: Chain-of-Action for Generative Semantic Labels
CoA: Chain-of-Action for Generative Semantic Labels
Meng Wei
Zhongnian Li
Peng Ying
Xinzheng Xu
VLM
129
0
0
26 Nov 2024
MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language
  Model
MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language Model
Yifan Wu
Min Zeng
Yang Li
Yize Zhang
Min Li
178
1
0
23 Nov 2024
Character-level Tokenizations as Powerful Inductive Biases for RNA Foundational Models
Adrián Morales-Pastor
Raquel Vázquez-Reza
Miłosz Wieczór
Clàudia Valverde
Manel Gil-Sorribes
Bertran Miquel-Oliver
Álvaro Ciudad
Alexis Molina
AI4CE
96
0
0
05 Nov 2024
Training Compute-Optimal Protein Language Models
Training Compute-Optimal Protein Language Models
Xingyi Cheng
Bo Chen
Pan Li
Jing Gong
Jie Tang
Le Song
133
17
0
04 Nov 2024
P-Masking: Power Law Masking Improves Multi-attribute Controlled
  Generation
P-Masking: Power Law Masking Improves Multi-attribute Controlled Generation
Mohamed Elgaar
Hadi Amiri
AI4CE
76
0
0
31 Oct 2024
Demystifying Large Language Models for Medicine: A Primer
Demystifying Large Language Models for Medicine: A Primer
Qiao Jin
Nicholas Wan
Robert Leaman
Shubo Tian
Zhizheng Wang
...
Chunhua Weng
Ronald M. Summers
Qingyu Chen
Yifan Peng
Zhiyong Lu
LM&MA
111
5
0
24 Oct 2024
A Little Help Goes a Long Way: Efficient LLM Training by Leveraging
  Small LMs
A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs
A. S. Rawat
Veeranjaneyulu Sadhanala
Afshin Rostamizadeh
Ayan Chakrabarti
Wittawat Jitkrittum
...
Rakesh Shivanna
Sashank J. Reddi
A. Menon
Rohan Anil
Sanjiv Kumar
152
3
0
24 Oct 2024
Responsible Multilingual Large Language Models: A Survey of Development,
  Applications, and Societal Impact
Responsible Multilingual Large Language Models: A Survey of Development, Applications, and Societal Impact
Junhua Liu
Bin Fu
LRM
44
1
0
23 Oct 2024
MiniPLM: Knowledge Distillation for Pre-Training Language Models
MiniPLM: Knowledge Distillation for Pre-Training Language Models
Yuxian Gu
Hao Zhou
Fandong Meng
Jie Zhou
Minlie Huang
233
7
0
22 Oct 2024
Reducing Hallucinations in Vision-Language Models via Latent Space
  Steering
Reducing Hallucinations in Vision-Language Models via Latent Space Steering
Sheng Liu
Haotian Ye
Lei Xing
James Zou
VLMLLMSV
173
9
0
21 Oct 2024
A Benchmark for Cross-Domain Argumentative Stance Classification on
  Social Media
A Benchmark for Cross-Domain Argumentative Stance Classification on Social Media
Jiaqing Yuan
Ruijie Xi
Munindar P. Singh
47
0
0
11 Oct 2024
LongGenBench: Long-context Generation Benchmark
LongGenBench: Long-context Generation Benchmark
Xiang Liu
Peijie Dong
Xuming Hu
Xiaowen Chu
RALM
118
9
0
05 Oct 2024
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge
  Distillation for Large Language Models in Code Generation
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation
Ziyang Luo
Xin Li
Hongzhan Lin
Jing Ma
Lidong Bing
VLM
70
0
0
01 Oct 2024
On the Inductive Bias of Stacking Towards Improving Reasoning
On the Inductive Bias of Stacking Towards Improving Reasoning
Nikunj Saunshi
Stefani Karp
Shankar Krishnan
Sobhan Miryoosefi
Sashank J. Reddi
Sanjiv Kumar
LRMAI4CE
90
7
0
27 Sep 2024
Imagine yourself: Tuning-Free Personalized Image Generation
Imagine yourself: Tuning-Free Personalized Image Generation
Zecheng He
Bo Sun
Felix Juefei-Xu
Haoyu Ma
Ankit Ramchandani
...
Ning Zhang
Peizhao Zhang
Roshan Sumbaly
Peter Vajda
Animesh Sinha
DiffM
105
19
0
20 Sep 2024
From Linguistic Giants to Sensory Maestros: A Survey on Cross-Modal
  Reasoning with Large Language Models
From Linguistic Giants to Sensory Maestros: A Survey on Cross-Modal Reasoning with Large Language Models
Shengsheng Qian
Zuyi Zhou
Dizhan Xue
Bing Wang
Changsheng Xu
LRM
162
2
0
19 Sep 2024
A Survey of Large Language Models for European Languages
A Survey of Large Language Models for European Languages
Wazir Ali
S. Pyysalo
163
3
0
27 Aug 2024
Natural Language Outlines for Code: Literate Programming in the LLM Era
Natural Language Outlines for Code: Literate Programming in the LLM Era
Kensen Shi
Deniz Altınbüken
Saswat Anand
Mihai Christodorescu
Katja Grünwedel
...
Tobias Welp
Pengcheng Yin
Manzil Zaheer
Satish Chandra
Charles Sutton
157
7
0
09 Aug 2024
Coalitions of Large Language Models Increase the Robustness of AI Agents
Coalitions of Large Language Models Increase the Robustness of AI Agents
Prattyush Mangal
Carol Mak
Theo Kanakis
Timothy Donovan
Dave Braines
Edward Pyzer-Knapp
58
1
0
02 Aug 2024
Intermittent Semi-working Mask: A New Masking Paradigm for LLMs
Intermittent Semi-working Mask: A New Masking Paradigm for LLMs
Mingcong Lu
Jiangcai Zhu
Wang Hao
Zheng Li
Shusheng Zhang
Kailai Shao
Chao Chen
Nan Li
Feng Wang
Xin Lu
72
0
0
01 Aug 2024
Towards a "universal translator" for neural dynamics at single-cell,
  single-spike resolution
Towards a "universal translator" for neural dynamics at single-cell, single-spike resolution
Yizi Zhang
Yanchen Wang
Donato Jimenez-Beneto
Zixuan Wang
Mehdi Azabou
...
Olivier Winter
The International Brain Laboratory
Eva L. Dyer
Liam Paninski
Cole Hurwitz
MedImAI4CE
81
14
0
19 Jul 2024
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding
Ofir Abramovich
Niv Nayman
Sharon Fogel
I. Lavi
Ron Litman
Shahar Tsiper
Royee Tichauer
Srikar Appalaraju
Shai Mazor
R. Manmatha
VLM
109
3
0
17 Jul 2024
A Survey on Symbolic Knowledge Distillation of Large Language Models
A Survey on Symbolic Knowledge Distillation of Large Language Models
Kamal Acharya
Alvaro Velasquez
Haoze Song
SyDa
78
7
0
12 Jul 2024
HDT: Hierarchical Document Transformer
HDT: Hierarchical Document Transformer
Haoyu He
Markus Flicke
Jan Buchmann
Iryna Gurevych
Andreas Geiger
92
0
0
11 Jul 2024
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via
  Dynamic Sparse Attention
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention
Huiqiang Jiang
Yucheng Li
Chengruidong Zhang
Qianhui Wu
Xufang Luo
...
Amir H. Abdi
Dongsheng Li
Chin-Yew Lin
Yuqing Yang
L. Qiu
165
122
0
02 Jul 2024
Reliable Confidence Intervals for Information Retrieval Evaluation Using
  Generative A.I
Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I
Harrie Oosterhuis
R. Jagerman
Zhen Qin
Xuanhui Wang
Michael Bendersky
89
5
0
02 Jul 2024
RVISA: Reasoning and Verification for Implicit Sentiment Analysis
RVISA: Reasoning and Verification for Implicit Sentiment Analysis
Wenna Lai
H. Xie
Guandong Xu
Qing Li
LRM
93
3
0
02 Jul 2024
Eliminating Position Bias of Language Models: A Mechanistic Approach
Eliminating Position Bias of Language Models: A Mechanistic Approach
Ziqi Wang
Hanlin Zhang
Xiner Li
Kuan-Hao Huang
Chi Han
Shuiwang Ji
Sham Kakade
Hao Peng
Heng Ji
181
20
0
01 Jul 2024
Structured Unrestricted-Rank Matrices for Parameter Efficient
  Fine-tuning
Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning
Arijit Sehanobish
Avinava Dubey
Krzysztof Choromanski
Somnath Basu Roy Chowdhury
Deepali Jain
Vikas Sindhwani
Snigdha Chaturvedi
ALM
100
3
0
25 Jun 2024
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning
Brandon Huang
Chancharik Mitra
Assaf Arbelle
Leonid Karlinsky
Trevor Darrell
Roei Herzig
101
21
0
21 Jun 2024
Enhancing Question Answering on Charts Through Effective Pre-training
  Tasks
Enhancing Question Answering on Charts Through Effective Pre-training Tasks
Ashim Gupta
Vivek Gupta
Shuo Zhang
Yujie He
Ning Zhang
Shalin S Shah
47
3
0
14 Jun 2024
Large Language Models Meet Text-Centric Multimodal Sentiment Analysis: A
  Survey
Large Language Models Meet Text-Centric Multimodal Sentiment Analysis: A Survey
Hao Yang
Yanyan Zhao
Yang Wu
Shilong Wang
Tian Zheng
Hongbo Zhang
Zongyang Ma
Wanxiang Che
Bing Qin
135
14
0
12 Jun 2024
The Factorization Curse: Which Tokens You Predict Underlie the Reversal
  Curse and More
The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More
O. Kitouni
Niklas Nolte
Diane Bouchacourt
Adina Williams
Mike Rabbat
Mark Ibrahim
LRMCLL
104
12
0
07 Jun 2024
12345
Next