ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.05131
  4. Cited By
UL2: Unifying Language Learning Paradigms
v1v2v3 (latest)

UL2: Unifying Language Learning Paradigms

10 May 2022
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
Xavier Garcia
Jason W. Wei
Xuezhi Wang
Hyung Won Chung
Siamak Shakeri
Dara Bahri
Tal Schuster
H. Zheng
Denny Zhou
N. Houlsby
Donald Metzler
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "UL2: Unifying Language Learning Paradigms"

50 / 227 papers shown
Title
CIRCUITSYNTH: Leveraging Large Language Models for Circuit Topology
  Synthesis
CIRCUITSYNTH: Leveraging Large Language Models for Circuit Topology Synthesis
Prashanth Vijayaraghavan
Luyao Shi
Ehsan Degan
Xin Zhang
113
2
0
06 Jun 2024
Landscape-Aware Growing: The Power of a Little LAG
Landscape-Aware Growing: The Power of a Little LAG
Stefani Karp
Nikunj Saunshi
Sobhan Miryoosefi
Sashank J. Reddi
Sanjiv Kumar
87
1
0
04 Jun 2024
LLMs Could Autonomously Learn Without External Supervision
LLMs Could Autonomously Learn Without External Supervision
Ke Ji
Junying Chen
Anningzhe Gao
Wenya Xie
Xiang Wan
Benyou Wang
98
4
0
02 Jun 2024
Large Language Model Confidence Estimation via Black-Box Access
Large Language Model Confidence Estimation via Black-Box Access
Tejaswini Pedapati
Amit Dhurandhar
Soumya Ghosh
Soham Dan
P. Sattigeri
251
5
0
01 Jun 2024
LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking
LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking
Yifan Zeng
Ojas Tendolkar
Raymond Baartmans
Qingyun Wu
Huazheng Wang
Lizhong Chen
84
0
0
31 May 2024
Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits
  Multimodal Reasoning
Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning
Cheng Tan
Jingxuan Wei
Linzhuang Sun
Zhangyang Gao
Siyuan Li
Bihui Yu
Ruifeng Guo
Stan Z. Li
ReLMLRM3DV
127
7
0
31 May 2024
Is My Data in Your Retrieval Database? Membership Inference Attacks Against Retrieval Augmented Generation
Is My Data in Your Retrieval Database? Membership Inference Attacks Against Retrieval Augmented Generation
Maya Anderson
Guy Amit
Abigail Goldsteen
AAML
154
19
0
30 May 2024
Faster Cascades via Speculative Decoding
Faster Cascades via Speculative Decoding
Harikrishna Narasimhan
Wittawat Jitkrittum
A. S. Rawat
Seungyeon Kim
Neha Gupta
A. Menon
Sanjiv Kumar
LRM
118
10
0
29 May 2024
Wavelet-Based Image Tokenizer for Vision Transformers
Wavelet-Based Image Tokenizer for Vision Transformers
Zhenhai Zhu
Radu Soricut
ViT
109
5
0
28 May 2024
Scaling Laws for Discriminative Classification in Large Language Models
Scaling Laws for Discriminative Classification in Large Language Models
Dean Wyatte
Fatemeh Tahmasbi
Ming Li
Thomas Markovich
99
2
0
24 May 2024
A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large
  Language Models Reveal Human-like Patterns
A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns
Asaf Yehudai
Taelin Karidi
Gabriel Stanovsky
Ariel Goldstein
Omri Abend
91
1
0
23 May 2024
Bitune: Bidirectional Instruction-Tuning
Bitune: Bidirectional Instruction-Tuning
D. J. Kopiczko
Tijmen Blankevoort
Yuki Markus Asano
50
3
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
353
54
0
23 May 2024
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
Minghao Wu
Jiahao Xu
Yulin Yuan
Gholamreza Haffari
Longyue Wang
Weihua Luo
Kaifu Zhang
LLMAG
199
27
0
20 May 2024
IGOT: Information Gain Optimized Tokenizer on Domain Adaptive
  Pretraining
IGOT: Information Gain Optimized Tokenizer on Domain Adaptive Pretraining
Dawei Feng
Yihai Zhang
Zhixuan Xu
SyDa
55
0
0
16 May 2024
DEPTH: Discourse Education through Pre-Training Hierarchically
DEPTH: Discourse Education through Pre-Training Hierarchically
Zachary Bamberger
Ofek Glick
Chaim Baskin
Yonatan Belinkov
128
0
0
13 May 2024
OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage
  Pruning
OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning
Dan Qiao
Yi Su
Pinzheng Wang
Jing Ye
Wen Xie
...
Wenliang Chen
Guohong Fu
Guodong Zhou
Qiaoming Zhu
Min Zhang
MQ
65
0
0
09 May 2024
Better & Faster Large Language Models via Multi-token Prediction
Better & Faster Large Language Models via Multi-token Prediction
Fabian Gloeckle
Badr Youbi Idrissi
Baptiste Rozière
David Lopez-Paz
Gabriele Synnaeve
116
121
0
30 Apr 2024
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
Timin Gao
Peixian Chen
Mengdan Zhang
Chaoyou Fu
Yunhang Shen
...
Shengchuan Zhang
Xiawu Zheng
Xing Sun
Liujuan Cao
Rongrong Ji
MLLMLRM
126
22
0
24 Apr 2024
Pillars of Grammatical Error Correction: Comprehensive Inspection Of
  Contemporary Approaches In The Era of Large Language Models
Pillars of Grammatical Error Correction: Comprehensive Inspection Of Contemporary Approaches In The Era of Large Language Models
Kostiantyn Omelianchuk
Andrii Liubonko
Oleksandr Skurzhanskyi
Artem Chernodub
Oleksandr Korniienko
Igor Samokhin
88
2
0
23 Apr 2024
Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?
Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?
Chengwei Qin
Wenhan Xia
Tan Wang
Fangkai Jiao
Yuchen Hu
Bosheng Ding
Ruirui Chen
Shafiq Joty
LRM
129
5
0
19 Apr 2024
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Shusheng Xu
Wei Fu
Jiaxuan Gao
Wenjie Ye
Weiling Liu
Zhiyu Mei
Guangju Wang
Chao Yu
Yi Wu
174
165
0
16 Apr 2024
Language Model Cascades: Token-level uncertainty and beyond
Language Model Cascades: Token-level uncertainty and beyond
Neha Gupta
Harikrishna Narasimhan
Wittawat Jitkrittum
A. S. Rawat
A. Menon
Sanjiv Kumar
UQLM
144
56
0
15 Apr 2024
Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning
  Skills in Large Language Models
Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models
Yantao Liu
Zijun Yao
Xin Lv
Yuchen Fan
S. Cao
Jifan Yu
Lei Hou
Juanzi Li
105
3
0
04 Apr 2024
Using Large Language Models to Enrich the Documentation of Datasets for
  Machine Learning
Using Large Language Models to Enrich the Documentation of Datasets for Machine Learning
Joan Giner-Miguelez
Abel Gómez
Jordi Cabot
LLMAG
85
4
0
04 Apr 2024
Query Performance Prediction using Relevance Judgments Generated by Large Language Models
Query Performance Prediction using Relevance Judgments Generated by Large Language Models
Chuan Meng
Negar Arabzadeh
Arian Askari
Mohammad Aliannejadi
Maarten de Rijke
LRM
149
12
0
01 Apr 2024
Multi-Level Explanations for Generative Language Models
Multi-Level Explanations for Generative Language Models
Lucas Monteiro Paes
Dennis L. Wei
Hyo Jin Do
Hendrik Strobelt
Ronny Luss
...
Manish Nagireddy
Karthikeyan N. Ramamurthy
P. Sattigeri
Werner Geyer
Soumya Ghosh
FAtt
103
8
0
21 Mar 2024
Generalizing Denoising to Non-Equilibrium Structures Improves
  Equivariant Force Fields
Generalizing Denoising to Non-Equilibrium Structures Improves Equivariant Force Fields
Yi-Lun Liao
Tess E. Smidt
Abhishek Das
DiffMAI4CE
75
12
0
14 Mar 2024
Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and
  Understanding -- A Survey
Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey
Xi Fang
Weijie Xu
Fiona Anting Tan
Jiani Zhang
Ziqing Hu
Yanjun Qi
Scott Nickleach
Diego Socolinsky
Srinivasan H. Sengamedu
Christos Faloutsos
LMTDALM
191
81
0
27 Feb 2024
When Scaling Meets LLM Finetuning: The Effect of Data, Model and
  Finetuning Method
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Biao Zhang
Zhongtao Liu
Colin Cherry
Orhan Firat
LRM
119
160
0
27 Feb 2024
StructLM: Towards Building Generalist Models for Structured Knowledge
  Grounding
StructLM: Towards Building Generalist Models for Structured Knowledge Grounding
Alex Zhuang
Ge Zhang
Tianyu Zheng
Xinrun Du
Junjie Wang
Weiming Ren
Stephen W. Huang
Jie Fu
Xiang Yue
Wenhu Chen
LMTD
160
21
0
26 Feb 2024
AttributionBench: How Hard is Automatic Attribution Evaluation?
AttributionBench: How Hard is Automatic Attribution Evaluation?
Yifei Li
Xiang Yue
Zeyi Liao
Huan Sun
HILM
86
13
0
23 Feb 2024
Chain-of-Thought Unfaithfulness as Disguised Accuracy
Chain-of-Thought Unfaithfulness as Disguised Accuracy
Oliver Bentham
Nathan Stringham
Ana Marasović
LRMHILM
90
16
0
22 Feb 2024
GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
Kundan Krishna
S. Ramprasad
Prakhar Gupta
Byron C. Wallace
Zachary Chase Lipton
Jeffrey P. Bigham
HILMKELMSyDa
147
9
0
19 Feb 2024
The Revolution of Multimodal Large Language Models: A Survey
The Revolution of Multimodal Large Language Models: A Survey
Davide Caffagni
Federico Cocchi
Luca Barsellotti
Nicholas Moratelli
Sara Sarto
Lorenzo Baraldi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
LRMVLM
139
64
0
19 Feb 2024
Structured Chain-of-Thought Prompting for Few-Shot Generation of
  Content-Grounded QA Conversations
Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations
M. Sultan
Jatin Ganhotra
Ramón Fernández Astudillo
LRM
71
3
0
19 Feb 2024
Multi-dimensional Evaluation of Empathetic Dialog Responses
Multi-dimensional Evaluation of Empathetic Dialog Responses
Zhichao Xu
Jiepu Jiang
68
3
0
18 Feb 2024
Efficient Stagewise Pretraining via Progressive Subnetworks
Efficient Stagewise Pretraining via Progressive Subnetworks
Abhishek Panigrahi
Nikunj Saunshi
Kaifeng Lyu
Sobhan Miryoosefi
Sashank J. Reddi
Satyen Kale
Sanjiv Kumar
69
6
0
08 Feb 2024
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Gilles Baechler
Srinivas Sunkara
Maria Wang
Fedir Zubach
Hassan Mansoor
Vincent Etter
Victor Carbune
Jason Lin
Jindong Chen
Abhanshu Sharma
199
59
0
07 Feb 2024
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Fuzhao Xue
Zian Zheng
Yao Fu
Jinjie Ni
Zangwei Zheng
Wangchunshu Zhou
Yang You
MoE
112
104
0
29 Jan 2024
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced
  Understanding and Generation
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation
Gokcce Uludougan
Zeynep Yirmibecsouglu Balal
Furkan Akkurt
Melikcsah Turker
Onur Gungor
S. Uskudarli
78
12
0
25 Jan 2024
MM-LLMs: Recent Advances in MultiModal Large Language Models
MM-LLMs: Recent Advances in MultiModal Large Language Models
Duzhen Zhang
Yahan Yu
Jiahua Dong
Chenxing Li
Dan Su
Chenhui Chu
Dong Yu
OffRLLRM
174
217
0
24 Jan 2024
MaLA-500: Massive Language Adaptation of Large Language Models
MaLA-500: Massive Language Adaptation of Large Language Models
Peiqin Lin
Shaoxiong Ji
Jörg Tiedemann
André F. T. Martins
Hinrich Schütze
ELM
125
18
0
24 Jan 2024
Health-LLM: Large Language Models for Health Prediction via Wearable
  Sensor Data
Health-LLM: Large Language Models for Health Prediction via Wearable Sensor Data
Y. Kim
X. Xu
Daniel J. McDuff
C. Breazeal
Hae Won Park
AI4MHLM&MA
109
72
0
12 Jan 2024
Structsum Generation for Faster Text Comprehension
Structsum Generation for Faster Text Comprehension
Parag Jain
Andreea Marzoca
Francesco Piccinno
ReLM
74
8
0
12 Jan 2024
Distilling Vision-Language Models on Millions of Videos
Distilling Vision-Language Models on Millions of Videos
Yue Zhao
Long Zhao
Xingyi Zhou
Jialin Wu
Chun-Te Chu
...
Hartwig Adam
Ting Liu
Boqing Gong
Philipp Krahenbuhl
Liangzhe Yuan
VLM
96
14
0
11 Jan 2024
xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering
  the Language of Protein
xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein
Bo Chen
Xingyi Cheng
Pan Li
Yangli-ao Geng
Jing Gong
...
Chiming Liu
Aohan Zeng
Yuxiao Dong
Jie Tang
Leo T. Song
83
113
0
11 Jan 2024
Attendre: Wait To Attend By Retrieval With Evicted Queries in
  Memory-Based Transformers for Long Context Processing
Attendre: Wait To Attend By Retrieval With Evicted Queries in Memory-Based Transformers for Long Context Processing
Zi Yang
Nan Hua
RALM
97
4
0
10 Jan 2024
Enhanced Automated Code Vulnerability Repair using Large Language Models
Enhanced Automated Code Vulnerability Repair using Large Language Models
David de-Fitero-Dominguez
Eva García-López
Antonio Garcia-Cabot
J. Martínez-Herráiz
67
16
0
08 Jan 2024
A Simple LLM Framework for Long-Range Video Question-Answering
A Simple LLM Framework for Long-Range Video Question-Answering
Ce Zhang
Taixi Lu
Md. Mohaiminul Islam
Ziyang Wang
Shoubin Yu
Mohit Bansal
Gedas Bertasius
195
92
0
28 Dec 2023
Previous
12345
Next