Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
v1
v2
v3
v4
v5 (latest)
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,332 papers shown
Title
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Pei Zhou
Jay Pujara
Xiang Ren
Xinyun Chen
Heng-Tze Cheng
Quoc V. Le
Ed H. Chi
Denny Zhou
Swaroop Mishra
Huaixiu Steven Zheng
LRM
ReLM
82
56
0
06 Feb 2024
TexShape: Information Theoretic Sentence Embedding for Language Models
Kaan Kale
H. Esfahanizadeh
Noel Elias
Oguzhan Baser
Muriel Médard
S. Vishwanath
72
3
0
05 Feb 2024
Shortened LLaMA: Depth Pruning for Large Language Models with Comparison of Retraining Methods
Bo-Kyeong Kim
Geonmin Kim
Tae-Ho Kim
Thibault Castells
Shinkook Choi
Junho Shin
Hyoung-Kyu Song
118
40
0
05 Feb 2024
KS-Lottery: Finding Certified Lottery Tickets for Multilingual Language Models
Fei Yuan
Chang Ma
Shuai Yuan
Qiushi Sun
Lei Li
74
3
0
05 Feb 2024
DeAL: Decoding-time Alignment for Large Language Models
James Y. Huang
Sailik Sengupta
Daniele Bonadiman
Yi-An Lai
Arshit Gupta
Nikolaos Pappas
Saab Mansour
Katrin Kirchoff
Dan Roth
131
36
0
05 Feb 2024
Position: What Can Large Language Models Tell Us about Time Series Analysis
Ming Jin
Yifan Zhang
Wei Chen
Kexin Zhang
Yuxuan Liang
Bin Yang
Jindong Wang
Shirui Pan
Qingsong Wen
AI4TS
90
23
0
05 Feb 2024
Integration of cognitive tasks into artificial general intelligence test for large models
Youzhi Qu
Chen Wei
Penghui Du
Wenxin Che
Chi Zhang
...
Bin Hu
Kai Du
Haiyan Wu
Jia Liu
Quanying Liu
ELM
68
10
0
04 Feb 2024
LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model
Dilxat Muhtar
Zhenshi Li
Feng-Xue Gu
Xue-liang Zhang
Pengfeng Xiao
206
62
0
04 Feb 2024
GLaPE: Gold Label-agnostic Prompt Evaluation and Optimization for Large Language Model
Xuanchang Zhang
Zhuosheng Zhang
Hai Zhao
LRM
ALM
67
3
0
04 Feb 2024
NetLLM: Adapting Large Language Models for Networking
Duo Wu
Xianda Wang
Yaqi Qiao
Zhi Wang
Junchen Jiang
Shuguang Cui
Fangxin Wang
107
49
0
04 Feb 2024
A Survey on Data Selection for LLM Instruction Tuning
Bolin Zhang
Jiahao Wang
Qianlong Du
Jiajun Zhang
Zhiying Tu
Dianhui Chu
113
48
0
04 Feb 2024
Beyond the Limits: A Survey of Techniques to Extend the Context Length in Large Language Models
Xindi Wang
Mahsa Salmani
Parsa Omidi
Xiangyu Ren
Mehdi Rezagholizadeh
A. Eshaghi
LRM
99
48
0
03 Feb 2024
Do Moral Judgment and Reasoning Capability of LLMs Change with Language? A Study using the Multilingual Defining Issues Test
Aditi Khandelwal
Utkarsh Agarwal
Kumar Tanmay
Monojit Choudhury
ELM
LRM
68
7
0
03 Feb 2024
Code Representation Learning At Scale
Dejiao Zhang
W. Ahmad
Ming Tan
Hantian Ding
Ramesh Nallapati
Dan Roth
Xiaofei Ma
Bing Xiang
OffRL
68
12
0
02 Feb 2024
Natural language guidance of high-fidelity text-to-speech with synthetic annotations
Daniel Lyth
Simon King
115
49
0
02 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
86
10
0
02 Feb 2024
Fractal Patterns May Illuminate the Success of Next-Token Prediction
Ibrahim Alabdulmohsin
Vinh Q. Tran
Mostafa Dehghani
55
2
0
02 Feb 2024
Enhancing Stochastic Gradient Descent: A Unified Framework and Novel Acceleration Methods for Faster Convergence
Yichuan Deng
Zhao Song
Chiwun Yang
56
1
0
02 Feb 2024
LimSim++: A Closed-Loop Platform for Deploying Multimodal LLMs in Autonomous Driving
Daocheng Fu
Wenjie Lei
Licheng Wen
Pinlong Cai
Song Mao
Min Dou
Botian Shi
Yu Qiao
122
31
0
02 Feb 2024
Large Language Models for Time Series: A Survey
Xiyuan Zhang
Ranak Roy Chowdhury
Rajesh K. Gupta
Jingbo Shang
AI4TS
165
67
0
02 Feb 2024
CorpusLM: Towards a Unified Language Model on Corpus for Knowledge-Intensive Tasks
Xiaoxi Li
Zhicheng Dou
Yujia Zhou
Fangchao Liu
RALM
104
17
0
02 Feb 2024
Efficient Prompt Caching via Embedding Similarity
Hanlin Zhu
Banghua Zhu
Jiantao Jiao
RALM
92
9
0
02 Feb 2024
When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards
Norah A. Alzahrani
H. A. Alyahya
Sultan Yazeed Alnumay
Muhtasim Tahmid
Shaykhah Alsubaie
...
Saleh Soltan
Nathan Scales
Marie-Anne Lachaux
Samuel R. Bowman
Haidar Khan
ELM
151
80
0
01 Feb 2024
Evaluating Large Language Models for Generalization and Robustness via Data Compression
Yucheng Li
Yunhao Guo
Frank Guerin
Chenghua Lin
ELM
102
6
0
01 Feb 2024
Can Large Language Models Understand Context?
Yilun Zhu
Joel Ruben Antony Moniz
Shruti Bhargava
Jiarui Lu
Dhivya Piraviperumal
Site Li
Yuan-kang Zhang
Hong-ye Yu
Bo-Hsiang Tseng
104
26
0
01 Feb 2024
Towards Efficient Exact Optimization of Language Model Alignment
Haozhe Ji
Cheng Lu
Yilin Niu
Pei Ke
Hongning Wang
Jun Zhu
Jie Tang
Minlie Huang
104
20
0
01 Feb 2024
Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?
Xue-Yong Fu
Md Tahmid Rahman Laskar
Elena Khasanova
Cheng-Hsiung Chen
TN ShashiBhushan
ALM
100
23
0
01 Feb 2024
OLMo: Accelerating the Science of Language Models
Dirk Groeneveld
Iz Beltagy
Pete Walsh
Akshita Bhagia
Rodney Michael Kinney
...
Jesse Dodge
Kyle Lo
Luca Soldaini
Noah A. Smith
Hanna Hajishirzi
OSLM
226
413
0
01 Feb 2024
Ocassionally Secure: A Comparative Analysis of Code Generation Assistants
Ran Elgedawy
John Sadik
Senjuti Dutta
Anuj Gautam
Konstantinos Georgiou
Farzin Gholamrezae
Fujiao Ji
Kyungchan Lim
Qian Liu
Scott Ruoti
110
8
0
01 Feb 2024
A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains
Alon Jacovi
Yonatan Bitton
Bernd Bohnet
Jonathan Herzig
Or Honovich
Michael Tseng
Michael Collins
Roee Aharoni
Mor Geva
LRM
144
27
0
01 Feb 2024
EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models
Xuchen Pan
Yanxi Chen
Yaliang Li
Bolin Ding
Jingren Zhou
79
8
0
01 Feb 2024
Disentangling the Roles of Target-Side Transfer and Regularization in Multilingual Machine Translation
Yan Meng
Christof Monz
LRM
81
2
0
01 Feb 2024
Computational Experiments Meet Large Language Model Based Agents: A Survey and Perspective
Qun Ma
Xiao Xue
Deyu Zhou
Xiangning Yu
Donghua Liu
...
Yifan Shen
Peilin Ji
Juanjuan Li
Gang Wang
Wanpeng Ma
AI4CE
LM&Ro
LLMAG
99
9
0
01 Feb 2024
CroissantLLM: A Truly Bilingual French-English Language Model
Manuel Faysse
Patrick Fernandes
Nuno M. Guerreiro
António Loison
Duarte M. Alves
...
François Yvon
André F.T. Martins
Gautier Viaud
C´eline Hudelot
Pierre Colombo
184
37
0
01 Feb 2024
Large Scale Generative AI Text Applied to Sports and Music
Aaron Baughman
Stephen Hammer
Rahul Agarwal
Gozde Akay
Eduardo Morales
Tony Johnson
Leonid Karlinsky
Rogerio Feris
48
3
0
31 Jan 2024
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Parth Sarthi
Salman Abdullah
Aditi Tuli
Shubh Khanna
Anna Goldie
Christopher D. Manning
RALM
125
148
0
31 Jan 2024
Desiderata for the Context Use of Question Answering Systems
Sagi Shaier
Lawrence E Hunter
Katharina von der Wense
131
5
0
31 Jan 2024
SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language models
Xiao Shao
Weifu Jiang
Fei Zuo
Mengqing Liu
LLMAG
97
7
0
31 Jan 2024
Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning
Tinghui Zhu
Kai Zhang
Jian Xie
Yu-Chuan Su
LRM
106
17
0
31 Jan 2024
Rethinking Interpretability in the Era of Large Language Models
Chandan Singh
J. Inala
Michel Galley
Rich Caruana
Jianfeng Gao
LRM
AI4CE
136
72
0
30 Jan 2024
Transfer Learning for Text Diffusion Models
Kehang Han
Kathleen Kenealy
Aditya Barua
Noah Fiedel
Noah Constant
VLM
AI4CE
117
4
0
30 Jan 2024
Large Language Model Evaluation via Matrix Entropy
Lai Wei
Zhiquan Tan
Chenghai Li
Jindong Wang
Weiran Huang
77
5
0
30 Jan 2024
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis
Zecheng Tang
Chenfei Wu
Zekai Zhang
Mingheng Ni
Sheng-Siang Yin
...
Zhengyuan Yang
Lijuan Wang
Zicheng Liu
Juntao Li
Nan Duan
93
13
0
30 Jan 2024
Towards Unified Interactive Visual Grounding in The Wild
Jie Xu
Hanbo Zhang
Qingyi Si
Yifeng Li
Xuguang Lan
Tao Kong
LM&Ro
66
5
0
30 Jan 2024
When Large Language Models Meet Vector Databases: A Survey
Zhi Jing
Yongye Su
Yikun Han
Bo Yuan
Haiyun Xu
Chunjiang Liu
Kehai Chen
Min Zhang
152
38
0
30 Jan 2024
Leveraging Professional Radiologists' Expertise to Enhance LLMs' Evaluation for Radiology Reports
Qingqing Zhu
Preslav Nakov
Qiao Jin
Benjamin Hou
T. Mathai
Pritam Mukherjee
Xin Gao
Ronald M. Summers
Zhiyong Lu
LM&MA
86
6
0
29 Jan 2024
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
Xiao-wen Dong
Pan Zhang
Yuhang Zang
Yuhang Cao
Bin Wang
...
Conghui He
Xingcheng Zhang
Yu Qiao
Dahua Lin
Jiaqi Wang
VLM
MLLM
178
268
0
29 Jan 2024
Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
Banghua Zhu
Michael I. Jordan
Jiantao Jiao
86
33
0
29 Jan 2024
LLaMandement: Large Language Models for Summarization of French Legislative Proposals
Joseph Gesnouin
Yannis Tannier
Christophe Gomes Da Silva
Hatim Tapory
Camille Brier
...
Emmanuel Cortes
Pierre-Etienne Devineau
Ulrich Tan
Esther Mac Namara
Su Yang
AILaw
98
8
0
29 Jan 2024
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Fuzhao Xue
Zian Zheng
Yao Fu
Jinjie Ni
Zangwei Zheng
Wangchunshu Zhou
Yang You
MoE
112
104
0
29 Jan 2024
Previous
1
2
3
...
36
37
38
...
85
86
87
Next