Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
v1
v2
v3
v4
v5 (latest)
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,332 papers shown
Title
Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models
Abhishek Kumar
Sarfaroz Yunusov
Ali Emami
75
3
0
23 May 2024
Scalable Visual State Space Model with Fractal Scanning
Lv Tang
Haoke Xiao
Peng-Tao Jiang
Hao Zhang
Jinwei Chen
Yue Liu
Mamba
100
8
0
23 May 2024
Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration
Yang Zhang
Shixin Yang
Chenjia Bai
Fei Wu
Xiu Li
Zhen Wang
Xuelong Li
LLMAG
117
32
0
23 May 2024
How Do Transformers "Do" Physics? Investigating the Simple Harmonic Oscillator
Subhash Kantamneni
Ziming Liu
Max Tegmark
177
2
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
344
54
0
23 May 2024
ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous Vehicles
Jiawei Zhang
Chejian Xu
Yue Liu
118
48
0
22 May 2024
Image-of-Thought Prompting for Visual Reasoning Refinement in Multimodal Large Language Models
Qiji Zhou
Ruochen Zhou
Zike Hu
Panzhong Lu
Siyang Gao
Yue Zhang
LRM
106
17
0
22 May 2024
Dense Connector for MLLMs
Huanjin Yao
Wenhao Wu
Taojiannan Yang
Yuxin Song
Mengxi Zhang
Haocheng Feng
Yifan Sun
Zhiheng Li
Wanli Ouyang
Jingdong Wang
MLLM
VLM
102
25
0
22 May 2024
Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation
Cyril Chhun
Fabian M. Suchanek
Chloé Clavel
LRM
114
18
0
22 May 2024
A social path to human-like artificial intelligence
Edgar A. Duénez-Guzmán
Suzanne Sadedin
Jane X. Wang
Kevin R. McKee
Joel Z Leibo
GNN
100
30
0
22 May 2024
360Zhinao Technical Report
360Zhinao Team
67
0
0
22 May 2024
Learning Manipulation Skills through Robot Chain-of-Thought with Sparse Failure Guidance
Kaifeng Zhang
Zhao-Heng Yin
Weirui Ye
Yang Gao
159
4
0
22 May 2024
Adapting Multi-modal Large Language Model to Concept Drift From Pre-training Onwards
Xiaoyu Yang
Jie Lu
Enshui Yu
VLM
115
1
0
22 May 2024
Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models
Zhangyue Yin
Qiushi Sun
Qipeng Guo
Zhiyuan Zeng
Xiaonan Li
...
Qinyuan Cheng
Ding Wang
Xiaofeng Mou
Xipeng Qiu
XuanJing Huang
LRM
96
4
0
21 May 2024
Large Language Models Meet NLP: A Survey
Libo Qin
Qiguang Chen
Xiachong Feng
Yang Wu
Yongheng Zhang
Hai-Tao Zheng
Min Li
Wanxiang Che
Philip S. Yu
ALM
LM&MA
ELM
LRM
123
59
0
21 May 2024
FAdam: Adam is a natural gradient optimizer using diagonal empirical Fisher information
Dongseong Hwang
ODL
104
9
0
21 May 2024
A Survey of Deep Learning-based Radiology Report Generation Using Multimodal Data
Xinyi Wang
Grazziela Figueredo
Ruizhe Li
Wei Emma Zhang
Weitong Chen
Xin Chen
MedIm
ViT
124
2
0
21 May 2024
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving
Aniket Didolkar
Anirudh Goyal
Nan Rosemary Ke
Siyuan Guo
Michal Valko
Timothy Lillicrap
Danilo Jimenez Rezende
Yoshua Bengio
Michael C. Mozer
Sanjeev Arora
LRM
72
30
0
20 May 2024
Multiple-Choice Questions are Efficient and Robust LLM Evaluators
Ziyin Zhang
Zhaokun Jiang
Lizhen Xu
Hong-ping Hao
Rui Wang
100
19
0
20 May 2024
Data Contamination Calibration for Black-box LLMs
Wen-song Ye
Jiaqi Hu
Liyao Li
Haobo Wang
Gang Chen
Junbo Zhao
64
9
0
20 May 2024
CPS-LLM: Large Language Model based Safe Usage Plan Generator for Human-in-the-Loop Human-in-the-Plant Cyber-Physical System
Ayan Banerjee
Aranyak Maity
Payal Kamboj
Sandeep K. S. Gupta
34
5
0
19 May 2024
MapCoder: Multi-Agent Code Generation for Competitive Problem Solving
Md. Ashraful Islam
Mohammed Eunus Ali
Md. Rizwan Parvez
SyDa
117
69
0
18 May 2024
Automating PTSD Diagnostics in Clinical Interviews: Leveraging Large Language Models for Trauma Assessments
Sichang Tu
Abigail Powers
Natalie Merrill
Negar Fani
Sierra Carter
S. Doogan
Jinho D. Choi
LM&MA
59
2
0
18 May 2024
LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions
Chuanneng Sun
Songjun Huang
D. Pompili
LLMAG
125
35
0
17 May 2024
Open-Vocabulary Spatio-Temporal Action Detection
Tao Wu
Shuqiu Ge
Jie Qin
Gangshan Wu
Limin Wang
ObjD
77
7
0
17 May 2024
Lean Attention: Hardware-Aware Scalable Attention Mechanism for the Decode-Phase of Transformers
Rya Sanovar
Srikant Bharadwaj
Renée St. Amant
Victor Rühle
Saravan Rajmohan
167
7
0
17 May 2024
HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models
R. Sukthanker
Arber Zela
B. Staffler
Aaron Klein
Lennart Purucker
Jorg K. H. Franke
Frank Hutter
ELM
103
4
0
16 May 2024
A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks
Xuanfan Ni
Piji Li
ELM
LRM
69
9
0
16 May 2024
When Large Language Model Meets Optimization
Sen Huang
Kaixiang Yang
Sheng Qi
Rui Wang
104
14
0
16 May 2024
Crowdsourcing with Enhanced Data Quality Assurance: An Efficient Approach to Mitigate Resource Scarcity Challenges in Training Large Language Models for Healthcare
Prosanta Barai
Gondy Leroy
Prakash Bisht
Joshua M Rothman
Sumi Lee
Jennifer G. Andrews
Sydney A Rice
Arif Ahmed
66
3
0
16 May 2024
Leveraging Human Revisions for Improving Text-to-Layout Models
Amber Xie
Chin-Yi Cheng
Forrest Huang
Yang Li
73
1
0
16 May 2024
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Chameleon Team
MLLM
226
338
0
16 May 2024
IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Diji Yang
Jinmeng Rao
Kezhen Chen
Xiaoyuan Guo
Yawen Zhang
Jie Yang
Yi Zhang
LRM
RALM
115
20
0
15 May 2024
A Survey on Transformers in NLP with Focus on Efficiency
Wazib Ansar
Saptarsi Goswami
Amlan Chakrabarti
MedIm
100
2
0
15 May 2024
A Systematic Analysis on the Temporal Generalization of Language Models in Social Media
Asahi Ushio
Jose Camacho-Collados
50
0
0
15 May 2024
Towards Next-Generation Steganalysis: LLMs Unleash the Power of Detecting Steganography
Minhao Bai
Kaiyi Pang
Huili Wang
Yongfeng Huang
44
4
0
15 May 2024
LLM-Assisted Rule Based Machine Translation for Low/No-Resource Languages
J. Coleman
Bhaskar Krishnamachari
Khalil Iskarous
Ruben Rosales
76
9
0
14 May 2024
Contextual Emotion Recognition using Large Vision Language Models
Yasaman Etesam
Özge Nilay Yalçin
Chuxuan Zhang
Angelica Lim
VLM
138
4
0
14 May 2024
SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Jonathan Roberts
Kai Han
N. Houlsby
Samuel Albanie
94
16
0
14 May 2024
Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory
Xueyan Niu
Bo Bai
Lei Deng
Wei Han
88
8
0
14 May 2024
QCRD: Quality-guided Contrastive Rationale Distillation for Large Language Models
Wei Wang
Zhaowei Li
Qi Xu
Yiqing Cai
Hang Song
Qi Qi
Ran Zhou
Zhida Huang
Tao Wang
Li Xiao
ALM
91
1
0
14 May 2024
Improving Transformers with Dynamically Composable Multi-Head Attention
Da Xiao
Qingye Meng
Shengping Li
Xingyuan Yuan
58
4
0
14 May 2024
SpeechVerse: A Large-scale Generalizable Audio Language Model
Nilaksh Das
Saket Dingliwal
S. Ronanki
Rohit Paturi
David Huang
...
Monica Sunkara
S. Srinivasan
Kyu J. Han
Katrin Kirchhoff
Katrin Kirchhoff
118
44
0
14 May 2024
Russian-Language Multimodal Dataset for Automatic Summarization of Scientific Papers
Alena Tsanda
E. Bruches
53
0
0
13 May 2024
OpenLLM-Ro -- Technical Report on Open-source Romanian LLMs
Mihai Masala
Denis C. Ilie-Ablachim
D. Corlatescu
Miruna Zavelca
Marius Leordeanu
Horia Velicu
Marius Popescu
Mihai Dascalu
Traian Rebedea
105
4
0
13 May 2024
MathDivide: Improved mathematical reasoning by large language models
S. Srivastava
Ashutosh Gandhi
LRM
ReLM
58
0
0
12 May 2024
MUD: Towards a Large-Scale and Noise-Filtered UI Dataset for Modern Style UI Modeling
Sidong Feng
Suyu Ma
Han Wang
David Kong
Chunyang Chen
107
11
0
11 May 2024
TacoERE: Cluster-aware Compression for Event Relation Extraction
Yong Guan
Xiaozhi Wang
Lei Hou
Juanzi Li
Jeff Z. Pan
Jiaoyan Chen
Freddy Lecue
61
2
0
11 May 2024
Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs
Harsh Patel
Buvaneswari A. Ramanan
Manzoor A. Khan
Thomas Williams
Brian D. Friedman
Lawrence Drabeck
ELM
52
0
0
10 May 2024
The Ghanaian NLP Landscape: A First Look
Sheriff Issaka
Zhaoyi Zhang
Mihir Heda
Keyi Wang
Yinka Ajibola
Ryan DeMar
Xuefeng Du
77
2
0
10 May 2024
Previous
1
2
3
...
24
25
26
...
85
86
87
Next