Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
v1
v2
v3
v4
v5 (latest)
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,332 papers shown
Title
AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language Models
Daqin Luo
Chengjian Feng
Yuxuan Nong
Yiqing Shen
61
6
0
01 Aug 2024
Intermittent Semi-working Mask: A New Masking Paradigm for LLMs
Mingcong Lu
Jiangcai Zhu
Wang Hao
Zheng Li
Shusheng Zhang
Kailai Shao
Chao Chen
Nan Li
Feng Wang
Xin Lu
67
0
0
01 Aug 2024
GalleryGPT: Analyzing Paintings with Large Multimodal Models
Yi Bin
Wenhao Shi
Yujuan Ding
Zhiqiang Hu
Zheng Wang
Yang Yang
See-Kiong Ng
H. Shen
MLLM
96
11
0
01 Aug 2024
What comes after transformers? -- A selective survey connecting ideas in deep learning
Johannes Schneider
AI4CE
125
2
0
01 Aug 2024
Memorization Capacity for Additive Fine-Tuning with Small ReLU Networks
Jy-yong Sohn
Dohyun Kwon
Seoyeon An
Kangwook Lee
111
0
0
01 Aug 2024
Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?
Richard Ren
Steven Basart
Adam Khoja
Alice Gatti
Long Phan
...
Alexander Pan
Gabriel Mukobi
Ryan H. Kim
Stephen Fitz
Dan Hendrycks
ELM
87
26
0
31 Jul 2024
TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Ming Zhang
Caishuang Huang
Yilong Wu
Shichun Liu
Huiyuan Zheng
...
Jun Zhao
Junjie Ye
Qi Zhang
Tao Gui
Xuanjing Huang
93
2
0
31 Jul 2024
Evaluating SAM2's Role in Camouflaged Object Detection: From SAM to SAM2
Lv Tang
Bo Li
VLM
74
7
0
31 Jul 2024
Data Contamination Report from the 2024 CONDA Shared Task
Oscar Sainz
Iker García-Ferrero
Alon Jacovi
Jonas Hanselle
Yanai Elazar
...
Yu-Min Tseng
Vishaal Udandarao
Zengzhi Wang
Ruijie Xu
Jinglin Yang
121
6
0
31 Jul 2024
KemenkeuGPT: Leveraging a Large Language Model on Indonesia's Government Financial Data and Regulations to Enhance Decision Making
Kuo Wang
Pingping Zhang
AIFin
42
2
0
31 Jul 2024
Autonomous Improvement of Instruction Following Skills via Foundation Models
Zhiyuan Zhou
P. Atreya
Abraham Lee
Homer Walke
Oier Mees
Sergey Levine
95
14
0
30 Jul 2024
Prompt2DeModel: Declarative Neuro-Symbolic Modeling with Natural Language
Hossein Rajaby Faghihi
Aliakbar Nafar
Andrzej Uszok
Hamid Karimian
Parisa Kordjamshidi
82
0
0
30 Jul 2024
AI-Assisted Generation of Difficult Math Questions
Vedant Shah
Dingli Yu
Kaifeng Lyu
Simon Park
Nan Rosemary Ke
...
Yoshua Bengio
Sanjeev Arora
Anirudh Goyal
Sanjeev Arora
Anirudh Goyal
123
18
0
30 Jul 2024
Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
Ekaterina Iakovleva
Fabio Pizzati
Philip Torr
Stéphane Lathuiliere
DiffM
96
0
0
29 Jul 2024
Efficient Training of Large Language Models on Distributed Infrastructures: A Survey
Jiangfei Duan
Shuo Zhang
Zerui Wang
Lijuan Jiang
Wenwen Qu
...
Dahua Lin
Yonggang Wen
Xin Jin
Tianwei Zhang
Peng Sun
159
13
0
29 Jul 2024
Urban Safety Perception Assessments via Integrating Multimodal Large Language Models with Street View Images
Jiaxin Zhanga
Yunqin Lia
Tomohiro Fukudab
Bowen Wang
76
1
0
29 Jul 2024
Logic Distillation: Learning from Code Function by Function for Planning and Decision-making
Dong Chen
Shilin Zhang
Fei Gao
Yueting Zhuang
Siliang Tang
Qidong Liu
Mingliang Xu
LRM
45
1
0
28 Jul 2024
NVC-1B: A Large Neural Video Coding Model
Xihua Sheng
Chuanbo Tang
Li Li
Dong Liu
Feng Wu
3DV
VLM
90
3
0
28 Jul 2024
Stochastic Parrots or ICU Experts? Large Language Models in Critical Care Medicine: A Scoping Review
Tongyue Shi
Jun Ma
Zihan Yu
Haowei Xu
Minqi Xiong
Meirong Xiao
Yilin Li
Huiying Zhao
Guilan Kong
76
2
0
27 Jul 2024
Optimizing Numerical Estimation and Operational Efficiency in the Legal Domain through Large Language Models
Jia-Hong Huang
Chao-Chun Yang
Yixian Shen
A. M. Pacces
Evangelos Kanoulas
ELM
AILaw
100
6
0
26 Jul 2024
The power of Prompts: Evaluating and Mitigating Gender Bias in MT with LLMs
Aleix Sant
Carlos Escolano
Audrey Mash
Francesca de Luca Fornaciari
Maite Melero
75
6
0
26 Jul 2024
Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
Tianduo Wang
Shichen Li
Wei Lu
LRM
AI4CE
88
20
1
25 Jul 2024
Difficulty Estimation and Simplification of French Text Using LLMs
Henri Jamet
Yash Raj Shrestha
Michalis Vlachos
68
2
0
25 Jul 2024
Ontology of Belief Diversity: A Community-Based Epistemological Approach
Tyler Fischella
Erin van Liemt
Qiuyi
Qiuyi Zhang
51
0
0
25 Jul 2024
Know Your Limits: A Survey of Abstention in Large Language Models
Bingbing Wen
Jihan Yao
Shangbin Feng
Chenjun Xu
Yulia Tsvetkov
Bill Howe
Lucy Lu Wang
131
5
0
25 Jul 2024
u-
μ
\mu
μ
P: The Unit-Scaled Maximal Update Parametrization
Charlie Blake
C. Eichenberg
Josef Dean
Lukas Balles
Luke Y. Prince
Bjorn Deiseroth
Andres Felipe Cruz Salinas
Carlo Luschi
Samuel Weinbach
Douglas Orr
134
10
0
24 Jul 2024
V
I
L
A
2
VILA^2
V
I
L
A
2
: VILA Augmented VILA
Yunhao Fang
Ligeng Zhu
Yao Lu
Yan Wang
Pavlo Molchanov
Jang Hyun Cho
Marco Pavone
Song Han
Hongxu Yin
VLM
90
10
0
24 Jul 2024
Exploring Domain Robust Lightweight Reward Models based on Router Mechanism
Hyuk Namgoong
Jeesu Jung
Sangkeun Jung
Yoonhyung Roh
74
1
0
24 Jul 2024
Bailicai: A Domain-Optimized Retrieval-Augmented Generation Framework for Medical Applications
Cui Long
Yongbin Liu
Chunping Ouyang
Ying Yu
88
5
0
24 Jul 2024
Course-Correction: Safety Alignment Using Synthetic Preferences
Rongwu Xu
Yishuo Cai
Zhenhong Zhou
Renjie Gu
Haiqin Weng
Yan Liu
Tianwei Zhang
Wei Xu
Han Qiu
76
7
0
23 Jul 2024
Enhancing LLM's Cognition via Structurization
Kai-Chun Liu
Zhihang Fu
Chao Chen
Wei Zhang
Rongxin Jiang
Fan Zhou
Yao-Shen Chen
Yue-bo Wu
Jieping Ye
83
1
0
23 Jul 2024
Exploring the Effectiveness and Consistency of Task Selection in Intermediate-Task Transfer Learning
Pin-Jie Lin
Miaoran Zhang
Marius Mosbach
Dietrich Klakow
44
0
0
23 Jul 2024
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget
Vikash Sehwag
Xianghao Kong
Jingtao Li
Michael Spranger
Lingjuan Lyu
DiffM
90
11
0
22 Jul 2024
Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability
Zhuoyan Xu
Zhenmei Shi
Yingyu Liang
CoGe
LRM
81
38
0
22 Jul 2024
Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning
Xiangyang Qu
Jing Yu
Keke Gai
Jiamin Zhuang
Yuanmin Tang
Gang Xiong
Gaopeng Gou
Qi Wu
102
3
0
22 Jul 2024
Building Machines that Learn and Think with People
Katherine M. Collins
Ilia Sucholutsky
Umang Bhatt
Kartik Chandra
Lionel Wong
...
Mark K. Ho
Vikash K. Mansinghka
Adrian Weller
Joshua B. Tenenbaum
Thomas Griffiths
138
39
0
22 Jul 2024
MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory for Long Sequences Training
Cheng Luo
Jiawei Zhao
Zhuoming Chen
Beidi Chen
A. Anandkumar
99
4
0
22 Jul 2024
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models
Amir Mohammad Karimi Mamaghan
Samuele Papa
Karl Henrik Johansson
Stefan Bauer
Andrea Dittadi
OCL
180
9
0
22 Jul 2024
VideoGameBunny: Towards vision assistants for video games
Mohammad Reza Taesiri
Cor-Paul Bezemer
VLM
MLLM
81
2
0
21 Jul 2024
Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives
D. Hagos
Rick Battle
Danda B. Rawat
LM&MA
OffRL
116
28
0
20 Jul 2024
Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks
Jiayu Lin
Guanrong Chen
Bojun Jin
Chenyang Li
Shutong Jia
...
R. Xu
Long Zhang
Jiuxin Cao
Ting Jin
Zhongyu Wei
84
1
0
20 Jul 2024
Adversarial Databases Improve Success in Retrieval-based Large Language Models
Sean Wu
Michael Koo
Li Yo Kao
Andy Black
L. Blum
Fabien Scalzo
Ira Kurtz
RALM
53
0
0
19 Jul 2024
Foundation Models for Autonomous Robots in Unstructured Environments
Hossein Naderi
Alireza Shojaei
Lifu Huang
LM&Ro
99
0
0
19 Jul 2024
LAPIS: Language Model-Augmented Police Investigation System
Heedou Kim
Dain Kim
Jiwoo Lee
Chanwoong Yoon
Donghee Choi
Mogan Gim
Jaewoo Kang
RALM
75
2
0
19 Jul 2024
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text Generation: A State-of-the-Art Investigation
Joy Mahapatra
Utpal Garain
92
10
0
19 Jul 2024
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by Direct Preference Optimization
Md Sultan al Nahian
R. Kavuluru
MedIm
AI4CE
61
0
0
19 Jul 2024
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
Chaofan Tao
Qian Liu
Longxu Dou
Niklas Muennighoff
Zhongwei Wan
Ping Luo
Min Lin
Ngai Wong
PILM
132
54
0
18 Jul 2024
Reconstruct the Pruned Model without Any Retraining
Pingjie Wang
Ziqing Fan
Shengchao Hu
Zhe Chen
Yanfeng Wang
Yu Wang
91
2
0
18 Jul 2024
LLM-Empowered State Representation for Reinforcement Learning
Boyuan Wang
Yun Qu
Yuhang Jiang
Jianzhun Shao
Chang-rui Liu
Wenming Yang
Xiangyang Ji
91
14
0
18 Jul 2024
Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems
Tamer Abuelsaad
Deepak Akkil
Prasenjit Dey
Ashish Jagmohan
Aditya Vempaty
Ravi Kokku
100
28
0
17 Jul 2024
Previous
1
2
3
...
17
18
19
...
85
86
87
Next