Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,245 papers shown
Title
Reconstruct the Pruned Model without Any Retraining
Pingjie Wang
Ziqing Fan
Shengchao Hu
Zhe Chen
Yanfeng Wang
Yu Wang
53
1
0
18 Jul 2024
LLM-Empowered State Representation for Reinforcement Learning
Boyuan Wang
Yun Qu
Yuhang Jiang
Jianzhun Shao
Chang-rui Liu
Wenming Yang
Xiangyang Ji
45
7
0
18 Jul 2024
Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems
Tamer Abuelsaad
Deepak Akkil
Prasenjit Dey
Ashish Jagmohan
Aditya Vempaty
Ravi Kokku
53
23
0
17 Jul 2024
Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions
Jinsung Yoon
Raj Sinha
Sercan O. Arik
Tomas Pfister
24
1
0
17 Jul 2024
EchoSight: Advancing Visual-Language Models with Wiki Knowledge
Yibin Yan
Weidi Xie
RALM
37
10
0
17 Jul 2024
Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning
Yanting Miao
William Loh
Suraj Kothawade
Pascal Poupart
Abdullah Rashwan
Yeqing Li
EGVM
55
1
0
16 Jul 2024
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces
Zehan Wang
Ziang Zhang
Hang Zhang
Luping Liu
Rongjie Huang
Xize Cheng
Hengshuang Zhao
Zhou Zhao
51
9
0
16 Jul 2024
XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach
Truong Thanh Hung Nguyen
Phuc Truong Loc Nguyen
Hung Cao
32
3
0
16 Jul 2024
RobotKeyframing: Learning Locomotion with High-Level Objectives via Mixture of Dense and Sparse Rewards
Fatemeh Zargarbashi
Jin Cheng
Dongho Kang
Robert Sumner
Stelian Coros
118
8
0
16 Jul 2024
The Oscars of AI Theater: A Survey on Role-Playing with Language Models
Nuo Chen
Yan Wang
Yang Deng
Jia Li
42
16
0
16 Jul 2024
Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights
Shunqi Mao
Chaoyi Zhang
Hang Su
Hwanjun Song
Igor Shalyminov
Weidong Cai
46
1
0
16 Jul 2024
Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
Jinrui Zhang
Teng Wang
Haigang Zhang
Ping Lu
Feng Zheng
MLLM
LRM
VLM
54
3
0
16 Jul 2024
Performance Evaluation of Lightweight Open-source Large Language Models in Pediatric Consultations: A Comparative Analysis
Qiuhong Wei
Ying Cui
Mengwei Ding
Yanqin Wang
Lingling Xiang
Zhengxiong Yao
Ceran Chen
Ying Long
Zhezhen Jin
Ximing Xu
ELM
LM&MA
AI4MH
49
0
0
16 Jul 2024
Evaluating Model Bias Requires Characterizing its Mistakes
Isabela Albuquerque
Jessica Schrouff
David Warde-Farley
Ali Taylan Cemgil
Sven Gowal
Olivia Wiles
58
2
0
15 Jul 2024
DeepGate3: Towards Scalable Circuit Representation Learning
Zhengyuan Shi
Ziyang Zheng
Sadaf Khan
Qiang Xu
Min Li
Qiang Xu
GNN
AI4CE
51
9
0
15 Jul 2024
CodeV: Empowering LLMs with HDL Generation through Multi-Level Summarization
Yang Zhao
Di Huang
Chongxiao Li
Pengwei Jin
Muxin Song
...
Rui Zhang
Xingui Hu
Yunji Chen
Qi Guo
Xing Hu
79
23
0
15 Jul 2024
MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs
Quang H. Nguyen
Duy C. Hoang
Juliette Decugis
Saurav Manchanda
Nitesh Chawla
Khoa D. Doan
Khoa D. Doan
52
8
0
15 Jul 2024
Representation Learning and Identity Adversarial Training for Facial Behavior Understanding
Mang Ning
A. A. Salah
Itir Onal Ertugrul
CVBM
87
4
0
15 Jul 2024
Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model
Xunyu Zhu
Jian Li
Can Ma
Weiping Wang
LRM
44
0
0
14 Jul 2024
Look Within, Why LLMs Hallucinate: A Causal Perspective
He Li
Haoang Chi
Mingyu Liu
Wenjing Yang
LRM
44
5
0
14 Jul 2024
Cross-Lingual Multi-Hop Knowledge Editing
Aditi Khandelwal
Harman Singh
Hengrui Gu
Tianlong Chen
Kaixiong Zhou
KELM
47
0
0
14 Jul 2024
Affordance-Guided Reinforcement Learning via Visual Prompting
Olivia Y. Lee
Annie Xie
Kuan Fang
Karl Pertsch
Chelsea Finn
OffRL
LM&Ro
76
9
0
14 Jul 2024
VLMPC: Vision-Language Model Predictive Control for Robotic Manipulation
Wentao Zhao
Jiaming Chen
Ziyu Meng
Donghui Mao
Ran Song
Wei Zhang
48
8
0
13 Jul 2024
MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts
Zhenpeng Su
Zijia Lin
Xue Bai
Xing Wu
Yizhe Xiong
...
Guangyuan Ma
Hui Chen
Guiguang Ding
Wei Zhou
Songlin Hu
MoE
38
5
0
13 Jul 2024
Open (Clinical) LLMs are Sensitive to Instruction Phrasings
Alberto Mario Ceballos Arroyo
Monica Munnangi
Jiuding Sun
Karen Y.C. Zhang
Denis Jered McInerney
Byron C. Wallace
Silvio Amir
LM&MA
26
8
0
12 Jul 2024
Mitigating Entity-Level Hallucination in Large Language Models
Weihang Su
Yichen Tang
Qingyao Ai
Changyue Wang
Zhijing Wu
Yiqun Liu
HILM
47
7
0
12 Jul 2024
Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees
Alexia Jolicoeur-Martineau
A. Baratin
Kisoo Kwon
Boris Knyazev
Yan Zhang
45
1
0
12 Jul 2024
A Survey on Symbolic Knowledge Distillation of Large Language Models
Kamal Acharya
Alvaro Velasquez
Haoze Song
SyDa
46
5
0
12 Jul 2024
Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing
Jun Zhu
Zihao Du
Haotian Xu
Fengbo Lan
Zilong Zheng
Bo Ma
Shengjie Wang
Tao Zhang
41
4
0
12 Jul 2024
Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs
Aobo Kong
Shiwan Zhao
Hao Chen
Qicheng Li
Yong Qin
Ruiqi Sun
Xin Zhou
Jiaming Zhou
Haoqin Sun
65
8
0
12 Jul 2024
IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Shrestha Mohanty
Negar Arabzadeh
Andrea Tupini
Yuxuan Sun
Alexey Skrynnik
Artem Zholus
Marc-Alexandre Côté
Julia Kiseleva
48
0
0
12 Jul 2024
Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding with ChildPlay
Gonçalo Hora de Carvalho
Oscar Knap
R. Pollice
ReLM
ELM
LRM
39
1
0
12 Jul 2024
Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
Wenshuo Peng
Kaipeng Zhang
Yue Yang
Hao Zhang
Ping Luo
VLM
34
2
0
11 Jul 2024
SEED-Story: Multimodal Long Story Generation with Large Language Model
Shuai Yang
Yuying Ge
Yang Li
Yukang Chen
Yixiao Ge
Ying Shan
Yingcong Chen
VGen
DiffM
83
27
0
11 Jul 2024
Converging Paradigms: The Synergy of Symbolic and Connectionist AI in LLM-Empowered Autonomous Agents
Haoyi Xiong
Zhiyuan Wang
Xuhong Li
Jiang Bian
Zeke Xie
Shahid Mumtaz
Laura E. Barnes
LLMAG
41
7
0
11 Jul 2024
FLAIR: Feeding via Long-horizon AcquIsition of Realistic dishes
Rajat Kumar Jenamani
Priya Sundaresan
Maram Sakr
Tapomayukh Bhattacharjee
Dorsa Sadigh
41
10
0
10 Jul 2024
CiteME: Can Language Models Accurately Cite Scientific Claims?
Ori Press
Andreas Hochlehnert
Ameya Prabhu
Vishaal Udandarao
Ofir Press
Matthias Bethge
52
13
0
10 Jul 2024
Interpretable Differential Diagnosis with Dual-Inference Large Language Models
Shuang Zhou
Sirui Ding
Jiashuo Wang
Mingquan Lin
Genevieve B. Melton
Rui Zhang
LM&MA
38
2
0
10 Jul 2024
Deconstructing What Makes a Good Optimizer for Language Models
Rosie Zhao
Depen Morwani
David Brandfonbrener
Nikhil Vyas
Sham Kakade
55
17
0
10 Jul 2024
Training on the Test Task Confounds Evaluation and Emergence
Ricardo Dominguez-Olmedo
Florian E. Dorner
Moritz Hardt
ELM
71
7
1
10 Jul 2024
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators
Wentao Zhang
Junliang Guo
Tianyu He
Li Zhao
Linli Xu
Jiang Bian
47
3
0
10 Jul 2024
Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models
Jupinder Parmar
Sanjev Satheesh
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
56
29
0
09 Jul 2024
FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation
Liqun Ma
Mingjie Sun
Zhiqiang Shen
31
7
0
09 Jul 2024
Entropy Law: The Story Behind Data Compression and LLM Performance
Mingjia Yin
Chuhan Wu
Yufei Wang
Hao Wang
Wei Guo
Yasheng Wang
Yong Liu
Ruiming Tang
Defu Lian
Enhong Chen
44
19
0
09 Jul 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
45
43
0
09 Jul 2024
Prompting Techniques for Secure Code Generation: A Systematic Investigation
Catherine Tony
Nicolás E. Díaz Ferreyra
Markus Mutas
Salem Dhiff
Riccardo Scandariato
SILM
82
9
0
09 Jul 2024
AI-driven multi-omics integration for multi-scale predictive modeling of causal genotype-environment-phenotype relationships
You Wu
Lei Xie
AI4CE
28
10
0
08 Jul 2024
Data, Data Everywhere: A Guide for Pretraining Dataset Construction
Jupinder Parmar
Shrimai Prabhumoye
Joseph Jennings
Bo Liu
Aastha Jhunjhunwala
Zhilin Wang
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
55
6
0
08 Jul 2024
Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models
Jinliang Lu
Ziliang Pang
Min Xiao
Yaochen Zhu
Rui Xia
Jiajun Zhang
MoMe
59
18
0
08 Jul 2024
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages
Yinquan Lu
Wenhao Zhu
Lei Li
Yu Qiao
Fei Yuan
46
25
0
08 Jul 2024
Previous
1
2
3
...
16
17
18
...
83
84
85
Next