Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,246 papers shown
Title
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation
Tong Wu
Guandao Yang
Zhibing Li
Kai Zhang
Ziwei Liu
Leonidas J. Guibas
Dahua Lin
Gordon Wetzstein
EGVM
VGen
35
89
0
08 Jan 2024
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Maciej Pióro
Kamil Ciebiera
Krystian Król
Jan Ludziejewski
Michał Krutul
Jakub Krajewski
Szymon Antoniak
Piotr Miłoś
Marek Cygan
Sebastian Jaszczur
MoE
Mamba
28
55
0
08 Jan 2024
Enhanced Automated Code Vulnerability Repair using Large Language Models
David de-Fitero-Dominguez
Eva García-López
Antonio Garcia-Cabot
J. Martínez-Herráiz
24
12
0
08 Jan 2024
ExTraCT -- Explainable Trajectory Corrections from language inputs using Textual description of features
J-Anne Yow
N. P. Garg
Manoj Ramanathan
Wei Tech Ang
38
5
0
08 Jan 2024
InFoBench: Evaluating Instruction Following Ability in Large Language Models
Yiwei Qin
Kaiqiang Song
Yebowen Hu
Wenlin Yao
Sangwoo Cho
Xiaoyang Wang
Xuansheng Wu
Fei Liu
Pengfei Liu
Dong Yu
ELM
36
42
0
07 Jan 2024
Denoising Vision Transformers
Jiawei Yang
Katie Z Luo
Jie Li
Kilian Q. Weinberger
Yonglong Tian
Yue Wang
DiffM
35
13
0
05 Jan 2024
Can Large Language Models Understand Molecules?
Seyedeh Shaghayegh Sadeghi
Alan Bui
Ali Forooghi
Jianguo Lu
A. Ngom
AI4CE
26
9
0
05 Jan 2024
MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
Renjie Pi
Tianyang Han
Jianshu Zhang
Yueqi Xie
Rui Pan
Qing Lian
Hanze Dong
Jipeng Zhang
Tong Zhang
AAML
36
61
0
05 Jan 2024
XUAT-Copilot: Multi-Agent Collaborative System for Automated User Acceptance Testing with Large Language Model
Zhitao Wang
Wei Wang
Zirao Li
Long Wang
Can Yi
Xinjie Xu
Luyang Cao
Hanjing Su
Shouzhi Chen
Jun Zhou
ALM
LLMAG
37
8
0
05 Jan 2024
AST-T5: Structure-Aware Pretraining for Code Generation and Understanding
Linyuan Gong
Mostafa Elhoushi
Alvin Cheung
39
14
0
05 Jan 2024
LLaMA Pro: Progressive LLaMA with Block Expansion
Chengyue Wu
Yukang Gan
Yixiao Ge
Zeyu Lu
Jiahao Wang
Ye Feng
Ying Shan
Ping Luo
CLL
37
61
0
04 Jan 2024
TinyLlama: An Open-Source Small Language Model
Peiyuan Zhang
Guangtao Zeng
Tianduo Wang
Wei Lu
ALM
LRM
72
364
0
04 Jan 2024
Data-Centric Foundation Models in Computational Healthcare: A Survey
Yunkun Zhang
Jin Gao
Zheling Tan
Lingfeng Zhou
Kexin Ding
Mu Zhou
Shaoting Zhang
Dequan Wang
AI4CE
56
22
0
04 Jan 2024
Re-evaluating the Memory-balanced Pipeline Parallelism: BPipe
Mincong Huang
Chao Wang
Chi Ma
Yineng Zhang
Peng Zhang
Lei Yu
33
1
0
04 Jan 2024
Understanding LLMs: A Comprehensive Overview from Training to Inference
Yi-Hsueh Liu
Haoyang He
Tianle Han
Xu-Yao Zhang
Mengyuan Liu
...
Xintao Hu
Tuo Zhang
Ning Qiang
Tianming Liu
Bao Ge
SyDa
45
66
0
04 Jan 2024
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives
Wenqi Zhang
Yongliang Shen
Linjuan Wu
Qiuying Peng
Jun Wang
Yueting Zhuang
Weiming Lu
LRM
LLMAG
50
53
0
04 Jan 2024
Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers
Aleksandar Stanić
Sergi Caelles
Michael Tschannen
LRM
VLM
27
9
0
03 Jan 2024
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity
Andrew Lee
Xiaoyan Bai
Itamar Pres
Martin Wattenberg
Jonathan K. Kummerfeld
Rada Mihalcea
77
104
0
03 Jan 2024
Navigating Uncertainty: Optimizing API Dependency for Hallucination Reduction in Closed-Book Question Answering
Pierre Erbacher
Louis Falissard
Vincent Guigue
Laure Soulier
HILM
RALM
32
4
0
03 Jan 2024
Few-shot Adaptation of Multi-modal Foundation Models: A Survey
Fan Liu
Tianshu Zhang
Wenwen Dai
Wenwen Cai
Wenwen Cai Xiaocong Zhou
Delong Chen
VLM
OffRL
36
25
0
03 Jan 2024
GOAT-Bench: Safety Insights to Large Multimodal Models through Meme-Based Social Abuse
Hongzhan Lin
Ziyang Luo
Bo Wang
Ruichao Yang
Jing Ma
50
25
0
03 Jan 2024
BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving
Tao Tang
Dafeng Wei
Zhengyu Jia
Tian Gao
Changwei Cai
...
Yixing Zhao
Fu Liu
Xiaodan Liang
Xianpeng Lang
Yang Wang
41
7
0
02 Jan 2024
Taking the Next Step with Generative Artificial Intelligence: The Transformative Role of Multimodal Large Language Models in Science Education
Arne Bewersdorff
Christian Hartmann
Marie Hornberger
Kathrin Seßler
Maria Bannert
Enkelejda Kasneci
Gjergji Kasneci
Xiaoming Zhai
Claudia Nerdel
41
30
0
01 Jan 2024
Fine-tuning and Utilization Methods of Domain-specific LLMs
CheonSu Jeong
29
45
0
01 Jan 2024
DocLLM: A layout-aware generative language model for multimodal document understanding
Dongsheng Wang
Natraj Raman
Mathieu Sibue
Zhiqiang Ma
Petr Babkin
Simerjot Kaur
Yulong Pei
Armineh Nourbakhsh
Xiaomo Liu
VLM
27
54
0
31 Dec 2023
State of What Art? A Call for Multi-Prompt LLM Evaluation
Moran Mizrahi
Guy Kaplan
Daniel Malkin
Rotem Dror
Dafna Shahaf
Gabriel Stanovsky
ELM
60
129
0
31 Dec 2023
HSC-GPT: A Large Language Model for Human Settlements Construction
Ran Chen
Xueqi Yao
Xuhui Jiang
Zhengqi Han
Jingze Guo
...
Chumin Liu
Jing Zhao
Zeke Lian
Jingjing Zhang
Keke Li
36
1
0
31 Dec 2023
keqing: knowledge-based question answering is a nature chain-of-thought mentor of LLM
Chaojie Wang
Yishi Xu
Zhong Peng
Chenxi Zhang
Bo Chen
Xinrun Wang
Lei Feng
Bo An
81
18
0
31 Dec 2023
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit
Yao Wan
Yang He
Zhangqian Bi
Jianguo Zhang
Hongyu Zhang
Yulei Sui
Guandong Xu
Hai Jin
Philip S. Yu
53
21
0
30 Dec 2023
The Art of Defending: A Systematic Evaluation and Analysis of LLM Defense Strategies on Safety and Over-Defensiveness
Neeraj Varshney
Pavel Dolin
Agastya Seth
Chitta Baral
AAML
ELM
33
48
0
30 Dec 2023
Open-TI: Open Traffic Intelligence with Augmented Language Model
Longchao Da
Kuanru Liou
Tiejin Chen
Xuesong Zhou
Xiangyong Luo
Yezhou Yang
Hua Wei
54
23
0
30 Dec 2023
Olapa-MCoT: Enhancing the Chinese Mathematical Reasoning Capability of LLMs
Shaojie Zhu
Zhaobin Wang
Chengxiang Zhuo
Hui Lu
Bo Hu
Zang Li
LRM
35
0
0
29 Dec 2023
Enhancing Quantitative Reasoning Skills of Large Language Models through Dimension Perception
Yuncheng Huang
Qi He
Jiaqing Liang
Sihang Jiang
Yanghua Xiao
Yunwen Chen
LRM
70
2
0
29 Dec 2023
Differentially Private Low-Rank Adaptation of Large Language Model Using Federated Learning
Xiao-Yang Liu
Rongyi Zhu
Daochen Zha
Jiechao Gao
Shan Zhong
Matt White
Meikang Qiu
29
16
0
29 Dec 2023
MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining
Jacob P. Portes
Alex Trott
Sam Havens
Daniel King
Abhinav Venigalla
Moin Nadeem
Nikhil Sardana
D. Khudia
Jonathan Frankle
34
17
0
29 Dec 2023
SentinelLMs: Encrypted Input Adaptation and Fine-tuning of Language Models for Private and Secure Inference
Abhijit Mishra
Mingda Li
S. Deo
SILM
23
2
0
28 Dec 2023
Fast Inference of Mixture-of-Experts Language Models with Offloading
Artyom Eliseev
Denis Mazur
MoE
19
43
0
28 Dec 2023
Structured Packing in LLM Training Improves Long Context Utilization
Konrad Staniszewski
Szymon Tworkowski
Sebastian Jaszczur
Yu Zhao
Henryk Michalewski
Lukasz Kuciñski
Piotr Milo's
43
13
0
28 Dec 2023
DrugAssist: A Large Language Model for Molecule Optimization
Geyan Ye
Xibao Cai
Houtim Lai
Xing Wang
Junhong Huang
Longyue Wang
Wei Liu
Xian Zeng
66
26
0
28 Dec 2023
Prompt Expansion for Adaptive Text-to-Image Generation
Siddhartha Datta
Alexander Ku
Deepak Ramachandran
Peter Anderson
DiffM
52
9
0
27 Dec 2023
Rethinking Tabular Data Understanding with Large Language Models
Tianyang Liu
Fei Wang
Muhao Chen
ReLM
LMTD
LRM
37
14
0
27 Dec 2023
PanGu-
π
π
π
: Enhancing Language Model Architectures via Nonlinearity Compensation
Yunhe Wang
Hanting Chen
Yehui Tang
Tianyu Guo
Kai Han
...
Qinghua Xu
Qun Liu
Jun Yao
Chao Xu
Dacheng Tao
73
17
0
27 Dec 2023
Preference as Reward, Maximum Preference Optimization with Importance Sampling
Zaifan Jiang
Xing Huang
Chao Wei
36
2
0
27 Dec 2023
Supervised Knowledge Makes Large Language Models Better In-context Learners
Linyi Yang
Shuibai Zhang
Zhuohao Yu
Guangsheng Bao
Yidong Wang
...
Ruochen Xu
Weirong Ye
Xing Xie
Weizhu Chen
Yue Zhang
44
15
0
26 Dec 2023
ChartBench: A Benchmark for Complex Visual Reasoning in Charts
Zhengzhuo Xu
Sinan Du
Yiyan Qi
Chengjin Xu
Chun Yuan
Jian Guo
50
37
0
26 Dec 2023
Large Language Models are Not Stable Recommender Systems
Tianhui Ma
Yuan Cheng
Hengshu Zhu
Hui Xiong
37
13
0
25 Dec 2023
PersianLLaMA: Towards Building First Persian Large Language Model
Mohammad Amin Abbasi
A. Ghafouri
Mahdi Firouzmandi
Hassan Naderi
B. Minaei-Bidgoli
32
9
0
25 Dec 2023
EcomGPT-CT: Continual Pre-training of E-commerce Large Language Models with Semi-structured Data
Shirong Ma
Shen Huang
Shulin Huang
Xiaobin Wang
Yangning Li
Hai-Tao Zheng
Pengjun Xie
Fei Huang
Yong-jia Jiang
53
6
0
25 Dec 2023
IQAGPT: Image Quality Assessment with Vision-language and ChatGPT Models
Zhihao Chen
Bin Hu
Chuang Niu
Tao Chen
Yuxin Li
Hongming Shan
Ge Wang
LM&MA
MLLM
37
4
0
25 Dec 2023
Making Large Language Models A Better Foundation For Dense Retrieval
Chaofan Li
Zheng Liu
Shitao Xiao
Yingxia Shao
RALM
45
39
0
24 Dec 2023
Previous
1
2
3
...
37
38
39
...
83
84
85
Next