Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
v1
v2
v3
v4
v5 (latest)
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,332 papers shown
Title
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Amey Agrawal
Nitin Kedia
Ashish Panwar
Jayashree Mohan
Nipun Kwatra
Bhargav S. Gulavani
Alexey Tumanov
Ramachandran Ramjee
106
187
0
04 Mar 2024
To Generate or to Retrieve? On the Effectiveness of Artificial Contexts for Medical Open-Domain Question Answering
Giacomo Frisoni
Alessio Cocchieri
Alex Presepi
Gianluca Moro
Zaiqiao Meng
RALM
MedIm
118
17
0
04 Mar 2024
Fostering the Ecosystem of Open Neural Encoders for Portuguese with Albertina PT* Family
Rodrigo Santos
João Rodrigues
Luís Gomes
Joao Silva
António Branco
Henrique Lopes Cardoso
T. Osório
Bernardo Leite
82
8
0
04 Mar 2024
Fine Tuning vs. Retrieval Augmented Generation for Less Popular Knowledge
Heydar Soudani
Evangelos Kanoulas
Faegheh Hasibi
81
39
0
03 Mar 2024
VBART: The Turkish LLM
Meliksah Turker
Mehmet Erdi Ari
Aydin Han
VLM
55
4
0
02 Mar 2024
Accelerating Greedy Coordinate Gradient via Probe Sampling
Yiran Zhao
Wenyue Zheng
Tianle Cai
Xuan Long Do
Kenji Kawaguchi
Anirudh Goyal
Michael Shieh
98
2
0
02 Mar 2024
Predictions from language models for multiple-choice tasks are not robust under variation of scoring methods
Polina Tsvilodub
Hening Wang
Sharon Grosch
Michael Franke
84
9
0
01 Mar 2024
Never-Ending Behavior-Cloning Agent for Robotic Manipulation
Wenqi Liang
Gan Sun
Qian He
Yu Ren
Jiahua Dong
Yang Cong
LM&Ro
90
1
0
01 Mar 2024
Gender Bias in Large Language Models across Multiple Languages
Jinman Zhao
Yitian Ding
Chen Jia
Yining Wang
Zifan Qian
76
32
0
01 Mar 2024
LLM-Ensemble: Optimal Large Language Model Ensemble Method for E-commerce Product Attribute Value Extraction
Chenhao Fang
Xiaohan Li
Zezhong Fan
Jianpeng Xu
Kaushiki Nag
Evren Körpeoglu
Sushant Kumar
Kannan Achan
75
43
0
29 Feb 2024
Query-OPT: Optimizing Inference of Large Language Models via Multi-Query Instructions in Meeting Summarization
Md Tahmid Rahman Laskar
Elena Khasanova
Xue-Yong Fu
Cheng-Hsiung Chen
TN ShashiBhushan
68
2
0
29 Feb 2024
OpenMedLM: Prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models
Jenish Maharjan
A. Garikipati
N. Singh
Leo Cyrus
Mayank Sharma
M. Ciobanu
G. Barnes
R. Thapa
Q. Mao
R. Das
LM&MA
ELM
86
31
0
29 Feb 2024
Exploring the Potential of Large Language Models for Improving Digital Forensic Investigation Efficiency
Akila Wickramasekara
Frank Breitinger
Mark Scanlon
162
10
0
29 Feb 2024
WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Jiantao Qiu
Haijun Lv
Zhenjiang Jin
Rui Wang
Wenchang Ning
...
Zhongying Tu
Lin Dahua
Yu Qiao
Hang Yan
Conghui He
87
7
0
29 Feb 2024
RiNALMo: General-Purpose RNA Language Models Can Generalize Well on Structure Prediction Tasks
Rafael Josip Penić
Tin Vlasic
Roland G. Huber
Yue Wan
M. Šikić
AI4CE
61
35
0
29 Feb 2024
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model
Hao-Ran Cheng
Erjia Xiao
Jindong Gu
Le Yang
Jinhao Duan
Jize Zhang
Jiahang Cao
Kaidi Xu
Renjing Xu
109
9
0
29 Feb 2024
Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models
Xin Li
Yunfei Wu
Xinghua Jiang
Zhihao Guo
Ming Gong
Haoyu Cao
Yinsong Liu
Deqiang Jiang
Xing Sun
VLM
112
18
0
29 Feb 2024
AdaMergeX: Cross-Lingual Transfer with Large Language Models via Adaptive Adapter Merging
Yiran Zhao
Wenxuan Zhang
Huiming Wang
Kenji Kawaguchi
Lidong Bing
MoMe
104
23
0
29 Feb 2024
How do Large Language Models Handle Multilingualism?
Yiran Zhao
Wenxuan Zhang
Guizhen Chen
Kenji Kawaguchi
Lidong Bing
LRM
108
81
0
29 Feb 2024
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
Xupeng Miao
Gabriele Oliaro
Xinhao Cheng
Vineeth Kada
Ruohan Gao
...
April Yang
Yingcheng Wang
Mengdi Wu
Colin Unger
Zhihao Jia
MoE
195
11
0
29 Feb 2024
Learning to Compress Prompt in Natural Language Formats
Yu-Neng Chuang
Tianwei Xing
Chia-Yuan Chang
Zirui Liu
Xun Chen
Helen Zhou
89
20
0
28 Feb 2024
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
Haoxiang Wang
Yong Lin
Wei Xiong
Rui Yang
Shizhe Diao
Shuang Qiu
Han Zhao
Tong Zhang
147
89
0
28 Feb 2024
A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist
Wentao Zhang
Lingxuan Zhao
Haochong Xia
Shuo Sun
Jiaze Sun
...
Yilei Zhao
Xinyu Cai
Longtao Zheng
Xinrun Wang
Bo An
AIFin
117
57
0
28 Feb 2024
Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge in English-Centric Large Language Models
Ercong Nie
Shuzhou Yuan
Bolei Ma
Helmut Schmid
Michael Farber
Frauke Kreuter
Hinrich Schütze
ReLM
127
7
0
28 Feb 2024
Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the Key?
Qineng Wang
Zihao Wang
Ying Su
Hanghang Tong
Yangqiu Song
LLMAG
LRM
126
79
0
28 Feb 2024
Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners
Shanu Vashishtha
Abhinav Prakash
Lalitesh Morishetti
Kaushiki Nag
Yokila Arora
Sushant Kumar
Kannan Achan
DiffM
56
5
0
28 Feb 2024
No Token Left Behind: Reliable KV Cache Compression via Importance-Aware Mixed Precision Quantization
J. Yang
Byeongwook Kim
Jeongin Bae
Beomseok Kwon
Gunho Park
Eunho Yang
S. Kwon
Dongsoo Lee
MQ
186
53
0
28 Feb 2024
Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension
Fan Yin
Jayanth Srinivasa
Kai-Wei Chang
HILM
119
26
0
28 Feb 2024
Do Large Language Models Mirror Cognitive Language Processing?
Yuqi Ren
Renren Jin
Tongxuan Zhang
Deyi Xiong
156
6
0
28 Feb 2024
All in an Aggregated Image for In-Image Learning
Lei Wang
Wanyu Xu
Zhiqiang Hu
Yihuai Lan
Shan Dong
Hao Wang
Roy Ka-wei Lee
Ee-Peng Lim
VLM
101
1
0
28 Feb 2024
ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training
Le Zhuo
Zewen Chi
Minghao Xu
Heyan Huang
Heqi Zheng
Conghui He
Xian-Ling Mao
Wentao Zhang
184
13
0
28 Feb 2024
Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction
Koki Maeda
Shuhei Kurita
Taiki Miyanishi
Naoaki Okazaki
61
2
0
28 Feb 2024
Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
Hanjie Chen
Zhouxiang Fang
Yash Singla
Mark Dredze
ELM
AI4MH
145
43
0
28 Feb 2024
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
301
22
0
28 Feb 2024
Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems
Zhenting Qi
Hanlin Zhang
Eric Xing
Sham Kakade
Hima Lakkaraju
SILM
133
25
0
27 Feb 2024
Towards Optimal Learning of Language Models
Yuxian Gu
Li Dong
Y. Hao
Qingxiu Dong
Minlie Huang
Furu Wei
106
7
0
27 Feb 2024
Case-Based or Rule-Based: How Do Transformers Do the Math?
Yi Hu
Xiaojuan Tang
Haotong Yang
Muhan Zhang
LRM
114
25
0
27 Feb 2024
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
Wenqi Zhang
Ke Tang
Hai Wu
Mengna Wang
Yongliang Shen
Guiyang Hou
Zeqi Tan
Peng Li
Yueting Zhuang
Weiming Lu
LLMAG
105
48
0
27 Feb 2024
Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning
Zhaoxun Ju
Chao Yang
Hongbo Wang
Yu Qiao
Gang Hua
LM&Ro
141
4
0
27 Feb 2024
Determinants of LLM-assisted Decision-Making
Eva Eigner
Thorsten Händler
109
49
0
27 Feb 2024
KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark
Seongbo Jang
Seonghyeon Lee
Hwanjo Yu
ELM
78
0
0
27 Feb 2024
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning
Debrup Das
Debopriyo Banerjee
Somak Aditya
Ashish Kulkarni
ReLM
LRM
82
15
0
27 Feb 2024
Measuring Vision-Language STEM Skills of Neural Models
Jianhao Shen
Ye Yuan
Srbuhi Mirzoyan
Ming Zhang
Chenguang Wang
VLM
143
12
0
27 Feb 2024
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Biao Zhang
Zhongtao Liu
Colin Cherry
Orhan Firat
LRM
119
160
0
27 Feb 2024
Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models
Xinran Zhao
Hongming Zhang
Xiaoman Pan
Wenlin Yao
Dong Yu
Tongshuang Wu
Jianshu Chen
HILM
LRM
71
8
0
27 Feb 2024
Understanding Subjectivity through the Lens of Motivational Context in Model-Generated Image Satisfaction
Senjuti Dutta
Sherol Chen
Sunny Mak
Amnah Ahmad
Katherine M. Collins
Alena Butryna
Deepak Ramachandran
Krishnamurthy Dvijotham
Ellie Pavlick
Ravi Rajakumar
EGVM
64
1
0
27 Feb 2024
Re-Ex: Revising after Explanation Reduces the Factual Errors in LLM Responses
Juyeon Kim
Jeongeun Lee
Yoonho Chang
Chanyeol Choi
Junseong Kim
Jy-yong Sohn
KELM
LRM
170
2
0
27 Feb 2024
SongComposer: A Large Language Model for Lyric and Melody Generation in Song Composition
Shuangrui Ding
Zihan Liu
Xiao-wen Dong
Pan Zhang
Rui Qian
Junhao Huang
Conghui He
Jiaqi Wang
Jiaqi Wang
134
23
0
27 Feb 2024
Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding
Benjamin Bergner
Andrii Skliar
Amelie Royer
Tijmen Blankevoort
Yuki Markus Asano
B. Bejnordi
134
7
0
26 Feb 2024
Nemotron-4 15B Technical Report
Jupinder Parmar
Shrimai Prabhumoye
Pritam Gundecha
M. Patwary
Sandeep Subramanian
...
Ashwath Aithal
Oleksii Kuchaiev
Mohammad Shoeybi
Jonathan Cohen
Bryan Catanzaro
108
23
0
26 Feb 2024
Previous
1
2
3
...
32
33
34
...
85
86
87
Next