Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.10403
Cited By
PaLM 2 Technical Report
17 May 2023
Rohan Anil
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
Alexandre Passos
Siamak Shakeri
Emanuel Taropa
Paige Bailey
Z. Chen
Eric Chu
J. Clark
Laurent El Shafey
Yanping Huang
Kathy Meier-Hellstern
Gaurav Mishra
Erica Moreira
Mark Omernick
Kevin Robinson
Sebastian Ruder
Yi Tay
Kefan Xiao
Yuanzhong Xu
Yujing Zhang
Gustavo Hernández Ábrego
Junwhan Ahn
Jacob Austin
P. Barham
Jan A. Botha
James Bradbury
Siddhartha Brahma
K. Brooks
Michele Catasta
Yongzhou Cheng
Colin Cherry
Christopher A. Choquette-Choo
Aakanksha Chowdhery
Clément Crepy
Shachi Dave
Mostafa Dehghani
Sunipa Dev
Jacob Devlin
Mark Díaz
Nan Du
Ethan Dyer
Vladimir Feinberg
Fan Feng
Vlad Fienber
Markus Freitag
Xavier Garcia
Sebastian Gehrmann
Lucas González
Guy Gur-Ari
Steven Hand
Hadi Hashemi
Le Hou
Joshua Howland
A. Hu
Jeffrey Hui
Jeremy Hurwitz
Michael Isard
Abe Ittycheriah
Matthew Jagielski
W. Jia
Kathleen Kenealy
M. Krikun
Sneha Kudugunta
Chang Lan
Katherine Lee
Benjamin Lee
Eric Li
Mu-Li Li
Wei Li
Yaguang Li
Jun Yu Li
Hyeontaek Lim
Han Lin
Zhong-Zhong Liu
Frederick Liu
Marcello Maggioni
Aroma Mahendru
Joshua Maynez
Vedant Misra
Maysam Moussalem
Zachary Nado
John Nham
Eric Ni
A. Nystrom
Alicia Parrish
Marie Pellat
M. Polacek
Oleksandr Polozov
Reiner Pope
Siyuan Qiao
Emily Reif
Bryan Richter
Parker Riley
Alex Castro-Ros
Aurko Roy
Brennan Saeta
Rajkumar Samuel
Renee Shelby
Ambrose Slone
D. Smilkov
David R. So
Daniela Sohn
Simon Tokumine
Dasha Valter
Vijay Vasudevan
Kiran Vodrahalli
Xuezhi Wang
Pidong Wang
Zirui Wang
Tao Wang
John Wieting
Yuhuai Wu
Ke Xu
Yunhan Xu
L. Xue
Pengcheng Yin
Jiahui Yu
Qiaoling Zhang
Steven Zheng
Ce Zheng
Wei Zhou
Denny Zhou
Slav Petrov
Yonghui Wu
ReLM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PaLM 2 Technical Report"
50 / 845 papers shown
Title
LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History
Akash Gupta
Ivaxi Sheth
Vyas Raina
Mark J. F. Gales
Mario Fritz
43
4
0
28 Feb 2024
Lemur: Log Parsing with Entropy Sampling and Chain-of-Thought Merging
Wei Zhang
Jian Yang
Anjie Le
Zehan Li
Shuangyong Song
Xianfu Cheng
Tieqiao Zheng
Shi Xu
64
14
0
28 Feb 2024
Towards Optimal Learning of Language Models
Yuxian Gu
Li Dong
Y. Hao
Qingxiu Dong
Minlie Huang
Furu Wei
36
7
0
27 Feb 2024
Securing Reliability: A Brief Overview on Enhancing In-Context Learning for Foundation Models
Yunpeng Huang
Yaonan Gu
Jingwei Xu
Zhihong Zhu
Zhaorun Chen
Xiaoxing Ma
40
3
0
27 Feb 2024
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Biao Zhang
Zhongtao Liu
Colin Cherry
Orhan Firat
LRM
63
126
0
27 Feb 2024
Predicting Sustainable Development Goals Using Course Descriptions -- from LLMs to Conventional Foundation Models
Lev Kharlashkin
Melany Macias
Leo Huovinen
Mika Hämäläinen
29
2
0
26 Feb 2024
GraphWiz: An Instruction-Following Language Model for Graph Problems
Nuo Chen
Yuhan Li
Jianheng Tang
Jia Li
43
28
0
25 Feb 2024
Chimera: A Lossless Decoding Method for Accelerating Large Language Models Inference by Fusing all Tokens
Ziqian Zeng
Jiahong Yu
Qianshi Pang
Zihao Wang
Huiping Zhuang
Cen Chen
Xiaofeng Zou
38
4
0
24 Feb 2024
ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition
Lu Ye
Ze Tao
Yong Huang
Yang Li
34
26
0
23 Feb 2024
Biomedical Entity Linking as Multiple Choice Question Answering
Zhenxi Lin
Ziheng Zhang
Xian Wu
Yefeng Zheng
33
2
0
23 Feb 2024
Linear Transformers are Versatile In-Context Learners
Max Vladymyrov
J. Oswald
Mark Sandler
Rong Ge
34
13
0
21 Feb 2024
Towards Building Multilingual Language Model for Medicine
Pengcheng Qiu
Chaoyi Wu
Xiaoman Zhang
Weixiong Lin
Haicheng Wang
Ya-Qin Zhang
Yanfeng Wang
Weidi Xie
LM&MA
ELM
33
65
0
21 Feb 2024
Preserving Near-Optimal Gradient Sparsification Cost for Scalable Distributed Deep Learning
Daegun Yoon
Sangyoon Oh
43
0
0
21 Feb 2024
User-LLM: Efficient LLM Contextualization with User Embeddings
Lin Ning
Luyang Liu
Jiaxing Wu
Neo Wu
D. Berlowitz
Sushant Prakash
Bradley Green
S. O’Banion
Jun Xie
55
33
0
21 Feb 2024
Towards audio language modeling -- an overview
Haibin Wu
Xuanjun Chen
Yi-Cheng Lin
Kai-Wei Chang
Ho-Lam Chung
Alexander H. Liu
Hung-yi Lee
AuLLM
35
28
0
20 Feb 2024
VideoPrism: A Foundational Visual Encoder for Video Understanding
Long Zhao
N. B. Gundavarapu
Liangzhe Yuan
Hao Zhou
Shen Yan
...
Huisheng Wang
Hartwig Adam
Mikhail Sirotenko
Ting Liu
Boqing Gong
VGen
43
29
0
20 Feb 2024
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Zhiyuan Li
Hong Liu
Denny Zhou
Tengyu Ma
LRM
AI4CE
28
96
0
20 Feb 2024
Talk Through It: End User Directed Manipulation Learning
Carl Winge
Adam Imdieke
Bahaa Aldeeb
Dongyeop Kang
Karthik Desingh
LM&Ro
46
1
0
19 Feb 2024
Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability
Xue-Qing Qian
Yu Wang
Simian Luo
Yinda Zhang
Ying Tai
...
Xiangyang Xue
Bo Zhao
Tiejun Huang
Yunsheng Wu
Yanwei Fu
29
6
0
19 Feb 2024
Amplifying Training Data Exposure through Fine-Tuning with Pseudo-Labeled Memberships
Myung Gyo Oh
Hong Eun Ahn
L. Park
T.-H. Kwon
MIALM
AAML
34
0
0
19 Feb 2024
InMD-X: Large Language Models for Internal Medicine Doctors
Hansle Gwon
Imjin Ahn
Hyoje Jung
Byeolhee Kim
Young-Hak Kim
Tae Joon Jun
LM&MA
25
1
0
19 Feb 2024
An Empirical Evaluation of LLMs for Solving Offensive Security Challenges
Minghao Shao
Boyuan Chen
Sofija Jancheska
Brendan Dolan-Gavitt
Siddharth Garg
Ramesh Karri
Muhammad Shafique
30
25
0
19 Feb 2024
Utilizing BERT for Information Retrieval: Survey, Applications, Resources, and Challenges
Jiajia Wang
Jimmy X. Huang
Xinhui Tu
Junmei Wang
Angela J. Huang
Md Tahmid Rahman Laskar
Amran Bhuiyan
34
28
0
18 Feb 2024
Efficient Multimodal Learning from Data-centric Perspective
Muyang He
Yexin Liu
Boya Wu
Jianhao Yuan
Yueze Wang
Tiejun Huang
Bo-Lu Zhao
MLLM
38
84
0
18 Feb 2024
AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition
Zhaorun Chen
Zhuokai Zhao
Zhihong Zhu
Ruiqi Zhang
Xiang Li
Bhiksha Raj
Huaxiu Yao
LRM
25
25
0
18 Feb 2024
Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Jacky Liang
Fei Xia
Wenhao Yu
Andy Zeng
Montse Gonzalez Arenas
...
N. Heess
Kanishka Rao
Nik Stewart
Jie Tan
Carolina Parada
LM&Ro
61
34
0
18 Feb 2024
PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter
Junfei Xiao
Zheng Xu
Alan L. Yuille
Shen Yan
Boyu Wang
33
3
0
16 Feb 2024
Chain-of-Thought Reasoning Without Prompting
Xuezhi Wang
Denny Zhou
ReLM
LRM
152
101
0
15 Feb 2024
SwissNYF: Tool Grounded LLM Agents for Black Box Setting
Somnath Sendhil Kumar
Dhruv Jain
Eshaan Agarwal
Raunak Pandey
LLMAG
29
0
0
15 Feb 2024
Chain-of-Planned-Behaviour Workflow Elicits Few-Shot Mobility Generation in LLMs
Chenyang Shao
Fengli Xu
Bingbing Fan
Jingtao Ding
Yuan Yuan
Meng Wang
Yong Li
LRM
24
6
0
15 Feb 2024
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
Kuang-Huei Lee
Xinyun Chen
Hiroki Furuta
John F. Canny
Ian S. Fischer
RALM
55
29
0
15 Feb 2024
Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM Inference
Harry Dong
Xinyu Yang
Zhenyu (Allen) Zhang
Zhangyang Wang
Yuejie Chi
Beidi Chen
32
49
0
14 Feb 2024
Instruction Backdoor Attacks Against Customized LLMs
Rui Zhang
Hongwei Li
Rui Wen
Wenbo Jiang
Yuan Zhang
Michael Backes
Yun Shen
Yang Zhang
AAML
SILM
30
24
0
14 Feb 2024
Premise Order Matters in Reasoning with Large Language Models
Xinyun Chen
Ryan A. Chi
Xuezhi Wang
Denny Zhou
ReLM
LRM
44
26
0
14 Feb 2024
Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal Foundation Models
Maurice Diesendruck
Jianzhe Lin
Shima Imani
Gayathri Mahalingam
Mingyang Xu
Jie Zhao
17
1
0
13 Feb 2024
Grounding LLMs For Robot Task Planning Using Closed-loop State Feedback
V. Bhat
Ali Umut Kaypak
Prashanth Krishnamurthy
Ramesh Karri
Farshad Khorrami
LM&Ro
66
13
0
13 Feb 2024
Unsupervised Evaluation of Code LLMs with Round-Trip Correctness
Miltiadis Allamanis
Sheena Panthaplackel
Pengcheng Yin
ALM
OffRL
LRM
43
9
0
13 Feb 2024
BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models
Haotian Sun
Yuchen Zhuang
Wei Wei
Chao Zhang
Bo Dai
18
3
0
13 Feb 2024
PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models
Wei Zou
Runpeng Geng
Binghui Wang
Jinyuan Jia
SILM
39
45
1
12 Feb 2024
Natural Language Reinforcement Learning
Xidong Feng
Bo Liu
Mengyue Yang
Ziyan Wang
Girish A. Koushiks
Yali Du
Ying Wen
Jun Wang
OffRL
35
3
0
11 Feb 2024
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Huaiyuan Ying
Shuo Zhang
Linyang Li
Zhejian Zhou
Yunfan Shao
...
Hang Yan
Xipeng Qiu
Jiayu Wang
Kai-xiang Chen
Dahua Lin
ReLM
LRM
34
70
0
09 Feb 2024
FL-NAS: Towards Fairness of NAS for Resource Constrained Devices via Large Language Models
Ruiyang Qin
Yuting Hu
Zheyu Yan
Jinjun Xiong
Ahmed Abbasi
Yiyu Shi
32
5
0
09 Feb 2024
Large Language Models: A Survey
Shervin Minaee
Tomáš Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALM
LM&MA
ELM
134
371
0
09 Feb 2024
Let Your Graph Do the Talking: Encoding Structured Data for LLMs
Bryan Perozzi
Bahare Fatemi
Dustin Zelle
Anton Tsitsulin
Mehran Kazemi
Rami Al-Rfou
Jonathan J. Halcrow
GNN
39
55
0
08 Feb 2024
Generalized Preference Optimization: A Unified Approach to Offline Alignment
Yunhao Tang
Z. Guo
Zeyu Zheng
Daniele Calandriello
Rémi Munos
Mark Rowland
Pierre Harvey Richemond
Michal Valko
Bernardo Avila-Pires
Bilal Piot
32
88
0
08 Feb 2024
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents
Yuxi Wei
Zi Wang
Yifan Lu
Chenxin Xu
Chang-rui Liu
Hao Zhao
Siheng Chen
Yanfeng Wang
VGen
65
59
0
08 Feb 2024
Real-World Robot Applications of Foundation Models: A Review
Kento Kawaharazuka
T. Matsushima
Andrew Gambardella
Jiaxian Guo
Chris Paxton
Andy Zeng
OffRL
VLM
LM&Ro
51
45
0
08 Feb 2024
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
Chen Chen
Ruizhe Li
Yuchen Hu
Sabato Marco Siniscalchi
Pin-Yu Chen
Ensiong Chng
Chao-Han Huck Yang
33
19
0
08 Feb 2024
In-Context Principle Learning from Mistakes
Tianjun Zhang
Aman Madaan
Luyu Gao
Steven Zheng
Swaroop Mishra
Yiming Yang
Niket Tandon
Uri Alon
KELM
ReLM
33
23
0
08 Feb 2024
Do Large Code Models Understand Programming Concepts? Counterfactual Analysis for Code Predicates
Ashish Hooda
Mihai Christodorescu
Miltos Allamanis
Aaron Wilson
Kassem Fawaz
Somesh Jha
ELM
38
5
0
08 Feb 2024
Previous
1
2
3
...
7
8
9
...
15
16
17
Next