Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,245 papers shown
Title
Improved Baselines for Data-efficient Perceptual Augmentation of LLMs
Théophane Vallaeys
Mustafa Shukor
Matthieu Cord
Jakob Verbeek
59
12
0
20 Mar 2024
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Wenqiao Zhang
Tianwei Lin
Jiang Liu
Fangxun Shu
Haoyuan Li
...
Zheqi Lv
Hao Jiang
Juncheng Li
Siliang Tang
Yueting Zhuang
VLM
MLLM
41
4
0
20 Mar 2024
Clinical information extraction for Low-resource languages with Few-shot learning using Pre-trained language models and Prompting
Phillip Richter-Pechanski
Philipp Wiesenbach
Dominic M. Schwab
Christina Kiriakou
Nicolas Geis
Christoph Dieterich
Anette Frank
37
4
0
20 Mar 2024
Community Needs and Assets: A Computational Analysis of Community Conversations
Towhid Chowdhury
Naveen Sharma
Ashiqur R. KhudaBukhsh
36
0
0
20 Mar 2024
A Study of Vulnerability Repair in JavaScript Programs with Large Language Models
Tan Khang Le
Saba Alimadadi
Steven Y. Ko
56
5
0
19 Mar 2024
HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
Fucai Ke
Zhixi Cai
Simindokht Jahangard
Weiqing Wang
P. D. Haghighi
Hamid Rezatofighi
LRM
53
10
0
19 Mar 2024
Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs
Md Ashiqur Rahman
Robert Joseph George
Mogab Elleithy
Daniel Leibovici
Zong-Yi Li
...
Julius Berner
Raymond A. Yeh
Jean Kossaifi
Kamyar Azizzadenesheli
A. Anandkumar
AI4CE
54
21
0
19 Mar 2024
CrossTune: Black-Box Few-Shot Classification with Label Enhancement
Danqing Luo
Chen Zhang
Yan Zhang
Haizhou Li
32
2
0
19 Mar 2024
An Empirical Study of Speech Language Models for Prompt-Conditioned Speech Synthesis
Yifan Peng
Ilia Kulikov
Yilin Yang
Sravya Popuri
Hui Lu
Changhan Wang
Hongyu Gong
38
1
0
19 Mar 2024
RankPrompt: Step-by-Step Comparisons Make Language Models Better Reasoners
Chi Hu
Yuan Ge
Xiangnan Ma
Hang Cao
Qiang Li
Yonghua Yang
Tong Xiao
Jingbo Zhu
ReLM
ELM
LRM
ALM
45
9
0
19 Mar 2024
Leveraging Large Language Models to Detect npm Malicious Packages
Nusrat Zahan
Philipp Burckhardt
Mikola Lysenko
Feross Aboukhadijeh
Laurie A. Williams
45
2
0
18 Mar 2024
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control
Enshen Zhou
Yiran Qin
Zhen-fei Yin
Yuzhou Huang
Ruimao Zhang
Lu Sheng
Yu Qiao
Jing Shao
LM&Ro
AI4CE
50
34
0
18 Mar 2024
EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
Abhaysinh Zala
Jaemin Cho
Han Lin
Jaehong Yoon
Mohit Bansal
46
13
0
18 Mar 2024
CICLe: Conformal In-Context Learning for Largescale Multi-Class Food Risk Classification
Korbinian Randl
John Pavlopoulos
Aron Henriksson
Tony Lindgren
44
3
0
18 Mar 2024
Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Sha Zhang
Di Huang
Jiajun Deng
Shixiang Tang
Wanli Ouyang
Tong He
Yanyong Zhang
VGen
46
14
0
18 Mar 2024
Towards Understanding the Relationship between In-context Learning and Compositional Generalization
Sungjun Han
Sebastian Padó
CoGe
34
2
0
18 Mar 2024
Embedded Named Entity Recognition using Probing Classifiers
Nicholas Popovic
Michael Färber
45
1
0
18 Mar 2024
Linguacodus: A Synergistic Framework for Transformative Code Generation in Machine Learning Pipelines
Ekaterina Trofimova
Emil Sataev
Andrey E. Ustyuzhanin
42
0
0
18 Mar 2024
A Novel Paradigm Boosting Translation Capabilities of Large Language Models
Jiaxin Guo
Hao Yang
Zongyao Li
Daimeng Wei
Hengchao Shang
Xiaoyu Chen
47
7
0
18 Mar 2024
SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant
Guohao Sun
Can Qin
Jiamian Wang
Zeyuan Chen
Ran Xu
Zhiqiang Tao
MLLM
VLM
LRM
37
9
0
17 Mar 2024
Decoding Continuous Character-based Language from Non-invasive Brain Recordings
Cenyuan Zhang
Xiaoqing Zheng
Ruicheng Yin
Shujie Geng
Jianhan Xu
...
Changze Lv
Zixuan Ling
Xuanjing Huang
Miao Cao
Jianfeng Feng
46
0
0
17 Mar 2024
SelfIE: Self-Interpretation of Large Language Model Embeddings
Haozhe Chen
Carl Vondrick
Chengzhi Mao
27
20
0
16 Mar 2024
Discovering Latent Themes in Social Media Messaging: A Machine-in-the-Loop Approach Integrating LLMs
Tunazzina Islam
Dan Goldwasser
67
5
0
15 Mar 2024
EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models
Rocktim Jyoti Das
Simeon Emilov Hristov
Haonan Li
Dimitar Iliyanov Dimitrov
Ivan Koychev
Preslav Nakov
CoGe
ELM
77
14
0
15 Mar 2024
TriSum: Learning Summarization Ability from Large Language Models with Structured Rationale
Pengcheng Jiang
Cao Xiao
Zifeng Wang
Parminder Bhatia
Jimeng Sun
Jiawei Han
LRM
32
10
0
15 Mar 2024
Read between the lines -- Functionality Extraction From READMEs
Prince Kumar
Srikanth G. Tamilselvam
Dinesh Garg
27
0
0
15 Mar 2024
The Whole is Better than the Sum: Using Aggregated Demonstrations in In-Context Learning for Sequential Recommendation
Lei Wang
Ee-Peng Lim
RALM
27
5
0
15 Mar 2024
DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language Models
Weihang Su
Yichen Tang
Qingyao Ai
Zhijing Wu
Yiqun Liu
3DV
RALM
AI4TS
SyDa
61
19
0
15 Mar 2024
ExeGPT: Constraint-Aware Resource Scheduling for LLM Inference
Hyungjun Oh
Kihong Kim
Jaemin Kim
Sungkyun Kim
Junyeol Lee
Du-Seong Chang
Jiwon Seo
41
28
0
15 Mar 2024
Don't Half-listen: Capturing Key-part Information in Continual Instruction Tuning
Yongquan He
Xuancheng Huang
Xuancheng Huang
Peng Zhang
CLL
ALM
75
5
0
15 Mar 2024
Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models
Chaoqun Liu
Wenxuan Zhang
Yiran Zhao
Anh Tuan Luu
Lidong Bing
LRM
43
10
0
15 Mar 2024
Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training
Yanlai Yang
Matt Jones
Michael C. Mozer
Mengye Ren
70
1
0
14 Mar 2024
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Brandon McKinzie
Zhe Gan
J. Fauconnier
Sam Dodge
Bowen Zhang
...
Zirui Wang
Ruoming Pang
Peter Grasch
Alexander Toshev
Yinfei Yang
MLLM
43
189
0
14 Mar 2024
OpenGraph: Open-Vocabulary Hierarchical 3D Graph Representation in Large-Scale Outdoor Environments
Yinan Deng
Jiahui Wang
Jingyu Zhao
Xinyu Tian
Guangyan Chen
Yi Yang
Yufeng Yue
3DV
40
13
0
14 Mar 2024
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Haiyang Wang
Hao Tang
Li Jiang
Shaoshuai Shi
Muhammad Ferjad Naeem
Hongsheng Li
Bernt Schiele
Liwei Wang
VLM
51
10
0
14 Mar 2024
BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences
Sun Ao
Weilin Zhao
Xu Han
Cheng Yang
Zhiyuan Liu
Chuan Shi
Maosong Sun
GNN
40
8
0
14 Mar 2024
Unveiling the Generalization Power of Fine-Tuned Large Language Models
Haoran Yang
Yumeng Zhang
Jiaqi Xu
Hongyuan Lu
Pheng Ann Heng
Wai Lam
50
30
0
14 Mar 2024
USimAgent: Large Language Models for Simulating Search Users
Erhan Zhang
Xingzhu Wang
Peiyuan Gong
Yankai Lin
Jiaxin Mao
LLMAG
38
17
0
14 Mar 2024
UniCode: Learning a Unified Codebook for Multimodal Large Language Models
Sipeng Zheng
Bohan Zhou
Yicheng Feng
Ye Wang
Zongqing Lu
VLM
MLLM
46
7
0
14 Mar 2024
Bugs in Large Language Models Generated Code: An Empirical Study
Florian Tambon
Arghavan Moradi Dakhel
Amin Nikanjam
Foutse Khomh
Michel C. Desmarais
G. Antoniol
ELM
42
34
0
13 Mar 2024
Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics
Tyler A. Chang
Katrin Tomanek
Jessica Hoffmann
Nithum Thain
Erin van Liemt
Kathleen Meier-Hellstern
Lucas Dixon
46
7
0
13 Mar 2024
Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization
Renjie Pi
Tianyang Han
Wei Xiong
Jipeng Zhang
Runtao Liu
Rui Pan
Tong Zhang
MLLM
55
34
0
13 Mar 2024
Bifurcated Attention: Accelerating Massively Parallel Decoding with Shared Prefixes in LLMs
Ben Athiwaratkun
Sujan Kumar Gonugondla
Sanjay Krishna Gouda
Haifeng Qian
Hantian Ding
...
Liangfu Chen
Parminder Bhatia
Ramesh Nallapati
Sudipta Sengupta
Bing Xiang
59
4
0
13 Mar 2024
MedInsight: A Multi-Source Context Augmentation Framework for Generating Patient-Centric Medical Responses using Large Language Models
Subash Neupane
Shaswata Mitra
Sudip Mittal
Noorbakhsh Amiri Golilarz
Shahram Rahimi
Amin Amirlatifi
70
3
0
13 Mar 2024
Language models scale reliably with over-training and on downstream tasks
S. Gadre
Georgios Smyrnis
Vaishaal Shankar
Suchin Gururangan
Mitchell Wortsman
...
Y. Carmon
Achal Dave
Reinhard Heckel
Niklas Muennighoff
Ludwig Schmidt
ALM
ELM
LRM
108
42
0
13 Mar 2024
From human experts to machines: An LLM supported approach to ontology and knowledge graph construction
Vamsi Krishna Kommineni
B. König-Ries
Sheeba Samuel
41
31
0
13 Mar 2024
Knowledge Conflicts for LLMs: A Survey
Rongwu Xu
Zehan Qi
Zhijiang Guo
Cunxiang Wang
Hongru Wang
Yue Zhang
Wei Xu
208
96
0
13 Mar 2024
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models
Ning Ding
Yulin Chen
Ganqu Cui
Xingtai Lv
Weilin Zhao
Ruobing Xie
Bowen Zhou
Zhiyuan Liu
Maosong Sun
ALM
MoMe
AI4CE
43
7
0
13 Mar 2024
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation
Minbin Huang
Yanxin Long
Xinchi Deng
Ruihang Chu
Jiangfeng Xiong
Xiaodan Liang
Hong Cheng
Qinglin Lu
Wei Liu
MLLM
EGVM
65
8
0
13 Mar 2024
Mechanics of Next Token Prediction with Self-Attention
Yingcong Li
Yixiao Huang
M. E. Ildiz
A. S. Rawat
Samet Oymak
42
28
0
12 Mar 2024
Previous
1
2
3
...
28
29
30
...
83
84
85
Next