Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.02311
Cited By
PaLM: Scaling Language Modeling with Pathways
5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PaLM: Scaling Language Modeling with Pathways"
50 / 4,245 papers shown
Title
NOLO: Navigate Only Look Once
Mengyu Bu
Shuhao Gu
Yang Feng
EgoV
61
1
0
02 Aug 2024
Semantic Skill Grounding for Embodied Instruction-Following in Cross-Domain Environments
Sangwoo Shin
Takehiro Matsuoka
Youngsoo Jang
Moontae Lee
Kazuya Yoshida
52
0
0
02 Aug 2024
CFBench: A Comprehensive Constraints-Following Benchmark for LLMs
Leo Micklem
Yan-Bin Shen
Wenjing Luo
Yan Zhang
Hao Liang
...
Weipeng Chen
Bin Cui
Blair Thornton
Wentao Zhang
Zenan Zhou
ELM
84
16
0
02 Aug 2024
AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language Models
Daqin Luo
Chengjian Feng
Yuxuan Nong
Yiqing Shen
42
5
0
01 Aug 2024
Intermittent Semi-working Mask: A New Masking Paradigm for LLMs
Mingcong Lu
Jiangcai Zhu
Wang Hao
Zheng Li
Shusheng Zhang
Kailai Shao
Chao Chen
Nan Li
Feng Wang
Xin Lu
48
0
0
01 Aug 2024
GalleryGPT: Analyzing Paintings with Large Multimodal Models
Yi Bin
Wenhao Shi
Yujuan Ding
Zhiqiang Hu
Zheng Wang
Yang Yang
See-Kiong Ng
H. Shen
MLLM
43
11
0
01 Aug 2024
What comes after transformers? -- A selective survey connecting ideas in deep learning
Johannes Schneider
AI4CE
56
2
0
01 Aug 2024
Memorization Capacity for Additive Fine-Tuning with Small ReLU Networks
Jy-yong Sohn
Dohyun Kwon
Seoyeon An
Kangwook Lee
54
0
0
01 Aug 2024
Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?
Richard Ren
Steven Basart
Adam Khoja
Alice Gatti
Long Phan
...
Alexander Pan
Gabriel Mukobi
Ryan H. Kim
Stephen Fitz
Dan Hendrycks
ELM
31
22
0
31 Jul 2024
TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
Ming Zhang
Caishuang Huang
Yilong Wu
Shichun Liu
Huiyuan Zheng
...
Jun Zhao
Junjie Ye
Qi Zhang
Tao Gui
Xuanjing Huang
62
2
0
31 Jul 2024
Evaluating SAM2's Role in Camouflaged Object Detection: From SAM to SAM2
Lv Tang
Bo Li
VLM
40
7
0
31 Jul 2024
Data Contamination Report from the 2024 CONDA Shared Task
Oscar Sainz
Iker García-Ferrero
Alon Jacovi
Jonas Hanselle
Yanai Elazar
...
Yu-Min Tseng
Vishaal Udandarao
Zengzhi Wang
Ruijie Xu
Jinglin Yang
53
5
0
31 Jul 2024
KemenkeuGPT: Leveraging a Large Language Model on Indonesia's Government Financial Data and Regulations to Enhance Decision Making
Kuo Wang
Pingping Zhang
AIFin
24
1
0
31 Jul 2024
Autonomous Improvement of Instruction Following Skills via Foundation Models
Zhiyuan Zhou
P. Atreya
Abraham Lee
Homer Walke
Oier Mees
Sergey Levine
37
11
0
30 Jul 2024
Prompt2DeModel: Declarative Neuro-Symbolic Modeling with Natural Language
Hossein Rajaby Faghihi
Aliakbar Nafar
Andrzej Uszok
Hamid Karimian
Parisa Kordjamshidi
39
0
0
30 Jul 2024
AI-Assisted Generation of Difficult Math Questions
Vedant Shah
Dingli Yu
Kaifeng Lyu
Simon Park
Nan Rosemary Ke
...
Yoshua Bengio
Sanjeev Arora
Anirudh Goyal
Sanjeev Arora
Anirudh Goyal
53
16
0
30 Jul 2024
Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
Ekaterina Iakovleva
Fabio Pizzati
Philip Torr
Stéphane Lathuiliere
DiffM
41
0
0
29 Jul 2024
Efficient Training of Large Language Models on Distributed Infrastructures: A Survey
Jiangfei Duan
Shuo Zhang
Zerui Wang
Lijuan Jiang
Wenwen Qu
...
Dahua Lin
Yonggang Wen
Xin Jin
Tianwei Zhang
Peng Sun
73
8
0
29 Jul 2024
Revolutionizing Urban Safety Perception Assessments: Integrating Multimodal Large Language Models with Street View Images
Jiaxin Zhanga
Yunqin Lia
Tomohiro Fukudab
Bowen Wang
43
1
0
29 Jul 2024
Logic Distillation: Learning from Code Function by Function for Planning and Decision-making
Dong Chen
Shilin Zhang
Fei Gao
Yueting Zhuang
Siliang Tang
Qidong Liu
Mingliang Xu
LRM
35
0
0
28 Jul 2024
NVC-1B: A Large Neural Video Coding Model
Xihua Sheng
Chuanbo Tang
Li Li
Dong Liu
Feng Wu
3DV
VLM
52
3
0
28 Jul 2024
Stochastic Parrots or ICU Experts? Large Language Models in Critical Care Medicine: A Scoping Review
Tongyue Shi
Jun Ma
Zihan Yu
Haowei Xu
Minqi Xiong
Meirong Xiao
Yilin Li
Huiying Zhao
Guilan Kong
55
1
0
27 Jul 2024
Optimizing Numerical Estimation and Operational Efficiency in the Legal Domain through Large Language Models
Jia-Hong Huang
Chao-Chun Yang
Yixian Shen
A. M. Pacces
Evangelos Kanoulas
ELM
AILaw
62
6
0
26 Jul 2024
The power of Prompts: Evaluating and Mitigating Gender Bias in MT with LLMs
Aleix Sant
Carlos Escolano
Audrey Mash
Francesca de Luca Fornaciari
Maite Melero
38
4
0
26 Jul 2024
Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
Tianduo Wang
Shichen Li
Wei Lu
LRM
AI4CE
53
14
1
25 Jul 2024
Difficulty Estimation and Simplification of French Text Using LLMs
Henri Jamet
Yash Raj Shrestha
Michalis Vlachos
35
2
0
25 Jul 2024
Ontology of Belief Diversity: A Community-Based Epistemological Approach
Tyler Fischella
Erin van Liemt
Qiuyi
Qiuyi Zhang
32
0
0
25 Jul 2024
Know Your Limits: A Survey of Abstention in Large Language Models
Bingbing Wen
Jihan Yao
Shangbin Feng
Chenjun Xu
Yulia Tsvetkov
Bill Howe
Lucy Lu Wang
61
11
0
25 Jul 2024
u-
μ
\mu
μ
P: The Unit-Scaled Maximal Update Parametrization
Charlie Blake
C. Eichenberg
Josef Dean
Lukas Balles
Luke Y. Prince
Bjorn Deiseroth
Andres Felipe Cruz Salinas
Carlo Luschi
Samuel Weinbach
Douglas Orr
61
10
0
24 Jul 2024
V
I
L
A
2
VILA^2
V
I
L
A
2
: VILA Augmented VILA
Yunhao Fang
Ligeng Zhu
Yao Lu
Yan Wang
Pavlo Molchanov
Jang Hyun Cho
Marco Pavone
Song Han
Hongxu Yin
VLM
49
7
0
24 Jul 2024
Exploring Domain Robust Lightweight Reward Models based on Router Mechanism
Hyuk Namgoong
Jeesu Jung
Sangkeun Jung
Yoonhyung Roh
49
1
0
24 Jul 2024
Bailicai: A Domain-Optimized Retrieval-Augmented Generation Framework for Medical Applications
Cui Long
Yongbin Liu
Chunping Ouyang
Ying Yu
49
4
0
24 Jul 2024
Course-Correction: Safety Alignment Using Synthetic Preferences
Rongwu Xu
Yishuo Cai
Zhenhong Zhou
Renjie Gu
Haiqin Weng
Yan Liu
Tianwei Zhang
Wei Xu
Han Qiu
42
4
0
23 Jul 2024
Enhancing LLM's Cognition via Structurization
Kai-Chun Liu
Zhihang Fu
Chao Chen
Wei Zhang
Rongxin Jiang
Fan Zhou
Yao-Shen Chen
Yue-bo Wu
Jieping Ye
65
1
0
23 Jul 2024
Exploring the Effectiveness and Consistency of Task Selection in Intermediate-Task Transfer Learning
Pin-Jie Lin
Miaoran Zhang
Marius Mosbach
Dietrich Klakow
38
0
0
23 Jul 2024
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget
Vikash Sehwag
Xianghao Kong
Jingtao Li
Michael Spranger
Lingjuan Lyu
DiffM
47
9
0
22 Jul 2024
Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability
Zhuoyan Xu
Zhenmei Shi
Yingyu Liang
CoGe
LRM
40
28
0
22 Jul 2024
Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning
Xiangyang Qu
Jing Yu
Keke Gai
Jiamin Zhuang
Yuanmin Tang
Gang Xiong
Gaopeng Gou
Qi Wu
54
3
0
22 Jul 2024
Building Machines that Learn and Think with People
Katherine M. Collins
Ilia Sucholutsky
Umang Bhatt
Kartik Chandra
Lionel Wong
...
Mark K. Ho
Vikash K. Mansinghka
Adrian Weller
Joshua B. Tenenbaum
Thomas Griffiths
59
30
0
22 Jul 2024
MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory for Long Sequences Training
Cheng Luo
Jiawei Zhao
Zhuoming Chen
Beidi Chen
A. Anandkumar
37
3
0
22 Jul 2024
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models
Amir Mohammad Karimi Mamaghan
Samuele Papa
Karl Henrik Johansson
Stefan Bauer
Andrea Dittadi
OCL
53
5
0
22 Jul 2024
VideoGameBunny: Towards vision assistants for video games
Mohammad Reza Taesiri
Cor-Paul Bezemer
VLM
MLLM
35
2
0
21 Jul 2024
Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives
D. Hagos
Rick Battle
Danda B. Rawat
LM&MA
OffRL
50
23
0
20 Jul 2024
Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks
Jiayu Lin
Guanrong Chen
Bojun Jin
Chenyang Li
Shutong Jia
...
R. Xu
Long Zhang
Jiuxin Cao
Ting Jin
Zhongyu Wei
44
1
0
20 Jul 2024
Adversarial Databases Improve Success in Retrieval-based Large Language Models
Sean Wu
Michael Koo
Li Yo Kao
Andy Black
L. Blum
Fabien Scalzo
Ira Kurtz
RALM
38
0
0
19 Jul 2024
Foundation Models for Autonomous Robots in Unstructured Environments
Hossein Naderi
Alireza Shojaei
Lifu Huang
LM&Ro
54
0
0
19 Jul 2024
LAPIS: Language Model-Augmented Police Investigation System
Heedou Kim
Dain Kim
Jiwoo Lee
Chanwoong Yoon
Donghee Choi
Mogan Gim
Jaewoo Kang
RALM
35
1
0
19 Jul 2024
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text Generation: A State-of-the-Art Investigation
Joy Mahapatra
Utpal Garain
47
8
0
19 Jul 2024
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by Direct Preference Optimization
Md Sultan al Nahian
R. Kavuluru
MedIm
AI4CE
46
0
0
19 Jul 2024
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
Chaofan Tao
Qian Liu
Longxu Dou
Niklas Muennighoff
Zhongwei Wan
Ping Luo
Min Lin
Ngai Wong
PILM
60
47
0
18 Jul 2024
Previous
1
2
3
...
15
16
17
...
83
84
85
Next