ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.02311
  4. Cited By
PaLM: Scaling Language Modeling with Pathways

PaLM: Scaling Language Modeling with Pathways

5 April 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
Adam Roberts
P. Barham
Hyung Won Chung
Charles Sutton
Sebastian Gehrmann
Parker Schuh
Kensen Shi
Sasha Tsvyashchenko
Joshua Maynez
Abhishek Rao
Parker Barnes
Yi Tay
Noam M. Shazeer
Vinodkumar Prabhakaran
Emily Reif
Nan Du
Ben Hutchinson
Reiner Pope
James Bradbury
Jacob Austin
Michael Isard
Guy Gur-Ari
Pengcheng Yin
Toju Duke
Anselm Levskaya
Sanjay Ghemawat
Sunipa Dev
Henryk Michalewski
Xavier Garcia
Vedant Misra
Kevin Robinson
Liam Fedus
Denny Zhou
Daphne Ippolito
D. Luan
Hyeontaek Lim
Barret Zoph
A. Spiridonov
Ryan Sepassi
David Dohan
Shivani Agrawal
Mark Omernick
Andrew M. Dai
Thanumalayan Sankaranarayana Pillai
Marie Pellat
Aitor Lewkowycz
Erica Moreira
R. Child
Oleksandr Polozov
Katherine Lee
Zongwei Zhou
Xuezhi Wang
Brennan Saeta
Mark Díaz
Orhan Firat
Michele Catasta
Jason W. Wei
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
    PILM
    LRM
ArXivPDFHTML

Papers citing "PaLM: Scaling Language Modeling with Pathways"

50 / 4,245 papers shown
Title
Understanding Knowledge Drift in LLMs through Misinformation
Understanding Knowledge Drift in LLMs through Misinformation
Alina Fastowski
Gjergji Kasneci
KELM
43
2
0
11 Sep 2024
Pushing the Limits of Vision-Language Models in Remote Sensing without
  Human Annotations
Pushing the Limits of Vision-Language Models in Remote Sensing without Human Annotations
Keumgang Cha
Donggeun Yu
Junghoon Seo
VLM
34
0
0
11 Sep 2024
FreeRide: Harvesting Bubbles in Pipeline Parallelism
FreeRide: Harvesting Bubbles in Pipeline Parallelism
Jiashu Zhang
Zihan Pan
Molly
Xu
Khuzaima S. Daudjee
90
0
0
11 Sep 2024
RNR: Teaching Large Language Models to Follow Roles and Rules
RNR: Teaching Large Language Models to Follow Roles and Rules
Kuan-Chieh Wang
Alexander Bukharin
Haoming Jiang
Qingyu Yin
Zhengyang Wang
...
Chao Zhang
Bing Yin
Xian Li
Jianshu Chen
Shiyang Li
ALM
26
1
0
10 Sep 2024
MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large
  Language Model
MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large Language Model
Zhen Yang
Jinhao Chen
Zhengxiao Du
Wenmeng Yu
Weihan Wang
Wenyi Hong
Zhihuan Jiang
Bin Xu
Yuxiao Dong
Jie Tang
VLM
LRM
32
8
0
10 Sep 2024
What is the Role of Small Models in the LLM Era: A Survey
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
66
23
0
10 Sep 2024
Can OOD Object Detectors Learn from Foundation Models?
Can OOD Object Detectors Learn from Foundation Models?
Jiahui Liu
Xin Wen
Shizhen Zhao
Y. Chen
Xiaojuan Qi
OODD
48
2
0
08 Sep 2024
Expanding Expressivity in Transformer Models with MöbiusAttention
Expanding Expressivity in Transformer Models with MöbiusAttention
Anna-Maria Halacheva
M. Nayyeri
Steffen Staab
27
1
0
08 Sep 2024
POINTS: Improving Your Vision-language Model with Affordable Strategies
POINTS: Improving Your Vision-language Model with Affordable Strategies
Yuan Liu
Zhongyin Zhao
Ziyuan Zhuang
Le Tian
Xiao Zhou
Jie Zhou
VLM
43
5
0
07 Sep 2024
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation
Zhuoyan Luo
Fengyuan Shi
Yixiao Ge
Yujiu Yang
Limin Wang
Ying Shan
VLM
52
53
0
06 Sep 2024
How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with
  High-Quality Data
How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data
Yejie Wang
Keqing He
Dayuan Fu
Zhuoma Gongque
Heyang Xu
...
Muxi Diao
Jingang Wang
Hao Fei
Xunliang Cai
Weiran Xu
ALM
SyDa
48
3
0
05 Sep 2024
Hallucination Detection in LLMs: Fast and Memory-Efficient Finetuned
  Models
Hallucination Detection in LLMs: Fast and Memory-Efficient Finetuned Models
Gabriel Y. Arteaga
Thomas B. Schon
Nicolas Pielawski
38
7
0
04 Sep 2024
Diversify-verify-adapt: Efficient and Robust Retrieval-Augmented Ambiguous Question Answering
Diversify-verify-adapt: Efficient and Robust Retrieval-Augmented Ambiguous Question Answering
Yeonjun In
Sungchul Kim
Ryan A. Rossi
Md Mehrab Tanjim
Tong Yu
Ritwik Sinha
Chanyoung Park
37
2
0
04 Sep 2024
Leveraging Large Language Models for Solving Rare MIP Challenges
Leveraging Large Language Models for Solving Rare MIP Challenges
Teng Wang
Wing-Yin Yu
Ruifeng She
Wenhan Yang
Taijie Chen
Jianping Zhang
AI4CE
29
5
0
03 Sep 2024
Think Twice Before Recognizing: Large Multimodal Models for General
  Fine-grained Traffic Sign Recognition
Think Twice Before Recognizing: Large Multimodal Models for General Fine-grained Traffic Sign Recognition
Yaozong Gan
Guang Li
Ren Togo
Keisuke Maeda
Takahiro Ogawa
Miki Haseyama
57
1
0
03 Sep 2024
Towards General Industrial Intelligence: A Survey on IIoT-Enhanced
  Continual Large Models
Towards General Industrial Intelligence: A Survey on IIoT-Enhanced Continual Large Models
Jiao Chen
Jiayi He
Fangfang Chen
Zuohong Lv
Jianhua Tang
Weihua Li
Zuozhu Liu
Howard H. Yang
Guangjie Han
AI4CE
41
1
0
02 Sep 2024
LuWu: An End-to-End In-Network Out-of-Core Optimizer for 100B-Scale
  Model-in-Network Data-Parallel Training on Distributed GPUs
LuWu: An End-to-End In-Network Out-of-Core Optimizer for 100B-Scale Model-in-Network Data-Parallel Training on Distributed GPUs
Mo Sun
Zihan Yang
Changyue Liao
Yingtao Li
Fei Wu
Zeke Wang
65
1
0
02 Sep 2024
SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring
  Expression Segmentation
SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation
Yi-Chia Chen
Wei-Hua Li
Cheng Sun
Yu-Chiang Frank Wang
Chu-Song Chen
VLM
45
11
0
01 Sep 2024
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
Zanlin Ni
Yulin Wang
Renping Zhou
Rui Lu
Jiayi Guo
Jinyi Hu
Zhiyuan Liu
Yuan Yao
Gao Huang
50
7
0
31 Aug 2024
UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios
UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios
Baichuan Zhou
Haote Yang
Dairong Chen
Junyan Ye
Tianyi Bai
Jinhua Yu
Songyang Zhang
Dahua Lin
Conghui He
Weijia Li
VLM
58
4
0
30 Aug 2024
A Survey for Large Language Models in Biomedicine
A Survey for Large Language Models in Biomedicine
Chong Wang
Mengyao Li
Junjun He
Zhongruo Wang
Erfan Darzi
...
Yi Yu
Pietro Liò
Tianyun Wang
Yu Guang Wang
Yiqing Shen
LM&MA
48
9
0
29 Aug 2024
LoraMap: Harnessing the Power of LoRA Connections
LoraMap: Harnessing the Power of LoRA Connections
Hyeryun Park
Jeongwon Kwak
Dongsuk Jang
Sumin Park
Jinwook Choi
MoMe
38
0
0
29 Aug 2024
Hand1000: Generating Realistic Hands from Text with Only 1,000 Images
Hand1000: Generating Realistic Hands from Text with Only 1,000 Images
Haozhuo Zhang
B. Zhu
Yu Cao
Y. Hao
VLM
44
2
0
28 Aug 2024
A Statistical Framework for Data-dependent Retrieval-Augmented Models
A Statistical Framework for Data-dependent Retrieval-Augmented Models
Soumya Basu
A. S. Rawat
Manzil Zaheer
RALM
54
0
0
27 Aug 2024
Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language
  Instruction Tuning for Semiconductor Electron Micrograph Analysis
Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis
Sakhinana Sagar Srinivas
Chidaksh Ravuru
Geethan Sannidhi
Venkataramana Runkana
50
0
0
27 Aug 2024
Zero-Shot Visual Reasoning by Vision-Language Models: Benchmarking and
  Analysis
Zero-Shot Visual Reasoning by Vision-Language Models: Benchmarking and Analysis
Aishik Nagar
Shantanu Jaiswal
Cheston Tan
ReLM
LRM
31
7
0
27 Aug 2024
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and
  Deduplication by Introducing a Competitive Large Language Model Baseline
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline
Guosheng Dong
Zhuoran Zhang
Yiding Sun
Da Pan
Zheng Liang
...
Bingning Wang
Wentao Zhang
Jiaxin Mao
Zenan Zhou
Weipeng Chen
ALM
48
2
0
27 Aug 2024
A Survey of Large Language Models for European Languages
A Survey of Large Language Models for European Languages
Wazir Ali
S. Pyysalo
52
2
0
27 Aug 2024
Evidence-Enhanced Triplet Generation Framework for Hallucination
  Alleviation in Generative Question Answering
Evidence-Enhanced Triplet Generation Framework for Hallucination Alleviation in Generative Question Answering
Haowei Du
Huishuai Zhang
Dongyan Zhao
HILM
35
0
0
27 Aug 2024
Cross-Modal Learning for Chemistry Property Prediction: Large Language
  Models Meet Graph Machine Learning
Cross-Modal Learning for Chemistry Property Prediction: Large Language Models Meet Graph Machine Learning
Sakhinana Sagar Srinivas
Venkataramana Runkana
AI4CE
51
1
0
27 Aug 2024
HPT++: Hierarchically Prompting Vision-Language Models with
  Multi-Granularity Knowledge Generation and Improved Structure Modeling
HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure Modeling
Yubin Wang
Xinyang Jiang
De Cheng
Wenli Sun
Dongsheng Li
Cairong Zhao
VLM
51
0
0
27 Aug 2024
Measuring Human Contribution in AI-Assisted Content Generation
Measuring Human Contribution in AI-Assisted Content Generation
Yueqi Xie
Tao Qi
Jingwei Yi
Ryan Whalen
Junming Huang
Qian Ding
Yu Xie
Xing Xie
Fangzhao Wu
Fangzhao Wu
44
1
0
27 Aug 2024
Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep
  Learning
Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning
Wei An
Xiao Bi
Guanting Chen
Shanhuang Chen
Chengqi Deng
...
Chenggang Zhao
Yao Zhao
Shangyan Zhou
Shunfeng Zhou
Yuheng Zou
41
6
0
26 Aug 2024
Watermarking Techniques for Large Language Models: A Survey
Watermarking Techniques for Large Language Models: A Survey
Yuqing Liang
Jiancheng Xiao
Wensheng Gan
Philip S. Yu
OffRL
37
3
0
26 Aug 2024
Hierarchical Network Fusion for Multi-Modal Electron Micrograph
  Representation Learning with Foundational Large Language Models
Hierarchical Network Fusion for Multi-Modal Electron Micrograph Representation Learning with Foundational Large Language Models
Sakhinana Sagar Srinivas
Geethan Sannidhi
Venkataramana Runkana
45
0
0
24 Aug 2024
Utilizing Large Language Models for Named Entity Recognition in
  Traditional Chinese Medicine against COVID-19 Literature: Comparative Study
Utilizing Large Language Models for Named Entity Recognition in Traditional Chinese Medicine against COVID-19 Literature: Comparative Study
Xu Tong
N. Smirnova
Sharmila Upadhyaya
Ran Yu
Jack H. Culbert
Chao Sun
Wolfgang Otto
Philipp Mayr
AI4MH
34
0
0
24 Aug 2024
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to
  Small-Scale Local LLMs
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
Chansung Park
Juyong Jiang
Fan Wang
Sayak Paul
Jing Tang
39
2
0
24 Aug 2024
Understanding Defects in Generated Codes by Language Models
Understanding Defects in Generated Codes by Language Models
Ali Mohammadi Esfahani
N. Kahani
S. Ajila
35
1
0
23 Aug 2024
Foundational Model for Electron Micrograph Analysis: Instruction-Tuning
  Small-Scale Language-and-Vision Assistant for Enterprise Adoption
Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption
Sakhinana Sagar Srinivas
Chidaksh Ravuru
Geethan Sannidhi
Venkataramana Runkana
46
0
0
23 Aug 2024
In-Context Learning with Reinforcement Learning for Incomplete Utterance
  Rewriting
In-Context Learning with Reinforcement Learning for Incomplete Utterance Rewriting
Haowei Du
Dongyan Zhao
RALM
37
0
0
23 Aug 2024
Internal and External Knowledge Interactive Refinement Framework for
  Knowledge-Intensive Question Answering
Internal and External Knowledge Interactive Refinement Framework for Knowledge-Intensive Question Answering
Haowei Du
Dongyan Zhao
KELM
30
0
0
23 Aug 2024
Investigating LLM Applications in E-Commerce
Investigating LLM Applications in E-Commerce
Chester Palen-Michel
Ruixiang Wang
Yipeng Zhang
David Yu
Canran Xu
Zhe Wu
28
3
0
23 Aug 2024
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Yi-Fan Zhang
Huanyu Zhang
Haochen Tian
Chaoyou Fu
Shuangqing Zhang
...
Qingsong Wen
Zhang Zhang
Liwen Wang
Rong Jin
Tieniu Tan
OffRL
69
36
0
23 Aug 2024
Show-o: One Single Transformer to Unify Multimodal Understanding and
  Generation
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation
Jinheng Xie
Weijia Mao
Zechen Bai
David Junhao Zhang
Weihao Wang
Kevin Qinghong Lin
Yuchao Gu
Zhijie Chen
Zhenheng Yang
Mike Zheng Shou
59
166
0
22 Aug 2024
Fine-tuning Smaller Language Models for Question Answering over
  Financial Documents
Fine-tuning Smaller Language Models for Question Answering over Financial Documents
Karmvir Singh Phogat
Sai Akhil Puranam
Sridhar Dasaratha
Chetan Harsha
Shashishekar Ramakrishna
LRM
39
2
0
22 Aug 2024
D-RMGPT: Robot-assisted collaborative tasks driven by large multimodal
  models
D-RMGPT: Robot-assisted collaborative tasks driven by large multimodal models
Matteo Forlini
Mihail Babcinschi
Giacomo Palmieri
Pedro Neto
57
1
0
21 Aug 2024
LARR: Large Language Model Aided Real-time Scene Recommendation with
  Semantic Understanding
LARR: Large Language Model Aided Real-time Scene Recommendation with Semantic Understanding
Zhizhong Wan
Bin Yin
Junjie Xie
Fei Jiang
Xiang Li
Wei Lin
3DV
51
5
0
21 Aug 2024
EmbodiedSAM: Online Segment Any 3D Thing in Real Time
EmbodiedSAM: Online Segment Any 3D Thing in Real Time
Xiuwei Xu
Huangxing Chen
Linqing Zhao
Ziwei Wang
Jie Zhou
Jiwen Lu
47
15
0
21 Aug 2024
Benchmarking Large Language Models for Math Reasoning Tasks
Benchmarking Large Language Models for Math Reasoning Tasks
Kathrin Seßler
Yao Rong
Emek Gözlüklü
Enkelejda Kasneci
LRM
38
3
0
20 Aug 2024
CodeJudge-Eval: Can Large Language Models be Good Judges in Code
  Understanding?
CodeJudge-Eval: Can Large Language Models be Good Judges in Code Understanding?
Yuwei Zhao
Ziyang Luo
Yuchen Tian
Hongzhan Lin
Weixiang Yan
Annan Li
Jing Ma
ELM
ALM
LRM
50
8
0
20 Aug 2024
Previous
123...131415...838485
Next