Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.07682
Cited By
v1
v2 (latest)
Emergent Abilities of Large Language Models
15 June 2022
Jason W. Wei
Yi Tay
Rishi Bommasani
Colin Raffel
Barret Zoph
Sebastian Borgeaud
Dani Yogatama
Maarten Bosma
Denny Zhou
Donald Metzler
Ed H. Chi
Tatsunori Hashimoto
Oriol Vinyals
Percy Liang
J. Dean
W. Fedus
ELM
ReLM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Emergent Abilities of Large Language Models"
50 / 1,644 papers shown
Title
Large Language Models as Psychological Simulators: A Methodological Guide
Zhicheng Lin
LLMAG
6
1
0
20 Jun 2025
Self-Critique-Guided Curiosity Refinement: Enhancing Honesty and Helpfulness in Large Language Models via In-Context Learning
Duc Hieu Ho
Chenglin Fan
HILM
LRM
6
0
0
19 Jun 2025
Unveiling the Learning Mind of Language Models: A Cognitive Framework and Empirical Study
Zhengyu Hu
Jianxun Lian
Zheyuan Xiao
Seraphina Zhang
Tianfu Wang
Nicholas Jing Yuan
Xing Xie
Hui Xiong
ELM
LRM
7
0
0
16 Jun 2025
Position: Pause Recycling LoRAs and Prioritize Mechanisms to Uncover Limits and Effectiveness
Mei-Yen Chen
Thi Thu Uyen Hoang
Michael Hahn
M. Sarfraz
MoMe
7
0
0
16 Jun 2025
Distinct Computations Emerge From Compositional Curricula in In-Context Learning
Jin Hwa Lee
Andrew Kyle Lampinen
Aaditya K. Singh
Andrew Saxe
13
0
0
16 Jun 2025
Complexity Scaling Laws for Neural Models using Combinatorial Optimization
Lowell Weissman
Michael Krumdick
A. Lynn Abbott
16
0
0
15 Jun 2025
EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction
Hsi-Che Lin
Yu-Chu Yu
Kai-Po Chang
Y. Wang
64
0
0
13 Jun 2025
Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning
Jikai Jin
Vasilis Syrgkanis
Sham Kakade
Hanlin Zhang
ELM
103
1
0
12 Jun 2025
BF-Max: an Efficient Bit Flipping Decoder with Predictable Decoding Failure Rate
Alessio Baldelli
Marco Baldi
F. Chiaraluce
Paolo Santini
107
0
0
11 Jun 2025
A Survey on Large Language Models for Mathematical Reasoning
Peng-Yuan Wang
Tian-Shuo Liu
Chenyang Wang
Yi-Di Wang
Shu Yan
...
Xu-Hui Liu
Xin-Wei Chen
Jia-Cheng Xu
Ziniu Li
Yang Yu
LRM
16
0
0
10 Jun 2025
Large Language Models and Emergence: A Complex Systems Perspective
D. Krakauer
John W. Krakauer
Melanie Mitchell
18
0
0
10 Jun 2025
Olica: Efficient Structured Pruning of Large Language Models without Retraining
Jiujun He
Huazhen Lin
19
0
0
10 Jun 2025
Extrapolation by Association: Length Generalization Transfer in Transformers
Ziyang Cai
Nayoung Lee
Avi Schwarzschild
Samet Oymak
Dimitris Papailiopoulos
27
0
0
10 Jun 2025
Mimicking or Reasoning: Rethinking Multi-Modal In-Context Learning in Vision-Language Models
Chengyue Huang
Yuchen Zhu
Sichen Zhu
Jingyun Xiao
Moises Andrade
Shivang Chopra
Z. Kira
ReLM
VLM
LRM
10
0
0
09 Jun 2025
MiniCPM4: Ultra-Efficient LLMs on End Devices
MiniCPM Team
Chaojun Xiao
Yuxuan Li
Xu Han
Yuzhuo Bai
...
Zhiyuan Liu
Guoyang Zeng
Chao Jia
Dahai Li
Maosong Sun
MLLM
19
0
0
09 Jun 2025
Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations
Zhiyu Xue
Reza Abbasi-Asl
Ramtin Pedarsani
AAML
13
0
0
08 Jun 2025
Transferring Features Across Language Models With Model Stitching
Alan Chen
Jack Merullo
Alessandro Stolfo
Ellie Pavlick
9
0
0
07 Jun 2025
Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks
D. Kunin
Giovanni Luca Marchetti
F. Chen
Dhruva Karkada
James B. Simon
M. DeWeese
Surya Ganguli
Nina Miolane
16
0
0
06 Jun 2025
Contextually Guided Transformers via Low-Rank Adaptation
A. Zhmoginov
Jihwan Lee
Max Vladymyrov
Mark Sandler
OffRL
48
0
0
06 Jun 2025
RecGPT: A Foundation Model for Sequential Recommendation
Yangqin Jiang
Xubin Ren
Lianghao Xia
Da Luo
Kangyi Lin
Chao Huang
LRM
84
0
0
06 Jun 2025
LLM-based phoneme-to-grapheme for phoneme-based speech recognition
Te Ma
Min Bi
Saierdaer Yusuyin
Hao Huang
Zhijian Ou
164
0
0
05 Jun 2025
Neural Network Reprogrammability: A Unified Theme on Model Reprogramming, Prompt Tuning, and Prompt Instruction
Zesheng Ye
C. Cai
Ruijiang Dong
Jianzhong Qi
Lei Feng
Pin-Yu Chen
Feng Liu
181
0
0
05 Jun 2025
Explainer-guided Targeted Adversarial Attacks against Binary Code Similarity Detection Models
Mingjie Chen
Tiancheng Zhu
Mingxue Zhang
Yiling He
Minghao Lin
Penghui Li
Kui Ren
AAML
17
1
0
05 Jun 2025
An Exploratory Framework for Future SETI Applications: Detecting Generative Reactivity via Language Models
Po-Chieh Yu
37
0
0
03 Jun 2025
PC-MoE: Memory-Efficient and Privacy-Preserving Collaborative Training for Mixture-of-Experts LLMs
Ze Yu Zhang
Bolin Ding
Bryan Kian Hsiang Low
MoE
67
0
0
03 Jun 2025
Transformers as Multi-task Learners: Decoupling Features in Hidden Markov Models
Yifan Hao
Chenlu Ye
Chi Han
Tong Zhang
44
0
0
02 Jun 2025
Fodor and Pylyshyn's Legacy - Still No Human-like Systematic Compositionality in Neural Networks
Tim Woydt
Moritz Willig
Antonia Wüst
Lukas Helff
Wolfgang Stammer
Constantin Rothkopf
Kristian Kersting
51
1
0
02 Jun 2025
From Street Views to Urban Science: Discovering Road Safety Factors with Multimodal Large Language Models
Yihong Tang
Ao Qu
Xujing Yu
Weipeng Deng
Jun Ma
Jinhua Zhao
Lijun Sun
23
0
0
02 Jun 2025
The World As Large Language Models See It: Exploring the reliability of LLMs in representing geographical features
Omid Reza Abbasi
Franz Welscher
Georg Weinberger
Johannes Scholz
26
1
0
30 May 2025
Diversity of Transformer Layers: One Aspect of Parameter Scaling Laws
Hidetaka Kamigaito
Ying Zhang
Jingun Kwon
Katsuhiko Hayashi
Manabu Okumura
Taro Watanabe
MoE
33
1
0
29 May 2025
SGD as Free Energy Minimization: A Thermodynamic View on Neural Network Training
Ildus Sadrtdinov
Ivan Klimov
E. Lobacheva
Dmitry Vetrov
20
0
0
29 May 2025
Cross-Task Experiential Learning on LLM-based Multi-Agent Collaboration
Yilong Li
Chen Qian
Yu Xia
Ruijie Shi
Yufan Dang
...
Ye Tian
Xuantang Xiong
Lei Han
Zhiyuan Liu
Maosong Sun
LLMAG
72
0
0
29 May 2025
BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning
Jianyang Gu
Samuel Stevens
Elizabeth G. Campolongo
Matthew J. Thompson
Net Zhang
...
Daniel Rubenstein
Hilmar Lapp
T. Berger-Wolf
Wei-Lun Chao
Yu-Chuan Su
VLM
46
2
0
29 May 2025
Domain-Aware Tensor Network Structure Search
Giorgos Iacovides
Wuyang Zhou
Chao Li
Qibin Zhao
Danilo Mandic
22
0
0
29 May 2025
Neither Stochastic Parroting nor AGI: LLMs Solve Tasks through Context-Directed Extrapolation from Training Data Priors
Harish Tayyar Madabushi
Melissa Torgbi
C. Bonial
64
0
0
29 May 2025
Scalable Complexity Control Facilitates Reasoning Ability of LLMs
Liangkai Hang
Junjie Yao
Zhiwei Bai
Tianyi Chen
Yang Chen
...
Feiyu Xiong
Y. Zhang
Weinan E
Hongkang Yang
Zhi-hai Xu
LRM
39
0
0
29 May 2025
Fast and Cost-effective Speculative Edge-Cloud Decoding with Early Exits
Yeshwanth Venkatesha
Souvik Kundu
Priyadarshini Panda
42
0
0
27 May 2025
Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Seungheon Doh
Junghyun Koo
Marco A. Martínez-Ramírez
Wei-Hsiang Liao
Juhan Nam
Yuki Mitsufuji
32
0
0
27 May 2025
Test-Time Learning for Large Language Models
Jinwu Hu
Zhitian Zhang
Guohao Chen
Xutao Wen
Chao Shuai
Wei Luo
Bin Xiao
Yuanqing Li
Mingkui Tan
47
0
0
27 May 2025
Respond to Change with Constancy: Instruction-tuning with LLM for Non-I.I.D. Network Traffic Classification
Xinjie Lin
Gang Xiong
Gaopeng Gou
Wenqi Dong
Jing Yu
Zhen Li
W. Xia
26
0
0
27 May 2025
Assessment of L2 Oral Proficiency using Speech Large Language Models
Rao Ma
Mengjie Qian
Siyuan Tang
Stefano Bannò
Kate Knill
Mark Gales
AuLLM
52
0
0
27 May 2025
Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation
Yuhao Wang
Ruiyang Ren
Yucheng Wang
Wayne Xin Zhao
Jing Liu
Hua Wu
Haifeng Wang
RALM
OffRL
75
0
0
27 May 2025
In-Context Brush: Zero-shot Customized Subject Insertion with Context-Aware Latent Space Manipulation
Yu Xu
Fan Tang
You Wu
Lin Gao
Oliver Deussen
Hongbin Yan
Jintao Li
Juan Cao
Tong-Yee Lee
DiffM
39
0
0
26 May 2025
Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks
Debargha Ganguly
Vikash Singh
Sreehari Sankar
Biyao Zhang
Xuecen Zhang
Srinivasan Iyengar
Xiaotian Han
Amit Sharma
Shivkumar Kalyanaraman
Vipin Chaudhary
46
0
0
26 May 2025
The Birth of Knowledge: Emergent Features across Time, Space, and Scale in Large Language Models
Shashata Sawmya
Micah Adler
Nir Shavit
MILM
26
0
0
26 May 2025
Chemical classification program synthesis using generative artificial intelligence
Christopher J. Mungall
Adnan Malik
Daniel R. Korn
Justin T Reese
Noel M. O'Boyle
Noel
Janna Hastings
67
0
0
24 May 2025
μ
μ
μ
-MoE: Test-Time Pruning as Micro-Grained Mixture-of-Experts
T. Koike-Akino
Jing Liu
Ye Wang
MoE
26
0
0
24 May 2025
Synthesizing and Adapting Error Correction Data for Mobile Large Language Model Applications
Yanxiang Zhang
Zheng Xu
Shanshan Wu
Yuanbo Zhang
Daniel Ramage
KELM
32
0
0
24 May 2025
BTC-LLM: Efficient Sub-1-Bit LLM Quantization via Learnable Transformation and Binary Codebook
Hao Gu
Lujun Li
Zheyu Wang
B. Liu
Qiyuan Zhu
Sirui Han
Yike Guo
MQ
7
0
0
24 May 2025
Multilingual Question Answering in Low-Resource Settings: A Dzongkha-English Benchmark for Foundation Models
Md. Tanzib Hosain
Rajan Das Gupta
Md. Kishor Morol
10
0
0
24 May 2025
1
2
3
4
...
31
32
33
Next