ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.07682
  4. Cited By
Emergent Abilities of Large Language Models

Emergent Abilities of Large Language Models

15 June 2022
Jason W. Wei
Yi Tay
Rishi Bommasani
Colin Raffel
Barret Zoph
Sebastian Borgeaud
Dani Yogatama
Maarten Bosma
Denny Zhou
Donald Metzler
Ed H. Chi
Tatsunori Hashimoto
Oriol Vinyals
Percy Liang
J. Dean
W. Fedus
    ELM
    ReLM
    LRM
ArXivPDFHTML

Papers citing "Emergent Abilities of Large Language Models"

50 / 1,588 papers shown
Title
Beyond Imitation: Learning Key Reasoning Steps from Dual
  Chain-of-Thoughts in Reasoning Distillation
Beyond Imitation: Learning Key Reasoning Steps from Dual Chain-of-Thoughts in Reasoning Distillation
Chengwei Dai
Kun Li
Wei Zhou
Song Hu
LRM
58
6
0
30 May 2024
Why Larger Language Models Do In-context Learning Differently?
Why Larger Language Models Do In-context Learning Differently?
Zhenmei Shi
Junyi Wei
Zhuoyan Xu
Yingyu Liang
37
23
0
30 May 2024
Cracking the Code of Juxtaposition: Can AI Models Understand the
  Humorous Contradictions
Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions
Zhe Hu
Tuo Liang
Jing Li
Yiren Lu
Yunlai Zhou
Yiran Qiao
Jing Ma
Yu Yin
64
3
0
29 May 2024
Large Brain Model for Learning Generic Representations with Tremendous
  EEG Data in BCI
Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI
Wei-Bang Jiang
Li-Ming Zhao
Bao-Liang Lu
60
71
0
29 May 2024
Are PPO-ed Language Models Hackable?
Are PPO-ed Language Models Hackable?
Suraj Anand
David Getzen
37
0
0
28 May 2024
Scaling Laws and Compute-Optimal Training Beyond Fixed Training
  Durations
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations
Alexander Hägele
Elie Bakouch
Atli Kosson
Loubna Ben Allal
Leandro von Werra
Martin Jaggi
46
37
0
28 May 2024
Towards a theory of how the structure of language is acquired by deep
  neural networks
Towards a theory of how the structure of language is acquired by deep neural networks
Francesco Cagnetta
Matthieu Wyart
41
9
0
28 May 2024
FinerCut: Finer-grained Interpretable Layer Pruning for Large Language
  Models
FinerCut: Finer-grained Interpretable Layer Pruning for Large Language Models
Yang Zhang
Yawei Li
Xinpeng Wang
Qianli Shen
Barbara Plank
Bernd Bischl
Mina Rezaei
Kenji Kawaguchi
68
10
0
28 May 2024
Self-Guiding Exploration for Combinatorial Problems
Self-Guiding Exploration for Combinatorial Problems
Zangir Iklassov
Yali Du
Farkhad Akimov
Martin Takáč
LRM
32
3
0
28 May 2024
Metaheuristics and Large Language Models Join Forces: Toward an Integrated Optimization Approach
Metaheuristics and Large Language Models Join Forces: Toward an Integrated Optimization Approach
Camilo Chacón Sartori
Christian Blum
Filippo Bistaffa
Guillem Rodríguez Corominas
AIFin
61
4
0
28 May 2024
Phase Transitions in the Output Distribution of Large Language Models
Phase Transitions in the Output Distribution of Large Language Models
Julian Arnold
Flemming Holtorf
Frank Schafer
Niels Lörch
51
1
0
27 May 2024
Saturn: Sample-efficient Generative Molecular Design using Memory
  Manipulation
Saturn: Sample-efficient Generative Molecular Design using Memory Manipulation
Jeff Guo
Philippe Schwaller
Mamba
61
7
0
27 May 2024
Assessing Empathy in Large Language Models with Real-World
  Physician-Patient Interactions
Assessing Empathy in Large Language Models with Real-World Physician-Patient Interactions
Man Luo
Christopher J. Warren
Lu Cheng
Haidar M Abdul-Muhsin
Imon Banerjee
LM&MA
AI4MH
42
10
0
26 May 2024
The global landscape of academic guidelines for generative AI and Large Language Models
The global landscape of academic guidelines for generative AI and Large Language Models
Junfeng Jiao
S. Afroogh
Kevin Chen
David Atkinson
Amit Dhurandhar
92
4
0
26 May 2024
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large
  Language Models
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models
Xudong Lu
Aojun Zhou
Yuhui Xu
Renrui Zhang
Peng Gao
Hongsheng Li
45
7
0
25 May 2024
Incremental Comprehension of Garden-Path Sentences by Large Language
  Models: Semantic Interpretation, Syntactic Re-Analysis, and Attention
Incremental Comprehension of Garden-Path Sentences by Large Language Models: Semantic Interpretation, Syntactic Re-Analysis, and Attention
Andrew Li
Xianle Feng
Siddhant Narang
Austin Peng
Tianle Cai
Raj Sanjay Shah
Sashank Varma
LRM
46
6
0
25 May 2024
Unsupervised Meta-Learning via In-Context Learning
Unsupervised Meta-Learning via In-Context Learning
Anna Vettoruzzo
Lorenzo Braccaioli
Joaquin Vanschoren
M. Nowaczyk
SSL
78
0
0
25 May 2024
Open-Vocabulary SAM3D: Understand Any 3D Scene
Open-Vocabulary SAM3D: Understand Any 3D Scene
Hanchen Tai
Qingdong He
Jiangning Zhang
Yijie Qian
Zhenyu Zhang
Xiaobin Hu
Yabiao Wang
Yong Liu
VLM
71
1
0
24 May 2024
Learning Beyond Pattern Matching? Assaying Mathematical Understanding in
  LLMs
Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs
Siyuan Guo
Aniket Didolkar
Nan Rosemary Ke
Anirudh Goyal
Ferenc Huszár
Bernhard Schölkopf
59
4
0
24 May 2024
From Frege to chatGPT: Compositionality in language, cognition, and deep
  neural networks
From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks
Jacob Russin
Sam Whitman McGrath
Danielle J. Williams
Lotem Elber-Dorozko
AI4CE
110
3
0
24 May 2024
Quantifying the Gain in Weak-to-Strong Generalization
Quantifying the Gain in Weak-to-Strong Generalization
Moses Charikar
Chirag Pabbaraju
Kirankumar Shiragur
ELM
53
19
0
24 May 2024
AstroPT: Scaling Large Observation Models for Astronomy
AstroPT: Scaling Large Observation Models for Astronomy
Michael J. Smith
Ryan J. Roberts
E. Angeloudi
M. Huertas-Company
61
1
0
23 May 2024
Lessons from the Trenches on Reproducible Evaluation of Language Models
Lessons from the Trenches on Reproducible Evaluation of Language Models
Stella Biderman
Hailey Schoelkopf
Lintang Sutawika
Leo Gao
J. Tow
...
Xiangru Tang
Kevin A. Wang
Genta Indra Winata
Franccois Yvon
Andy Zou
ELM
ALM
143
55
3
23 May 2024
Large language models can be zero-shot anomaly detectors for time
  series?
Large language models can be zero-shot anomaly detectors for time series?
Sarah Alnegheimish
Linh Nguyen
Laure Berti-Equille
K. Veeramachaneni
AI4TS
45
13
0
23 May 2024
MultiCast: Zero-Shot Multivariate Time Series Forecasting Using LLMs
MultiCast: Zero-Shot Multivariate Time Series Forecasting Using LLMs
Georgios Chatzigeorgakidis
Konstantinos Lentzos
Dimitrios Skoutas
AI4TS
48
3
0
23 May 2024
Mitigating Quantization Errors Due to Activation Spikes in GLU-Based
  LLMs
Mitigating Quantization Errors Due to Activation Spikes in GLU-Based LLMs
Jaewoo Yang
Hayun Kim
Younghoon Kim
52
12
0
23 May 2024
Explainable Few-shot Knowledge Tracing
Explainable Few-shot Knowledge Tracing
Haoxuan Li
Jifan Yu
Y. Ouyang
Zhuang Liu
Wenge Rong
Juan-Zi Li
Zhang Xiong
48
1
0
23 May 2024
Can Large Language Models Create New Knowledge for Spatial Reasoning
  Tasks?
Can Large Language Models Create New Knowledge for Spatial Reasoning Tasks?
Thomas Greatrix
Roger Whitaker
Liam Turner
Walter Colombo
LRM
52
1
0
23 May 2024
Focus Anywhere for Fine-grained Multi-page Document Understanding
Focus Anywhere for Fine-grained Multi-page Document Understanding
Chenglong Liu
Haoran Wei
Jinyue Chen
Lingyu Kong
Zheng Ge
Zining Zhu
Liang Zhao
Jian‐Yuan Sun
Chunrui Han
Xiangyu Zhang
46
22
0
23 May 2024
Large Language Models' Detection of Political Orientation in Newspapers
Large Language Models' Detection of Political Orientation in Newspapers
Alessio Buscemi
Daniele Proverbio
40
3
0
23 May 2024
How Do Transformers "Do" Physics? Investigating the Simple Harmonic
  Oscillator
How Do Transformers "Do" Physics? Investigating the Simple Harmonic Oscillator
Subhash Kantamneni
Ziming Liu
Max Tegmark
26
2
0
23 May 2024
Implicit In-context Learning
Implicit In-context Learning
Zhuowei Li
Zihao Xu
Ligong Han
Yunhe Gao
Song Wen
Di Liu
Hao Wang
Dimitris N. Metaxas
67
2
0
23 May 2024
Carbon Connect: An Ecosystem for Sustainable Computing
Carbon Connect: An Ecosystem for Sustainable Computing
Benjamin C. Lee
David Brooks
Arthur van Benthem
Udit Gupta
G. Hills
...
Emma Strubell
Gu-Yeon Wei
Adam Wierman
Yuan Yao
Minlan Yu
30
2
0
22 May 2024
Do Language Models Enjoy Their Own Stories? Prompting Large Language
  Models for Automatic Story Evaluation
Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation
Cyril Chhun
Fabian M. Suchanek
Chloé Clavel
LRM
49
15
0
22 May 2024
A Survey of Robotic Language Grounding: Tradeoffs between Symbols and
  Embeddings
A Survey of Robotic Language Grounding: Tradeoffs between Symbols and Embeddings
Vanya Cohen
J. Liu
Raymond J. Mooney
Stefanie Tellex
David Watkins
LM&Ro
62
12
0
21 May 2024
Securing the Future of GenAI: Policy and Technology
Securing the Future of GenAI: Policy and Technology
Mihai Christodorescu
Craven
Soheil Feizi
Neil Zhenqiang Gong
Mia Hoffmann
...
Jessica Newman
Emelia Probasco
Yanjun Qi
Khawaja Shams
Turek
SILM
82
3
0
21 May 2024
EchoPT: A Pretrained Transformer Architecture that Predicts 2D In-Air
  Sonar Images for Mobile Robotics
EchoPT: A Pretrained Transformer Architecture that Predicts 2D In-Air Sonar Images for Mobile Robotics
Jan Steckel
W. Jansen
Nico Huebel
MDE
48
0
0
21 May 2024
Adapting Large Multimodal Models to Distribution Shifts: The Role of
  In-Context Learning
Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning
Guanglin Zhou
Zhongyi Han
Shiming Chen
Erdun Gao
Liming Zhu
Salman Khan
Xin Gao
Lina Yao
VLM
66
4
0
20 May 2024
Quantifying In-Context Reasoning Effects and Memorization Effects in
  LLMs
Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs
Siyu Lou
Yuntian Chen
Xiaodan Liang
Liang Lin
Quanshi Zhang
74
2
0
20 May 2024
Asymptotic theory of in-context learning by linear attention
Asymptotic theory of in-context learning by linear attention
Yue M. Lu
Mary I. Letey
Jacob A. Zavatone-Veth
Anindita Maiti
Cengiz Pehlevan
42
11
0
20 May 2024
Large Language Models are Biased Reinforcement Learners
Large Language Models are Biased Reinforcement Learners
William M. Hayes
Nicolas Yax
Stefano Palminteri
OffRL
50
1
0
19 May 2024
Mitigating Interpretation Bias in Rock Records with Large Language
  Models: Insights from Paleoenvironmental Analysis
Mitigating Interpretation Bias in Rock Records with Large Language Models: Insights from Paleoenvironmental Analysis
Luoqi Wang
Haipeng Li
Linshu Hu
Jiarui Cai
Zhenhong Du
AI4CE
45
0
0
17 May 2024
Function Extrapolation with Neural Networks and Its Application for
  Manifolds
Function Extrapolation with Neural Networks and Its Application for Manifolds
Guy Hay
N. Sharon
57
1
0
17 May 2024
Rethinking ChatGPT's Success: Usability and Cognitive Behaviors Enabled
  by Auto-regressive LLMs' Prompting
Rethinking ChatGPT's Success: Usability and Cognitive Behaviors Enabled by Auto-regressive LLMs' Prompting
Xinzhe Li
Ming Liu
59
0
0
17 May 2024
Can formal argumentative reasoning enhance LLMs performances?
Can formal argumentative reasoning enhance LLMs performances?
Federico Castagna
I. Sassoon
Simon Parsons
LRM
LLMAG
30
2
0
16 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks
  via Multi-modal Large Language Models
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
42
14
0
16 May 2024
What is it for a Machine Learning Model to Have a Capability?
What is it for a Machine Learning Model to Have a Capability?
Jacqueline Harding
Nathaniel Sharadin
ELM
45
3
0
14 May 2024
Hearing Touch: Audio-Visual Pretraining for Contact-Rich Manipulation
Hearing Touch: Audio-Visual Pretraining for Contact-Rich Manipulation
Jared Mejia
Victoria Dean
Tess Hellebrekers
Abhinav Gupta
62
13
0
14 May 2024
Improving Transformers with Dynamically Composable Multi-Head Attention
Improving Transformers with Dynamically Composable Multi-Head Attention
Da Xiao
Qingye Meng
Shengping Li
Xingyuan Yuan
40
3
0
14 May 2024
When Large Language Models Meet Optical Networks: Paving the Way for
  Automation
When Large Language Models Meet Optical Networks: Paving the Way for Automation
Danshi Wang
Yidi Wang
Xiaotian Jiang
Yao Zhang
Yue Pang
Min Zhang
42
5
0
14 May 2024
Previous
123...91011...303132
Next