Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.07682
Cited By
Emergent Abilities of Large Language Models
15 June 2022
Jason W. Wei
Yi Tay
Rishi Bommasani
Colin Raffel
Barret Zoph
Sebastian Borgeaud
Dani Yogatama
Maarten Bosma
Denny Zhou
Donald Metzler
Ed H. Chi
Tatsunori Hashimoto
Oriol Vinyals
Percy Liang
J. Dean
W. Fedus
ELM
ReLM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Emergent Abilities of Large Language Models"
50 / 1,573 papers shown
Title
The Role of Deductive and Inductive Reasoning in Large Language Models
Chengkun Cai
Xu Zhao
Haoliang Liu
Zhongyu Jiang
Tianfang Zhang
Zongkai Wu
Lei Li
Jenq-Neng Hwang
Lei Li
LRM
37
2
0
03 Oct 2024
Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets
Yuandong Tian
57
0
0
02 Oct 2024
Quantifying Generalization Complexity for Large Language Models
Zhenting Qi
Hongyin Luo
Xuliang Huang
Zhuokai Zhao
Yibo Jiang
Xiangjun Fan
Himabindu Lakkaraju
James Glass
LRM
ELM
31
5
0
02 Oct 2024
CreDes: Causal Reasoning Enhancement and Dual-End Searching for Solving Long-Range Reasoning Problems using LLMs
Kangsheng Wang
Xiao Zhang
Hao Liu
Songde Han
Huimin Ma
Tianyu Hu
LRM
51
5
0
02 Oct 2024
Sparse Autoencoders Reveal Temporal Difference Learning in Large Language Models
Can Demircan
Tankred Saanum
A. Jagadish
Marcel Binz
Eric Schulz
35
1
0
02 Oct 2024
Towards Inference-time Category-wise Safety Steering for Large Language Models
Amrita Bhattacharjee
Shaona Ghosh
Traian Rebedea
Christopher Parisien
LLMSV
34
4
0
02 Oct 2024
Geometric Signatures of Compositionality Across a Language Model's Lifetime
Jin Hwa Lee
Thomas Jiralerspong
Lei Yu
Yoshua Bengio
Emily Cheng
CoGe
84
0
0
02 Oct 2024
Positional Attention: Expressivity and Learnability of Algorithmic Computation
Artur Back de Luca
George Giapitzakis
Shenghao Yang
Petar Veličković
K. Fountoulakis
46
0
0
02 Oct 2024
U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models
Tung-Yu Wu
Pei-Yu Lo
ReLM
LRM
46
2
0
02 Oct 2024
Endless Jailbreaks with Bijection Learning
Brian R. Y. Huang
Maximilian Li
Leonard Tang
AAML
81
5
0
02 Oct 2024
ReXplain: Translating Radiology into Patient-Friendly Video Reports
Luyang Luo
Jenanan Vairavamurthy
Xiaoman Zhang
Abhinav Kumar
Ramon R. Ter-Oganesyan
Siyang Song
Dan Shilo
Rydhwana Hossain
Mike Moritz
Pranav Rajpurkar
MedIm
LM&MA
28
3
0
01 Oct 2024
Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models
Luohe Shi
Yao Yao
Zuchao Li
Lefei Zhang
Hai Zhao
29
0
0
30 Sep 2024
Scaling Optimal LR Across Token Horizons
Johan Bjorck
Alon Benhaim
Vishrav Chaudhary
Furu Wei
Xia Song
54
4
0
30 Sep 2024
Beyond Single Concept Vector: Modeling Concept Subspace in LLMs with Gaussian Distribution
Haiyan Zhao
Heng Zhao
Bo Shen
Ali Payani
Fan Yang
Mengnan Du
59
2
0
30 Sep 2024
Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token Reduction
Zhenmei Shi
Yifei Ming
Xuan-Phi Nguyen
Yingyu Liang
Shafiq Joty
81
27
0
25 Sep 2024
Data-Centric AI Governance: Addressing the Limitations of Model-Focused Policies
Ritwik Gupta
Leah Walker
Rodolfo Corona
Stephanie Fu
Suzanne Petryk
Janet Napolitano
Trevor Darrell
Andrew W. Reddie
ELM
40
3
0
25 Sep 2024
Analyzing Probabilistic Methods for Evaluating Agent Capabilities
Axel Højmark
Govind Pimpale
Arjun Panickssery
Marius Hobbhahn
Jérémy Scheurer
22
4
0
24 Sep 2024
VLM's Eye Examination: Instruct and Inspect Visual Competency of Vision Language Models
Nam Hyeon-Woo
Moon Ye-Bin
Wonseok Choi
Lee Hyun
Tae-Hyun Oh
CoGe
28
3
0
23 Sep 2024
Parse Trees Guided LLM Prompt Compression
Wenhao Mao
Chengbin Hou
Tianyu Zhang
Xinyu Lin
Ke Tang
Hairong Lv
26
0
0
23 Sep 2024
Co-occurrence is not Factual Association in Language Models
Xiao Zhang
Miao Li
Ji Wu
KELM
68
2
0
21 Sep 2024
End-Cloud Collaboration Framework for Advanced AI Customer Service in E-commerce
Liangyu Teng
Yang Liu
Jing Liu
Liang Song
36
2
0
20 Sep 2024
Visualizationary: Automating Design Feedback for Visualization Designers using LLMs
Sungbok Shin
Sanghyun Hong
Niklas Elmqvist
32
0
0
19 Sep 2024
Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning
Jiaxin Wen
Jian Guan
Hongning Wang
Wei Wu
Minlie Huang
ReLM
OffRL
LRM
31
7
0
19 Sep 2024
Small Language Models are Equation Reasoners
Bumjun Kim
Kunha Lee
Juyeon Kim
Sangam Lee
ReLM
LRM
29
2
0
19 Sep 2024
ProSLM : A Prolog Synergized Language Model for explainable Domain Specific Knowledge Based Question Answering
Priyesh Vakharia
Abigail Kufeldt
Max Meyers
Ian Lane
Leilani H. Gilpin
34
0
0
17 Sep 2024
The Midas Touch: Triggering the Capability of LLMs for RM-API Misuse Detection
Yi Yang
Jinghua Liu
Kai Chen
Miaoqian Lin
28
0
0
14 Sep 2024
Larger Language Models Don't Care How You Think: Why Chain-of-Thought Prompting Fails in Subjective Tasks
Georgios Chochlakis
Niyantha Maruthu Pandiyan
Kristina Lerman
Shrikanth Narayanan
ReLM
KELM
LRM
45
4
0
10 Sep 2024
LLMs Will Always Hallucinate, and We Need to Live With This
Sourav Banerjee
Ayushi Agarwal
Saloni Singla
HILM
LRM
26
35
0
09 Sep 2024
Improving Pretraining Data Using Perplexity Correlations
Tristan Thrush
Christopher Potts
Tatsunori Hashimoto
32
17
0
09 Sep 2024
Achieving Peak Performance for Large Language Models: A Systematic Review
Z. R. K. Rostam
Sándor Szénási
Gábor Kertész
37
3
0
07 Sep 2024
An overview of domain-specific foundation model: key technologies, applications and challenges
Haolong Chen
Hanzhi Chen
Zijian Zhao
Kaifeng Han
Guangxu Zhu
Yichen Zhao
Ying Du
Wei Xu
Qingjiang Shi
ALM
VLM
61
4
0
06 Sep 2024
LUK: Empowering Log Understanding with Expert Knowledge from Large Language Models
Lipeng Ma
Weidong Yang
Sihang Jiang
Ben Fei
Mingjie Zhou
Shuhao Li
Bo Xu
Bo Xu
Yanghua Xiao
66
0
0
03 Sep 2024
Interpreting and Improving Large Language Models in Arithmetic Calculation
Wei Zhang
Chaoqun Wan
Yonggang Zhang
Yiu-ming Cheung
Xinmei Tian
Xu Shen
Jieping Ye
LRM
29
18
0
03 Sep 2024
H-ARC: A Robust Estimate of Human Performance on the Abstraction and Reasoning Corpus Benchmark
Solim LeGris
Wai Keen Vong
Brenden Lake
Todd M. Gureckis
LRM
42
8
0
02 Sep 2024
Beyond Efficiency: Molecular Data Pruning for Enhanced Generalization
Dingshuo Chen
Zhixun Li
Yuyan Ni
Guibin Zhang
Ding Wang
Qiang Liu
Shu Wu
Jeffrey Xu Yu
Liang Wang
49
4
0
02 Sep 2024
Agentic Society: Merging skeleton from real world and texture from Large Language Model
Yuqi Bai
Kun Sun
Huishi Yin
35
1
0
02 Sep 2024
LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Jiayi Gui
Yiming Liu
Jiale Cheng
Xiaotao Gu
Xiao-Yang Liu
Hongning Wang
Yuxiao Dong
Jie Tang
Minlie Huang
ELM
LLMAG
LRM
37
2
0
28 Aug 2024
Zero-Shot Visual Reasoning by Vision-Language Models: Benchmarking and Analysis
Aishik Nagar
Shantanu Jaiswal
Cheston Tan
ReLM
LRM
23
7
0
27 Aug 2024
IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering
Ruosen Li
Barry Wang
Ruochen Li
Xinya Du
ELM
33
5
0
24 Aug 2024
Image Segmentation in Foundation Model Era: A Survey
Tianfei Zhou
Fei Zhang
Boyu Chang
Wenguan Wang
Ye Yuan
E. Konukoglu
Daniel Cremers
VLM
42
4
0
23 Aug 2024
SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging
Mohammadreza Pourreza
Ruoxi Sun
Hailong Li
Lesly Miculicich
Tomas Pfister
Sercan Ö. Arik
MoMe
37
5
0
22 Aug 2024
Cell-ontology guided transcriptome foundation model
Xinyu Yuan
Zhihao Zhan
Zuobai Zhang
Manqi Zhou
Jianan Zhao
Boyu Han
Yue Li
Jian Tang
47
1
0
22 Aug 2024
Personality Alignment of Large Language Models
Minjun Zhu
Linyi Yang
Yue Zhang
Yue Zhang
ALM
67
5
0
21 Aug 2024
Information-Theoretic Progress Measures reveal Grokking is an Emergent Phase Transition
Kenzo Clauw
S. Stramaglia
Daniele Marinazzo
50
3
0
16 Aug 2024
Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs
Jinming Liu
Yuntao Wei
Junyan Lin
Shengyang Zhao
Heming Sun
Zhibo Chen
Wenjun Zeng
Xin Jin
51
2
0
16 Aug 2024
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
Rafael Rafailov
Kyle Hatch
Anikait Singh
Laura Smith
Aviral Kumar
...
Victor Kolev
Philip J. Ball
Jiajun Wu
Chelsea Finn
Sergey Levine
OffRL
34
3
0
15 Aug 2024
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Jiri Hron
Laura J. Culp
Gamaleldin F. Elsayed
Rosanne Liu
Ben Adlam
...
T. Warkentin
Lechao Xiao
Kelvin Xu
Jasper Snoek
Simon Kornblith
45
1
0
14 Aug 2024
Do GPT Language Models Suffer From Split Personality Disorder? The Advent Of Substrate-Free Psychometrics
P. Romero
Stephen Fitz
T. Nakatsuma
25
10
0
14 Aug 2024
Connecting Dreams with Visual Brainstorming Instruction
Yasheng Sun
Bohan Li
Mingchen Zhuge
Deng-Ping Fan
Salman Khan
F. Khan
Hideki Koike
DiffM
42
0
0
14 Aug 2024
Can Large Language Models Reason? A Characterization via 3-SAT
Rishi Hazra
Gabriele Venturato
Pedro Zuidberg Dos Martires
Luc de Raedt
ELM
ReLM
LRM
35
4
0
13 Aug 2024
Previous
1
2
3
...
5
6
7
...
30
31
32
Next