Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.02207
Cited By
Language Models Represent Space and Time
3 October 2023
Wes Gurnee
Max Tegmark
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Language Models Represent Space and Time"
50 / 119 papers shown
Title
Active Testing of Large Language Model via Multi-Stage Sampling
Yuheng Huang
Jiayang Song
Qiang Hu
Felix Juefei-Xu
Lei Ma
29
2
0
07 Aug 2024
Logistic Regression makes small LLMs strong and explainable "tens-of-shot" classifiers
Marcus Buckmann
Edward Hill
37
2
0
06 Aug 2024
Cluster-norm for Unsupervised Probing of Knowledge
Walter Laurito
Sharan Maiya
Grégoire Dhimoïla
Owen
Owen Yeung
Kaarel Hänni
31
2
0
26 Jul 2024
Compositional Structures in Neural Embedding and Interaction Decompositions
Matthew Trager
Alessandro Achille
Pramuditha Perera
L. Zancato
Stefano Soatto
CoGe
32
0
0
12 Jul 2024
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models
Daking Rai
Yilun Zhou
Shi Feng
Abulhair Saparov
Ziyu Yao
82
19
0
02 Jul 2024
Monitoring Latent World States in Language Models with Propositional Probes
Jiahai Feng
Stuart Russell
Jacob Steinhardt
HILM
46
6
0
27 Jun 2024
The Remarkable Robustness of LLMs: Stages of Inference?
Vedang Lad
Wes Gurnee
Max Tegmark
38
33
0
27 Jun 2024
Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space
Core Francisco Park
Maya Okawa
Andrew Lee
Ekdeep Singh Lubana
Hidenori Tanaka
62
7
0
27 Jun 2024
STBench: Assessing the Ability of Large Language Models in Spatio-Temporal Analysis
Wenbin Li
Di Yao
Ruibo Zhao
Wenjie Chen
Zijie Xu
Chengxue Luo
Chang Gong
Quanliang Jing
Haining Tan
Jingping Bi
42
3
0
27 Jun 2024
Towards Open-World Grasping with Large Vision-Language Models
Georgios Tziafas
H. Kasaei
LM&Ro
LRM
34
12
0
26 Jun 2024
Brittle Minds, Fixable Activations: Understanding Belief Representations in Language Models
Matteo Bortoletto
Constantin Ruhdorfer
Lei Shi
Andreas Bulling
AI4MH
LRM
46
5
0
25 Jun 2024
CityGPT: Empowering Urban Spatial Cognition of Large Language Models
Jie Feng
Yuwei Du
Tianhui Liu
Siqi Guo
Yuming Lin
Yong Li
45
13
0
20 Jun 2024
CityBench: Evaluating the Capabilities of Large Language Model as World Model
Jie Feng
Jun Zhang
Junbo Yan
Xin Zhang
Tianjian Ouyang
Tianhui Liu
Yuwei Du
Siqi Guo
Yong Li
ELM
56
0
0
20 Jun 2024
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
Bahare Fatemi
Mehran Kazemi
Anton Tsitsulin
Karishma Malkan
Jinyeong Yim
John Palowitch
Sungyong Seo
Jonathan J. Halcrow
Bryan Perozzi
LRM
43
26
0
13 Jun 2024
Legend: Leveraging Representation Engineering to Annotate Safety Margin for Preference Datasets
Duanyu Feng
Bowen Qin
Chen Huang
Youcheng Huang
Zheng-Wei Zhang
Wenqiang Lei
44
2
0
12 Jun 2024
AI Sandbagging: Language Models can Strategically Underperform on Evaluations
Teun van der Weij
Felix Hofstätter
Ollie Jaffe
Samuel F. Brown
Francis Rhys Ward
ELM
45
23
0
11 Jun 2024
Feature contamination: Neural networks learn uncorrelated features and fail to generalize
Tianren Zhang
Chujie Zhao
Guanyu Chen
Yizhou Jiang
Feng Chen
OOD
MLT
OODD
77
3
0
05 Jun 2024
The Geometry of Categorical and Hierarchical Concepts in Large Language Models
Kiho Park
Yo Joong Choe
Yibo Jiang
Victor Veitch
50
26
0
03 Jun 2024
The Embodied World Model Based on LLM with Visual Information and Prediction-Oriented Prompts
Wakana Haijima
Kou Nakakubo
Masahiro Suzuki
Yutaka Matsuo
28
1
0
02 Jun 2024
From Neurons to Neutrons: A Case Study in Interpretability
O. Kitouni
Niklas Nolte
Víctor Samuel Pérez-Díaz
S. Trifinopoulos
Mike Williams
MILM
19
1
0
27 May 2024
Survival of the Fittest Representation: A Case Study with Modular Addition
Xiaoman Delores Ding
Zifan Carl Guo
Eric J. Michaud
Ziming Liu
Max Tegmark
48
3
0
27 May 2024
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
65
75
0
27 May 2024
Phase Transitions in the Output Distribution of Large Language Models
Julian Arnold
Flemming Holtorf
Frank Schafer
Niels Lörch
41
1
0
27 May 2024
No Two Devils Alike: Unveiling Distinct Mechanisms of Fine-tuning Attacks
Chak Tou Leong
Yi Cheng
Kaishuai Xu
Jian Wang
Hanlin Wang
Wenjie Li
AAML
51
17
0
25 May 2024
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models
Bernal Jiménez Gutiérrez
Yiheng Shu
Yu Gu
Michihiro Yasunaga
Yu-Chuan Su
RALM
CLL
68
33
0
23 May 2024
How Do Transformers "Do" Physics? Investigating the Simple Harmonic Oscillator
Subhash Kantamneni
Ziming Liu
Max Tegmark
14
2
0
23 May 2024
A Causal Explainable Guardrails for Large Language Models
Zhixuan Chu
Yan Wang
Longfei Li
Zhibo Wang
Zhan Qin
Kui Ren
LLMSV
49
7
0
07 May 2024
More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model Trustworthiness
Aaron Jiaxun Li
Satyapriya Krishna
Himabindu Lakkaraju
33
3
0
29 Apr 2024
PatentGPT: A Large Language Model for Intellectual Property
Zilong Bai
Ruiji Zhang
Linqing Chen
Qijun Cai
Yuan Zhong
...
Fu Bian
Xiaolong Gu
Lisha Zhang
Weilei Wang
Changyang Tu
43
4
0
28 Apr 2024
Evaluation of Geographical Distortions in Language Models: A Crucial Step Towards Equitable Representations
R. Decoupes
R. Interdonato
Mathieu Roche
M. Teisseire
S. Valentin
24
1
0
26 Apr 2024
Mechanistic Interpretability for AI Safety -- A Review
Leonard Bereska
E. Gavves
AI4CE
40
112
0
22 Apr 2024
Reflectance Estimation for Proximity Sensing by Vision-Language Models: Utilizing Distributional Semantics for Low-Level Cognition in Robotics
Masashi Osada
G. A. G. Ricardez
Yosuke Suzuki
Tadahiro Taniguchi
26
2
0
11 Apr 2024
Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?
Mingyu Jin
Qinkai Yu
Jingyuan Huang
Qingcheng Zeng
Zhenting Wang
...
Yanda Meng
Kaize Ding
Fan Yang
Jundong Li
Yongfeng Zhang
50
0
0
10 Apr 2024
Exploring the True Potential: Evaluating the Black-box Optimization Capability of Large Language Models
Beichen Huang
Xingyu Wu
Yu Zhou
Jibin Wu
Liang Feng
Ran Cheng
Kay Chen Tan
55
12
0
09 Apr 2024
Where to Move Next: Zero-shot Generalization of LLMs for Next POI Recommendation
Shanshan Feng
Haoming Lyu
Caishun Chen
Y. Ong
LRM
38
8
0
02 Apr 2024
The Unreasonable Ineffectiveness of the Deeper Layers
Andrey Gromov
Kushal Tirumala
Hassan Shapourian
Paolo Glorioso
Daniel A. Roberts
43
79
0
26 Mar 2024
The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?
Qinyu Zhao
Ming Xu
Kartik Gupta
Akshay Asthana
Liang Zheng
Stephen Gould
29
7
0
14 Mar 2024
The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models
Carlo Nicolini
Jacopo Staiano
Bruno Lepri
Raffaele Marino
MoE
26
1
0
13 Mar 2024
Language Models Represent Beliefs of Self and Others
Wentao Zhu
Zhining Zhang
Yizhou Wang
MILM
LRM
44
8
0
28 Feb 2024
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
56
17
0
28 Feb 2024
Eight Methods to Evaluate Robust Unlearning in LLMs
Aengus Lynch
Phillip Guo
Aidan Ewart
Stephen Casper
Dylan Hadfield-Menell
ELM
MU
40
57
0
26 Feb 2024
Robust agents learn causal world models
Jonathan G. Richens
Tom Everitt
OOD
119
36
0
16 Feb 2024
Opening the AI black box: program synthesis via mechanistic interpretability
Eric J. Michaud
Isaac Liao
Vedang Lad
Ziming Liu
Anish Mudide
Chloe Loughridge
Zifan Carl Guo
Tara Rezaei Kheirkhah
Mateja Vukelić
Max Tegmark
23
12
0
07 Feb 2024
Position: Stop Making Unscientific AGI Performance Claims
Patrick Altmeyer
Andrew M. Demetriou
Antony Bartlett
Cynthia C. S. Liem
29
3
0
06 Feb 2024
Large Language Models for Time Series: A Survey
Xiyuan Zhang
Ranak Roy Chowdhury
Rajesh K. Gupta
Jingbo Shang
AI4TS
82
55
0
02 Feb 2024
LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law
Toni J. B. Liu
Nicolas Boullé
Raphael Sarfati
Christopher Earls
AI4TS
25
12
0
01 Feb 2024
Towards Urban General Intelligence: A Review and Outlook of Urban Foundation Models
Weijiao Zhang
Jindong Han
Zhao Xu
Hang Ni
Hao Liu
Hui Xiong
Hui Xiong
AI4CE
77
15
0
30 Jan 2024
Black-Box Access is Insufficient for Rigorous AI Audits
Stephen Casper
Carson Ezell
Charlotte Siegmann
Noam Kolt
Taylor Lynn Curtis
...
Michael Gerovitch
David Bau
Max Tegmark
David M. Krueger
Dylan Hadfield-Menell
AAML
34
78
0
25 Jan 2024
Universal Neurons in GPT2 Language Models
Wes Gurnee
Theo Horsley
Zifan Carl Guo
Tara Rezaei Kheirkhah
Qinyi Sun
Will Hathaway
Neel Nanda
Dimitris Bertsimas
MILM
99
37
0
22 Jan 2024
Has Your Pretrained Model Improved? A Multi-head Posterior Based Approach
Prince Osei Aboagye
Yan Zheng
Junpeng Wang
Uday Singh Saini
Xin Dai
...
Yujie Fan
Zhongfang Zhuang
Shubham Jain
Liang Wang
Wei Zhang
24
0
0
02 Jan 2024
Previous
1
2
3
Next