Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.05950
Cited By
v1
v2 (latest)
BERT Rediscovers the Classical NLP Pipeline
15 May 2019
Ian Tenney
Dipanjan Das
Ellie Pavlick
MILM
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT Rediscovers the Classical NLP Pipeline"
50 / 821 papers shown
Title
Mechanisms vs. Outcomes: Probing for Syntax Fails to Explain Performance on Targeted Syntactic Evaluations
Ananth Agarwal
Jasper Jian
Christopher D. Manning
Shikhar Murty
17
0
0
20 Jun 2025
Predicting New Research Directions in Materials Science using Large Language Models and Concept Graphs
Thomas Marwitz
Alexander Colsmann
Ben Breitung
Christoph Brabec
Christoph Kirchlechner
...
Michael Hirtz
Pavel A. Levkin
Yolita M. Eggeler
Tobias Schlöder
Pascal Friederich
AI4CE
31
0
0
20 Jun 2025
Large Language Models as Psychological Simulators: A Methodological Guide
Zhicheng Lin
LLMAG
30
1
0
20 Jun 2025
Under the Shadow of Babel: How Language Shapes Reasoning in LLMs
Chenxi Wang
Y. Zhang
Lang Gao
Zixiang Xu
Zirui Song
Yanbo Wang
Xiuying Chen
17
0
0
19 Jun 2025
Knee-Deep in C-RASP: A Transformer Depth Hierarchy
Andy Yang
Michaël Cadilhac
David Chiang
20
0
0
19 Jun 2025
Targeted Lexical Injection: Unlocking Latent Cross-Lingual Alignment in Lugha-Llama via Early-Layer LoRA Fine-Tuning
Stanley Ngugi
35
0
0
18 Jun 2025
Detecting High-Stakes Interactions with Activation Probes
Alex McKenzie
Urja Pawar
Phil Blandfort
William Bankes
David M. Krueger
Ekdeep Singh Lubana
Dmitrii Krasheninnikov
147
0
0
12 Jun 2025
Beyond Benchmarks: A Novel Framework for Domain-Specific LLM Evaluation and Knowledge Mapping
Nitin Sharma
Thomas Wolfers
Çağatay Yıldız
ALM
24
0
0
09 Jun 2025
PrunePEFT: Iterative Hybrid Pruning for Parameter-Efficient Fine-tuning of LLMs
Tongzhou Yu
Zhuhao Zhang
Guanghui Zhu
Shen Jiang
Meikang Qiu
Yihua Huang
36
0
0
09 Jun 2025
Behavioural vs. Representational Systematicity in End-to-End Models: An Opinionated Survey
Ivan Vegner
Sydelle de Souza
Valentin Forch
Martha Lewis
Leonidas A.A. Doumas
48
0
0
04 Jun 2025
Mechanistic Decomposition of Sentence Representations
Matthieu Tehenan
Vikram Natarajan
Jonathan Michala
Milton Lin
Juri Opitz
31
0
0
04 Jun 2025
Adaptive Task Vectors for Large Language Models
Joonseong Kang
Soojeong Lee
Subeen Park
Sumin Park
Taero Kim
Jihee Kim
Ryunyi Lee
Kyungwoo Song
29
0
0
03 Jun 2025
Model Internal Sleuthing: Finding Lexical Identity and Inflectional Morphology in Modern Language Models
Michael Li
Nishant Subramani
KELM
17
0
0
02 Jun 2025
Child-Directed Language Does Not Consistently Boost Syntax Learning in Language Models
Francesca Padovani
Jaap Jumelet
Yevgen Matusevych
Arianna Bisazza
ALM
48
1
0
29 May 2025
Do Large Language Models Think Like the Brain? Sentence-Level Evidence from fMRI and Hierarchical Embeddings
Yu-Zhou Lei
Xingyang Ge
Yi Zhang
Yiming Yang
Bolei Ma
42
0
0
28 May 2025
SAEs Are Good for Steering -- If You Select the Right Features
Dana Arad
Aaron Mueller
Yonatan Belinkov
LLMSV
55
0
0
26 May 2025
Grokking ExPLAIND: Unifying Model, Data, and Training Attribution to Study Model Behavior
Florian Eichin
Yupei Du
Philipp Mondorf
Barbara Plank
Michael A. Hedderich
FAtt
152
0
0
26 May 2025
Anchored Diffusion Language Model
Litu Rout
Constantine Caramanis
Sanjay Shakkottai
72
0
0
24 May 2025
Multi-Scale Probabilistic Generation Theory: A Hierarchical Framework for Interpreting Large Language Models
Yukin Zhang
Qi Dong
114
0
0
23 May 2025
Mitigating Hallucinations via Inter-Layer Consistency Aggregation in Large Vision-Language Models
Kai Tang
Jinhao You
Xiuqi Ge
Hanze Li
Yichen Guo
Xiande Huang
MLLM
173
0
0
18 May 2025
K
K
K
-MSHC: Unmasking Minimally Sufficient Head Circuits in Large Language Models with Experiments on Syntactic Classification Tasks
Pratim Chowdhary
Peter Chin
Deepernab Chakrabarty
112
0
0
18 May 2025
Designing and Contextualising Probes for African Languages
Wisdom Aduah
Francois Meyer
119
0
0
15 May 2025
Attention on Multiword Expressions: A Multilingual Study of BERT-based Models with Regard to Idiomaticity and Microsyntax
Iuliia Zaitova
Vitalii Hirak
Badr M. Abdullah
Dietrich Klakow
Bernd Möbius
T. Avgustinova
120
0
0
09 May 2025
Efficient Shapley Value-based Non-Uniform Pruning of Large Language Models
Chuan Sun
Han Yu
Lizhen Cui
Xiaoxiao Li
446
3
0
03 May 2025
Bi-directional Model Cascading with Proxy Confidence
David Warren
Mark Dras
83
0
0
27 Apr 2025
Exploring Compositional Generalization (in ReCOGS_pos) by Transformers using Restricted Access Sequence Processing (RASP)
William Bruns
108
0
0
21 Apr 2025
Deep Learning with Pretrained Ínternal World' Layers: A Gemma 3-Based Modular Architecture for Wildfire Prediction
Ayoub Jadouli
Chaker El Amrani
KELM
AI4TS
148
0
0
20 Apr 2025
Signatures of human-like processing in Transformer forward passes
Jennifer Hu
Michael A. Lepori
Michael Franke
AI4CE
433
0
0
18 Apr 2025
Linguistic Interpretability of Transformer-based Language Models: a systematic review
Miguel López-Otal
Jorge Gracia
Jordi Bernad
Carlos Bobed
Lucía Pitarch-Ballesteros
Emma Anglés-Herrero
VLM
108
1
0
09 Apr 2025
The Zero Body Problem: Probing LLM Use of Sensory Language
Rebecca M. M. Hicke
Sil Hamilton
David M. Mimno
92
0
0
08 Apr 2025
Few Dimensions are Enough: Fine-tuning BERT with Selected Dimensions Revealed Its Redundant Nature
Shion Fukuhata
Yoshinobu Kano
122
0
0
07 Apr 2025
On the Effectiveness and Generalization of Race Representations for Debiasing High-Stakes Decisions
Dang Nguyen
Chenhao Tan
76
1
0
07 Apr 2025
Layers at Similar Depths Generate Similar Activations Across LLM Architectures
Christopher Wolfram
Aaron Schein
100
2
0
03 Apr 2025
Language Models at the Syntax-Semantics Interface: A Case Study of the Long-Distance Binding of Chinese Reflexive ziji
Xiulin Yang
112
1
0
02 Apr 2025
BiPVL-Seg: Bidirectional Progressive Vision-Language Fusion with Global-Local Alignment for Medical Image Segmentation
Rafi Ibn Sultan
Hui Zhu
Chengyin Li
Dongxiao Zhu
94
0
0
30 Mar 2025
Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models
Zhanke Zhou
Zhaocheng Zhu
Xuan Li
Mikhail Galkin
Xiao Feng
Sanmi Koyejo
Jian Tang
Bo Han
LRM
173
6
0
28 Mar 2025
Efficient Knowledge Distillation via Curriculum Extraction
Shivam Gupta
Sushrut Karmalkar
114
0
0
21 Mar 2025
Aligned Probing: Relating Toxic Behavior and Model Internals
Andreas Waldis
Vagrant Gautam
Anne Lauscher
Dietrich Klakow
Iryna Gurevych
72
1
0
17 Mar 2025
Using the Tools of Cognitive Science to Understand Large Language Models at Different Levels of Analysis
Alexander Ku
Declan Campbell
Xuechunzi Bai
Jiayi Geng
Ryan Liu
...
Ilia Sucholutsky
Veniamin Veselovsky
Liyi Zhang
Jian-Qiao Zhu
Thomas L. Griffiths
ELM
154
4
0
17 Mar 2025
Efficient Safety Alignment of Large Language Models via Preference Re-ranking and Representation-based Reward Modeling
Qiyuan Deng
X. Bai
Kehai Chen
Yaowei Wang
Liqiang Nie
Min Zhang
OffRL
121
0
0
13 Mar 2025
Odysseus Navigates the Sirens' Song: Dynamic Focus Decoding for Factual and Diverse Open-Ended Text Generation
Wen Luo
Feifan Song
Wei Li
Guangyue Peng
Shaohang Wei
Houfeng Wang
AI4CE
96
0
0
11 Mar 2025
Syntactic Learnability of Echo State Neural Language Models at Scale
Ryo Ueda
Tatsuki Kuribayashi
Shunsuke Kando
Kentaro Inui
97
0
0
03 Mar 2025
Transformer Meets Twicing: Harnessing Unattended Residual Information
Laziz U. Abdullaev
Tan M. Nguyen
144
3
0
02 Mar 2025
A Close Look at Decomposition-based XAI-Methods for Transformer Language Models
L. Arras
Bruno Puri
Patrick Kahardipraja
Sebastian Lapuschkin
Wojciech Samek
98
3
0
21 Feb 2025
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
KELM
3DV
190
2
0
21 Feb 2025
Mechanistic Interpretability of Emotion Inference in Large Language Models
Ala Nekouvaght Tak
Amin Banayeeanzade
Anahita Bolourani
Mina Kian
Robin Jia
Jonathan Gratch
110
0
0
08 Feb 2025
Large Language Models Are Human-Like Internally
Tatsuki Kuribayashi
Yohei Oseki
Souhaib Ben Taieb
Kentaro Inui
Timothy Baldwin
187
5
0
03 Feb 2025
Discovering Chunks in Neural Embeddings for Interpretability
Shuchen Wu
Stephan Alaniz
Eric Schulz
Zeynep Akata
103
0
0
03 Feb 2025
Efficient Language Modeling for Low-Resource Settings with Hybrid RNN-Transformer Architectures
Gabriel Lindenmaier
Sean Papay
Sebastian Padó
143
0
0
02 Feb 2025
FinchGPT: a Transformer based language model for birdsong analysis
Kosei Kobayashi
Kosuke Matsuzaki
Masaya Taniguchi
Keisuke Sakaguchi
Kentaro Inui
Kentaro Abe
99
1
0
01 Feb 2025
1
2
3
4
...
15
16
17
Next