Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.05950
Cited By
v1
v2 (latest)
BERT Rediscovers the Classical NLP Pipeline
15 May 2019
Ian Tenney
Dipanjan Das
Ellie Pavlick
MILM
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT Rediscovers the Classical NLP Pipeline"
50 / 821 papers shown
Title
MicroCam: Leveraging Smartphone Microscope Camera for Context-Aware Contact Surface Sensing
Yongquan Hu
Hui-Shyong Yeo
Mingyue Yuan
Haoran Fan
Don Samitha Elvitigala
Wen Hu
Aaron Quigley
59
3
0
22 Jul 2024
Validating Mechanistic Interpretations: An Axiomatic Approach
Nils Palumbo
Ravi Mangal
Zifan Wang
Saranya Vijayakumar
Corina S. Pasareanu
Somesh Jha
106
1
0
18 Jul 2024
TokenSHAP: Interpreting Large Language Models with Monte Carlo Shapley Value Estimation
Roni Goldshmidt
Miriam Horovicz
LLMAG
58
14
0
14 Jul 2024
Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations
Matthias Lindemann
Alexander Koller
Ivan Titov
AI4CE
NAI
73
4
0
05 Jul 2024
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models
Daking Rai
Yilun Zhou
Shi Feng
Abulhair Saparov
Ziyu Yao
192
33
0
02 Jul 2024
IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons
Dan Shi
Renren Jin
Tianhao Shen
Weilong Dong
Xinwei Wu
Deyi Xiong
103
11
0
26 Jun 2024
Are there identifiable structural parts in the sentence embedding whole?
Vivi Nastase
Paola Merlo
65
3
0
24 Jun 2024
A Primal-Dual Framework for Transformers and Neural Networks
Tan M. Nguyen
Tam Nguyen
Nhat Ho
Andrea L. Bertozzi
Richard G. Baraniuk
Stanley J. Osher
ViT
70
14
0
19 Jun 2024
Elliptical Attention
Stefan K. Nielsen
Laziz U. Abdullaev
R. Teo
Tan M. Nguyen
87
4
0
19 Jun 2024
Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis
R. Teo
Tan M. Nguyen
91
4
0
19 Jun 2024
Who's asking? User personas and the mechanics of latent misalignment
Asma Ghandeharioun
Ann Yuan
Marius Guerard
Emily Reif
Michael A. Lepori
Lucas Dixon
LLMSV
98
8
0
17 Jun 2024
InternalInspector
I
2
I^2
I
2
: Robust Confidence Estimation in LLMs through Internal States
Mohammad Beigi
Ying Shen
Runing Yang
Zihao Lin
Qifan Wang
Ankith Mohan
Jianfeng He
Ming Jin
Chang-Tien Lu
Lifu Huang
HILM
78
10
0
17 Jun 2024
PrivacyRestore: Privacy-Preserving Inference in Large Language Models via Privacy Removal and Restoration
Huiping Zhuang
Jianwei Wang
Zhengdong Lu
Huiping Zhuang
Haoran Li
Huiping Zhuang
Cen Chen
RALM
KELM
127
8
0
03 Jun 2024
KGLink: A column type annotation method that combines knowledge graph and pre-trained language model
Yubo Wang
Hao Xin
Lei Chen
LMTD
112
3
0
01 Jun 2024
Towards a theory of how the structure of language is acquired by deep neural networks
Francesco Cagnetta
Matthieu Wyart
81
10
0
28 May 2024
Exploring Activation Patterns of Parameters in Language Models
Yudong Wang
Damai Dai
Zhifang Sui
54
2
0
28 May 2024
Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting
Suraj Anand
Michael A. Lepori
Jack Merullo
Ellie Pavlick
CLL
122
8
0
28 May 2024
InversionView: A General-Purpose Method for Reading Information from Neural Activations
Xinting Huang
Madhur Panwar
Navin Goyal
Michael Hahn
98
5
0
27 May 2024
Adaptive Activation Steering: A Tuning-Free LLM Truthfulness Improvement Method for Diverse Hallucinations Categories
Tianlong Wang
Xianfeng Jiao
Yifan He
Zhongzhi Chen
Yinghao Zhu
Xu Chu
Junyi Gao
Yasha Wang
Liantao Ma
LLMSV
139
15
0
26 May 2024
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models
Peng Wang
Zexi Li
Ningyu Zhang
Ziwen Xu
Yunzhi Yao
Yong Jiang
Pengjun Xie
Fei Huang
Huajun Chen
KELM
CLL
124
34
0
23 May 2024
Multiple Realizability and the Rise of Deep Learning
Sam Whitman McGrath
Jacob Russin
AI4CE
95
2
0
21 May 2024
Do language models capture implied discourse meanings? An investigation with exhaustivity implicatures of Korean morphology
Hagyeong Shin
Sean Trott
57
0
0
15 May 2024
A Systematic Analysis on the Temporal Generalization of Language Models in Social Media
Asahi Ushio
Jose Camacho-Collados
48
0
0
15 May 2024
α
α
α
VIL: Learning to Leverage Auxiliary Tasks for Multitask Learning
Rafael Kourdis
Gabriel Gordon-Hall
P. Gorinski
42
0
0
13 May 2024
Natural Language Processing RELIES on Linguistics
Juri Opitz
Shira Wein
Nathan Schneider
AI4CE
165
8
0
09 May 2024
Interpretability Needs a New Paradigm
Andreas Madsen
Himabindu Lakkaraju
Siva Reddy
Sarath Chandar
72
3
0
08 May 2024
A Causal Explainable Guardrails for Large Language Models
Zhixuan Chu
Yan Wang
Longfei Li
Peng Kuang
Zhan Qin
Kui Ren
LLMSV
97
9
0
07 May 2024
What does the Knowledge Neuron Thesis Have to do with Knowledge?
Jingcheng Niu
Andrew Liu
Zining Zhu
Gerald Penn
115
38
0
03 May 2024
Analyzing Narrative Processing in Large Language Models (LLMs): Using GPT4 to test BERT
Patrick Krauss
Jannik Hösch
C. Metzner
Andreas K. Maier
Peter Uhrig
Achim Schilling
66
3
0
03 May 2024
SPAFIT: Stratified Progressive Adaptation Fine-tuning for Pre-trained Large Language Models
Samir Arora
Liangliang Wang
34
0
0
30 Apr 2024
Mechanistic Interpretability for AI Safety -- A Review
Leonard Bereska
E. Gavves
AI4CE
137
158
0
22 Apr 2024
Intrusion Detection at Scale with the Assistance of a Command-line Language Model
Jiongliang Lin
Yiwen Guo
Hao Chen
28
2
0
20 Apr 2024
Bridging Vision and Language Spaces with Assignment Prediction
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
VLM
97
7
0
15 Apr 2024
Large language models and linguistic intentionality
J. Grindrod
76
6
0
15 Apr 2024
A Large-Scale Evaluation of Speech Foundation Models
Shu-Wen Yang
Heng-Jui Chang
Zili Huang
Andy T. Liu
Cheng-I Jeff Lai
...
Kushal Lakhotia
Shang-Wen Li
Abdelrahman Mohamed
Shinji Watanabe
Hung-yi Lee
101
27
0
15 Apr 2024
CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers
Longwei Zou
Qingyang Wang
Han Zhao
Jiangang Kong
Yi Yang
Yangdong Deng
98
0
0
10 Apr 2024
A Morphology-Based Investigation of Positional Encodings
Poulami Ghosh
Shikhar Vashishth
Raj Dabre
Pushpak Bhattacharyya
75
2
0
06 Apr 2024
SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers
Junghyun Koo
Gordon Wichern
François Germain
Sameer Khurana
Jonathan Le Roux
101
5
0
02 Apr 2024
Context Quality Matters in Training Fusion-in-Decoder for Extractive Open-Domain Question Answering
Kosuke Akimoto
Kunihiro Takeoka
Masafumi Oyamada
73
1
0
21 Mar 2024
Knowledge Conflicts for LLMs: A Survey
Rongwu Xu
Zehan Qi
Zhijiang Guo
Cunxiang Wang
Hongru Wang
Yue Zhang
Wei Xu
302
122
0
13 Mar 2024
How to Understand Named Entities: Using Common Sense for News Captioning
Ning Xu
Yanhui Wang
Tingting Zhang
Hongshuo Tian
Mohan Kankanhalli
An-An Liu
63
0
0
11 Mar 2024
Code-Mixed Probes Show How Pre-Trained Models Generalise On Code-Switched Text
Frances Adriana Laureano De Leon
Harish Tayyar Madabushi
Mark Lee
63
4
0
07 Mar 2024
EEE-QA: Exploring Effective and Efficient Question-Answer Representations
Zhanghao Hu
Yijun Yang
Junjie Xu
Yifu Qiu
Pinzhen Chen
72
0
0
04 Mar 2024
Topic Aware Probing: From Sentence Length Prediction to Idiom Identification how reliant are Neural Language Models on Topic?
Vasudevan Nedumpozhimana
John D. Kelleher
62
1
0
04 Mar 2024
Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps
Giuseppe Attanasio
Beatrice Savoldi
Dennis Fucci
Dirk Hovy
92
9
0
28 Feb 2024
RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations
Jing-ling Huang
Zhengxuan Wu
Christopher Potts
Mor Geva
Atticus Geiger
130
35
0
27 Feb 2024
What Do Language Models Hear? Probing for Auditory Representations in Language Models
Jerry Ngo
Yoon Kim
AuLLM
MILM
61
8
0
26 Feb 2024
The Hidden Space of Transformer Language Adapters
Jesujoba Oluwadara Alabi
Marius Mosbach
Matan Eyal
Dietrich Klakow
Mor Geva
109
10
1
20 Feb 2024
When Only Time Will Tell: Interpreting How Transformers Process Local Ambiguities Through the Lens of Restart-Incrementality
Brielen Madureira
Patrick Kahardipraja
David Schlangen
81
2
0
20 Feb 2024
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Zhiyuan Li
Hong Liu
Denny Zhou
Tengyu Ma
LRM
AI4CE
103
133
0
20 Feb 2024
Previous
1
2
3
4
5
6
...
15
16
17
Next