v1v2 (latest)

BERT Rediscovers the Classical NLP Pipeline

15 May 2019

Papers citing "BERT Rediscovers the Classical NLP Pipeline"

50 / 821 papers shown

Title
MicroCam: Leveraging Smartphone Microscope Camera for Context-Aware Contact Surface Sensing Yongquan Hu Hui-Shyong Yeo Mingyue Yuan Haoran Fan Don Samitha Elvitigala Wen Hu Aaron Quigley 59 3 0 22 Jul 2024
Validating Mechanistic Interpretations: An Axiomatic Approach Nils Palumbo Ravi Mangal Zifan Wang Saranya Vijayakumar Corina S. Pasareanu Somesh Jha 106 1 0 18 Jul 2024
TokenSHAP: Interpreting Large Language Models with Monte Carlo Shapley Value Estimation Roni Goldshmidt Miriam Horovicz LLMAG 58 14 0 14 Jul 2024
Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations Matthias Lindemann Alexander Koller Ivan Titov AI4CE NAI 73 4 0 05 Jul 2024
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models Daking Rai Yilun Zhou Shi Feng Abulhair Saparov Ziyu Yao 192 33 0 02 Jul 2024
IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons Dan Shi Renren Jin Tianhao Shen Weilong Dong Xinwei Wu Deyi Xiong 103 11 0 26 Jun 2024
Are there identifiable structural parts in the sentence embedding whole? Vivi Nastase Paola Merlo 65 3 0 24 Jun 2024
A Primal-Dual Framework for Transformers and Neural Networks Tan M. Nguyen Tam Nguyen Nhat Ho Andrea L. Bertozzi Richard G. Baraniuk Stanley J. Osher ViT 70 14 0 19 Jun 2024
Elliptical Attention Stefan K. Nielsen Laziz U. Abdullaev R. Teo Tan M. Nguyen 87 4 0 19 Jun 2024
Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis R. Teo Tan M. Nguyen 91 4 0 19 Jun 2024
Who's asking? User personas and the mechanics of latent misalignment Asma Ghandeharioun Ann Yuan Marius Guerard Emily Reif Michael A. Lepori Lucas Dixon LLMSV 98 8 0 17 Jun 2024
InternalInspector $I^2$ : Robust Confidence Estimation in LLMs through Internal States Mohammad Beigi Ying Shen Runing Yang Zihao Lin Qifan Wang Ankith Mohan Jianfeng He Ming Jin Chang-Tien Lu Lifu Huang HILM 78 10 0 17 Jun 2024
PrivacyRestore: Privacy-Preserving Inference in Large Language Models via Privacy Removal and Restoration Huiping Zhuang Jianwei Wang Zhengdong Lu Huiping Zhuang Haoran Li Huiping Zhuang Cen Chen RALM KELM 127 8 0 03 Jun 2024
KGLink: A column type annotation method that combines knowledge graph and pre-trained language model Yubo Wang Hao Xin Lei Chen LMTD 112 3 0 01 Jun 2024
Towards a theory of how the structure of language is acquired by deep neural networks Francesco Cagnetta Matthieu Wyart 81 10 0 28 May 2024
Exploring Activation Patterns of Parameters in Language Models Yudong Wang Damai Dai Zhifang Sui 54 2 0 28 May 2024
Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting Suraj Anand Michael A. Lepori Jack Merullo Ellie Pavlick CLL 122 8 0 28 May 2024
InversionView: A General-Purpose Method for Reading Information from Neural Activations Xinting Huang Madhur Panwar Navin Goyal Michael Hahn 98 5 0 27 May 2024
Adaptive Activation Steering: A Tuning-Free LLM Truthfulness Improvement Method for Diverse Hallucinations Categories Tianlong Wang Xianfeng Jiao Yifan He Zhongzhi Chen Yinghao Zhu Xu Chu Junyi Gao Yasha Wang Liantao Ma LLMSV 139 15 0 26 May 2024
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models Peng Wang Zexi Li Ningyu Zhang Ziwen Xu Yunzhi Yao Yong Jiang Pengjun Xie Fei Huang Huajun Chen KELM CLL 124 34 0 23 May 2024
Multiple Realizability and the Rise of Deep Learning Sam Whitman McGrath Jacob Russin AI4CE 95 2 0 21 May 2024
Do language models capture implied discourse meanings? An investigation with exhaustivity implicatures of Korean morphology Hagyeong Shin Sean Trott 57 0 0 15 May 2024
A Systematic Analysis on the Temporal Generalization of Language Models in Social Media Asahi Ushio Jose Camacho-Collados 48 0 0 15 May 2024
$α$ VIL: Learning to Leverage Auxiliary Tasks for Multitask Learning Rafael Kourdis Gabriel Gordon-Hall P. Gorinski 42 0 0 13 May 2024
Natural Language Processing RELIES on Linguistics Juri Opitz Shira Wein Nathan Schneider AI4CE 165 8 0 09 May 2024
Interpretability Needs a New Paradigm Andreas Madsen Himabindu Lakkaraju Siva Reddy Sarath Chandar 72 3 0 08 May 2024
A Causal Explainable Guardrails for Large Language Models Zhixuan Chu Yan Wang Longfei Li Peng Kuang Zhan Qin Kui Ren LLMSV 97 9 0 07 May 2024
What does the Knowledge Neuron Thesis Have to do with Knowledge? Jingcheng Niu Andrew Liu Zining Zhu Gerald Penn 115 38 0 03 May 2024
Analyzing Narrative Processing in Large Language Models (LLMs): Using GPT4 to test BERT Patrick Krauss Jannik Hösch C. Metzner Andreas K. Maier Peter Uhrig Achim Schilling 66 3 0 03 May 2024
SPAFIT: Stratified Progressive Adaptation Fine-tuning for Pre-trained Large Language Models Samir Arora Liangliang Wang 34 0 0 30 Apr 2024
Mechanistic Interpretability for AI Safety -- A Review Leonard Bereska E. Gavves AI4CE 137 158 0 22 Apr 2024
Intrusion Detection at Scale with the Assistance of a Command-line Language Model Jiongliang Lin Yiwen Guo Hao Chen 28 2 0 20 Apr 2024
Bridging Vision and Language Spaces with Assignment Prediction Jungin Park Jiyoung Lee Kwanghoon Sohn VLM 97 7 0 15 Apr 2024
Large language models and linguistic intentionality J. Grindrod 76 6 0 15 Apr 2024
A Large-Scale Evaluation of Speech Foundation Models Shu-Wen Yang Heng-Jui Chang Zili Huang Andy T. Liu Cheng-I Jeff Lai ... Kushal Lakhotia Shang-Wen Li Abdelrahman Mohamed Shinji Watanabe Hung-yi Lee 101 27 0 15 Apr 2024
CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers Longwei Zou Qingyang Wang Han Zhao Jiangang Kong Yi Yang Yangdong Deng 98 0 0 10 Apr 2024
A Morphology-Based Investigation of Positional Encodings Poulami Ghosh Shikhar Vashishth Raj Dabre Pushpak Bhattacharyya 75 2 0 06 Apr 2024
SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers Junghyun Koo Gordon Wichern François Germain Sameer Khurana Jonathan Le Roux 101 5 0 02 Apr 2024
Context Quality Matters in Training Fusion-in-Decoder for Extractive Open-Domain Question Answering Kosuke Akimoto Kunihiro Takeoka Masafumi Oyamada 73 1 0 21 Mar 2024
Knowledge Conflicts for LLMs: A Survey Rongwu Xu Zehan Qi Zhijiang Guo Cunxiang Wang Hongru Wang Yue Zhang Wei Xu 302 122 0 13 Mar 2024
How to Understand Named Entities: Using Common Sense for News Captioning Ning Xu Yanhui Wang Tingting Zhang Hongshuo Tian Mohan Kankanhalli An-An Liu 63 0 0 11 Mar 2024
Code-Mixed Probes Show How Pre-Trained Models Generalise On Code-Switched Text Frances Adriana Laureano De Leon Harish Tayyar Madabushi Mark Lee 63 4 0 07 Mar 2024
EEE-QA: Exploring Effective and Efficient Question-Answer Representations Zhanghao Hu Yijun Yang Junjie Xu Yifu Qiu Pinzhen Chen 72 0 0 04 Mar 2024
Topic Aware Probing: From Sentence Length Prediction to Idiom Identification how reliant are Neural Language Models on Topic? Vasudevan Nedumpozhimana John D. Kelleher 62 1 0 04 Mar 2024
Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps Giuseppe Attanasio Beatrice Savoldi Dennis Fucci Dirk Hovy 92 9 0 28 Feb 2024
RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations Jing-ling Huang Zhengxuan Wu Christopher Potts Mor Geva Atticus Geiger 130 35 0 27 Feb 2024
What Do Language Models Hear? Probing for Auditory Representations in Language Models Jerry Ngo Yoon Kim AuLLM MILM 61 8 0 26 Feb 2024
The Hidden Space of Transformer Language Adapters Jesujoba Oluwadara Alabi Marius Mosbach Matan Eyal Dietrich Klakow Mor Geva 109 10 1 20 Feb 2024
When Only Time Will Tell: Interpreting How Transformers Process Local Ambiguities Through the Lens of Restart-Incrementality Brielen Madureira Patrick Kahardipraja David Schlangen 81 2 0 20 Feb 2024
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems Zhiyuan Li Hong Liu Denny Zhou Tengyu Ma LRM AI4CE 103 133 0 20 Feb 2024