v1v2 (latest)

BERT Rediscovers the Classical NLP Pipeline

15 May 2019

Papers citing "BERT Rediscovers the Classical NLP Pipeline"

50 / 821 papers shown

Title
The Architectural Bottleneck Principle Tiago Pimentel Josef Valvoda Niklas Stoehr Ryan Cotterell 54 5 0 11 Nov 2022
A Comprehensive Survey of Transformers for Computer Vision Sonain Jamil Md. Jalil Piran Oh-Jin Kwon ViT 78 54 0 11 Nov 2022
SocioProbe: What, When, and Where Language Models Learn about Sociodemographics Anne Lauscher Federico Bianchi Samuel R. Bowman Dirk Hovy 91 7 0 08 Nov 2022
Third-Party Aligner for Neural Word Alignments Jinpeng Zhang C. Dong Xiangyu Duan Yuqi Zhang Hao Fei 67 0 0 08 Nov 2022
COPEN: Probing Conceptual Knowledge in Pre-trained Language Models Hao Peng Xiaozhi Wang Shengding Hu Hailong Jin Lei Hou Juanzi Li Zhiyuan Liu Qun Liu 89 25 0 08 Nov 2022
Logographic Information Aids Learning Better Representations for Natural Language Inference Zijian Jin Duygu Ataman 58 1 0 03 Nov 2022
BECTRA: Transducer-based End-to-End ASR with BERT-Enhanced Encoder Yosuke Higuchi Tetsuji Ogawa Tetsunori Kobayashi Shinji Watanabe 169 13 0 02 Nov 2022
A Law of Data Separation in Deep Learning Hangfeng He Weijie J. Su OOD 105 42 0 31 Oct 2022
BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model Yosuke Higuchi Brian Yan Siddhant Arora Tetsuji Ogawa Tetsunori Kobayashi Shinji Watanabe 118 26 0 29 Oct 2022
Debiasing Masks: A New Framework for Shortcut Mitigation in NLU Johannes Mario Meissner Saku Sugawara Akiko Aizawa AAML 61 16 0 28 Oct 2022
Controlled Text Reduction Aviv Slobodkin Paul Roit Eran Hirsch Ori Ernst Ido Dagan 73 10 0 24 Oct 2022
Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task Kenneth Li Aspen K. Hopkins David Bau Fernanda Viégas Hanspeter Pfister Martin Wattenberg MILM 180 297 0 24 Oct 2022
Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs Maarten Sap Ronan Le Bras Daniel Fried Yejin Choi 101 232 0 24 Oct 2022
Structural generalization is hard for sequence-to-sequence models Yuekun Yao Alexander Koller 88 22 0 24 Oct 2022
On the Transformation of Latent Space in Fine-Tuned NLP Models Nadir Durrani Hassan Sajjad Fahim Dalvi Firoj Alam 128 19 0 23 Oct 2022
EntityCS: Improving Zero-Shot Cross-lingual Transfer with Entity-Centric Code Switching Chenxi Whitehouse Fenia Christopoulou Ignacio Iacobacci 108 9 0 22 Oct 2022
What do Large Language Models Learn beyond Language? Avinash Madasu Shashank Srivastava LRM AI4CE 73 5 0 21 Oct 2022
Probing with Noise: Unpicking the Warp and Weft of Embeddings Filip Klubicka John D. Kelleher 68 4 0 21 Oct 2022
Spectral Probing Max Müller-Eberstein Rob van der Goot Barbara Plank 52 2 0 21 Oct 2022
Syntax-guided Localized Self-attention by Constituency Syntactic Distance Shengyuan Hou Jushi Kai Haotian Xue Bingyu Zhu Bo Yuan Longtao Huang Xinbing Wang Zhouhan Lin 17 4 0 21 Oct 2022
SLING: Sino Linguistic Evaluation of Large Language Models Yixiao Song Kalpesh Krishna R. Bhatt Mohit Iyyer 83 10 0 21 Oct 2022
Evidence > Intuition: Transferability Estimation for Encoder Selection Elisa Bassignana Max Müller-Eberstein Mike Zhang Barbara Plank 65 8 0 20 Oct 2022
Enhancing Out-of-Distribution Detection in Natural Language Understanding via Implicit Layer Ensemble Hyunsoo Cho Choonghyun Park Jaewoo Kang Kang Min Yoo Taeuk Kim Sang-goo Lee OODD 119 8 0 20 Oct 2022
Automatic Document Selection for Efficient Encoder Pretraining Yukun Feng Patrick Xia Benjamin Van Durme João Sedoc 114 11 0 20 Oct 2022
Transformers Learn Shortcuts to Automata Bingbin Liu Jordan T. Ash Surbhi Goel A. Krishnamurthy Cyril Zhang OffRL LRM 161 178 0 19 Oct 2022
Hidden State Variability of Pretrained Language Models Can Guide Computation Reduction for Transfer Learning Shuo Xie Jiahao Qiu Ankita Pasad Li Du Qing Qu Hongyuan Mei 87 16 0 18 Oct 2022
Post-hoc analysis of Arabic transformer models Ahmed Abdelali Nadir Durrani Fahim Dalvi Hassan Sajjad 43 1 0 18 Oct 2022
Predicting Fine-Tuning Performance with Probing Zining Zhu Soroosh Shahtalebi Frank Rudzicz 64 10 0 13 Oct 2022
On the Explainability of Natural Language Processing Deep Models Julia El Zini M. Awad 65 88 0 13 Oct 2022
Empowering the Fact-checkers! Automatic Identification of Claim Spans on Twitter Megha Sundriyal Atharva Kulkarni Vaibhav Pulastya Md. Shad Akhtar Tanmoy Chakraborty MedIm 71 19 0 10 Oct 2022
Breaking BERT: Evaluating and Optimizing Sparsified Attention Siddhartha Brahma Polina Zablotskaia David M. Mimno 37 1 0 07 Oct 2022
Probing of Quantitative Values in Abstractive Summarization Models Nathan M. White 76 0 0 03 Oct 2022
Downstream Datasets Make Surprisingly Good Pretraining Corpora Kundan Krishna Saurabh Garg Jeffrey P. Bigham Zachary Chase Lipton 108 33 0 28 Sep 2022
Causal Proxy Models for Concept-Based Model Explanations Zhengxuan Wu Karel DÓosterlinck Atticus Geiger Amir Zur Christopher Potts MILM 132 37 0 28 Sep 2022
Fast-FNet: Accelerating Transformer Encoder Models via Efficient Fourier Layers Nurullah Sevim Ege Ozan Özyedek Furkan Şahinuç Aykut Koç 95 12 0 26 Sep 2022
ImmunoLingo: Linguistics-based formalization of the antibody language Mai Ha Vu Philippe A. Robert Rahmad Akbar B. Swiatczak G. K. Sandve Dag Trygve Tryslew Haug Victor Greiff AI4CE 103 8 0 26 Sep 2022
Towards Faithful Model Explanation in NLP: A Survey Qing Lyu Marianna Apidianaki Chris Callison-Burch XAI 237 121 0 22 Sep 2022
Unsupervised Lexical Substitution with Decontextualised Embeddings Takashi Wada Timothy Baldwin Yuji Matsumoto Jey Han Lau 145 7 0 17 Sep 2022
Negation, Coordination, and Quantifiers in Contextualized Language Models A. Kalouli Rita Sevastjanova C. Beck Maribel Romero 88 12 0 16 Sep 2022
Revisiting the Practical Effectiveness of Constituency Parse Extraction from Pre-trained Language Models Taeuk Kim 132 1 0 15 Sep 2022
Analyzing Transformers in Embedding Space Guy Dar Mor Geva Ankit Gupta Jonathan Berant 83 93 0 06 Sep 2022
Why Do Neural Language Models Still Need Commonsense Knowledge to Handle Semantic Variations in Question Answering? Sunjae Kwon Cheongwoong Kang Jiyeon Han Jaesik Choi 59 0 0 01 Sep 2022
OOD-Probe: A Neural Interpretation of Out-of-Domain Generalization Zining Zhu Soroosh Shahtalebi Frank Rudzicz 95 5 0 25 Aug 2022
On Reality and the Limits of Language Data: Aligning LLMs with Human Norms Nigel Collier Fangyu Liu Ehsan Shareghi 48 3 0 25 Aug 2022
Interpreting Embedding Spaces by Conceptualization Adi Simhi Shaul Markovitch 95 7 0 22 Aug 2022
A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum Framework Avinash Madasu Anvesh Rao Vijjini 32 0 0 21 Aug 2022
An Interpretability Evaluation Benchmark for Pre-trained Language Models Ya-Ming Shen Lijie Wang Ying-Cong Chen Xinyan Xiao Jing Liu Hua Wu 79 4 0 28 Jul 2022
The Birth of Bias: A case study on the evolution of gender bias in an English language model Oskar van der Wal Jaap Jumelet K. Schulz Willem H. Zuidema 121 16 0 21 Jul 2022
BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval Wenqiao Zhang Jiannan Guo Meng Li Haochen Shi Shengyu Zhang Juncheng Li Siliang Tang Yueting Zhuang 88 6 0 09 Jul 2022
Probing via Prompting Jiaoda Li Ryan Cotterell Mrinmaya Sachan 109 13 0 04 Jul 2022