ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.05950
  4. Cited By
BERT Rediscovers the Classical NLP Pipeline
v1v2 (latest)

BERT Rediscovers the Classical NLP Pipeline

15 May 2019
Ian Tenney
Dipanjan Das
Ellie Pavlick
    MILMSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT Rediscovers the Classical NLP Pipeline"

50 / 821 papers shown
Title
The Architectural Bottleneck Principle
The Architectural Bottleneck Principle
Tiago Pimentel
Josef Valvoda
Niklas Stoehr
Ryan Cotterell
54
5
0
11 Nov 2022
A Comprehensive Survey of Transformers for Computer Vision
A Comprehensive Survey of Transformers for Computer Vision
Sonain Jamil
Md. Jalil Piran
Oh-Jin Kwon
ViT
78
54
0
11 Nov 2022
SocioProbe: What, When, and Where Language Models Learn about
  Sociodemographics
SocioProbe: What, When, and Where Language Models Learn about Sociodemographics
Anne Lauscher
Federico Bianchi
Samuel R. Bowman
Dirk Hovy
91
7
0
08 Nov 2022
Third-Party Aligner for Neural Word Alignments
Third-Party Aligner for Neural Word Alignments
Jinpeng Zhang
C. Dong
Xiangyu Duan
Yuqi Zhang
Hao Fei
67
0
0
08 Nov 2022
COPEN: Probing Conceptual Knowledge in Pre-trained Language Models
COPEN: Probing Conceptual Knowledge in Pre-trained Language Models
Hao Peng
Xiaozhi Wang
Shengding Hu
Hailong Jin
Lei Hou
Juanzi Li
Zhiyuan Liu
Qun Liu
89
25
0
08 Nov 2022
Logographic Information Aids Learning Better Representations for Natural
  Language Inference
Logographic Information Aids Learning Better Representations for Natural Language Inference
Zijian Jin
Duygu Ataman
58
1
0
03 Nov 2022
BECTRA: Transducer-based End-to-End ASR with BERT-Enhanced Encoder
BECTRA: Transducer-based End-to-End ASR with BERT-Enhanced Encoder
Yosuke Higuchi
Tetsuji Ogawa
Tetsunori Kobayashi
Shinji Watanabe
169
13
0
02 Nov 2022
A Law of Data Separation in Deep Learning
A Law of Data Separation in Deep Learning
Hangfeng He
Weijie J. Su
OOD
105
42
0
31 Oct 2022
BERT Meets CTC: New Formulation of End-to-End Speech Recognition with
  Pre-trained Masked Language Model
BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model
Yosuke Higuchi
Brian Yan
Siddhant Arora
Tetsuji Ogawa
Tetsunori Kobayashi
Shinji Watanabe
118
26
0
29 Oct 2022
Debiasing Masks: A New Framework for Shortcut Mitigation in NLU
Debiasing Masks: A New Framework for Shortcut Mitigation in NLU
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
AAML
61
16
0
28 Oct 2022
Controlled Text Reduction
Controlled Text Reduction
Aviv Slobodkin
Paul Roit
Eran Hirsch
Ori Ernst
Ido Dagan
73
10
0
24 Oct 2022
Emergent World Representations: Exploring a Sequence Model Trained on a
  Synthetic Task
Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task
Kenneth Li
Aspen K. Hopkins
David Bau
Fernanda Viégas
Hanspeter Pfister
Martin Wattenberg
MILM
180
297
0
24 Oct 2022
Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs
Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs
Maarten Sap
Ronan Le Bras
Daniel Fried
Yejin Choi
101
232
0
24 Oct 2022
Structural generalization is hard for sequence-to-sequence models
Structural generalization is hard for sequence-to-sequence models
Yuekun Yao
Alexander Koller
88
22
0
24 Oct 2022
On the Transformation of Latent Space in Fine-Tuned NLP Models
On the Transformation of Latent Space in Fine-Tuned NLP Models
Nadir Durrani
Hassan Sajjad
Fahim Dalvi
Firoj Alam
128
19
0
23 Oct 2022
EntityCS: Improving Zero-Shot Cross-lingual Transfer with Entity-Centric
  Code Switching
EntityCS: Improving Zero-Shot Cross-lingual Transfer with Entity-Centric Code Switching
Chenxi Whitehouse
Fenia Christopoulou
Ignacio Iacobacci
108
9
0
22 Oct 2022
What do Large Language Models Learn beyond Language?
What do Large Language Models Learn beyond Language?
Avinash Madasu
Shashank Srivastava
LRMAI4CE
73
5
0
21 Oct 2022
Probing with Noise: Unpicking the Warp and Weft of Embeddings
Probing with Noise: Unpicking the Warp and Weft of Embeddings
Filip Klubicka
John D. Kelleher
68
4
0
21 Oct 2022
Spectral Probing
Spectral Probing
Max Müller-Eberstein
Rob van der Goot
Barbara Plank
52
2
0
21 Oct 2022
Syntax-guided Localized Self-attention by Constituency Syntactic
  Distance
Syntax-guided Localized Self-attention by Constituency Syntactic Distance
Shengyuan Hou
Jushi Kai
Haotian Xue
Bingyu Zhu
Bo Yuan
Longtao Huang
Xinbing Wang
Zhouhan Lin
17
4
0
21 Oct 2022
SLING: Sino Linguistic Evaluation of Large Language Models
SLING: Sino Linguistic Evaluation of Large Language Models
Yixiao Song
Kalpesh Krishna
R. Bhatt
Mohit Iyyer
83
10
0
21 Oct 2022
Evidence > Intuition: Transferability Estimation for Encoder Selection
Evidence > Intuition: Transferability Estimation for Encoder Selection
Elisa Bassignana
Max Müller-Eberstein
Mike Zhang
Barbara Plank
65
8
0
20 Oct 2022
Enhancing Out-of-Distribution Detection in Natural Language
  Understanding via Implicit Layer Ensemble
Enhancing Out-of-Distribution Detection in Natural Language Understanding via Implicit Layer Ensemble
Hyunsoo Cho
Choonghyun Park
Jaewoo Kang
Kang Min Yoo
Taeuk Kim
Sang-goo Lee
OODD
119
8
0
20 Oct 2022
Automatic Document Selection for Efficient Encoder Pretraining
Automatic Document Selection for Efficient Encoder Pretraining
Yukun Feng
Patrick Xia
Benjamin Van Durme
João Sedoc
114
11
0
20 Oct 2022
Transformers Learn Shortcuts to Automata
Transformers Learn Shortcuts to Automata
Bingbin Liu
Jordan T. Ash
Surbhi Goel
A. Krishnamurthy
Cyril Zhang
OffRLLRM
161
178
0
19 Oct 2022
Hidden State Variability of Pretrained Language Models Can Guide
  Computation Reduction for Transfer Learning
Hidden State Variability of Pretrained Language Models Can Guide Computation Reduction for Transfer Learning
Shuo Xie
Jiahao Qiu
Ankita Pasad
Li Du
Qing Qu
Hongyuan Mei
87
16
0
18 Oct 2022
Post-hoc analysis of Arabic transformer models
Post-hoc analysis of Arabic transformer models
Ahmed Abdelali
Nadir Durrani
Fahim Dalvi
Hassan Sajjad
43
1
0
18 Oct 2022
Predicting Fine-Tuning Performance with Probing
Predicting Fine-Tuning Performance with Probing
Zining Zhu
Soroosh Shahtalebi
Frank Rudzicz
64
10
0
13 Oct 2022
On the Explainability of Natural Language Processing Deep Models
On the Explainability of Natural Language Processing Deep Models
Julia El Zini
M. Awad
65
88
0
13 Oct 2022
Empowering the Fact-checkers! Automatic Identification of Claim Spans on
  Twitter
Empowering the Fact-checkers! Automatic Identification of Claim Spans on Twitter
Megha Sundriyal
Atharva Kulkarni
Vaibhav Pulastya
Md. Shad Akhtar
Tanmoy Chakraborty
MedIm
71
19
0
10 Oct 2022
Breaking BERT: Evaluating and Optimizing Sparsified Attention
Breaking BERT: Evaluating and Optimizing Sparsified Attention
Siddhartha Brahma
Polina Zablotskaia
David M. Mimno
37
1
0
07 Oct 2022
Probing of Quantitative Values in Abstractive Summarization Models
Probing of Quantitative Values in Abstractive Summarization Models
Nathan M. White
76
0
0
03 Oct 2022
Downstream Datasets Make Surprisingly Good Pretraining Corpora
Downstream Datasets Make Surprisingly Good Pretraining Corpora
Kundan Krishna
Saurabh Garg
Jeffrey P. Bigham
Zachary Chase Lipton
108
33
0
28 Sep 2022
Causal Proxy Models for Concept-Based Model Explanations
Causal Proxy Models for Concept-Based Model Explanations
Zhengxuan Wu
Karel DÓosterlinck
Atticus Geiger
Amir Zur
Christopher Potts
MILM
132
37
0
28 Sep 2022
Fast-FNet: Accelerating Transformer Encoder Models via Efficient Fourier
  Layers
Fast-FNet: Accelerating Transformer Encoder Models via Efficient Fourier Layers
Nurullah Sevim
Ege Ozan Özyedek
Furkan Şahinuç
Aykut Koç
95
12
0
26 Sep 2022
ImmunoLingo: Linguistics-based formalization of the antibody language
ImmunoLingo: Linguistics-based formalization of the antibody language
Mai Ha Vu
Philippe A. Robert
Rahmad Akbar
B. Swiatczak
G. K. Sandve
Dag Trygve Tryslew Haug
Victor Greiff
AI4CE
103
8
0
26 Sep 2022
Towards Faithful Model Explanation in NLP: A Survey
Towards Faithful Model Explanation in NLP: A Survey
Qing Lyu
Marianna Apidianaki
Chris Callison-Burch
XAI
237
121
0
22 Sep 2022
Unsupervised Lexical Substitution with Decontextualised Embeddings
Unsupervised Lexical Substitution with Decontextualised Embeddings
Takashi Wada
Timothy Baldwin
Yuji Matsumoto
Jey Han Lau
145
7
0
17 Sep 2022
Negation, Coordination, and Quantifiers in Contextualized Language
  Models
Negation, Coordination, and Quantifiers in Contextualized Language Models
A. Kalouli
Rita Sevastjanova
C. Beck
Maribel Romero
88
12
0
16 Sep 2022
Revisiting the Practical Effectiveness of Constituency Parse Extraction
  from Pre-trained Language Models
Revisiting the Practical Effectiveness of Constituency Parse Extraction from Pre-trained Language Models
Taeuk Kim
132
1
0
15 Sep 2022
Analyzing Transformers in Embedding Space
Analyzing Transformers in Embedding Space
Guy Dar
Mor Geva
Ankit Gupta
Jonathan Berant
83
93
0
06 Sep 2022
Why Do Neural Language Models Still Need Commonsense Knowledge to Handle
  Semantic Variations in Question Answering?
Why Do Neural Language Models Still Need Commonsense Knowledge to Handle Semantic Variations in Question Answering?
Sunjae Kwon
Cheongwoong Kang
Jiyeon Han
Jaesik Choi
59
0
0
01 Sep 2022
OOD-Probe: A Neural Interpretation of Out-of-Domain Generalization
OOD-Probe: A Neural Interpretation of Out-of-Domain Generalization
Zining Zhu
Soroosh Shahtalebi
Frank Rudzicz
95
5
0
25 Aug 2022
On Reality and the Limits of Language Data: Aligning LLMs with Human
  Norms
On Reality and the Limits of Language Data: Aligning LLMs with Human Norms
Nigel Collier
Fangyu Liu
Ehsan Shareghi
48
3
0
25 Aug 2022
Interpreting Embedding Spaces by Conceptualization
Interpreting Embedding Spaces by Conceptualization
Adi Simhi
Shaul Markovitch
95
7
0
22 Aug 2022
A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum
  Framework
A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum Framework
Avinash Madasu
Anvesh Rao Vijjini
32
0
0
21 Aug 2022
An Interpretability Evaluation Benchmark for Pre-trained Language Models
An Interpretability Evaluation Benchmark for Pre-trained Language Models
Ya-Ming Shen
Lijie Wang
Ying-Cong Chen
Xinyan Xiao
Jing Liu
Hua Wu
79
4
0
28 Jul 2022
The Birth of Bias: A case study on the evolution of gender bias in an
  English language model
The Birth of Bias: A case study on the evolution of gender bias in an English language model
Oskar van der Wal
Jaap Jumelet
K. Schulz
Willem H. Zuidema
121
16
0
21 Jul 2022
BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid
  Counterfactual Training for Robust Content-based Image Retrieval
BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval
Wenqiao Zhang
Jiannan Guo
Meng Li
Haochen Shi
Shengyu Zhang
Juncheng Li
Siliang Tang
Yueting Zhuang
88
6
0
09 Jul 2022
Probing via Prompting
Probing via Prompting
Jiaoda Li
Ryan Cotterell
Mrinmaya Sachan
109
13
0
04 Jul 2022
Previous
123...789...151617
Next