Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05365
Cited By
v1
v2 (latest)
Deep contextualized word representations
15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep contextualized word representations"
50 / 4,508 papers shown
Title
DOCENT: Learning Self-Supervised Entity Representations from Large Document Collections
Yury Zemlyanskiy
Sudeep Gandhe
Ruining He
Bhargav Kanagal
Anirudh Ravula
Juraj Gottweis
Fei Sha
Ilya Eckstein
SSL
62
11
0
26 Feb 2021
PharmKE: Knowledge Extraction Platform for Pharmaceutical Texts using Transfer Learning
Nasi Jofche
Kostadin Mishev
Riste Stojanov
Milos Jovanovik
D. Trajanov
56
18
0
25 Feb 2021
Automated essay scoring using efficient transformer-based language models
C. Ormerod
Akanksha Malhotra
Amir Jafari
61
31
0
25 Feb 2021
Investigating the Limitations of Transformers with Simple Arithmetic Tasks
Rodrigo Nogueira
Zhiying Jiang
Jimmy J. Li
LRM
133
130
0
25 Feb 2021
BERT-based Acronym Disambiguation with Multiple Training Strategies
Chunguang Pan
Bingyan Song
Shengguang Wang
Zhipeng Luo
93
18
0
25 Feb 2021
Re-Evaluating GermEval17 Using German Pre-Trained Language Models
Yi Men
A. Corvonato
C. Heumann
VLM
91
6
0
24 Feb 2021
Multi-Task Attentive Residual Networks for Argument Mining
Andrea Galassi
Marco Lippi
Paolo Torroni
HAI
92
24
0
24 Feb 2021
Neural ranking models for document retrieval
M. Trabelsi
Zhiyu Zoey Chen
Brian D. Davison
J. Heflin
FedML
88
29
0
23 Feb 2021
Parallelizing Legendre Memory Unit Training
Narsimha Chilkuri
C. Eliasmith
104
39
0
22 Feb 2021
Domain Adaptation in Dialogue Systems using Transfer and Meta-Learning
Rui Ribeiro
A. Abad
J. Lopes
OffRL
37
1
0
22 Feb 2021
Position Information in Transformers: An Overview
Philipp Dufter
Martin Schmitt
Hinrich Schütze
114
149
0
22 Feb 2021
RUBERT: A Bilingual Roman Urdu BERT Using Cross Lingual Transfer Learning
Usama Khalid
M. O. Beg
Muhammad Umair Arshad
66
11
0
22 Feb 2021
Bilingual Language Modeling, A transfer learning technique for Roman Urdu
Usama Khalid
M. O. Beg
Muhammad Umair Arshad
46
3
0
22 Feb 2021
Using Prior Knowledge to Guide BERT's Attention in Semantic Textual Matching Tasks
Tingyu Xia
Yue Wang
Yuan Tian
Yi-Ju Chang
65
51
0
22 Feb 2021
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
Jun Chen
Han Guo
Kai Yi
Boyang Albert Li
Mohamed Elhoseiny
VLM
166
227
0
20 Feb 2021
Learning Dynamic BERT via Trainable Gate Variables and a Bi-modal Regularizer
Seohyeong Jeong
Nojun Kwak
43
0
0
19 Feb 2021
MUDES: Multilingual Detection of Offensive Spans
Tharindu Ranasinghe
Marcos Zampieri
83
41
0
18 Feb 2021
A Systematic Review of Natural Language Processing Applied to Radiology Reports
Arlene Casey
Emma Davidson
Michael Poon
Hang Dong
Daniel Duma
...
Víctor Suárez-Paniagua
Richard Tobin
William Whiteley
Honghan Wu
Beatrice Alex
AI4CE
46
150
0
18 Feb 2021
Training Large-Scale News Recommenders with Pretrained Language Models in the Loop
Shitao Xiao
Zheng Liu
Yingxia Shao
Tao Di
Xing Xie
VLM
AIFin
199
42
0
18 Feb 2021
Transferability of Neural Network Clinical De-identification Systems
Kahyun Lee
Nicholas J. Dobbins
Bridget T. McInnes
Meliha Yetisgen
Özlem Uzuner
OOD
61
5
0
17 Feb 2021
A Context-Enhanced De-identification System
Kahyun Lee
M. Kayaalp
Sam Henry
Özlem Uzuner
68
3
0
17 Feb 2021
COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
Yu Meng
Chenyan Xiong
Payal Bajaj
Saurabh Tiwary
Paul N. Bennett
Jiawei Han
Xia Song
195
206
0
16 Feb 2021
NoiseQA: Challenge Set Evaluation for User-Centric Question Answering
Abhilasha Ravichander
Siddharth Dalmia
Maria Ryskina
Florian Metze
Eduard H. Hovy
A. Black
ELM
66
32
0
16 Feb 2021
Large-Context Conversational Representation Learning: Self-Supervised Learning for Conversational Documents
Ryo Masumura
Naoki Makishima
Mana Ihori
Akihiko Takashima
Tomohiro Tanaka
Shota Orihashi
SSL
54
1
0
16 Feb 2021
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Ye Bai
Jiangyan Yi
J. Tao
Zhengkun Tian
Zhengqi Wen
Shuai Zhang
RALM
94
52
0
15 Feb 2021
MAPGN: MAsked Pointer-Generator Network for sequence-to-sequence pre-training
Mana Ihori
Naoki Makishima
Tomohiro Tanaka
Akihiko Takashima
Shota Orihashi
Ryo Masumura
SSL
59
5
0
15 Feb 2021
CATE: Computation-aware Neural Architecture Encoding with Transformers
Shen Yan
Kaiqiang Song
Z. Feng
Mi Zhang
91
28
0
14 Feb 2021
Exploring Classic and Neural Lexical Translation Models for Information Retrieval: Interpretability, Effectiveness, and Efficiency Benefits
Leonid Boytsov
Zico Kolter
58
11
0
12 Feb 2021
A Little Pretraining Goes a Long Way: A Case Study on Dependency Parsing Task for Low-resource Morphologically Rich Languages
Jivnesh Sandhan
Amrith Krishna
Ashim Gupta
Laxmidhar Behera
Pawan Goyal
54
9
0
12 Feb 2021
Transformer Language Models with LSTM-based Cross-utterance Information Representation
G. Sun
Chuxu Zhang
P. Woodland
116
32
0
12 Feb 2021
Neural Inverse Text Normalization
Monica Sunkara
Chaitanya P. Shivade
S. Bodapati
Katrin Kirchhoff
95
32
0
12 Feb 2021
Text Compression-aided Transformer Encoding
Z. Li
Zhuosheng Zhang
Hai Zhao
Rui Wang
Kehai Chen
Masao Utiyama
Eiichiro Sumita
AI4CE
71
45
0
11 Feb 2021
Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation
Renjie Zheng
Junkun Chen
Mingbo Ma
Liang Huang
157
69
0
10 Feb 2021
Customizing Contextualized Language Models forLegal Document Reviews
Shohreh Shaghaghian
Luna Feng
Feng
Borna Jafarpour
Nicolai Pogrebnyakov
AILaw
119
19
0
10 Feb 2021
Towards More Fine-grained and Reliable NLP Performance Prediction
Zihuiwen Ye
Pengfei Liu
Jinlan Fu
Graham Neubig
96
33
0
10 Feb 2021
Multi-turn Dialogue Reading Comprehension with Pivot Turns and Knowledge
Zhuosheng Zhang
Junlong Li
Hai Zhao
84
24
0
10 Feb 2021
Biomedical Question Answering: A Survey of Approaches and Challenges
Qiao Jin
Zheng Yuan
Guangzhi Xiong
Qian Yu
Huaiyuan Ying
Chuanqi Tan
Mosha Chen
Songfang Huang
Xiaozhong Liu
Sheng Yu
110
104
0
10 Feb 2021
The Singleton Fallacy: Why Current Critiques of Language Models Miss the Point
Magnus Sahlgren
F. Carlsson
66
28
0
08 Feb 2021
Nyströmformer: A Nyström-Based Algorithm for Approximating Self-Attention
Yunyang Xiong
Zhanpeng Zeng
Rudrasis Chakraborty
Mingxing Tan
G. Fung
Yin Li
Vikas Singh
160
526
0
07 Feb 2021
Unsupervised Sentence-embeddings by Manifold Approximation and Projection
Subhradeep Kayal
45
6
0
07 Feb 2021
Does He Wink or Does He Nod? A Challenging Benchmark for Evaluating Word Understanding of Language Models
Lutfi Kerem Senel
Hinrich Schütze
50
5
0
06 Feb 2021
Generalized Zero-shot Intent Detection via Commonsense Knowledge
A.B. Siddique
Fuad Jamour
Luxun Xu
Vagelis Hristidis
118
32
0
04 Feb 2021
Chord Embeddings: Analyzing What They Capture and Their Role for Next Chord Prediction and Artist Attribute Prediction
Allison Lahnala
Gauri Kambhatla
Jiajun Peng
Matthew Whitehead
Gillian Minnehan
Eric Guldan
Jonathan K. Kummerfeld
Anil cCamci
Rada Mihalcea
36
2
0
04 Feb 2021
Hierarchical Multi-head Attentive Network for Evidence-aware Fake News Detection
Nguyen Vo
Kyumin Lee
EgoV
75
44
0
04 Feb 2021
Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Prashanth Gurunath Shivakumar
P. Georgiou
Shrikanth Narayanan
35
1
0
03 Feb 2021
Focusing Knowledge-based Graph Argument Mining via Topic Modeling
Patricia B. Abels
Zahra Ahmadi
Sophie Burkhardt
Benjamin Schiller
Iryna Gurevych
Stefan Kramer
119
6
0
03 Feb 2021
General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Yucheng Zhao
Dacheng Yin
Chong Luo
Zhiyuan Zhao
Chuanxin Tang
Wenjun Zeng
Zhengjun Zha
SSL
59
6
0
03 Feb 2021
HeBERT & HebEMO: a Hebrew BERT Model and a Tool for Polarity Analysis and Emotion Recognition
Avihay Chriqui
I. Yahav
78
37
0
03 Feb 2021
AutoFreeze: Automatically Freezing Model Blocks to Accelerate Fine-tuning
Yuhan Liu
Saurabh Agarwal
Shivaram Venkataraman
OffRL
89
56
0
02 Feb 2021
Neural Data Augmentation via Example Extrapolation
Kenton Lee
Kelvin Guu
Luheng He
Timothy Dozat
Hyung Won Chung
80
72
0
02 Feb 2021
Previous
1
2
3
...
39
40
41
...
89
90
91
Next