Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05365
Cited By
v1
v2 (latest)
Deep contextualized word representations
15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep contextualized word representations"
50 / 4,508 papers shown
Title
What do you mean, BERT? Assessing BERT as a Distributional Semantics Model
Timothee Mickus
Denis Paperno
Mathieu Constant
Kees van Deemter
86
46
0
13 Nov 2019
Mark my Word: A Sequence-to-Sequence Approach to Definition Modeling
Timothee Mickus
Denis Paperno
Mathieu Constant
59
31
0
13 Nov 2019
Unsupervised Pre-training for Natural Language Generation: A Literature Review
Yuanxin Liu
Zheng Lin
SSL
AI4CE
45
3
0
13 Nov 2019
Word-level Lexical Normalisation using Context-Dependent Embeddings
Michael Stewart
Wei Liu
R. Cardell-Oliver
13
4
0
13 Nov 2019
KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation
Xiaozhi Wang
Tianyu Gao
Zhaocheng Zhu
Zhengyan Zhang
Zhiyuan Liu
Juan-Zi Li
Jian Tang
199
675
0
13 Nov 2019
Robustness to Capitalization Errors in Named Entity Recognition
S. Bodapati
Hyokun Yun
Yaser Al-Onaizan
76
18
0
13 Nov 2019
SMILES Transformer: Pre-trained Molecular Fingerprint for Low Data Drug Discovery
Shion Honda
Shoi Shi
H. Ueda
MedIm
83
179
0
12 Nov 2019
A Pre-training Based Personalized Dialogue Generation Model with Persona-sparse Data
Yinhe Zheng
Rongsheng Zhang
Xiaoxi Mao
Minlie Huang
65
161
0
12 Nov 2019
Learning Multi-Sense Word Distributions using Approximate Kullback-Leibler Divergence
P. Jayashree
Ballijepalli Shreya
P. K. Srijith
40
2
0
12 Nov 2019
Attending to Entities for Better Text Understanding
Pengxiang Cheng
K. Erk
LRM
75
38
0
11 Nov 2019
Deep Contextualized Self-training for Low Resource Dependency Parsing
Guy Rotman
Roi Reichart
112
50
0
11 Nov 2019
TANDA: Transfer and Adapt Pre-Trained Transformer Models for Answer Sentence Selection
Siddhant Garg
Thuy Vu
Alessandro Moschitti
106
216
0
11 Nov 2019
Word Sense Disambiguation using Knowledge-based Word Similarity
Sunjae Kwon
Dongsuk Oh
Youngjoong Ko
35
0
0
11 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAI
AI4TS
124
338
0
10 Nov 2019
TENER: Adapting Transformer Encoder for Named Entity Recognition
Hang Yan
Bocao Deng
Xiaonan Li
Xipeng Qiu
99
276
0
10 Nov 2019
Contract Discovery: Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive Baselines
Łukasz Borchmann
Dawid Wisniewski
Andrzej Gretkowski
Izabela Kosmala
Dawid Jurkiewicz
Lukasz Szalkiewicz
Gabriela Pałka
Karol Kaczmarek
Agnieszka Kaliska
Filip Graliñski
AILaw
50
0
0
10 Nov 2019
A Bilingual Generative Transformer for Semantic Sentence Embedding
John Wieting
Graham Neubig
Taylor Berg-Kirkpatrick
82
29
0
10 Nov 2019
CamemBERT: a Tasty French Language Model
Louis Martin
Benjamin Muller
Pedro Ortiz Suarez
Yoann Dupont
Laurent Romary
Eric Villemonte de la Clergerie
Djamé Seddah
Benoît Sagot
162
981
0
10 Nov 2019
Learning to Few-Shot Learn Across Diverse Natural Language Classification Tasks
Trapit Bansal
Rishikesh Jha
Andrew McCallum
SSL
118
121
0
10 Nov 2019
Increasing Robustness to Spurious Correlations using Forgettable Examples
Yadollah Yaghoobzadeh
Soroush Mehri
Remi Tachet
Timothy J. Hazen
Alessandro Sordoni
OOD
63
18
0
10 Nov 2019
r/Fakeddit: A New Multimodal Benchmark Dataset for Fine-grained Fake News Detection
Kai Nakamura
Sharon Levy
Wenjie Wang
101
125
0
10 Nov 2019
Distilling Knowledge Learned in BERT for Text Generation
Yen-Chun Chen
Zhe Gan
Yu Cheng
Jingzhou Liu
Jingjing Liu
82
28
0
10 Nov 2019
Generalizing Natural Language Analysis through Span-relation Representations
Zhengbao Jiang
Wenyuan Xu
Jun Araki
Graham Neubig
91
60
0
10 Nov 2019
PoD: Positional Dependency-Based Word Embedding for Aspect Term Extraction
Yichun Yin
Chenguang Wang
Ming Zhang
70
19
0
09 Nov 2019
Multi-Sentence Argument Linking
Seth Ebner
Patrick Xia
Ryan Culkin
Kyle Rawlins
Benjamin Van Durme
HAI
123
163
0
09 Nov 2019
Sentence Meta-Embeddings for Unsupervised Semantic Textual Similarity
Nina Poerner
Ulli Waltinger
Hinrich Schütze
AI4TS
191
20
0
09 Nov 2019
ConveRT: Efficient and Accurate Conversational Representations from Transformers
Matthew Henderson
I. Casanueva
Nikola Mrkvsić
Pei-hao Su
Tsung-Hsien
Ivan Vulić
123
200
0
09 Nov 2019
MKD: a Multi-Task Knowledge Distillation Approach for Pretrained Language Models
Linqing Liu
Haiquan Wang
Jimmy J. Lin
R. Socher
Caiming Xiong
65
21
0
09 Nov 2019
How Decoding Strategies Affect the Verifiability of Generated Text
Luca Massarelli
Fabio Petroni
Aleksandra Piktus
Myle Ott
Tim Rocktaschel
Vassilis Plachouras
Fabrizio Silvestri
Sebastian Riedel
140
50
0
09 Nov 2019
Graph-to-Graph Transformer for Transition-based Dependency Parsing
Alireza Mohammadshahi
James Henderson
AI4CE
48
26
0
08 Nov 2019
SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization
Haoming Jiang
Pengcheng He
Weizhu Chen
Xiaodong Liu
Jianfeng Gao
T. Zhao
143
563
0
08 Nov 2019
SEPT: Improving Scientific Named Entity Recognition with Span Representation
Tan Yan
Heyan Huang
Xian-Ling Mao
MedIm
23
0
0
08 Nov 2019
Negated and Misprimed Probes for Pretrained Language Models: Birds Can Talk, But Cannot Fly
Nora Kassner
Hinrich Schütze
84
325
0
08 Nov 2019
Towards automatic extractive text summarization of A-133 Single Audit reports with machine learning
Vivian T. Chou
LeAnna Kent
Joel A. Góngora
Samuel Ballerini
Carl D. Hoover
22
3
0
08 Nov 2019
Not Enough Data? Deep Learning to the Rescue!
Ateret Anaby-Tavor
Boaz Carmeli
Esther Goldbraich
Amir Kantor
George Kour
Segev Shlomov
N. Tepper
Naama Zwerdling
120
371
0
08 Nov 2019
Relation Adversarial Network for Low Resource Knowledge Graph Completion
Ningyu Zhang
Shumin Deng
Zhanlin Sun
Jiaoayan Chen
Wei Zhang
Huajun Chen
125
72
0
08 Nov 2019
What Would Elsa Do? Freezing Layers During Transformer Fine-Tuning
Jaejun Lee
Raphael Tang
Jimmy J. Lin
69
127
0
08 Nov 2019
Reducing Sentiment Bias in Language Models via Counterfactual Evaluation
Po-Sen Huang
Huan Zhang
Ray Jiang
Robert Stanforth
Johannes Welbl
Jack W. Rae
Vishal Maini
Dani Yogatama
Pushmeet Kohli
120
217
0
08 Nov 2019
Ruminating Word Representations with Random Noised Masker
Hwiyeol Jo
Byoung-Tak Zhang
78
0
0
08 Nov 2019
Contrastive Multi-document Question Generation
W. Cho
Yizhe Zhang
Sudha Rao
Asli Celikyilmaz
Chenyan Xiong
Jianfeng Gao
Mengdi Wang
Bill Dolan
SyDa
121
28
0
08 Nov 2019
Neural Graph Embedding Methods for Natural Language Processing
Shikhar Vashishth
GNN
74
9
0
08 Nov 2019
Blockwise Self-Attention for Long Document Understanding
J. Qiu
Hao Ma
Omer Levy
Scott Yih
Sinong Wang
Jie Tang
124
254
0
07 Nov 2019
Probing Contextualized Sentence Representations with Visual Awareness
Zhuosheng Zhang
Rui Wang
Kehai Chen
Masao Utiyama
Eiichiro Sumita
Hai Zhao
82
2
0
07 Nov 2019
How Can BERT Help Lexical Semantics Tasks?
Yile Wang
Leyang Cui
Yue Zhang
SSeg
36
11
0
07 Nov 2019
The LIG system for the English-Czech Text Translation Task of IWSLT 2019
Loïc Vial
Benjamin Lecouteux
D. Schwab
Hang Le
Laurent Besacier
17
3
0
07 Nov 2019
Dice Loss for Data-imbalanced NLP Tasks
Xiaoya Li
Xiaofei Sun
Yuxian Meng
Junjun Liang
Leilei Gan
Jiwei Li
125
594
0
07 Nov 2019
Dependency and Span, Cross-Style Semantic Role Labeling on PropBank and NomBank
Z. Li
Hai Zhao
Junru Zhou
Kevin Parnow
Shexia He
29
5
0
07 Nov 2019
Explicit Pairwise Word Interaction Modeling Improves Pretrained Transformers for English Semantic Similarity Tasks
Yinan Zhang
Raphael Tang
Jimmy J. Lin
31
5
0
07 Nov 2019
Multi-domain Dialogue State Tracking as Dynamic Knowledge Graph Enhanced Question Answering
Li Zhou
Kevin Small
59
85
0
07 Nov 2019
Enhancing Pre-trained Chinese Character Representation with Word-aligned Attention
Yanzeng Li
Yu Bowen
Mengge Xue
Tingwen Liu
84
27
0
07 Nov 2019
Previous
1
2
3
...
66
67
68
...
89
90
91
Next