Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05365
Cited By
v1
v2 (latest)
Deep contextualized word representations
15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep contextualized word representations"
50 / 4,508 papers shown
Title
Explicitly Modeling Syntax in Language Models with Incremental Parsing and a Dynamic Oracle
Songlin Yang
Shawn Tan
Alessandro Sordoni
Siva Reddy
Rameswar Panda
78
5
0
21 Oct 2020
A Simple and Efficient Multi-Task Learning Approach for Conditioned Dialogue Generation
Yan Zeng
J. Nie
71
5
0
21 Oct 2020
NeuSpell: A Neural Spelling Correction Toolkit
Sai Muralidhar Jayanthi
Danish Pruthi
Graham Neubig
KELM
LRM
104
67
0
21 Oct 2020
Neural Networks for Entity Matching: A Survey
Nils Barlaug
J. Gulla
171
96
0
21 Oct 2020
LT3 at SemEval-2020 Task 9: Cross-lingual Embeddings for Sentiment Analysis of Hinglish Social Media Text
Pranaydeep Singh
Els Lefever
45
6
0
21 Oct 2020
German's Next Language Model
Branden Chan
Stefan Schweter
Timo Möller
120
277
0
21 Oct 2020
Modeling Content and Context with Deep Relational Learning
Maria Leonor Pacheco
Dan Goldwasser
NAI
107
34
0
20 Oct 2020
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters
Hicham El Boukkouri
Olivier Ferret
Thomas Lavergne
Hiroshi Noji
Pierre Zweigenbaum
Junichi Tsujii
167
162
0
20 Oct 2020
UmlsBERT: Clinical Domain Knowledge Augmentation of Contextual Embeddings Using the Unified Medical Language System Metathesaurus
George Michalopoulos
Yuanxin Wang
H. Kaka
Helen H. Chen
Alexander Wong
147
127
0
20 Oct 2020
Bi-directional Cognitive Thinking Network for Machine Reading Comprehension
Wei Peng
Yue Hu
Luxi Xing
Yuqiang Xie
Jing Yu
Yajing Sun
Xiangpeng Wei
82
7
0
20 Oct 2020
Text Classification of Manifestos and COVID-19 Press Briefings using BERT and Convolutional Neural Networks
Kakia Chatsiou
71
10
0
20 Oct 2020
Performance of Transfer Learning Model vs. Traditional Neural Network in Low System Resource Environment
W. Hui
11
1
0
20 Oct 2020
Explainable Automated Fact-Checking for Public Health Claims
Neema Kotonya
Francesca Toni
273
263
0
19 Oct 2020
Online Active Model Selection for Pre-trained Classifiers
Mohammad Reza Karimi
Nezihe Merve Gürel
Bojan Karlavs
Johannes Rausch
Ce Zhang
Andreas Krause
85
22
0
19 Oct 2020
Diving Deep into Context-Aware Neural Machine Translation
Jingjing Huo
Christian Herold
Yingbo Gao
Leonard Dahlmann
Shahram Khadivi
Hermann Ney
120
24
0
19 Oct 2020
The RELX Dataset and Matching the Multilingual Blanks for Cross-Lingual Relation Classification
Abdullatif Köksal
Arzucan Özgür
100
19
0
19 Oct 2020
Global Attention for Name Tagging
Boliang Zhang
Spencer Whitehead
Lifu Huang
Heng Ji
92
17
0
19 Oct 2020
Towards Interpreting BERT for Reading Comprehension Based QA
Sahana Ramnath
Preksha Nema
Deep Sahni
Mitesh M. Khapra
110
30
0
18 Oct 2020
Mixed-Lingual Pre-training for Cross-lingual Summarization
Ruochen Xu
Chenguang Zhu
Yu Shi
Michael Zeng
Xuedong Huang
76
26
0
18 Oct 2020
Question Answering over Knowledge Base using Language Model Embeddings
Japa Sai Sharath
Banafsheh Rekabdar
44
11
0
17 Oct 2020
Hierarchical Multitask Learning Approach for BERT
Çagla Aksoy
Alper Ahmetoglu
Tunga Güngör
SSL
68
6
0
17 Oct 2020
Multi-task Learning of Negation and Speculation for Targeted Sentiment Classification
Andrew Moore
Jeremy Barnes
78
9
0
16 Oct 2020
Training Flexible Depth Model by Multi-Task Learning for Neural Machine Translation
Qiang Wang
Tong Xiao
Jingbo Zhu
47
2
0
16 Oct 2020
Unsupervised Extractive Summarization by Pre-training Hierarchical Transformers
Shusheng Xu
Xingxing Zhang
Yi Wu
Furu Wei
Ming Zhou
118
45
0
16 Oct 2020
Coarse-to-Fine Pre-training for Named Entity Recognition
Mengge Xue
Yu Bowen
Zhenyu Zhang
Tingwen Liu
Yue Zhang
Bin Wang
68
54
0
16 Oct 2020
Unsupervised Natural Language Inference via Decoupled Multimodal Contrastive Learning
Wanyun Cui
Guangyu Zheng
Wei Wang
SSL
62
21
0
16 Oct 2020
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering
Yingqi Qu
Yuchen Ding
Jing Liu
Kai Liu
Ruiyang Ren
Xin Zhao
Daxiang Dong
Hua Wu
Haifeng Wang
RALM
OffRL
291
618
0
16 Oct 2020
Inferring symmetry in natural language
Chelsea Tanchip
Lei Yu
Aotao Xu
Yang Xu
56
5
0
16 Oct 2020
Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach
Yue Yu
Simiao Zuo
Haoming Jiang
Wendi Ren
T. Zhao
Chao Zhang
AI4MH
96
133
0
15 Oct 2020
A Transformer Based Pitch Sequence Autoencoder with MIDI Augmentation
Mingshuo Ding
Yi Ma
42
1
0
15 Oct 2020
Does Chinese BERT Encode Word Structure?
Yile Wang
Leyang Cui
Yue Zhang
86
6
0
15 Oct 2020
TopicBERT for Energy Efficient Document Classification
Yatin Chaudhary
Pankaj Gupta
Khushbu Saxena
Vivek Kulkarni
Thomas Runkler
Hinrich Schütze
75
21
0
15 Oct 2020
From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?
Maryam Hashemzadeh
Greta Kaufeld
Martha White
Andrea E. Martin
Alona Fyshe
MILM
39
6
0
14 Oct 2020
Positioning yourself in the maze of Neural Text Generation: A Task-Agnostic Survey
Khyathi Chandu
A. Black
82
0
0
14 Oct 2020
Text Classification Using Label Names Only: A Language Model Self-Training Approach
Yu Meng
Yunyi Zhang
Jiaxin Huang
Chenyan Xiong
Heng Ji
Chao Zhang
Jiawei Han
VLM
102
74
0
14 Oct 2020
Summarize, Outline, and Elaborate: Long-Text Generation via Hierarchical Supervision from Extractive Summaries
Xiaofei Sun
Zijun Sun
Yuxian Meng
Jiwei Li
Chun Fan
63
21
0
14 Oct 2020
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search
Gyuwan Kim
Kyunghyun Cho
100
98
0
14 Oct 2020
Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision
Hao Tan
Joey Tianyi Zhou
CLIP
100
121
0
14 Oct 2020
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy J. Lin
Rodrigo Nogueira
Andrew Yates
VLM
421
628
0
13 Oct 2020
RuSemShift: a dataset of historical lexical semantic change in Russian
J. Rodina
Andrey Kutuzov
73
34
0
13 Oct 2020
CAPT: Contrastive Pre-Training for Learning Denoised Sequence Representations
Fuli Luo
Pengcheng Yang
Shicheng Li
Xuancheng Ren
Xu Sun
VLM
SSL
73
16
0
13 Oct 2020
BRUMS at SemEval-2020 Task 3: Contextualised Embeddings for Predicting the (Graded) Effect of Context in Word Similarity
Hansi Hettiarachchi
Tharindu Ranasinghe
53
14
0
13 Oct 2020
Incorporating BERT into Parallel Sequence Decoding with Adapters
Junliang Guo
Zhirui Zhang
Linli Xu
Hao-Ran Wei
Boxing Chen
Enhong Chen
132
69
0
13 Oct 2020
Corruption Is Not All Bad: Incorporating Discourse Structure into Pre-training via Corruption for Essay Scoring
Farjana Sultana Mim
Naoya Inoue
Paul Reisert
Hiroki Ouchi
Kentaro Inui
60
7
0
13 Oct 2020
BERT-EMD: Many-to-Many Layer Mapping for BERT Compression with Earth Mover's Distance
Jianquan Li
Xiaokang Liu
Honghong Zhao
Ruifeng Xu
Min Yang
Yaohong Jin
131
54
0
13 Oct 2020
Model Selection for Cross-Lingual Transfer
Yang Chen
Alan Ritter
89
12
0
13 Oct 2020
Improving Self-supervised Pre-training via a Fully-Explored Masked Language Model
Ming Zheng
Dinghan Shen
Yelong Shen
Weizhu Chen
Lin Xiao
SSL
64
4
0
12 Oct 2020
Measuring and Reducing Gendered Correlations in Pre-trained Models
Kellie Webster
Xuezhi Wang
Ian Tenney
Alex Beutel
Emily Pitler
Ellie Pavlick
Jilin Chen
Ed Chi
Slav Petrov
FaML
135
260
0
12 Oct 2020
Improving Text Generation with Student-Forcing Optimal Transport
Guoyin Wang
Chunyuan Li
Jianqiao Li
Hao Fu
Yuh-Chen Lin
...
Ruiyi Zhang
Wenlin Wang
Dinghan Shen
Qian Yang
Lawrence Carin
OT
86
18
0
12 Oct 2020
PECOS: Prediction for Enormous and Correlated Output Spaces
Hsiang-Fu Yu
Kai Zhong
Jiong Zhang
Wei-Cheng Chang
Inderjit S. Dhillon
132
85
0
12 Oct 2020
Previous
1
2
3
...
46
47
48
...
89
90
91
Next