Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05365
Cited By
v1
v2 (latest)
Deep contextualized word representations
15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep contextualized word representations"
50 / 4,508 papers shown
Title
Thieves on Sesame Street! Model Extraction of BERT-based APIs
Kalpesh Krishna
Gaurav Singh Tomar
Ankur P. Parikh
Nicolas Papernot
Mohit Iyyer
MIACV
MLAU
160
201
0
27 Oct 2019
FineText: Text Classification via Attention-based Language Model Fine-tuning
Yunzhe Tao
Saurabh Gupta
Satyapriya Krishna
Xiong Zhou
Orchid Majumder
Vineet Khare
37
3
0
25 Oct 2019
Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks
Alexandros Kastanos
Anton Ragni
Mark Gales
55
14
0
25 Oct 2019
Evaluation of Sentence Representations in Polish
Slawomir Dadas
Michal Perelkiewicz
Rafal Poswiata
184
16
0
25 Oct 2019
DENS: A Dataset for Multi-class Emotion Analysis
Chen Cecilia Liu
Muhammad Osama
Anderson de Andrade
AI4CE
75
37
0
25 Oct 2019
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
165
374
0
25 Oct 2019
Generating a Common Question from Multiple Documents using Multi-source Encoder-Decoder Models
W. Cho
Yizhe Zhang
Sudha Rao
Chris Brockett
Sungjin Lee
82
7
0
25 Oct 2019
A Unified MRC Framework for Named Entity Recognition
Xiaoya Li
Jingrong Feng
Yuxian Meng
Qinghong Han
Leilei Gan
Jiwei Li
164
640
0
25 Oct 2019
Heterogeneous Graph Learning for Visual Commonsense Reasoning
Weijiang Yu
Jingwen Zhou
Weihao Yu
Xiaodan Liang
Nong Xiao
LRM
79
47
0
25 Oct 2019
QASC: A Dataset for Question Answering via Sentence Composition
Tushar Khot
Peter Clark
Michal Guerquin
Peter Alexander Jansen
Ashish Sabharwal
CoGe
100
330
0
25 Oct 2019
Multi-Document Summarization with Determinantal Point Processes and Contextualized Representations
Sangwoo Cho
Chen Li
Dong Yu
H. Foroosh
Fei Liu
66
17
0
24 Oct 2019
Predicting In-game Actions from Interviews of NBA Players
Nadav Oved
Amir Feder
Roi Reichart
79
2
0
24 Oct 2019
Domain adversarial learning for emotion recognition
Zheng Lian
J. Tao
Bin Liu
Jian Huang
46
7
0
24 Oct 2019
Conversational Emotion Analysis via Attention Mechanisms
Zheng Lian
J. Tao
Bin Liu
Jian Huang
58
27
0
24 Oct 2019
Syntax-Enhanced Self-Attention-Based Semantic Role Labeling
Yue Zhang
Rui Wang
Luo Si
60
20
0
24 Oct 2019
Low-Resource Sequence Labeling via Unsupervised Multilingual Contextualized Representations
Zuyi Bao
Rui Huang
Chen Li
Kenny Q. Zhu
49
4
0
24 Oct 2019
Relation Module for Non-answerable Prediction on Question Answering
Kevin Huang
Yun Tang
Jing-ling Huang
Xiaodong He
Bowen Zhou
72
6
0
23 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
978
20,462
0
23 Oct 2019
Generative Pre-Training for Speech with Autoregressive Predictive Coding
Yu-An Chung
James R. Glass
SSL
105
174
0
23 Oct 2019
Healthcare NER Models Using Language Model Pretraining
A. Tarcar
Aashis Tiwari
V. Dhaimodker
Penjo Rebelo
Rahul Desai
Dattaraj J. Rao
75
29
0
23 Oct 2019
Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Dongwei Jiang
Xiaoning Lei
Wubo Li
Ne Luo
Yuxuan Hu
Wei Zou
Xiangang Li
91
99
0
22 Oct 2019
IPOD: An Industrial and Professional Occupations Dataset and its Applications to Occupational Data Mining and Analysis
Junhua Liu
Yung Chuen Ng
Kristin L. Wood
Kwan Hui Lim
68
6
0
22 Oct 2019
Fine-grained Fact Verification with Kernel Graph Attention Network
Zhenghao Liu
Chenyan Xiong
Maosong Sun
Zhiyuan Liu
125
225
0
22 Oct 2019
Composite Neural Network: Theory and Application to PM2.5 Prediction
M. Yang
Meng Chang Chen
PINN
48
9
0
22 Oct 2019
You May Not Need Order in Time Series Forecasting
Yunkai Zhang
Qiao Jiang
Shurui Li
Xiaoyong Jin
Xueying Ma
Xifeng Yan
AI4TS
28
3
0
21 Oct 2019
Domain-agnostic Question-Answering with Adversarial Training
Seanie Lee
Donggyu Kim
Jangwon Park
OOD
53
72
0
21 Oct 2019
A Neural Entity Coreference Resolution Review
Nikolaos Stylianou
I. Vlahavas
104
40
0
21 Oct 2019
Diversify Your Datasets: Analyzing Generalization via Controlled Variance in Adversarial Datasets
Ohad Rozen
Vered Shwartz
Roee Aharoni
Ido Dagan
AAML
95
38
0
21 Oct 2019
Findings of the NLP4IF-2019 Shared Task on Fine-Grained Propaganda Detection
Giovanni Da San Martino
Alberto Barrón-Cedeño
Preslav Nakov
136
82
0
20 Oct 2019
PT-CoDE: Pre-trained Context-Dependent Encoder for Utterance-level Emotion Recognition
Wenxiang Jiao
Michael R. Lyu
Irwin King
39
11
0
20 Oct 2019
Improving Sequence Modeling Ability of Recurrent Neural Networks via Sememes
Yujia Qin
Fanchao Qi
Sicong Ouyang
Zhiyuan Liu
Cheng Yang
Yasheng Wang
Qun Liu
Maosong Sun
60
5
0
20 Oct 2019
XL-Editor: Post-editing Sentences with XLNet
Yong-Siang Shih
Wei-Cheng Chang
Yiming Yang
KELM
82
11
0
19 Oct 2019
Keyphrase Extraction from Scholarly Articles as Sequence Labeling using Contextualized Embeddings
Dhruva Sahrawat
Debanjan Mahata
Mayank Kulkarni
Haimin Zhang
Rakesh Gosangi
Amanda Stent
Agniv Sharma
Yaman Kumar Singla
R. Shah
Roger Zimmermann
38
30
0
19 Oct 2019
An Improved Historical Embedding without Alignment
Xiaofei Xu
Ke Deng
Fei Hu
Li Li
AI4TS
31
0
0
19 Oct 2019
Towards Learning Cross-Modal Perception-Trace Models
Achim Rettinger
Viktoria Bogdanova
Philipp Niemann
14
0
0
18 Oct 2019
Using Local Knowledge Graph Construction to Scale Seq2Seq Models to Multi-Document Inputs
Angela Fan
Claire Gardent
Chloé Braud
Antoine Bordes
85
102
0
18 Oct 2019
Model Compression with Two-stage Multi-teacher Knowledge Distillation for Web Question Answering System
Ze Yang
Linjun Shou
Ming Gong
Wutao Lin
Daxin Jiang
69
94
0
18 Oct 2019
A Mutual Information Maximization Perspective of Language Representation Learning
Lingpeng Kong
Cyprien de Masson dÁutume
Wang Ling
Lei Yu
Zihang Dai
Dani Yogatama
SSL
297
167
0
18 Oct 2019
Theoretical Investigation of Composite Neural Network
M. Yang
Meng Chang Chen
PINN
39
3
0
18 Oct 2019
Estimator Vectors: OOV Word Embeddings based on Subword and Context Clue Estimates
R. Patel
C. Domeniconi
46
5
0
18 Oct 2019
Question Classification with Deep Contextualized Transformer
Haozheng Luo
Ningwei Liu
Charles Feng
88
2
0
17 Oct 2019
Cross-lingual Parsing with Polyglot Training and Multi-treebank Learning: A Faroese Case Study
James Barry
Joachim Wagner
Jennifer Foster
21
5
0
17 Oct 2019
Keyphrase Extraction from Disaster-related Tweets
Jishnu Ray Chowdhury
Cornelia Caragea
Doina Caragea
63
39
0
17 Oct 2019
BIG MOOD: Relating Transformers to Explicit Commonsense Knowledge
Jeff Da
34
0
0
17 Oct 2019
Memory-Augmented Recurrent Networks for Dialogue Coherence
David Donahue
Yuanliang Meng
Anna Rumshisky
33
0
0
16 Oct 2019
Evolution of transfer learning in natural language processing
Aditya Malte
Pratik Ratadiya
63
54
0
16 Oct 2019
A Probabilistic Framework for Learning Domain Specific Hierarchical Word Embeddings
Lahari Poddar
György Szarvas
Lea Frermann
34
0
0
16 Oct 2019
BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Model Performance
Timo Schick
Hinrich Schütze
83
50
0
16 Oct 2019
Analyzing the Forgetting Problem in the Pretrain-Finetuning of Dialogue Response Models
Tianxing He
Jun Liu
Kyunghyun Cho
Myle Ott
Bing-Quan Liu
James R. Glass
Fuchun Peng
CLL
100
9
0
16 Oct 2019
Context Matters: Recovering Human Semantic Structure from Machine Learning Analysis of Large-Scale Text Corpora
M. C. Iordan
Tyler Giallanza
C. Ellis
Nicole M. Beckage
Jonathan Cohen
48
10
0
15 Oct 2019
Previous
1
2
3
...
68
69
70
...
89
90
91
Next