Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05365
Cited By
v1
v2 (latest)
Deep contextualized word representations
15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep contextualized word representations"
50 / 4,508 papers shown
Title
Cisco at AAAI-CAD21 shared task: Predicting Emphasis in Presentation Slides using Contextualized Embeddings
Sreyan Ghosh
Sonal Kumar
H. Jalan
Hemant Yadav
R. Shah
88
2
0
10 Jan 2021
BERT & Family Eat Word Salad: Experiments with Text Understanding
Ashim Gupta
Giorgi Kvernadze
Vivek Srikumar
267
73
0
10 Jan 2021
Learning Better Sentence Representation with Syntax Information
Chen Yang
44
1
0
09 Jan 2021
Misspelling Correction with Pre-trained Contextual Language Model
Yifei Hu
X. Jing
Youlim Ko
Julia Taylor Rayz
KELM
100
28
0
08 Jan 2021
Ask2Transformers: Zero-Shot Domain labelling with Pre-trained Language Models
Oscar Sainz
German Rigau
VLM
73
22
0
07 Jan 2021
Simplified DOM Trees for Transferable Attribute Extraction from the Web
Yichao Zhou
Ying Sheng
N. Vo
Nick Edmonds
Sandeep Tata
193
29
0
07 Jan 2021
Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking
Yingjie Gu
Xiaoye Qu
Zhefeng Wang
Baoxing Huai
Nicholas Jing Yuan
Xiaolin Gui
85
30
0
07 Jan 2021
Exploring Text-transformers in AAAI 2021 Shared Task: COVID-19 Fake News Detection in English
Xiangyang Li
Yu Xia
Xiang Long
Zheng Li
Sujian Li
261
37
0
07 Jan 2021
Outline to Story: Fine-grained Controllable Story Generation from Cascaded Events
Le Fang
Tao Zeng
Chao-Ning Liu
Liefeng Bo
Wen Dong
Changyou Chen
101
12
0
04 Jan 2021
Coreference Resolution: Are the eliminated spans totally worthless?
Xin Tan
Longyin Zhang
Guodong Zhou
48
0
0
04 Jan 2021
Few-Shot Question Answering by Pretraining Span Selection
Ori Ram
Yuval Kirstain
Jonathan Berant
Amir Globerson
Omer Levy
135
98
0
02 Jan 2021
Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
Wangchunshu Zhou
Tao Ge
Canwen Xu
Ke Xu
Furu Wei
LRM
92
16
0
02 Jan 2021
End-to-end Semantic Role Labeling with Neural Transition-based Model
Hao Fei
Meishan Zhang
Bobo Li
Donghong Ji
OffRL
65
37
0
02 Jan 2021
Multitask Learning for Class-Imbalanced Discourse Classification
Alexander Spangher
Jonathan May
Sz-Rung Shiang
Lingjia Deng
90
5
0
02 Jan 2021
Learning to Emphasize: Dataset and Shared Task Models for Selecting Emphasis in Presentation Slides
Amirreza Shirani
Gia-Lac Tran
Hieu Trinh
Franck Dernoncourt
Nedim Lipka
P. Asente
J. Echevarria
Thamar Solorio
311
1
0
02 Jan 2021
What all do audio transformer models hear? Probing Acoustic Representations for Language Delivery and its Structure
Jui Shah
Yaman Kumar Singla
Changyou Chen
R. Shah
121
81
0
02 Jan 2021
Modeling Fine-Grained Entity Types with Box Embeddings
Yasumasa Onoe
Michael Boratko
Andrew McCallum
Greg Durrett
OCL
100
67
0
02 Jan 2021
Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers
Machel Reid
Edison Marrese-Taylor
Y. Matsuo
MoE
116
48
0
01 Jan 2021
How Do Your Biomedical Named Entity Recognition Models Generalize to Novel Entities?
Hyunjae Kim
Jaewoo Kang
AI4CE
175
21
0
01 Jan 2021
Sensei: Self-Supervised Sensor Name Segmentation
Jiaman Wu
Dezhi Hong
Rajesh K. Gupta
Jingbo Shang
34
1
0
01 Jan 2021
WARP: Word-level Adversarial ReProgramming
Karen Hambardzumyan
Hrant Khachatrian
Jonathan May
AAML
367
354
0
01 Jan 2021
Controlled Analyses of Social Biases in Wikipedia Bios
Anjalie Field
Chan Young Park
Kevin Z. Lin
Yulia Tsvetkov
99
27
0
31 Dec 2020
Understanding Politics via Contextualized Discourse Processing
Rajkumar Pujari
Dan Goldwasser
74
20
0
31 Dec 2020
How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models
Phillip Rust
Jonas Pfeiffer
Ivan Vulić
Sebastian Ruder
Iryna Gurevych
185
256
0
31 Dec 2020
Neural Machine Translation: A Review of Methods, Resources, and Tools
Zhixing Tan
Shuo Wang
Zonghan Yang
Gang Chen
Xuancheng Huang
Maosong Sun
Yang Liu
3DV
AI4TS
99
110
0
31 Dec 2020
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
Wei-Ning Hsu
David Harwath
Christopher Song
James R. Glass
CLIP
92
67
0
31 Dec 2020
Deriving Contextualised Semantic Features from BERT (and Other Transformer Model) Embeddings
Jacob Turton
D. Vinson
Robert Smith
46
25
0
30 Dec 2020
SemGloVe: Semantic Co-occurrences for GloVe from BERT
Leilei Gan
Zhiyang Teng
Yue Zhang
Linchao Zhu
Leilei Gan
Yi Yang
75
17
0
30 Dec 2020
Accurate Word Representations with Universal Visual Guidance
Zhuosheng Zhang
Haojie Yu
Hai Zhao
Rui Wang
Masao Utiyama
62
0
0
30 Dec 2020
Few-Shot Named Entity Recognition: A Comprehensive Study
Jiaxin Huang
Chunyuan Li
K. Subudhi
Damien Jose
S. Balakrishnan
Weizhu Chen
Baolin Peng
Jianfeng Gao
Jiawei Han
95
80
0
29 Dec 2020
Transformer Feed-Forward Layers Are Key-Value Memories
Mor Geva
R. Schuster
Jonathan Berant
Omer Levy
KELM
253
851
0
29 Dec 2020
Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning
Xuebo Liu
Longyue Wang
Derek F. Wong
Liang Ding
Lidia S. Chao
Zhaopeng Tu
AI4CE
68
35
0
29 Dec 2020
CMV-BERT: Contrastive multi-vocab pretraining of BERT
Wei-wei Zhu
Daniel Cheung
SSL
VLM
88
0
0
29 Dec 2020
RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems
Baolin Peng
Chunyuan Li
Zhu Zhang
Chenguang Zhu
Jinchao Li
Jianfeng Gao
74
50
0
29 Dec 2020
BURT: BERT-inspired Universal Representation from Learning Meaningful Segment
Yian Li
Hai Zhao
SSL
56
0
0
28 Dec 2020
ALP-KD: Attention-Based Layer Projection for Knowledge Distillation
Peyman Passban
Yimeng Wu
Mehdi Rezagholizadeh
Qun Liu
92
124
0
27 Dec 2020
Adaptive Convolution for Semantic Role Labeling
Kashif Munir
Hai Zhao
Z. Li
34
12
0
27 Dec 2020
SG-Net: Syntax Guided Transformer for Language Representation
Zhuosheng Zhang
Yuwei Wu
Junru Zhou
Sufeng Duan
Hai Zhao
Rui Wang
129
38
0
27 Dec 2020
ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic
Muhammad Abdul-Mageed
AbdelRahim Elmadany
El Moatez Billah Nagoudi
VLM
144
467
0
27 Dec 2020
Multi-Channel Sequential Behavior Networks for User Modeling in Online Advertising
Iyad Batal
Akshay Soni
28
0
0
27 Dec 2020
Cross-lingual Universal Dependency Parsing Only from One Monolingual Treebank
Kailai Sun
Z. Li
Hai Zhao
67
0
0
24 Dec 2020
WEmbSim: A Simple yet Effective Metric for Image Captioning
Naeha Sharif
Lyndon White
Bennamoun
Wei Liu
Syed Afaq Ali Shah
60
1
0
24 Dec 2020
Learning Dense Representations of Phrases at Scale
Jinhyuk Lee
Mujeen Sung
Jaewoo Kang
Danqi Chen
RALM
DML
NAI
80
122
0
23 Dec 2020
Automated Lay Language Summarization of Biomedical Scientific Reviews
Yue Guo
Weijian Qiu
Yizhong Wang
T. Cohen
115
79
0
23 Dec 2020
Simple-QE: Better Automatic Quality Estimation for Text Simplification
Reno Kriz
Marianna Apidianaki
Chris Callison-Burch
80
12
0
22 Dec 2020
Few-Shot Text Generation with Pattern-Exploiting Training
Timo Schick
Hinrich Schütze
140
148
0
22 Dec 2020
Undivided Attention: Are Intermediate Layers Necessary for BERT?
S. N. Sridhar
Anthony Sarah
85
15
0
22 Dec 2020
Improved Biomedical Word Embeddings in the Transformer Era
Jiho Noh
Ramakanth Kavuluru
MedIm
171
17
0
22 Dec 2020
Social NCE: Contrastive Learning of Socially-aware Motion Representations
Yuejiang Liu
Qi Yan
Alexandre Alahi
142
103
0
21 Dec 2020
An End-to-End Document-Level Neural Discourse Parser Exploiting Multi-Granularity Representations
Ke Shi
Zhengyuan Liu
Nancy F. Chen
46
7
0
21 Dec 2020
Previous
1
2
3
...
41
42
43
...
89
90
91
Next