Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05365
Cited By
v1
v2 (latest)
Deep contextualized word representations
15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep contextualized word representations"
50 / 4,508 papers shown
Title
GRET: Global Representation Enhanced Transformer
Rongxiang Weng
Hao-Ran Wei
Shujian Huang
Heng Yu
Lidong Bing
Weihua Luo
Jiajun Chen
74
9
0
24 Feb 2020
ScopeIt: Scoping Task Relevant Sentences in Documents
Vishwas Suryanarayanan
Barun Patra
P. Bhattacharya
C. Fufa
Charles Lee
47
4
0
23 Feb 2020
Efficient Sentence Embedding via Semantic Subspace Analysis
Bin Wang
Fenxiao Chen
Yun Cheng Wang
C.-C. Jay Kuo
65
10
0
22 Feb 2020
Modelling Latent Skills for Multitask Language Generation
Kris Cao
Dani Yogatama
46
3
0
21 Feb 2020
Measuring Social Biases in Grounded Vision and Language Embeddings
Candace Ross
Boris Katz
Andrei Barbu
110
65
0
20 Feb 2020
The Fluidity of Concept Representations in Human Brain Signals
E. Hendrikx
Lisa Beinborn
33
0
0
20 Feb 2020
Contextual Lensing of Universal Sentence Representations
J. Kiros
57
5
0
20 Feb 2020
Federated pretraining and fine tuning of BERT using clinical notes from multiple silos
Dianbo Liu
Timothy A. Miller
AI4MH
71
36
0
20 Feb 2020
Compressing BERT: Studying the Effects of Weight Pruning on Transfer Learning
Mitchell A. Gordon
Kevin Duh
Nicholas Andrews
VLM
105
343
0
19 Feb 2020
CoLES: Contrastive Learning for Event Sequences with Self-Supervision
Dmitrii Babaev
Ivan Kireev
Nikita Ovsov
Maria Ivanova
Gleb Gusev
Ivan Nazarov
Alexander Tuzhilin
SSL
AI4TS
63
27
0
19 Feb 2020
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
Zhangyin Feng
Daya Guo
Duyu Tang
Nan Duan
Xiaocheng Feng
...
Linjun Shou
Bing Qin
Ting Liu
Daxin Jiang
Ming Zhou
285
2,727
0
19 Feb 2020
A Systematic Comparison of Architectures for Document-Level Sentiment Classification
Jeremy Barnes
Vinit Ravishankar
Lilja Ovrelid
Erik Velldal
17
0
0
19 Feb 2020
The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding
Xiaodong Liu
Yu Wang
Jianshu Ji
Hao Cheng
Xueyun Zhu
...
Pengcheng He
Weizhu Chen
Hoifung Poon
Guihong Cao
Jianfeng Gao
AI4CE
77
61
0
19 Feb 2020
Learning by Semantic Similarity Makes Abstractive Summarization Better
Wonjin Yoon
Yoonsun Yeo
Minbyul Jeong
Bong-Jun Yi
Jaewoo Kang
164
16
0
18 Feb 2020
Gradient-Based Adversarial Training on Transformer Networks for Detecting Check-Worthy Factual Claims
Kevin Meng
Damian Jimenez
Fatma Arslan
J. Devasier
Daniel Obembe
Chengkai Li
77
16
0
18 Feb 2020
Hierarchical Transformer Network for Utterance-level Emotion Recognition
Qingbiao Li
Chunhua Wu
K. Zheng
Zhe Wang
63
23
0
18 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLM
AI4TS
AI4CE
122
140
0
18 Feb 2020
Annotating and Extracting Synthesis Process of All-Solid-State Batteries from Scientific Literature
Fusataka Kuniyoshi
Kohei Makino
Jun Ozawa
Makoto Miwa
67
34
0
18 Feb 2020
From English To Foreign Languages: Transferring Pre-trained Language Models
Ke M. Tran
58
52
0
18 Feb 2020
A Financial Service Chatbot based on Deep Bidirectional Transformers
S. Yu
Yuxin Chen
Hussain Zaidi
73
35
0
17 Feb 2020
Convergence of End-to-End Training in Deep Unsupervised Contrastive Learning
Zixin Wen
SSL
79
3
0
17 Feb 2020
Incorporating BERT into Neural Machine Translation
Jinhua Zhu
Yingce Xia
Lijun Wu
Di He
Tao Qin
Wen-gang Zhou
Houqiang Li
Tie-Yan Liu
FedML
AIMat
50
360
0
17 Feb 2020
SBERT-WK: A Sentence Embedding Method by Dissecting BERT-based Word Models
Bin Wang
C.-C. Jay Kuo
69
156
0
16 Feb 2020
Text-based Question Answering from Information Retrieval and Deep Neural Network Perspectives: A Survey
Zahra Abbasiyantaeb
S. Momtazi
RALM
95
74
0
16 Feb 2020
Automated Labelling using an Attention model for Radiology reports of MRI scans (ALARM)
D. Wood
J. Lynch
S. Kafiabadi
Emily Guilhem
A. A. Busaidi
...
Keena Patel
Gareth J. Barker
S. Ourselin
James H. Cole
Thomas C Booth
MedIm
63
42
0
16 Feb 2020
Multi-Scale Representation Learning for Spatial Feature Distributions using Grid Cells
Gengchen Mai
K. Janowicz
Bo Yan
Rui Zhu
Ling Cai
Ni Lao
SSL
148
125
0
16 Feb 2020
Deeper Task-Specificity Improves Joint Entity and Relation Extraction
Phil Crone
28
14
0
15 Feb 2020
TwinBERT: Distilling Knowledge to Twin-Structured BERT Models for Efficient Retrieval
Wenhao Lu
Jian Jiao
Ruofei Zhang
67
50
0
14 Feb 2020
Transformer on a Diet
Chenguang Wang
Zihao Ye
Aston Zhang
Zheng Zhang
Alex Smola
93
8
0
14 Feb 2020
Combining Visual and Textual Features for Semantic Segmentation of Historical Newspapers
Raphaël Barman
Maud Ehrmann
Simon Clematide
S. Oliveira
F. Kaplan
74
40
0
14 Feb 2020
Dialogue history integration into end-to-end signal-to-concept spoken language understanding systems
N. Tomashenko
C. Raymond
Antoine Caubrière
R. Mori
Yannick Esteve
94
15
0
14 Feb 2020
Pre-Training for Query Rewriting in A Spoken Language Understanding System
Zheng Chen
Xing Fan
Yuan Ling
Lambert Mathias
Chenlei Guo
54
23
0
13 Feb 2020
Utilizing BERT Intermediate Layers for Aspect Based Sentiment Analysis and Natural Language Inference
Youwei Song
Jiahai Wang
Zhiwei Liang
Zhiyue Liu
Tao Jiang
81
77
0
12 Feb 2020
Explaining Explanations: Axiomatic Feature Interactions for Deep Networks
Joseph D. Janizek
Pascal Sturmfels
Su-In Lee
FAtt
90
149
0
10 Feb 2020
Adversarial Filters of Dataset Biases
Ronan Le Bras
Swabha Swayamdipta
Chandra Bhagavatula
Rowan Zellers
Matthew E. Peters
Ashish Sabharwal
Yejin Choi
185
223
0
10 Feb 2020
Exploring Chemical Space using Natural Language Processing Methodologies for Drug Discovery
Hakime Öztürk
Arzucan Özgür
P. Schwaller
Teodoro Laino
Elif Özkirimli
98
122
0
10 Feb 2020
Localized Flood DetectionWith Minimal Labeled Social Media Data Using Transfer Learning
Neha Singh
Nirmalya Roy
A. Gangopadhyay
81
6
0
10 Feb 2020
How Much Knowledge Can You Pack Into the Parameters of a Language Model?
Adam Roberts
Colin Raffel
Noam M. Shazeer
KELM
185
898
0
10 Feb 2020
REALM: Retrieval-Augmented Language Model Pre-Training
Kelvin Guu
Kenton Lee
Zora Tung
Panupong Pasupat
Ming-Wei Chang
RALM
211
2,133
0
10 Feb 2020
Multilingual Alignment of Contextual Word Representations
Steven Cao
Nikita Kitaev
Dan Klein
166
194
0
10 Feb 2020
Geometric Dataset Distances via Optimal Transport
David Alvarez-Melis
Nicolò Fusi
OT
154
205
0
07 Feb 2020
Incorporating Visual Semantics into Sentence Representations within a Grounded Space
Patrick Bordes
Éloi Zablocki
Laure Soulier
Benjamin Piwowarski
Patrick Gallinari
55
26
0
07 Feb 2020
MA-DST: Multi-Attention Based Scalable Dialog State Tracking
Adarsh Kumar
Peter Ku
Anuj Kumar Goyal
A. Metallinou
Dilek Z. Hakkani-Tür
80
59
0
07 Feb 2020
Aligning the Pretraining and Finetuning Objectives of Language Models
Nuo Wang Pierse
Jing Lu
AI4CE
42
2
0
05 Feb 2020
K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters
Ruize Wang
Duyu Tang
Nan Duan
Zhongyu Wei
Xuanjing Huang
Jianshu Ji
Guihong Cao
Daxin Jiang
Ming Zhou
KELM
177
557
0
05 Feb 2020
Parsing as Pretraining
David Vilares
Michalina Strzyz
Anders Søgaard
Carlos Gómez-Rodríguez
90
32
0
05 Feb 2020
Syntactically Look-Ahead Attention Network for Sentence Compression
Hidetaka Kamigaito
Manabu Okumura
84
20
0
04 Feb 2020
Dynamic Parameter Allocation in Parameter Servers
Alexander Renz-Wieland
Rainer Gemulla
Steffen Zeuch
Volker Markl
59
18
0
03 Feb 2020
Fine-Tuning BERT for Schema-Guided Zero-Shot Dialogue State Tracking
Yu-Ping Ruan
Zhenhua Ling
Jia-Chen Gu
Quan Liu
73
20
0
01 Feb 2020
Pretrained Transformers for Simple Question Answering over Knowledge Graphs
Denis Lukovnikov
Asja Fischer
Jens Lehmann
GNN
AI4MH
96
58
0
31 Jan 2020
Previous
1
2
3
...
62
63
64
...
89
90
91
Next