Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,555 papers shown
Title
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion
Suyoun Kim
Siddharth Dalmia
Florian Metze
94
23
0
27 Jun 2019
Eliciting Knowledge from Experts:Automatic Transcript Parsing for Cognitive Task Analysis
Junyi Du
He Jiang
Jiaming Shen
Xiang Ren
58
3
0
26 Jun 2019
Determining Relative Argument Specificity and Stance for Complex Argumentative Structures
Esin Durmus
Faisal Ladhak
Claire Cardie
54
18
0
26 Jun 2019
Enhancing PIO Element Detection in Medical Text Using Contextualized Embedding
H. Mezaoui
A. Gontcharov
Isuru Gunasekara
21
5
0
26 Jun 2019
Good Secretaries, Bad Truck Drivers? Occupational Gender Stereotypes in Sentiment Analysis
J. Bhaskaran
Isha Bhallamudi
66
47
0
24 Jun 2019
Is It Worth the Attention? A Comparative Evaluation of Attention Layers for Argument Unit Segmentation
Maximilian Spliethover
Jonas Klaff
Hendrik Heuer
54
10
0
24 Jun 2019
Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation
Daniel Loureiro
A. Jorge
85
138
0
24 Jun 2019
LIAAD at SemDeep-5 Challenge: Word-in-Context (WiC)
Daniel Loureiro
A. Jorge
65
17
0
24 Jun 2019
Classification and Clustering of Arguments with Contextualized Word Embeddings
Nils Reimers
Benjamin Schiller
Tilman Beck
Johannes Daxenberger
Christian Stab
Iryna Gurevych
91
171
0
24 Jun 2019
EQuANt (Enhanced Question Answer Network)
Franccois-Xavier Aubet
D. Danks
Yuchen Zhu
56
3
0
24 Jun 2019
Evaluating the Supervised and Zero-shot Performance of Multi-lingual Translation Models
Chris Hokamp
John Glover
D. Ghalandari
60
14
0
24 Jun 2019
Alchemy: A Quantum Chemistry Dataset for Benchmarking AI Models
Guangyong Chen
Pengfei Chen
Chang-Yu Hsieh
Chee-Kong Lee
B. Liao
...
J. Qiu
Qiming Sun
Jie Tang
R. Zemel
Shengyu Zhang
79
76
0
22 Jun 2019
Identification of Tasks, Datasets, Evaluation Metrics, and Numeric Scores for Scientific Leaderboards Construction
Yufang Hou
Charles Jochim
Martin Gleize
Francesca Bonin
Debasis Ganguly
71
95
0
21 Jun 2019
Deep Leakage from Gradients
Ligeng Zhu
Zhijian Liu
Song Han
FedML
114
2,249
0
21 Jun 2019
Graph Star Net for Generalized Multi-Task Learning
H. Lu
Seth H. Huang
Tian Ye
Xiuyan Guo
GNN
85
46
0
21 Jun 2019
Informative Image Captioning with External Sources of Information
Sanqiang Zhao
Piyush Sharma
Tomer Levinboim
Radu Soricut
65
46
0
20 Jun 2019
Few-Shot Sequence Labeling with Label Dependency Transfer and Pair-wise Embedding
Yutai Hou
Zhihan Zhou
Yijia Liu
Ning Wang
Wanxiang Che
Han Liu
Ting Liu
71
9
0
20 Jun 2019
Generating Empathetic Responses by Looking Ahead the User's Sentiment
Jamin Shin
Peng Xu
Andrea Madotto
Pascale Fung
55
48
0
20 Jun 2019
Multi-Grained Named Entity Recognition
Congying Xia
Chenwei Zhang
Tao Yang
Yaliang Li
Nan Du
Xian Wu
Wei Fan
Fenglong Ma
Philip Yu
86
87
0
20 Jun 2019
SMILES-X: autonomous molecular compounds characterization for small datasets without descriptors
G. Lambard
Ekaterina Gracheva
71
21
0
20 Jun 2019
Learning Compressed Sentence Representations for On-Device Text Processing
Dinghan Shen
Pengyu Cheng
Dhanasekar Sundararaman
Xinyuan Zhang
Qian Yang
Meng Tang
Asli Celikyilmaz
Lawrence Carin
67
23
0
19 Jun 2019
REflex: Flexible Framework for Relation Extraction in Multiple Domains
Geeticka Chauhan
Matthew B. A. McDermott
Peter Szolovits
44
13
0
19 Jun 2019
SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly Representative Neural Architectures
Hsin-Pai Cheng
Tunhou Zhang
Yukun Yang
Feng Yan
Shiyu Li
Harris Teague
H. Li
Yiran Chen
77
11
0
19 Jun 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
497
8,472
0
19 Jun 2019
Evaluating Protein Transfer Learning with TAPE
Roshan Rao
Nicholas Bhattacharya
Neil Thomas
Yan Duan
Xi Chen
John F. Canny
Pieter Abbeel
Yun S. Song
SSL
110
813
0
19 Jun 2019
Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction
Christoph Alt
Marc Hübner
Leonhard Hennig
77
122
0
19 Jun 2019
Surf at MEDIQA 2019: Improving Performance of Natural Language Inference in the Clinical Domain by Adopting Pre-trained Language Model
Jiin Nam
Seunghyun Yoon
Kyomin Jung
LM&MA
39
3
0
19 Jun 2019
Improving Sentiment Analysis with Multi-task Learning of Negation
Jeremy Barnes
Erik Velldal
Lilja Øvrelid
70
36
0
18 Jun 2019
Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability
Antoine Caubrière
N. Tomashenko
Antoine Laurent
Emmanuel Morin
Nathalie Camelin
Yannick Esteve
52
54
0
18 Jun 2019
Towards Robust Named Entity Recognition for Historic German
Stefan Schweter
Johannes Baiter
53
23
0
18 Jun 2019
Transfer Learning for Causal Sentence Detection
Manolis Kyriakakis
Ion Androutsopoulos
Joan Ginés i Ametllé
Artur Saudabayev
52
25
0
18 Jun 2019
Zero-Shot Entity Linking by Reading Entity Descriptions
Lajanugen Logeswaran
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
Jacob Devlin
Honglak Lee
VLM
91
257
0
18 Jun 2019
Measuring Bias in Contextualized Word Representations
Keita Kurita
Nidhi Vyas
Ayush Pareek
A. Black
Yulia Tsvetkov
121
454
0
18 Jun 2019
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models
Wei Fang
Yu-An Chung
James R. Glass
61
27
0
17 Jun 2019
Coherent and Controllable Outfit Generation
Kedan Li
Chen Liu
David A. Forsyth
72
15
0
17 Jun 2019
Open Domain Event Extraction Using Neural Latent Variable Models
Xiao Liu
Heyan Huang
Yue Zhang
BDL
DRL
65
57
0
17 Jun 2019
ParNet: Position-aware Aggregated Relation Network for Image-Text matching
Yaxian Xia
Lun Huang
Wenmin Wang
Xiao-Yong Wei
Jie Chen
128
1
0
17 Jun 2019
MixUp as Directional Adversarial Training
Guillaume P. Archambault
Yongyi Mao
Hongyu Guo
Richong Zhang
AAML
58
23
0
17 Jun 2019
Understanding Natural Language Instructions for Fetching Daily Objects Using GAN-Based Multimodal Target-Source Classification
A. Magassouba
K. Sugiura
Anh Trinh Quoc
Hisashi Kawai
70
34
0
17 Jun 2019
Meta-learning Pseudo-differential Operators with Deep Neural Networks
Jordi Feliu-Fabà
Yuwei Fan
Lexing Ying
66
40
0
16 Jun 2019
Theoretical Limitations of Self-Attention in Neural Sequence Models
Michael Hahn
96
276
0
16 Jun 2019
One Epoch Is All You Need
Aran Komatsuzaki
78
51
0
16 Jun 2019
Multi-Hop Paragraph Retrieval for Open-Domain Question Answering
Yair Feldman
Ran El-Yaniv
RALM
97
100
0
15 Jun 2019
Context is Key: Grammatical Error Detection with Contextual Word Representations
Samuel J. Bell
H. Yannakoudakis
Marek Rei
82
43
0
15 Jun 2019
Can neural networks understand monotonicity reasoning?
Hitomi Yanaka
K. Mineshima
D. Bekki
Kentaro Inui
Satoshi Sekine
Lasha Abzianidze
Johan Bos
LRM
67
81
0
15 Jun 2019
High-Performance Deep Learning via a Single Building Block
E. Georganas
K. Banerjee
Dhiraj D. Kalamkar
Sasikanth Avancha
Anand Venkat
Michael J. Anderson
G. Henry
Hans Pabst
A. Heinecke
43
12
0
15 Jun 2019
Scalable Syntax-Aware Language Models Using Knowledge Distillation
A. Kuncoro
Chris Dyer
Laura Rimell
S. Clark
Phil Blunsom
146
26
0
14 Jun 2019
"My Way of Telling a Story": Persona based Grounded Story Generation
Shrimai Prabhumoye
Khyathi Chandu
Ruslan Salakhutdinov
A. Black
82
35
0
14 Jun 2019
Comparison of Diverse Decoding Methods from Conditional Language Models
Daphne Ippolito
Reno Kriz
M. Kustikova
João Sedoc
Chris Callison-Burch
AI4CE
100
114
0
14 Jun 2019
Augmenting Neural Networks with First-order Logic
Tao Li
Vivek Srikumar
66
109
0
14 Jun 2019
Previous
1
2
3
...
459
460
461
...
470
471
472
Next