Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 19,786 papers shown
Title
Self-Supervised Dialogue Learning
Jiawei Wu
Xin Eric Wang
William Yang Wang
SSL
19
58
0
30 Jun 2019
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting
Shiyang Li
Xiaoyong Jin
Yao Xuan
Xiyou Zhou
Wenhu Chen
Yu Wang
Xifeng Yan
AI4TS
26
1,391
0
29 Jun 2019
Deep Gamblers: Learning to Abstain with Portfolio Theory
Liu Ziyin
Zhikang T. Wang
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
Masahito Ueda
43
110
0
29 Jun 2019
GPT-based Generation for Classical Chinese Poetry
Yi-Lun Liao
Yasheng Wang
Qun Liu
Xin Jiang
29
40
0
29 Jun 2019
Relating Simple Sentence Representations in Deep Neural Networks and the Brain
Sharmistha Jat
Hao Tang
Partha P. Talukdar
Tom Michael Mitchell
22
21
0
27 Jun 2019
Good Secretaries, Bad Truck Drivers? Occupational Gender Stereotypes in Sentiment Analysis
J. Bhaskaran
Isha Bhallamudi
27
47
0
24 Jun 2019
Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation
Daniel Loureiro
A. Jorge
24
138
0
24 Jun 2019
LIAAD at SemDeep-5 Challenge: Word-in-Context (WiC)
Daniel Loureiro
A. Jorge
22
17
0
24 Jun 2019
Classification and Clustering of Arguments with Contextualized Word Embeddings
Nils Reimers
Benjamin Schiller
Tilman Beck
Johannes Daxenberger
Christian Stab
Iryna Gurevych
22
166
0
24 Jun 2019
EQuANt (Enhanced Question Answer Network)
Franccois-Xavier Aubet
D. Danks
Yuchen Zhu
26
3
0
24 Jun 2019
Evaluating the Supervised and Zero-shot Performance of Multi-lingual Translation Models
Chris Hokamp
John Glover
D. Ghalandari
26
14
0
24 Jun 2019
Deep Leakage from Gradients
Ligeng Zhu
Zhijian Liu
Song Han
FedML
43
2,176
0
21 Jun 2019
Graph Star Net for Generalized Multi-Task Learning
H. Lu
Seth H. Huang
Tian Ye
Xiuyan Guo
GNN
38
46
0
21 Jun 2019
SMILES-X: autonomous molecular compounds characterization for small datasets without descriptors
G. Lambard
Ekaterina Gracheva
27
21
0
20 Jun 2019
Learning Compressed Sentence Representations for On-Device Text Processing
Dinghan Shen
Pengyu Cheng
Dhanasekar Sundararaman
Xinyuan Zhang
Qian Yang
Meng Tang
Asli Celikyilmaz
Lawrence Carin
23
22
0
19 Jun 2019
SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly Representative Neural Architectures
Hsin-Pai Cheng
Tunhou Zhang
Yukun Yang
Feng Yan
Shiyu Li
Harris Teague
H. Li
Yiran Chen
25
11
0
19 Jun 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
129
8,361
0
19 Jun 2019
Evaluating Protein Transfer Learning with TAPE
Roshan Rao
Nicholas Bhattacharya
Neil Thomas
Yan Duan
Xi Chen
John F. Canny
Pieter Abbeel
Yun S. Song
SSL
61
786
0
19 Jun 2019
Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction
Christoph Alt
Marc Hübner
Leonhard Hennig
20
119
0
19 Jun 2019
Improving Sentiment Analysis with Multi-task Learning of Negation
Jeremy Barnes
Erik Velldal
Lilja Øvrelid
26
36
0
18 Jun 2019
Zero-Shot Entity Linking by Reading Entity Descriptions
Lajanugen Logeswaran
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
Jacob Devlin
Honglak Lee
VLM
22
252
0
18 Jun 2019
Measuring Bias in Contextualized Word Representations
Keita Kurita
Nidhi Vyas
Ayush Pareek
A. Black
Yulia Tsvetkov
63
448
0
18 Jun 2019
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models
Wei Fang
Yu-An Chung
James R. Glass
26
27
0
17 Jun 2019
Coherent and Controllable Outfit Generation
Kedan Li
Chen Liu
David A. Forsyth
56
15
0
17 Jun 2019
Open Domain Event Extraction Using Neural Latent Variable Models
Xiao Liu
Heyan Huang
Yue Zhang
BDL
DRL
27
57
0
17 Jun 2019
ParNet: Position-aware Aggregated Relation Network for Image-Text matching
Yaxian Xia
Lun Huang
Wenmin Wang
Xiao-Yong Wei
Jie Chen
52
1
0
17 Jun 2019
Meta-learning Pseudo-differential Operators with Deep Neural Networks
Jordi Feliu-Fabà
Yuwei Fan
Lexing Ying
24
39
0
16 Jun 2019
One Epoch Is All You Need
Aran Komatsuzaki
32
50
0
16 Jun 2019
Multi-Hop Paragraph Retrieval for Open-Domain Question Answering
Yair Feldman
Ran El-Yaniv
RALM
32
100
0
15 Jun 2019
Context is Key: Grammatical Error Detection with Contextual Word Representations
Samuel J. Bell
H. Yannakoudakis
Marek Rei
37
41
0
15 Jun 2019
Can neural networks understand monotonicity reasoning?
Hitomi Yanaka
K. Mineshima
D. Bekki
Kentaro Inui
Satoshi Sekine
Lasha Abzianidze
Johan Bos
LRM
41
80
0
15 Jun 2019
Scalable Syntax-Aware Language Models Using Knowledge Distillation
A. Kuncoro
Chris Dyer
Laura Rimell
S. Clark
Phil Blunsom
60
26
0
14 Jun 2019
"My Way of Telling a Story": Persona based Grounded Story Generation
Shrimai Prabhumoye
Khyathi Chandu
Ruslan Salakhutdinov
A. Black
32
35
0
14 Jun 2019
Augmenting Neural Networks with First-order Logic
Tao Li
Vivek Srikumar
21
109
0
14 Jun 2019
A Simple and Effective Approach to Automatic Post-Editing with Transfer Learning
Gonçalo M. Correia
André F. T. Martins
19
42
0
14 Jun 2019
DocRED: A Large-Scale Document-Level Relation Extraction Dataset
Yuan Yao
Deming Ye
Peng Li
Xu Han
Yankai Lin
Zhenghao Liu
Zhiyuan Liu
Lixin Huang
Jie Zhou
Maosong Sun
22
448
0
14 Jun 2019
Learning to Ask Unanswerable Questions for Machine Reading Comprehension
Haichao Zhu
Li Dong
Furu Wei
Wenhui Wang
Bing Qin
Ting Liu
RALM
26
31
0
14 Jun 2019
Image Captioning: Transforming Objects into Words
Simão Herdade
Armin Kappeler
K. Boakye
Joao Soares
ViT
62
465
0
14 Jun 2019
Sentiment analysis is not solved! Assessing and probing sentiment classification
Jeremy Barnes
Lilja Øvrelid
Erik Velldal
24
32
0
13 Jun 2019
Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index
Minjoon Seo
Jinhyuk Lee
Tom Kwiatkowski
Ankur P. Parikh
Ali Farhadi
Hannaneh Hajishirzi
RALM
26
154
0
13 Jun 2019
Learning Video Representations using Contrastive Bidirectional Transformer
Chen Sun
Fabien Baradel
Kevin Patrick Murphy
Cordelia Schmid
SSL
ViT
52
133
0
13 Jun 2019
2D Attentional Irregular Scene Text Recognizer
Pengyuan Lyu
Zhicheng Yang
Xinhang Leng
Xiaojun Wu
Ruiyu Li
Xiaoyong Shen
3DV
53
50
0
13 Jun 2019
Proactive Human-Machine Conversation with Explicit Conversation Goals
Wenquan Wu
Zhen Guo
Xiangyang Zhou
Hua Wu
Xiyuan Zhang
Rongzhong Lian
Haifeng Wang
38
195
0
13 Jun 2019
Lattice Transformer for Speech Translation
Pei Zhang
Boxing Chen
Niyu Ge
Kai Fan
39
48
0
13 Jun 2019
Transfer Learning in Biomedical Natural Language Processing: An Evaluation of BERT and ELMo on Ten Benchmarking Datasets
Yifan Peng
Shankai Yan
Zhiyong Lu
LM&MA
AI4MH
40
834
0
13 Jun 2019
Synthetic QA Corpora Generation with Roundtrip Consistency
Chris Alberti
D. Andor
Emily Pitler
Jacob Devlin
Michael Collins
SyDa
41
245
0
12 Jun 2019
Neural Arabic Question Answering
Hussein Mozannar
Karl El Hajal
Elie Maamary
Hazem M. Hajj
28
135
0
12 Jun 2019
Representation Learning for Words and Entities
Pushpendre Rastogi
SSL
44
0
0
12 Jun 2019
Explore, Propose, and Assemble: An Interpretable Model for Multi-Hop Reading Comprehension
Yichen Jiang
Nitish Joshi
Yen-Chun Chen
Joey Tianyi Zhou
RALM
29
39
0
12 Jun 2019
Toward Interpretable Music Tagging with Self-Attention
Minz Won
Sanghyuk Chun
Xavier Serra
ViT
21
81
0
12 Jun 2019
Previous
1
2
3
...
388
389
390
...
394
395
396
Next