Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,511 papers shown
Title
What do you learn from context? Probing for sentence structure in contextualized word representations
Ian Tenney
Patrick Xia
Berlin Chen
Alex Jinpeng Wang
Adam Poliak
...
Najoung Kim
Benjamin Van Durme
Samuel R. Bowman
Dipanjan Das
Ellie Pavlick
245
867
0
15 May 2019
A Surprisingly Robust Trick for Winograd Schema Challenge
Vid Kocijan
Ana-Maria Cretu
Oana-Maria Camburu
Yordan Yordanov
Thomas Lukasiewicz
88
101
0
15 May 2019
Selection Bias Explorations and Debias Methods for Natural Language Sentence Matching Datasets
Guanhua Zhang
Bing Bai
Jian Liang
Kun Bai
Shiyu Chang
Mo Yu
Conghui Zhu
Tiejun Zhao
79
27
0
15 May 2019
Behavior Sequence Transformer for E-commerce Recommendation in Alibaba
Qiwei Chen
Huan Zhao
Wei Li
Pipei Huang
Wenwu Ou
60
392
0
15 May 2019
BERT Rediscovers the Classical NLP Pipeline
Ian Tenney
Dipanjan Das
Ellie Pavlick
MILM
SSeg
240
1,487
0
15 May 2019
Sense Vocabulary Compression through the Semantic Knowledge of WordNet for Neural Word Sense Disambiguation
Loïc Vial
Benjamin Lecouteux
D. Schwab
70
92
0
14 May 2019
Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation
Ning Dai
Jianze Liang
Xipeng Qiu
Xuanjing Huang
DRL
121
204
0
14 May 2019
Strong and Simple Baselines for Multimodal Utterance Embeddings
Paul Pu Liang
Y. Lim
Yao-Hung Hubert Tsai
Ruslan Salakhutdinov
Louis-Philippe Morency
SSL
66
30
0
14 May 2019
How to Fine-Tune BERT for Text Classification?
Chi Sun
Xipeng Qiu
Yige Xu
Xuanjing Huang
130
1,532
0
14 May 2019
Imputing Missing Events in Continuous-Time Event Streams
Hongyuan Mei
Guanghui Qin
Jason Eisner
AI4TS
80
41
0
14 May 2019
Entity-Relation Extraction as Multi-Turn Question Answering
Xiaoya Li
Fan Yin
Zijun Sun
Xiayu Li
Arianna Yuan
Duo Chai
Mingxin Zhou
Jiwei Li
95
348
0
14 May 2019
PatentBERT: Patent Classification with Fine-Tuning a pre-trained BERT Model
Jieh-Sheng Lee
J. Hsiang
56
94
0
14 May 2019
Improving Neural Conversational Models with Entropy-Based Data Filtering
Richard Csaky
Patrik Purgai
Gábor Recski
103
58
0
14 May 2019
Cognitive Graph for Multi-Hop Reading Comprehension at Scale
Ming Ding
Chang Zhou
Qibin Chen
Hongxia Yang
Jie Tang
106
226
0
14 May 2019
A Review of Keyphrase Extraction
Eirini Papagiannopoulou
Grigorios Tsoumakas
66
169
0
13 May 2019
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Yi Ren
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
95
102
0
13 May 2019
Synchronous Bidirectional Neural Machine Translation
Long Zhou
Jiajun Zhang
Chengqing Zong
123
106
0
13 May 2019
Challenges in Building Intelligent Open-domain Dialog Systems
Minlie Huang
Xiaoyan Zhu
Jianfeng Gao
VLM
152
316
0
13 May 2019
A logical-based corpus for cross-lingual evaluation
Felipe Salvatore
Marcelo Finger
R. Hirata
80
1
0
10 May 2019
Deep Unsupervised Cardinality Estimation
Zongheng Yang
Eric Liang
Amog Kamsetty
Chenggang Wu
Yan Duan
Peter Chen
Pieter Abbeel
J. M. Hellerstein
S. Krishnan
Ion Stoica
96
208
0
10 May 2019
Language Modeling with Deep Transformers
Kazuki Irie
Albert Zeyer
Ralf Schluter
Hermann Ney
KELM
111
176
0
10 May 2019
Survey on Evaluation Methods for Dialogue Systems
Jan Deriu
Álvaro Rodrigo
Arantxa Otegi
Guillermo Echegoyen
S. Rosset
Eneko Agirre
Mark Cieliebak
118
285
0
10 May 2019
Improving Discrete Latent Representations With Differentiable Approximation Bridges
Jason Ramapuram
Russ Webb
DRL
40
9
0
09 May 2019
Deep Closest Point: Learning Representations for Point Cloud Registration
Yue Wang
Justin Solomon
3DPC
74
854
0
08 May 2019
Generative Model with Dynamic Linear Flow
Huadong Liao
Jiawei He
Kun-xian Shu
DRL
49
5
0
08 May 2019
MetaPred: Meta-Learning for Clinical Risk Prediction with Limited Patient Electronic Health Records
Xi Sheryl Zhang
Fengyi Tang
H. H. Dodge
Jiayu Zhou
Fei Wang
54
110
0
08 May 2019
Unified Language Model Pre-training for Natural Language Understanding and Generation
Li Dong
Nan Yang
Wenhui Wang
Furu Wei
Xiaodong Liu
Yu Wang
Jianfeng Gao
M. Zhou
H. Hon
ELM
AI4CE
250
1,562
0
08 May 2019
Emotion Recognition in Conversation: Research Challenges, Datasets, and Recent Advances
Soujanya Poria
Navonil Majumder
Rada Mihalcea
Eduard H. Hovy
78
363
0
08 May 2019
FAQ Retrieval using Query-Question Similarity and BERT-Based Query-Answer Relevance
Wataru Sakata
Tomohide Shibata
Ribeka Tanaka
Sadao Kurohashi
RALM
67
108
0
08 May 2019
MASS: Masked Sequence to Sequence Pre-training for Language Generation
Kaitao Song
Xu Tan
Tao Qin
Jianfeng Lu
Tie-Yan Liu
160
967
0
07 May 2019
Show, Price and Negotiate: A Negotiator with Online Value Look-Ahead
Amin Parvaneh
Ehsan Abbasnejad
Qi Wu
Javen Qinfeng Shi
Anton van den Hengel
OffRL
71
5
0
07 May 2019
Neural Architecture Refinement: A Practical Way for Avoiding Overfitting in NAS
Yangzhou Jiang
Cong Zhao
Zeyang Dou
Lei Pang
59
5
0
07 May 2019
Taming Pretrained Transformers for Extreme Multi-label Text Classification
Wei-Cheng Chang
Hsiang-Fu Yu
Kai Zhong
Yiming Yang
Inderjit Dhillon
75
20
0
07 May 2019
Investigating the Successes and Failures of BERT for Passage Re-Ranking
Harshith Padigela
Hamed Zamani
W. Bruce Croft
75
47
0
05 May 2019
Towards More Realistic Human-Robot Conversation: A Seq2Seq-based Body Gesture Interaction System
Minjie Hua
Fuyuan Shi
Yibing Nan
Kai Wang
Hao Chen
Kai Wang
55
10
0
05 May 2019
Learning to Denoise Distantly-Labeled Data for Entity Typing
Yasumasa Onoe
Greg Durrett
93
58
0
04 May 2019
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
431
2,331
0
02 May 2019
ASER: A Large-scale Eventuality Knowledge Graph
Hongming Zhang
Xin Liu
Haojie Pan
Yangqiu Song
C. Leung
SLR
107
163
0
01 May 2019
Deep Learning for Audio Signal Processing
Hendrik Purwins
Yue Liu
Tuomas Virtanen
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
123
599
0
30 Apr 2019
Very Deep Self-Attention Networks for End-to-End Speech Recognition
Ngoc-Quan Pham
T. Nguyen
Jan Niehues
Markus Müller
Sebastian Stüker
A. Waibel
91
161
0
30 Apr 2019
Segmentation is All You Need
Zehua Cheng
Yuxiang Wu
Zhenghua Xu
Thomas Lukasiewicz
Weiyan Wang
72
20
0
30 Apr 2019
Enabling Robots to Understand Incomplete Natural Language Instructions Using Commonsense Reasoning
Haonan Chen
Hao Tan
Alan Kuntz
Joey Tianyi Zhou
Ron Alterovitz
LM&Ro
LRM
76
45
0
29 Apr 2019
Unsupervised Data Augmentation for Consistency Training
Qizhe Xie
Zihang Dai
Eduard H. Hovy
Minh-Thang Luong
Quoc V. Le
163
2,337
0
29 Apr 2019
Softmax Optimizations for Intel Xeon Processor-based Platforms
Jacek Czaja
Michal Gallus
Tomasz Patejko
Jian Tang Intel Corporation
35
3
0
28 Apr 2019
Towards Efficient Model Compression via Learned Global Ranking
Ting-Wu Chin
Ruizhou Ding
Cha Zhang
Diana Marculescu
83
172
0
28 Apr 2019
Improved Conditional VRNNs for Video Prediction
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
DRL
157
164
0
27 Apr 2019
Contextualized Word Embeddings Enhanced Event Temporal Relation Extraction for Story Understanding
Rujun Han
Mengyue Liang
Bashar Alhafni
Nanyun Peng
AI4TS
NAI
58
21
0
26 Apr 2019
Transformers with convolutional context for ASR
Abdel-rahman Mohamed
Dmytro Okhonko
Luke Zettlemoyer
69
169
0
26 Apr 2019
TVQA+: Spatio-Temporal Grounding for Video Question Answering
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
77
230
0
25 Apr 2019
Probing What Different NLP Tasks Teach Machines about Function Word Comprehension
Najoung Kim
Roma Patel
Adam Poliak
Alex Jinpeng Wang
Patrick Xia
...
Alexis Ross
Tal Linzen
Benjamin Van Durme
Samuel R. Bowman
Ellie Pavlick
80
107
0
25 Apr 2019
Previous
1
2
3
...
463
464
465
...
469
470
471
Next