Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 18,720 papers shown
Title
Consistent Dialogue Generation with Self-supervised Feature Learning
Yizhe Zhang
Xiang Gao
Sungjin Lee
Chris Brockett
Michel Galley
Jianfeng Gao
W. Dolan
30
28
0
13 Mar 2019
Maybe Deep Neural Networks are the Best Choice for Modeling Source Code
Rafael-Michael Karampatsis
Charles Sutton
32
54
0
13 Mar 2019
ALOHA: Auxiliary Loss Optimization for Hypothesis Augmentation
Ethan M. Rudd
Felipe N. Ducau
Cody Wild
Konstantin Berlin
Richard E. Harang
AAML
24
29
0
13 Mar 2019
ETNLP: a visual-aided systematic approach to select pre-trained embeddings for a downstream task
Xuan-Son Vu
Thanh Tien Vu
Son N. Tran
Lili Jiang
26
6
0
11 Mar 2019
HLT@SUDA at SemEval 2019 Task 1: UCCA Graph Parsing as Constituent Tree Parsing
Wei Jiang
Zhenghua Li
Yu Zhang
Min Zhang
27
20
0
11 Mar 2019
Fast Prototyping a Dialogue Comprehension System for Nurse-Patient Conversations on Symptom Monitoring
Zhengyuan Liu
Jia Hui Hazel Lim
Nur Farah Ain Binte Sahimi
Shao Chuen Tong
Sharon Ong
...
M. Macdonald
Savitha Ramasamy
Pavitra Krishnaswamy
W. Chow
Nancy F. Chen
23
24
0
08 Mar 2019
Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State
Richard Futrell
Ethan Gotlieb Wilcox
Takashi Morita
Peng Qian
Miguel Ballesteros
R. Levy
MILM
42
191
0
08 Mar 2019
Learning to Speak and Act in a Fantasy Text Adventure Game
Jack Urbanek
Angela Fan
Siddharth Karamcheti
Saachi Jain
Samuel Humeau
Emily Dinan
Tim Rocktaschel
Douwe Kiela
Arthur Szlam
Jason Weston
LLMAG
33
205
0
07 Mar 2019
Predicting Research Trends From Arxiv
Steffen Eger
Chao Li
Florian Netzer
Iryna Gurevych
21
7
0
07 Mar 2019
Hierarchical Autoregressive Image Models with Auxiliary Decoders
J. Fauw
Sander Dieleman
Karen Simonyan
GAN
30
37
0
06 Mar 2019
SemEval-2019 Task 1: Cross-lingual Semantic Parsing with UCCA
Daniel Hershcovich
Zohar Aizenbud
Leshem Choshen
Elior Sulem
A. Rappoport
Omri Abend
30
36
0
06 Mar 2019
SECNLP: A Survey of Embeddings in Clinical Natural Language Processing
Katikapalli Subramanyam Kalyan
S. Sangeetha
17
81
0
04 Mar 2019
Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data
Wei Zhao
Liang Wang
Kewei Shen
Ruoyu Jia
Jingming Liu
25
210
0
01 Mar 2019
Infer Your Enemies and Know Yourself, Learning in Real-Time Bidding with Partially Observable Opponents
Manxing Du
Alexander I. Cowen-Rivers
Ying Wen
Phu Sakulwongtana
Jun Wang
M. Brorsson
R. State
16
1
0
28 Feb 2019
Representation Learning for Recommender Systems with Application to the Scientific Literature
Robin Brochier
19
5
0
28 Feb 2019
Link Prediction with Mutual Attention for Text-Attributed Networks
Robin Brochier
Adrien Guille
Julien Velcin
22
12
0
28 Feb 2019
Better, Faster, Stronger Sequence Tagging Constituent Parsers
David Vilares
Mostafa Abdou
Anders Søgaard
35
23
0
28 Feb 2019
BERT for Joint Intent Classification and Slot Filling
Qian Chen
Zhu Zhuo
Wen Wang
VLM
31
547
0
28 Feb 2019
Still a Pain in the Neck: Evaluating Text Representations on Lexical Composition
Vered Shwartz
Ido Dagan
CoGe
27
79
0
27 Feb 2019
An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models
Alexandra Chronopoulou
Christos Baziotis
Alexandros Potamianos
CLL
31
130
0
27 Feb 2019
Multi-Task Learning with Contextualized Word Representations for Extented Named Entity Recognition
Thai-Hoang Pham
Khai Mai
M. T. Nguyen
Nguyen Tuan Duc
Danushka Bollegala
Ryohei Sasano
Satoshi Sekine
26
4
0
26 Feb 2019
Cross-Lingual Alignment of Contextual Word Embeddings, with Applications to Zero-shot Dependency Parsing
Tal Schuster
Ori Ram
Regina Barzilay
Amir Globerson
41
207
0
25 Feb 2019
Attentional Encoder Network for Targeted Sentiment Classification
Youwei Song
Jiahai Wang
Tao Jiang
Zhiyue Liu
Yanghui Rao
14
275
0
25 Feb 2019
Pretraining-Based Natural Language Generation for Text Summarization
Haoyu Zhang
Jianjun Xu
Ji Wang
27
208
0
25 Feb 2019
A Theoretical Analysis of Contrastive Unsupervised Representation Learning
Sanjeev Arora
H. Khandeparkar
M. Khodak
Orestis Plevrakis
Nikunj Saunshi
SSL
68
765
0
25 Feb 2019
Star-Transformer
Qipeng Guo
Xipeng Qiu
Pengfei Liu
Yunfan Shao
Xiangyang Xue
Zheng Zhang
27
262
0
25 Feb 2019
Text Analysis in Adversarial Settings: Does Deception Leave a Stylistic Trace?
Tommi Gröndahl
Nadarajah Asokan
30
24
0
24 Feb 2019
Enhancing Clinical Concept Extraction with Contextual Embeddings
Yuqi Si
Jingqi Wang
Hua Xu
Kirk Roberts
AI4MH
29
286
0
22 Feb 2019
Improving Multilingual Sentence Embedding using Bi-directional Dual Encoder with Additive Margin Softmax
Yinfei Yang
Gustavo Hernández Ábrego
Steve Yuan
Mandy Guo
Qinlan Shen
Daniel Cer
Yun-hsuan Sung
B. Strope
R. Kurzweil
52
115
0
22 Feb 2019
Breaking the Softmax Bottleneck via Learnable Monotonic Pointwise Non-linearities
O. Ganea
Sylvain Gelly
Gary Bécigneul
Aliaksei Severyn
29
18
0
21 Feb 2019
Audio-Linguistic Embeddings for Spoken Sentences
Albert Haque
Michelle Guo
Prateek Verma
Li Fei-Fei
28
51
0
20 Feb 2019
Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent
Jaehoon Lee
Lechao Xiao
S. Schoenholz
Yasaman Bahri
Roman Novak
Jascha Narain Sohl-Dickstein
Jeffrey Pennington
57
1,080
0
18 Feb 2019
Using Embeddings to Correct for Unobserved Confounding in Networks
Victor Veitch
Yixin Wang
David M. Blei
CML
23
56
0
11 Feb 2019
End-to-End Open-Domain Question Answering with BERTserini
Wei Yang
Yuqing Xie
Aileen Lin
Xingyu Li
Luchen Tan
Kun Xiong
Ming Li
Jimmy J. Lin
RALM
52
492
0
05 Feb 2019
The Referential Reader: A Recurrent Entity Network for Anaphora Resolution
Fei Liu
Luke Zettlemoyer
Jacob Eisenstein
40
16
0
05 Feb 2019
Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference
R. Thomas McCoy
Ellie Pavlick
Tal Linzen
69
1,217
0
04 Feb 2019
Improving Question Answering with External Knowledge
Xiaoman Pan
Kai Sun
Dian Yu
Jianshu Chen
Heng Ji
Claire Cardie
Dong Yu
KELM
19
66
0
03 Feb 2019
tax2vec: Constructing Interpretable Features from Taxonomies for Short Text Classification
Blaž Škrlj
Matej Martinc
Jan Kralj
Nada Lavrac
Senja Pollak
27
44
0
01 Feb 2019
The Second Conversational Intelligence Challenge (ConvAI2)
Emily Dinan
V. Logacheva
Valentin Malykh
Alexander H. Miller
Kurt Shuster
...
Alexander I. Rudnicky
Jason Williams
Joelle Pineau
Andrey Kravchenko
Jason Weston
DRL
40
361
0
31 Jan 2019
Multi-Task Deep Neural Networks for Natural Language Understanding
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
AI4CE
60
1,263
0
31 Jan 2019
A large-scale crowdsourced analysis of abuse against women journalists and politicians on Twitter
Laure Delisle
Freddie Kalaitzis
Krzysztof Majewski
A. D. Berker
M. Marin
Julien Cornebise
11
29
0
31 Jan 2019
Learning and Evaluating General Linguistic Intelligence
Dani Yogatama
Cyprien de Masson dÁutume
Jerome T. Connor
Tomás Kociský
Mike Chrzanowski
...
Angeliki Lazaridou
Wang Ling
Lei Yu
Chris Dyer
Phil Blunsom
ELM
AI4CE
33
209
0
31 Jan 2019
EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks
Jason W. Wei
Kai Zou
37
1,920
0
31 Jan 2019
Glyce: Glyph-vectors for Chinese Character Representations
Yuxian Meng
Wei Wu
Fei Wang
Xiaoya Li
Ping Nie
J. Mei
Muyu Li
Qinghong Han
Xiaofei Sun
Jiwei Li
VLM
27
190
0
29 Jan 2019
Evaluating Word Embedding Models: Methods and Experimental Results
Bin Wang
Angela Wang
Fenxiao Chen
Yun Cheng Wang
C.-C. Jay Kuo
ELM
27
260
0
28 Jan 2019
Stiffness: A New Perspective on Generalization in Neural Networks
Stanislav Fort
Pawel Krzysztof Nowak
Stanislaw Jastrzebski
S. Narayanan
27
94
0
28 Jan 2019
Dual Co-Matching Network for Multi-choice Reading Comprehension
Shuailiang Zhang
Zhao Hai
Yuwei Wu
Zhuosheng Zhang
Xi Zhou
Xiaoping Zhou
39
131
0
27 Jan 2019
Deep Learning on Small Datasets without Pre-Training using Cosine Loss
Björn Barz
Joachim Denzler
27
130
0
25 Jan 2019
A BERT Baseline for the Natural Questions
Chris Alberti
Kenton Lee
Michael Collins
ELM
AI4MH
25
127
0
24 Jan 2019
Large-Batch Training for LSTM and Beyond
Yang You
Jonathan Hseu
Chris Ying
J. Demmel
Kurt Keutzer
Cho-Jui Hsieh
33
89
0
24 Jan 2019
Previous
1
2
3
...
372
373
374
375
Next