Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 19,786 papers shown
Title
ETNLP: a visual-aided systematic approach to select pre-trained embeddings for a downstream task
Xuan-Son Vu
Thanh Tien Vu
Son N. Tran
Lili Jiang
31
6
0
11 Mar 2019
HLT@SUDA at SemEval 2019 Task 1: UCCA Graph Parsing as Constituent Tree Parsing
Wei Jiang
Zhenghua Li
Yu Zhang
Min Zhang
32
20
0
11 Mar 2019
Fast Prototyping a Dialogue Comprehension System for Nurse-Patient Conversations on Symptom Monitoring
Zhengyuan Liu
Jia Hui Hazel Lim
Nur Farah Ain Binte Sahimi
Shao Chuen Tong
Sharon Ong
...
M. Macdonald
Savitha Ramasamy
Pavitra Krishnaswamy
W. Chow
Nancy F. Chen
41
24
0
08 Mar 2019
Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State
Richard Futrell
Ethan Gotlieb Wilcox
Takashi Morita
Peng Qian
Miguel Ballesteros
R. Levy
MILM
64
192
0
08 Mar 2019
Learning to Speak and Act in a Fantasy Text Adventure Game
Jack Urbanek
Angela Fan
Siddharth Karamcheti
Saachi Jain
Samuel Humeau
Emily Dinan
Tim Rocktaschel
Douwe Kiela
Arthur Szlam
Jason Weston
LLMAG
45
206
0
07 Mar 2019
Predicting Research Trends From Arxiv
Steffen Eger
Chao Li
Florian Netzer
Iryna Gurevych
21
7
0
07 Mar 2019
Hierarchical Autoregressive Image Models with Auxiliary Decoders
J. Fauw
Sander Dieleman
Karen Simonyan
GAN
35
37
0
06 Mar 2019
SemEval-2019 Task 1: Cross-lingual Semantic Parsing with UCCA
Daniel Hershcovich
Zohar Aizenbud
Leshem Choshen
Elior Sulem
A. Rappoport
Omri Abend
43
36
0
06 Mar 2019
SECNLP: A Survey of Embeddings in Clinical Natural Language Processing
Katikapalli Subramanyam Kalyan
S. Sangeetha
22
81
0
04 Mar 2019
DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs
Dheeru Dua
Yizhong Wang
Pradeep Dasigi
Gabriel Stanovsky
Sameer Singh
Matt Gardner
AIMat
37
929
0
01 Mar 2019
Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data
Wei Zhao
Liang Wang
Kewei Shen
Ruoyu Jia
Jingming Liu
41
210
0
01 Mar 2019
Infer Your Enemies and Know Yourself, Learning in Real-Time Bidding with Partially Observable Opponents
Manxing Du
Alexander I. Cowen-Rivers
Ying Wen
Phu Sakulwongtana
Jun Wang
M. Brorsson
R. State
16
1
0
28 Feb 2019
Representation Learning for Recommender Systems with Application to the Scientific Literature
Robin Brochier
19
5
0
28 Feb 2019
Link Prediction with Mutual Attention for Text-Attributed Networks
Robin Brochier
Adrien Guille
Julien Velcin
22
12
0
28 Feb 2019
Better, Faster, Stronger Sequence Tagging Constituent Parsers
David Vilares
Mostafa Abdou
Anders Søgaard
46
23
0
28 Feb 2019
BERT for Joint Intent Classification and Slot Filling
Qian Chen
Zhu Zhuo
Wen Wang
VLM
31
548
0
28 Feb 2019
Still a Pain in the Neck: Evaluating Text Representations on Lexical Composition
Vered Shwartz
Ido Dagan
CoGe
27
79
0
27 Feb 2019
An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models
Alexandra Chronopoulou
Christos Baziotis
Alexandros Potamianos
CLL
31
130
0
27 Feb 2019
Multi-Task Learning with Contextualized Word Representations for Extented Named Entity Recognition
Thai-Hoang Pham
Khai Mai
M. T. Nguyen
Nguyen Tuan Duc
Danushka Bollegala
Ryohei Sasano
Satoshi Sekine
40
4
0
26 Feb 2019
Cross-Lingual Alignment of Contextual Word Embeddings, with Applications to Zero-shot Dependency Parsing
Tal Schuster
Ori Ram
Regina Barzilay
Amir Globerson
44
208
0
25 Feb 2019
Attentional Encoder Network for Targeted Sentiment Classification
Youwei Song
Jiahai Wang
Tao Jiang
Zhiyue Liu
Yanghui Rao
14
275
0
25 Feb 2019
Pretraining-Based Natural Language Generation for Text Summarization
Haoyu Zhang
Jianjun Xu
Ji Wang
27
208
0
25 Feb 2019
A Theoretical Analysis of Contrastive Unsupervised Representation Learning
Sanjeev Arora
H. Khandeparkar
M. Khodak
Orestis Plevrakis
Nikunj Saunshi
SSL
68
765
0
25 Feb 2019
Star-Transformer
Qipeng Guo
Xipeng Qiu
Pengfei Liu
Yunfan Shao
Xiangyang Xue
Zheng Zhang
32
262
0
25 Feb 2019
Text Analysis in Adversarial Settings: Does Deception Leave a Stylistic Trace?
Tommi Gröndahl
Nadarajah Asokan
44
24
0
24 Feb 2019
Enhancing Clinical Concept Extraction with Contextual Embeddings
Yuqi Si
Jingqi Wang
Hua Xu
Kirk Roberts
AI4MH
29
286
0
22 Feb 2019
Improving Multilingual Sentence Embedding using Bi-directional Dual Encoder with Additive Margin Softmax
Yinfei Yang
Gustavo Hernández Ábrego
Steve Yuan
Mandy Guo
Qinlan Shen
Daniel Cer
Yun-hsuan Sung
B. Strope
R. Kurzweil
52
116
0
22 Feb 2019
Latent Translation: Crossing Modalities by Bridging Generative Models
Yingtao Tian
Jesse Engel
DRL
23
15
0
21 Feb 2019
Breaking the Softmax Bottleneck via Learnable Monotonic Pointwise Non-linearities
O. Ganea
Sylvain Gelly
Gary Bécigneul
Aliaksei Severyn
34
18
0
21 Feb 2019
Audio-Linguistic Embeddings for Spoken Sentences
Albert Haque
Michelle Guo
Prateek Verma
Li Fei-Fei
28
51
0
20 Feb 2019
Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent
Jaehoon Lee
Lechao Xiao
S. Schoenholz
Yasaman Bahri
Roman Novak
Jascha Narain Sohl-Dickstein
Jeffrey Pennington
57
1,081
0
18 Feb 2019
Using Embeddings to Correct for Unobserved Confounding in Networks
Victor Veitch
Yixin Wang
David M. Blei
CML
28
57
0
11 Feb 2019
Improved Knowledge Distillation via Teacher Assistant
Seyed Iman Mirzadeh
Mehrdad Farajtabar
Ang Li
Nir Levine
Akihiro Matsukawa
H. Ghasemzadeh
55
1,070
0
09 Feb 2019
End-to-End Open-Domain Question Answering with BERTserini
Wei Yang
Yuqing Xie
Aileen Lin
Xingyu Li
Luchen Tan
Kun Xiong
Ming Li
Jimmy J. Lin
RALM
57
493
0
05 Feb 2019
The Referential Reader: A Recurrent Entity Network for Anaphora Resolution
Fei Liu
Luke Zettlemoyer
Jacob Eisenstein
40
16
0
05 Feb 2019
Attention in Natural Language Processing
Andrea Galassi
Marco Lippi
Paolo Torroni
GNN
36
470
0
04 Feb 2019
Extracting Multiple-Relations in One-Pass with Pre-Trained Transformers
Haoyu Wang
Ming Tan
Mo Yu
Shiyu Chang
Dakuo Wang
Kun Xu
Xiaoxiao Guo
Saloni Potdar
ViT
34
97
0
04 Feb 2019
Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference
R. Thomas McCoy
Ellie Pavlick
Tal Linzen
74
1,224
0
04 Feb 2019
Improving Question Answering with External Knowledge
Xiaoman Pan
Kai Sun
Dian Yu
Jianshu Chen
Heng Ji
Claire Cardie
Dong Yu
KELM
24
66
0
03 Feb 2019
tax2vec: Constructing Interpretable Features from Taxonomies for Short Text Classification
Blaž Škrlj
Matej Martinc
Jan Kralj
Nada Lavrac
Senja Pollak
44
44
0
01 Feb 2019
The Second Conversational Intelligence Challenge (ConvAI2)
Emily Dinan
V. Logacheva
Valentin Malykh
Alexander H. Miller
Kurt Shuster
...
Alexander I. Rudnicky
Jason Williams
Joelle Pineau
Andrey Kravchenko
Jason Weston
DRL
45
362
0
31 Jan 2019
Multi-Task Deep Neural Networks for Natural Language Understanding
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
AI4CE
65
1,263
0
31 Jan 2019
A large-scale crowdsourced analysis of abuse against women journalists and politicians on Twitter
Laure Delisle
Freddie Kalaitzis
Krzysztof Majewski
A. D. Berker
M. Marin
Julien Cornebise
19
29
0
31 Jan 2019
Learning and Evaluating General Linguistic Intelligence
Dani Yogatama
Cyprien de Masson dÁutume
Jerome T. Connor
Tomás Kociský
Mike Chrzanowski
...
Angeliki Lazaridou
Wang Ling
Lei Yu
Chris Dyer
Phil Blunsom
ELM
AI4CE
50
210
0
31 Jan 2019
EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks
Jason W. Wei
Kai Zou
37
1,922
0
31 Jan 2019
The Evolved Transformer
David R. So
Chen Liang
Quoc V. Le
ViT
38
461
0
30 Jan 2019
Glyce: Glyph-vectors for Chinese Character Representations
Yuxian Meng
Wei Wu
Fei Wang
Xiaoya Li
Ping Nie
J. Mei
Muyu Li
Qinghong Han
Xiaofei Sun
Jiwei Li
VLM
29
190
0
29 Jan 2019
Evaluating Word Embedding Models: Methods and Experimental Results
Bin Wang
Angela Wang
Fenxiao Chen
Yun Cheng Wang
C.-C. Jay Kuo
ELM
32
260
0
28 Jan 2019
Language Independent Sequence Labelling for Opinion Target Extraction
Rodrigo Agerri
German Rigau
17
22
0
28 Jan 2019
Stiffness: A New Perspective on Generalization in Neural Networks
Stanislav Fort
Pawel Krzysztof Nowak
Stanislaw Jastrzebski
S. Narayanan
52
94
0
28 Jan 2019
Previous
1
2
3
...
393
394
395
396
Next