Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 18,335 papers shown
Title
Cross-Lingual Ability of Multilingual BERT: An Empirical Study
Karthikeyan K
Zihan Wang
Stephen D. Mayhew
Dan Roth
LRM
36
333
0
17 Dec 2019
Multilingual is not enough: BERT for Finnish
Antti Virtanen
Jenna Kanerva
Rami Ilo
Jouni Luoma
Juhani Luotolahti
T. Salakoski
Filip Ginter
S. Pyysalo
36
277
0
15 Dec 2019
Knowledge-based Conversational Search
Svitlana Vakulenko
22
13
0
14 Dec 2019
Text as Environment: A Deep Reinforcement Learning Text Readability Assessment Model
Hamid Reza Mohammadi
S. H. Khasteh
Tahereh Firoozi
Taha Samavati
29
20
0
12 Dec 2019
FlauBERT: Unsupervised Language Model Pre-training for French
Hang Le
Loïc Vial
Jibril Frej
Vincent Segonne
Maximin Coavoux
Benjamin Lecouteux
A. Allauzen
Benoît Crabbé
Laurent Besacier
D. Schwab
AI4CE
54
395
0
11 Dec 2019
CoSimLex: A Resource for Evaluating Graded Word Similarity in Context
C. S. Armendariz
Matthew Purver
Matej Ulčar
Senja Pollak
Nikola Ljubesic
Marko Robnik-Šikonja
Mark Granroth-Wilding
Kristiina Vaik
29
35
0
11 Dec 2019
Improving Neural Protein-Protein Interaction Extraction with Knowledge Selection
Huiwei Zhou
Xuefei Li
Weihong Yao
Zhuang Liu
Shixian Ning
Chengkun Lang
Lei Du
24
7
0
11 Dec 2019
Adversarial Analysis of Natural Language Inference Systems
Tiffany Chien
Jugal Kalita
AAML
39
12
0
07 Dec 2019
Weak Supervision helps Emergence of Word-Object Alignment and improves Vision-Language Tasks
Corentin Kervadec
G. Antipov
M. Baccouche
Christian Wolf
21
15
0
06 Dec 2019
Self-Supervised Learning of Video-Induced Visual Invariances
Michael Tschannen
Josip Djolonga
Marvin Ritter
Aravindh Mahendran
Xiaohua Zhai
N. Houlsby
Sylvain Gelly
Mario Lucic
SSL
18
61
0
05 Dec 2019
Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline
Vishvak Murahari
Dhruv Batra
Devi Parikh
Abhishek Das
VLM
23
115
0
05 Dec 2019
Scratch that! An Evolution-based Adversarial Attack against Neural Networks
Malhar Jere
Loris Rossi
Briland Hitaj
Gabriela F. Cretu-Ciocarlie
Giacomo Boracchi
F. Koushanfar
AAML
19
18
0
05 Dec 2019
Plug and Play Language Models: A Simple Approach to Controlled Text Generation
Sumanth Dathathri
Andrea Madotto
Janice Lan
Jane Hung
Eric Frank
Piero Molino
J. Yosinski
Rosanne Liu
KELM
64
946
0
04 Dec 2019
An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering
Shayne Longpre
Yi Lu
Zhucheng Tu
Christopher DuBois
27
70
0
04 Dec 2019
AMUSED: A Multi-Stream Vector Representation Method for Use in Natural Dialogue
Gaurav Kumar
Rishabh Joshi
Jaspreet Singh
Promod Yenigalla
23
7
0
04 Dec 2019
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
33
313
0
04 Dec 2019
Scalable Bayesian Preference Learning for Crowds
Edwin Simpson
Iryna Gurevych
BDL
19
24
0
04 Dec 2019
Reading the Manual: Event Extraction as Definition Comprehension
Yunmo Chen
Tongfei Chen
Seth Ebner
Aaron Steven White
Benjamin Van Durme
30
63
0
03 Dec 2019
A Comparative Study of Pretrained Language Models on Thai Social Text Categorization
Thanapapas Horsuwan
Kasidis Kanwatchara
P. Vateekul
B. Kijsirikul
19
9
0
03 Dec 2019
An Annotated Dataset of Coreference in English Literature
David Bamman
Olivia Lewke
A. Mansoor
16
105
0
03 Dec 2019
Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection
Shubhi Tyagi
M. Nicolis
Jonas Rohnke
Thomas Drugman
Jaime Lorenzo-Trueba
32
32
0
02 Dec 2019
Solving Arithmetic Word Problems Automatically Using Transformer and Unambiguous Representations
Kaden Griffith
Jugal Kalita
18
16
0
02 Dec 2019
Multi-Scale Self-Attention for Text Classification
Qipeng Guo
Xipeng Qiu
Pengfei Liu
Xiangyang Xue
Zheng-Wei Zhang
ViT
13
62
0
02 Dec 2019
Knowledge Infused Learning (K-IL): Towards Deep Incorporation of Knowledge in Deep Learning
Ugur Kursuncu
Manas Gaur
A. Sheth
NAI
26
57
0
01 Dec 2019
Preserving Patient Privacy while Training a Predictive Model of In-hospital Mortality
Pulkit Sharma
Farah E. Shamout
David Clifton
25
25
0
01 Dec 2019
Automatic Creation of Text Corpora for Low-Resource Languages from the Internet: The Case of Swiss German
Lucy Linder
Michael Jungo
J. Hennebert
C. Musat
Andreas Fischer
22
15
0
30 Nov 2019
Deconstructing and reconstructing word embedding algorithms
Edward Newell
Kian Kenyon-Dean
Jackie C.K. Cheung
39
4
0
29 Nov 2019
An Iterative Polishing Framework based on Quality Aware Masked Language Model for Chinese Poetry Generation
Li-Ming Deng
Jie Wang
Hangming Liang
Hui-ping Chen
Zhiqiang Xie
Bojin Zhuang
Shaojun Wang
Jing Xiao
57
22
0
29 Nov 2019
Blockwisely Supervised Neural Architecture Search with Knowledge Distillation
Changlin Li
Jiefeng Peng
Liuchun Yuan
Guangrun Wang
Xiaodan Liang
Liang Lin
Xiaojun Chang
36
180
0
29 Nov 2019
Inducing Relational Knowledge from BERT
Zied Bouraoui
Jose Camacho-Collados
Steven Schockaert
34
166
0
28 Nov 2019
DeFINE: DEep Factorized INput Token Embeddings for Neural Sequence Modeling
Sachin Mehta
Rik Koncel-Kedziorski
Mohammad Rastegari
Hannaneh Hajishirzi
AI4TS
38
23
0
27 Nov 2019
FairyTED: A Fair Rating Predictor for TED Talk Data
Rupam Acharyya
Shouman Das
Ankani Chattoraj
Md. Iftekhar Tanveer
27
12
0
25 Nov 2019
Corpus Wide Argument Mining -- a Working Solution
L. Ein-Dor
Eyal Shnarch
Lena Dankin
Alon Halfon
Benjamin Sznajder
...
Leshem Choshen
Yufang Hou
Yonatan Bilu
R. Aharonov
Noam Slonim
22
62
0
25 Nov 2019
End-to-End Trainable Non-Collaborative Dialog System
Yu Li
Kun Qian
Weiyan Shi
Zhou Yu
21
45
0
25 Nov 2019
Who did They Respond to? Conversation Structure Modeling using Masked Hierarchical Transformer
Henghui Zhu
Feng Nan
Zhiguo Wang
Ramesh Nallapati
Bing Xiang
30
39
0
25 Nov 2019
Invenio: Discovering Hidden Relationships Between Tasks/Domains Using Structured Meta Learning
Sameeksha Katoch
Kowshik Thopalli
Jayaraman J. Thiagarajan
Pavan Turaga
A. Spanias
14
4
0
24 Nov 2019
Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context
Yichi Zhang
Zhijian Ou
Zhou Yu
27
182
0
24 Nov 2019
A Transformer-based approach to Irony and Sarcasm detection
Rolandos Alexandros Potamias
Georgios Siolas
A. Stafylopatis
33
206
0
23 Nov 2019
Factorized Multimodal Transformer for Multimodal Sequential Learning
Amir Zadeh
Chengfeng Mao
Kelly Shi
Yiwei Zhang
Paul Pu Liang
Soujanya Poria
Louis-Philippe Morency
25
44
0
22 Nov 2019
WildMix Dataset and Spectro-Temporal Transformer Model for Monoaural Audio Source Separation
Amir Zadeh
Tianjun Ma
Soujanya Poria
Louis-Philippe Morency
19
8
0
21 Nov 2019
Separate and Attend in Personal Email Search
Yu Meng
Maryam Karimzadehgan
Honglei Zhuang
Donald Metzler
FedML
220
2
0
21 Nov 2019
Improving Conditioning in Context-Aware Sequence to Sequence Models
Xinyi Wang
Jason Weston
Michael Auli
Yacine Jernite
31
13
0
21 Nov 2019
Automatically Neutralizing Subjective Bias in Text
Reid Pryzant
Richard Diehl Martinez
Nathan Dass
Sadao Kurohashi
Dan Jurafsky
Diyi Yang
33
175
0
21 Nov 2019
Attention-Informed Mixed-Language Training for Zero-shot Cross-lingual Task-oriented Dialogue Systems
Zihan Liu
Genta Indra Winata
Zhaojiang Lin
Peng Xu
Pascale Fung
25
98
0
21 Nov 2019
Generating Interactive Worlds with Text
Angela Fan
Jack Urbanek
Pratik Ringshia
Emily Dinan
Emma Qian
...
Shrimai Prabhumoye
Douwe Kiela
Tim Rocktaschel
Arthur Szlam
Jason Weston
27
27
0
20 Nov 2019
Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization
Shiori Sagawa
Pang Wei Koh
Tatsunori B. Hashimoto
Percy Liang
OOD
16
1,200
0
20 Nov 2019
Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation
Junliang Guo
Xu Tan
Linli Xu
Tao Qin
Enhong Chen
Tie-Yan Liu
14
85
0
20 Nov 2019
DermGAN: Synthetic Generation of Clinical Skin Images with Pathology
Amirata Ghorbani
Vivek Natarajan
David Coz
Yuan Liu
GAN
MedIm
21
98
0
20 Nov 2019
Global Greedy Dependency Parsing
Z. Li
Zhao Hai
Kevin Parnow
31
31
0
20 Nov 2019
Towards Lingua Franca Named Entity Recognition with BERT
Taesun Moon
Parul Awasthy
Jian Ni
Radu Florian
19
29
0
19 Nov 2019
Previous
1
2
3
...
350
351
352
...
365
366
367
Next