Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 18,335 papers shown
Title
Self-supervised Learning from a Multi-view Perspective
Yao-Hung Hubert Tsai
Yue Wu
Ruslan Salakhutdinov
Louis-Philippe Morency
SSL
25
30
0
10 Jun 2020
Learning Functions to Study the Benefit of Multitask Learning
Gabriele Bettgenhauser
Michael A. Hedderich
Dietrich Klakow
18
4
0
09 Jun 2020
Knowledge Distillation: A Survey
Jianping Gou
B. Yu
Stephen J. Maybank
Dacheng Tao
VLM
31
2,857
0
09 Jun 2020
Modeling Label Semantics for Predicting Emotional Reactions
Radhika Gaonkar
Heeyoung Kwon
Mohaddeseh Bastan
Niranjan Balasubramanian
Nathanael Chambers
26
29
0
09 Jun 2020
Combination of abstractive and extractive approaches for summarization of long scientific texts
Vladislav Tretyak
Denis Stepanov
21
10
0
09 Jun 2020
Breaking the Limits of Remote Sensing by Simulation and Deep Learning for Flood and Debris Flow Mapping
Naoto Yokoya
Kazuki Yamanoi
Wei He
Gerald Baier
B. Adriano
H. Miura
S. Oishi
AI4CE
27
1
0
09 Jun 2020
Input-independent Attention Weights Are Expressive Enough: A Study of Attention in Self-supervised Audio Transformers
Tsung-Han Wu
Chun-Chen Hsieh
Yen-Hao Chen
Po-Han Chi
Hung-yi Lee
26
1
0
09 Jun 2020
Provable tradeoffs in adversarially robust classification
Yan Sun
Hamed Hassani
David Hong
Alexander Robey
23
53
0
09 Jun 2020
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines
Marius Mosbach
Maksym Andriushchenko
Dietrich Klakow
31
354
0
08 Jun 2020
Linformer: Self-Attention with Linear Complexity
Sinong Wang
Belinda Z. Li
Madian Khabsa
Han Fang
Hao Ma
108
1,655
0
08 Jun 2020
The Lipschitz Constant of Self-Attention
Hyunjik Kim
George Papamakarios
A. Mnih
16
135
0
08 Jun 2020
Graph-based Visual-Semantic Entanglement Network for Zero-shot Image Recognition
Yang Hu
Guihua Wen
Adriane P. Chapman
Pei Yang
Mingnan Luo
Yingxue Xu
Dan Dai
Wendy Hall
37
18
0
08 Jun 2020
Text Detection and Recognition in the Wild: A Review
Z. Raisi
Mohamed A. Naiel
Paul Fieguth
Steven Wardell
John S. Zelek
47
35
0
08 Jun 2020
Feature Interaction based Neural Network for Click-Through Rate Prediction
Dafang Zou
Leimin Zhang
Jiafa Mao
Weiguo Sheng
14
3
0
07 Jun 2020
Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report Generation
Mingjie Li
Fuyu Wang
Xiaojun Chang
Xiaodan Liang
MedIm
34
101
0
06 Jun 2020
UDPipe at EvaLatin 2020: Contextualized Embeddings and Treebank Embeddings
Milan Straka
Jana Straková
36
21
0
05 Jun 2020
Visual Transformers: Token-based Image Representation and Processing for Computer Vision
Bichen Wu
Chenfeng Xu
Xiaoliang Dai
Alvin Wan
Peizhao Zhang
Zhicheng Yan
Masayoshi Tomizuka
Joseph E. Gonzalez
Kurt Keutzer
Peter Vajda
ViT
39
548
0
05 Jun 2020
An Overview of Neural Network Compression
James OÑeill
AI4CE
52
98
0
05 Jun 2020
DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations
John Giorgi
Osvald Nitski
Bo Wang
Gary D. Bader
SSL
39
490
0
05 Jun 2020
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
Pengcheng He
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
AAML
69
2,631
0
05 Jun 2020
Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers
K. Choromanski
Valerii Likhosherstov
David Dohan
Xingyou Song
Andreea Gane
...
Peter Hawkins
Jared Davis
David Belanger
Lucy J. Colwell
Adrian Weller
41
84
0
05 Jun 2020
Unsupervised Translation of Programming Languages
Marie-Anne Lachaux
Baptiste Roziere
L. Chanussot
Guillaume Lample
45
409
0
05 Jun 2020
Sponge Examples: Energy-Latency Attacks on Neural Networks
Ilia Shumailov
Yiren Zhao
Daniel Bates
Nicolas Papernot
Robert D. Mullins
Ross J. Anderson
SILM
19
127
0
05 Jun 2020
Classification Aware Neural Topic Model and its Application on a New COVID-19 Disinformation Corpus
Xingyi Song
Johann Petrak
Ye Jiang
Iknoor Singh
Diana Maynard
Kalina Bontcheva
18
19
0
05 Jun 2020
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing
Zihang Dai
Guokun Lai
Yiming Yang
Quoc V. Le
48
230
0
05 Jun 2020
DeepVar: An End-to-End Deep Learning Approach for Genomic Variant Recognition in Biomedical Literature
Chaoran Cheng
Fei Tan
Zhi Wei
25
7
0
05 Jun 2020
The SOFC-Exp Corpus and Neural Approaches to Information Extraction in the Materials Science Domain
Annemarie Friedrich
Heike Adel
F. Tomazic
Johannes Hingerl
Renou Benteau
Anika Maruscyk
Lukas Lange
27
71
0
04 Jun 2020
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning
Sameer Khurana
Antoine Laurent
James R. Glass
SSL
19
12
0
04 Jun 2020
Emergent Multi-Agent Communication in the Deep Learning Era
Angeliki Lazaridou
Marco Baroni
AI4CE
48
197
0
03 Jun 2020
DiscSense: Automated Semantic Analysis of Discourse Markers
Damien Sileo
Tim Van de Cruys
Camille Pradel
Philippe Muller
14
7
0
02 Jun 2020
Toxicity Detection: Does Context Really Matter?
John Pavlopoulos
Jeffrey Scott Sorensen
Lucas Dixon
Nithum Thain
Ion Androutsopoulos
29
158
0
01 Jun 2020
A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions
Pengzhen Ren
Yun Xiao
Xiaojun Chang
Po-Yao (Bernie) Huang
Zhihui Li
Xiaojiang Chen
Xin Wang
AI4CE
79
655
0
01 Jun 2020
High-Fidelity Audio Generation and Representation Learning with Guided Adversarial Autoencoder
Kazi Nazmul Haque
R. Rana
Björn W Schuller
DRL
31
12
0
01 Jun 2020
A Survey on Transfer Learning in Natural Language Processing
Zaid Alyafeai
Maged S. Alshaibani
Irfan Ahmad
30
72
0
31 May 2020
Quantized Neural Networks: Characterization and Holistic Optimization
Yoonho Boo
Sungho Shin
Wonyong Sung
MQ
50
8
0
31 May 2020
Rethinking Assumptions in Deep Anomaly Detection
Lukas Ruff
Robert A. Vandermeulen
Billy Joe Franks
Klaus-Robert Muller
Marius Kloft
40
88
0
30 May 2020
IMUTube: Automatic Extraction of Virtual on-body Accelerometry from Video for Human Activity Recognition
Hyeokhyen Kwon
C. Tong
H. Haresamudram
Yan Gao
G. Abowd
Nicholas D. Lane
Thomas Ploetz
24
83
0
29 May 2020
Stance Prediction for Contemporary Issues: Data and Experiments
Marjan Hosseinia
Eduard Constantin Dragut
Arjun Mukherjee
30
28
0
29 May 2020
Analyzing COVID-19 on Online Social Media: Trends, Sentiments and Emotions
Xiaoya Li
Mingxin Zhou
Jiawei Wu
Arianna Yuan
Fei Wu
Jiwei Li
24
45
0
29 May 2020
SAFER: A Structure-free Approach for Certified Robustness to Adversarial Word Substitutions
Mao Ye
Chengyue Gong
Qiang Liu
AAML
23
96
0
29 May 2020
Noise Robust Named Entity Understanding for Voice Assistants
Deepak Muralidharan
Joel Ruben Antony Moniz
Sida Gao
Xiao Yang
Justine T. Kao
...
Kushal Tayal
Roger Zheng
Peter Grasch
Jason D. Williams
Lin Li
24
7
0
29 May 2020
On Incorporating Structural Information to improve Dialogue Response Generation
Nikita Moghe
Priyesh Vijayan
Balaraman Ravindran
Mitesh M. Khapra
22
6
0
28 May 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
140
40,394
0
28 May 2020
Unsupervised Dual Paraphrasing for Two-stage Semantic Parsing
Ruisheng Cao
Su Zhu
Chenyu Yang
Chen Liu
Rao Ma
Yanbin Zhao
Lu Chen
Kai Yu
34
46
0
27 May 2020
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
A. Kuncoro
Lingpeng Kong
Daniel Fried
Dani Yogatama
Laura Rimell
Chris Dyer
Phil Blunsom
31
33
0
27 May 2020
CausaLM: Causal Model Explanation Through Counterfactual Language Models
Amir Feder
Nadav Oved
Uri Shalit
Roi Reichart
CML
LRM
51
157
0
27 May 2020
Transition-based Semantic Dependency Parsing with Pointer Networks
Daniel Fernández-González
Carlos Gómez-Rodríguez
GNN
31
34
0
27 May 2020
General-Purpose User Embeddings based on Mobile App Usage
Junqi Zhang
Bing Bai
Ye Lin
Jian Liang
Kun Bai
Fei-Yue Wang
38
35
0
27 May 2020
Predict-then-Decide: A Predictive Approach for Wait or Answer Task in Dialogue Systems
Zehao Lin
Shaobo Cui
Guodun Li
Xiaoming Kang
Feng Ji
Feng-Lin Li
Zhongzhou Zhao
Haiqing Chen
Yin Zhang
34
1
0
27 May 2020
Comparing BERT against traditional machine learning text classification
Santiago González-Carvajal
E.C. Garrido-Merchán
VLM
30
230
0
26 May 2020
Previous
1
2
3
...
336
337
338
...
365
366
367
Next