Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,799 papers shown
Title
Applying the Information Bottleneck Principle to Prosodic Representation Learning
Guangyan Zhang
Ying Qin
Daxin Tan
Tan Lee
79
4
0
05 Aug 2021
Decoupled Transformer for Scalable Inference in Open-domain Question Answering
Haytham ElFadeel
Stanislav Peshterliev
115
1
0
05 Aug 2021
Video Contrastive Learning with Global Context
Haofei Kuang
Yi Zhu
Zhi-Li Zhang
Xinyu Li
Joseph Tighe
Sören Schwertfeger
C. Stachniss
Mu Li
SSL
AI4TS
93
61
0
05 Aug 2021
Using a Collated Cybersecurity Dataset for Machine Learning and Artificial Intelligence
Erik Hemberg
Una-May O’Reilly
52
10
0
05 Aug 2021
EENLP: Cross-lingual Eastern European NLP Index
Alexey Tikhonov
Alex Malkhasov
A. Manoshin
George-Andrei Dima
Réka Cserháti
Md. Sadek Hossain Asif
Matt Sárdi
84
2
0
05 Aug 2021
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Yiding Jiang
Bidisha Sharma
Maulik C. Madhavi
Haizhou Li
103
26
0
05 Aug 2021
Imperceptible Adversarial Examples by Spatial Chroma-Shift
A. Aydin
Deniz Sen
Berat Tuna Karli
Oguz Hanoglu
A. Temi̇zel
AAML
62
16
0
05 Aug 2021
Token Shift Transformer for Video Classification
Hao Zhang
Y. Hao
Chong-Wah Ngo
ViT
87
119
0
05 Aug 2021
Fast Convergence of DETR with Spatially Modulated Co-Attention
Peng Gao
Minghang Zheng
Xiaogang Wang
Jifeng Dai
Hongsheng Li
ViT
102
308
0
05 Aug 2021
TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding
Dailan He
Yusheng Zhao
Junyu Luo
Tianrui Hui
Shaofei Huang
Aixi Zhang
Si Liu
ViT
81
95
0
05 Aug 2021
Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning
Xinzhi Dong
Chengjiang Long
Wenju Xu
Chunxia Xiao
ViT
152
68
0
05 Aug 2021
Understand me, if you refer to Aspect Knowledge: Knowledge-aware Gated Recurrent Memory Network
Bowen Xing
Ivor W. Tsang
46
16
0
05 Aug 2021
FMMformer: Efficient and Flexible Transformer via Decomposed Near-field and Far-field Attention
T. Nguyen
Vai Suliafu
Stanley J. Osher
Long Chen
Bao Wang
72
36
0
05 Aug 2021
Robust Transfer Learning with Pretrained Language Models through Adapters
Wenjuan Han
Bo Pang
Ying Nian Wu
74
56
0
05 Aug 2021
Automatic Detection of COVID-19 Vaccine Misinformation with Graph Link Prediction
Maxwell Weinzierl
S. Harabagiu
104
29
0
04 Aug 2021
Boosting Few-shot Semantic Segmentation with Transformers
Guolei Sun
Yun-Hai Liu
Christos Sakaridis
Luc Van Gool
ViT
63
9
0
04 Aug 2021
Curriculum learning for language modeling
Daniel Fernando Campos
59
33
0
04 Aug 2021
The Potential of Using Vision Videos for CrowdRE: Video Comments as a Source of Feedback
Oliver Karras
Eklekta Kristo
J. Klünder
20
8
0
04 Aug 2021
Question-controlled Text-aware Image Captioning
Anwen Hu
Shizhe Chen
Qin Jin
76
15
0
04 Aug 2021
Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification
Shengding Hu
Ning Ding
Huadong Wang
Zhiyuan Liu
Jingang Wang
Juan-Zi Li
Wei Wu
Maosong Sun
VLM
110
373
0
04 Aug 2021
Log-based Anomaly Detection Without Log Parsing
Van-Hoang Le
Hongyu Zhang
87
189
0
04 Aug 2021
How to Query Language Models?
Leonard Adolphs
Shehzaad Dhuliawala
Thomas Hofmann
KELM
86
15
0
04 Aug 2021
PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence Pretraining
Machel Reid
Mikel Artetxe
VLM
105
28
0
04 Aug 2021
Quality Evaluation of the Low-Resource Synthetically Generated Code-Mixed Hinglish Text
Vivek Srivastava
M. Singh
69
12
0
04 Aug 2021
Controlled Text Generation as Continuous Optimization with Multiple Constraints
Sachin Kumar
Eric Malmi
Aliaksei Severyn
Yulia Tsvetkov
BDL
AI4CE
115
79
0
04 Aug 2021
Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management
Cécile Logé
Emily L. Ross
D. Dadey
Saahil Jain
A. Saporta
A. Ng
Pranav Rajpurkar
144
23
0
03 Aug 2021
Improving Counterfactual Generation for Fair Hate Speech Detection
Aida Mostafazadeh Davani
Ali Omrani
Brendan Kennedy
M. Atari
Xiang Ren
Morteza Dehghani
70
11
0
03 Aug 2021
Linking Common Vulnerabilities and Exposures to the MITRE ATT&CK Framework: A Self-Distillation Approach
Benjamin Ampel
Sagar Samtani
Steven Ullman
Hsinchun Chen
70
39
0
03 Aug 2021
Vision Transformer with Progressive Sampling
Xiaoyu Yue
Shuyang Sun
Zhanghui Kuang
Meng Wei
Philip Torr
Wayne Zhang
Dahua Lin
ViT
94
85
0
03 Aug 2021
Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation
Zaid Khan
Y. Fu
81
140
0
03 Aug 2021
Grounding Representation Similarity with Statistical Testing
Frances Ding
Jean-Stanislas Denain
Jacob Steinhardt
87
30
0
03 Aug 2021
Large-Scale Differentially Private BERT
Rohan Anil
Badih Ghazi
Vineet Gupta
Ravi Kumar
Pasin Manurangsi
96
139
0
03 Aug 2021
ExBERT: An External Knowledge Enhanced BERT for Natural Language Inference
Amit Gajbhiye
Noura Al Moubayed
S. Bradley
65
10
0
03 Aug 2021
sarcasm detection and quantification in arabic tweets
Bashar Talafha
Muhy Eddin Za'ter
Samer Suleiman
M. Al-Ayyoub
M. Al-Kabi
46
10
0
03 Aug 2021
Cycle-Consistent Inverse GAN for Text-to-Image Synthesis
Hao Wang
Guosheng Lin
Guosheng Lin
Chunyan Miao
103
48
0
03 Aug 2021
Where do Models go Wrong? Parameter-Space Saliency Maps for Explainability
Roman Levin
Manli Shu
Eitan Borgnia
Furong Huang
Micah Goldblum
Tom Goldstein
FAtt
AAML
60
11
0
03 Aug 2021
Dialogue Summarization with Supporting Utterance Flow Modeling and Fact Regularization
Wang Chen
Pijian Li
Hou Pong Chan
Irwin King
HILM
AI4TS
61
10
0
03 Aug 2021
CanvasVAE: Learning to Generate Vector Graphic Documents
Kota Yamaguchi
GAN
65
65
0
03 Aug 2021
Representation learning for neural population activity with Neural Data Transformers
Joel Ye
C. Pandarinath
AI4TS
AI4CE
241
57
0
02 Aug 2021
Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter Dataset
A. Ils
Dan Liu
Daniela Grunow
Steffen Eger
85
9
0
02 Aug 2021
A Survey of Human-in-the-loop for Machine Learning
Xingjiao Wu
Luwei Xiao
Yixuan Sun
Junhang Zhang
Tianlong Ma
Liangbo He
SyDa
137
533
0
02 Aug 2021
Communication-Efficient Federated Learning via Predictive Coding
Kai Yue
Richeng Jin
Chau-Wai Wong
H. Dai
FedML
78
14
0
02 Aug 2021
Self-supervised Answer Retrieval on Clinical Notes
Paul Grundmann
Sebastian Arnold
Alexander Loser
RALM
MedIm
60
2
0
02 Aug 2021
Efficient Deep Feature Calibration for Cross-Modal Joint Embedding Learning
Zhongwei Xie
Ling Liu
Lin Li
Luo Zhong
29
2
0
02 Aug 2021
Transfer Learning for Mining Feature Requests and Bug Reports from Tweets and App Store Reviews
Pablo Restrepo Henao
Jannik Fischbach
Dominik Spies
Julian Frattini
Andreas Vogelsang
45
26
0
02 Aug 2021
From LSAT: The Progress and Challenges of Complex Reasoning
Siyuan Wang
Zhongkun Liu
Wanjun Zhong
Ming Zhou
Zhongyu Wei
Zhumin Chen
Nan Duan
ELM
95
46
0
02 Aug 2021
Semi-Supervising Learning, Transfer Learning, and Knowledge Distillation with SimCLR
Khoi Duc Minh Nguyen
Y. Nguyen
Bao Le
57
5
0
02 Aug 2021
Congested Crowd Instance Localization with Dilated Convolutional Swin Transformer
Junyuan Gao
Maoguo Gong
Xuelong Li
ViT
112
47
0
02 Aug 2021
Logic-Consistency Text Generation from Semantic Parses
Chang Shu
Yusen Zhang
Xiangyu Dong
Peng Shi
Tao Yu
Rui Zhang
110
34
0
02 Aug 2021
MuSiQue: Multihop Questions via Single-hop Question Composition
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
LRM
164
282
0
02 Aug 2021
Previous
1
2
3
...
315
316
317
...
474
475
476
Next