Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,694 papers shown
Title
Interactive query expansion for professional search applications
Tony Russell-Rose
Phil Gooch
Udo Kruschwitz
28
7
0
25 Jun 2021
Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
Hongwei Xue
Yupan Huang
Bei Liu
Houwen Peng
Jianlong Fu
Houqiang Li
Jiebo Luo
101
89
0
25 Jun 2021
A Picture May Be Worth a Hundred Words for Visual Question Answering
Yusuke Hirota
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
Ittetsu Taniguchi
Takao Onoye
ViT
35
4
0
25 Jun 2021
JNLP Team: Deep Learning Approaches for Legal Processing Tasks in COLIEE 2021
Nguyen Ha Thanh
Phuong Minh Nguyen
Thi-Hai-Yen Vuong
Quan Minh Bui
Chau Nguyen
Binh Dang
Vu Tran
Minh Le Nguyen
Ken Satoh
AILaw
53
16
0
25 Jun 2021
ParaLaw Nets -- Cross-lingual Sentence-level Pretraining for Legal Text Processing
Nguyen Ha Thanh
Vu Tran
Phuong Minh Nguyen
Thi-Hai-Yen Vuong
Quan Minh Bui
Chau Nguyen
Binh Dang
Minh Le Nguyen
Kenji Satoh
AILaw
52
10
0
25 Jun 2021
To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels
Yuning Chai
Pei Sun
Jiquan Ngiam
Weiyue Wang
Benjamin Caine
Vijay Vasudevan
Xiao Zhang
Drago Anguelov
3DPC
116
69
0
25 Jun 2021
Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature
Yu Wang
Jinchao Li
Tristan Naumann
Chenyan Xiong
Hao Cheng
...
Yang Qin
Eric Horvitz
Paul N. Bennett
Jianfeng Gao
Hoifung Poon
OOD
85
14
0
25 Jun 2021
VOGUE: Answer Verbalization through Multi-Task Learning
Endri Kacupaj
Shyamnath Premnadh
Kuldeep Singh
Jens Lehmann
M. Maleshkova
62
7
0
24 Jun 2021
Multitask Learning for Citation Purpose Classification
Alexander X. Oesterling
Angikar Ghosal
Haoyang Yu
Rui Xin
Yasa Baig
Lesia Semenova
Cynthia Rudin
31
7
0
24 Jun 2021
Towards Understanding and Mitigating Social Biases in Language Models
Paul Pu Liang
Chiyu Wu
Louis-Philippe Morency
Ruslan Salakhutdinov
118
399
0
24 Jun 2021
Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data
Paul Pu Liang
Terrance Liu
Anna Cai
Michal Muszynski
Ryo Ishii
Nicholas B. Allen
Randy P. Auerbach
David Brent
Ruslan Salakhutdinov
Louis-Philippe Morency
89
18
0
24 Jun 2021
FitVid: Overfitting in Pixel-Level Video Prediction
Mohammad Babaeizadeh
M. Saffar
Suraj Nair
Sergey Levine
Chelsea Finn
D. Erhan
VLM
115
84
0
24 Jun 2021
VOLO: Vision Outlooker for Visual Recognition
Li-xin Yuan
Qibin Hou
Zihang Jiang
Jiashi Feng
Shuicheng Yan
ViT
141
328
0
24 Jun 2021
A Transformer-based Cross-modal Fusion Model with Adversarial Training for VQA Challenge 2021
Keda Lu
Bo Fang
Kuan-Yu Chen
ViT
47
2
0
24 Jun 2021
Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting
Haixu Wu
Jiehui Xu
Jianmin Wang
Mingsheng Long
AI4TS
146
2,384
0
24 Jun 2021
AIT-QA: Question Answering Dataset over Complex Tables in the Airline Industry
Yannis Katsis
Saneem A. Chemmengath
Vishwajeet Kumar
Samarth Bharadwaj
Mustafa Canim
...
A. Gliozzo
FeiFei Pan
Jaydeep Sen
Karthik Sankaranarayanan
Soumen Chakrabarti
LMTD
RALM
81
38
0
24 Jun 2021
MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
Guozhi Tang
Lele Xie
Lianwen Jin
Jiapeng Wang
Jingdong Chen
Zhen Xu
Qianying Wang
Yaqiang Wu
Hui Li
103
29
0
24 Jun 2021
Accelerating variational quantum algorithms with multiple quantum processors
Yuxuan Du
Yan Qian
Dacheng Tao
55
8
0
24 Jun 2021
Modeling Diagnostic Label Correlation for Automatic ICD Coding
Shang-Chi Tsai
Chaorui Huang
Yun-Nung Chen
65
16
0
24 Jun 2021
Where is the disease? Semi-supervised pseudo-normality synthesis from an abnormal image
Yuanqi Du
Quan Quan
Hu Han
S. Kevin Zhou
GAN
MedIm
52
3
0
24 Jun 2021
Label Disentanglement in Partition-based Extreme Multilabel Classification
Xuanqing Liu
Wei-Cheng Chang
Hsiang-Fu Yu
Cho-Jui Hsieh
Inderjit S. Dhillon
63
11
0
24 Jun 2021
An Automated Knowledge Mining and Document Classification System with Multi-model Transfer Learning
J. Chong
Zhiyuan Chen
Mei Shin Oh
28
2
0
24 Jun 2021
Discovering novel drug-supplement interactions using a dietary supplements knowledge graph generated from the biomedical literature
Dalton Schutte
J. Vasilakes
A. Bompelli
Yuqi Zhou
M. Fiszman
Hua Xu
H. Kilicoglu
J. Bishop
T. Adam
Rui Zhang
19
2
0
24 Jun 2021
An Efficient Group-based Search Engine Marketing System for E-Commerce
Cheng Jie
Da Xu
Zigeng Wang
Lu Wang
Wei Shen
61
1
0
24 Jun 2021
Fairness via Representation Neutralization
Mengnan Du
Subhabrata Mukherjee
Guanchu Wang
Ruixiang Tang
Ahmed Hassan Awadallah
Helen Zhou
90
81
0
23 Jun 2021
Charformer: Fast Character Transformers via Gradient-based Subword Tokenization
Yi Tay
Vinh Q. Tran
Sebastian Ruder
Jai Gupta
Hyung Won Chung
Dara Bahri
Zhen Qin
Simon Baumgartner
Cong Yu
Donald Metzler
175
162
0
23 Jun 2021
Extreme Multi-label Learning for Semantic Matching in Product Search
Wei-Cheng Chang
Daniel Jiang
Hsiang-Fu Yu
C. Teo
Jiong Zhang
...
Qie Hu
Nikhil Shandilya
Vyacheslav Ievgrafov
Japinder Singh
Inderjit S. Dhillon
90
60
0
23 Jun 2021
Clinical Named Entity Recognition using Contextualized Token Representations
Yichao Zhou
C. Ju
J. H. Caufield
Kevin J. Shih
Calvin Yu‐Chian Chen
Yizhou Sun
Kai-Wei Chang
Peipei Ping
Wei Wang
115
11
0
23 Jun 2021
Stable, Fast and Accurate: Kernelized Attention with Relative Positional Encoding
Shengjie Luo
Shanda Li
Tianle Cai
Di He
Dinglan Peng
Shuxin Zheng
Guolin Ke
Liwei Wang
Tie-Yan Liu
97
50
0
23 Jun 2021
BERT-based Multi-Task Model for Country and Province Level Modern Standard Arabic and Dialectal Arabic Identification
Abdellah El Mekki
Abdelkader El Mahdaouy
Kabil Essefar
Nabil El Mamoun
Ismail Berrada
A. Khoumsi
46
18
0
23 Jun 2021
Deep Multi-Task Model for Sarcasm Detection and Sentiment Analysis in Arabic Language
Abdelkader El Mahdaouy
Abdellah El Mekki
Kabil Essefar
Nabil El Mamoun
Ismail Berrada
A. Khoumsi
52
37
0
23 Jun 2021
From Canonical Correlation Analysis to Self-supervised Graph Neural Networks
Hengrui Zhang
Qitian Wu
Junchi Yan
David Wipf
Philip S. Yu
SSL
115
223
0
23 Jun 2021
Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data Transformations
Charaf Eddine Benarab
CLIP
VLM
52
2
0
23 Jun 2021
Mixtures of Deep Neural Experts for Automated Speech Scoring
Sara Papi
E. Trentin
R. Gretter
M. Matassoni
D. Falavigna
MoE
38
10
0
23 Jun 2021
Reinforcement Learning-based Dialogue Guided Event Extraction to Exploit Argument Relations
Qian Li
Hao Peng
Jianxin Li
Hongzhi Zhang
Yuanxing Ning
Lihong Wang
Philip S. Yu
Ziyi Wang
61
26
0
23 Jun 2021
Learning from Pseudo Lesion: A Self-supervised Framework for COVID-19 Diagnosis
Zhongliang Li
Zhihao Jin
Xuechen Li
Linlin Shen
SSL
MedIm
49
1
0
23 Jun 2021
APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores
Boyuan Feng
Yuke Wang
Tong Geng
Ang Li
Yufei Ding
MQ
74
37
0
23 Jun 2021
NodePiece: Compositional and Parameter-Efficient Representations of Large Knowledge Graphs
Mikhail Galkin
E. Denis
Jiapeng Wu
William L. Hamilton
OCL
83
90
0
23 Jun 2021
Probabilistic Attention for Interactive Segmentation
Prasad Gabbur
Manjot Bilkhu
J. Movellan
105
13
0
23 Jun 2021
Revisiting Deep Learning Models for Tabular Data
Yu. V. Gorishniy
Ivan Rubachev
Valentin Khrulkov
Artem Babenko
LMTD
157
784
0
22 Jun 2021
On Adversarial Robustness of Synthetic Code Generation
Mrinal Anand
Pratik Kayal
M. Singh
132
5
0
22 Jun 2021
Learn to Resolve Conversational Dependency: A Consistency Training Framework for Conversational Question Answering
Gangwoo Kim
Hyunjae Kim
Jungsoo Park
Jaewoo Kang
103
38
0
22 Jun 2021
SENT: Sentence-level Distant Relation Extraction via Negative Training
Ruotian Ma
Tao Gui
Linyang Li
Qi Zhang
Yaqian Zhou
Xuanjing Huang
53
32
0
22 Jun 2021
DocFormer: End-to-End Transformer for Document Understanding
Srikar Appalaraju
Bhavan A. Jasani
Bhargava Urala Kota
Yusheng Xie
R. Manmatha
ViT
121
281
0
22 Jun 2021
Key-Sparse Transformer for Multimodal Speech Emotion Recognition
Weidong Chen
Xiaofen Xing
Xiangmin Xu
Jichen Yang
Jianxin Pang
83
52
0
22 Jun 2021
Graph Routing between Capsules
Yang Li
Wei Zhao
Min Zhang
Suhang Wang
Steffen Eger
GNN
37
14
0
22 Jun 2021
BARTScore: Evaluating Generated Text as Text Generation
Weizhe Yuan
Graham Neubig
Pengfei Liu
222
851
0
22 Jun 2021
A Comprehensive Comparison of Pre-training Language Models
Tonglei Guo
VLM
ELM
102
3
0
22 Jun 2021
Incremental Deep Neural Network Learning using Classification Confidence Thresholding
Justin Leo
Jugal Kalita
CLL
65
18
0
21 Jun 2021
Hi-BEHRT: Hierarchical Transformer-based model for accurate prediction of clinical events using multimodal longitudinal electronic health records
Yikuan Li
M. Mamouei
G. Salimi-Khorshidi
Shishir Rao
A. Hassaine
D. Canoy
Thomas Lukasiewicz
K. Rahimi
98
83
0
21 Jun 2021
Previous
1
2
3
...
322
323
324
...
472
473
474
Next