Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,641 papers shown
Title
SemEval-2021 Task 9: Fact Verification and Evidence Finding for Tabular Data in Scientific Documents (SEM-TAB-FACTS)
N. Wang
Diwakar Mahajan
Marina Danilevsky
Sara Rosenthal
LMTD
93
56
0
28 May 2021
Linguistic Structures as Weak Supervision for Visual Scene Graph Generation
Keren Ye
Adriana Kovashka
69
54
0
28 May 2021
Changing the World by Changing the Data
Anna Rogers
76
73
0
28 May 2021
Knowledge Inheritance for Pre-trained Language Models
Yujia Qin
Yankai Lin
Jing Yi
Jiajie Zhang
Xu Han
...
Yusheng Su
Zhiyuan Liu
Peng Li
Maosong Sun
Jie Zhou
VLM
85
50
0
28 May 2021
Accelerating BERT Inference for Sequence Labeling via Early-Exit
Xiaonan Li
Yunfan Shao
Tianxiang Sun
Hang Yan
Xipeng Qiu
Xuanjing Huang
84
41
0
28 May 2021
Language Models Use Monotonicity to Assess NPI Licensing
Jaap Jumelet
Milica Denić
Jakub Szymanik
Dieuwke Hupkes
Shane Steinert-Threlkeld
KELM
74
29
0
28 May 2021
Early Exiting with Ensemble Internal Classifiers
Tianxiang Sun
Yunhua Zhou
Xiangyang Liu
Xinyu Zhang
Hao Jiang
Bo Zhao
Xuanjing Huang
Xipeng Qiu
70
31
0
28 May 2021
An Explanatory Query-Based Framework for Exploring Academic Expertise
O. Cocarascu
A. McLean
Paul French
Francesca Toni
37
0
0
28 May 2021
Data Augmentation for Text Generation Without Any Augmented Data
Wei Bi
Huayang Li
Jiacheng Huang
65
7
0
28 May 2021
Cross-Lingual Abstractive Summarization with Limited Parallel Resources
Yu Bai
Yang Gao
Heyan Huang
91
52
0
28 May 2021
Noised Consistency Training for Text Summarization
Jing Liu
Qianren Mao
Bang Liu
Hao Peng
Hongdong Zhu
Jianxin Li
34
1
0
28 May 2021
KVT: k-NN Attention for Boosting Vision Transformers
Pichao Wang
Xue Wang
F. Wang
Ming Lin
Shuning Chang
Hao Li
Rong Jin
ViT
135
108
0
28 May 2021
Alleviating the Knowledge-Language Inconsistency: A Study for Deep Commonsense Knowledge
Yi Zhang
Lei Li
Yunfang Wu
Qi Su
Xu Sun
54
4
0
28 May 2021
SciFive: a text-to-text transformer model for biomedical literature
Long Phan
J. Anibal
H. Tran
Shaurya Chanana
Erol Bahadroglu
Alec Peltekian
G. Altan-Bonnet
MedIm
89
151
0
28 May 2021
ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation
Vijit Malik
Rishabh Sanjay
S. Nigam
Kripabandhu Ghosh
S. Guha
Arnab Bhattacharya
Ashutosh Modi
ELM
AILaw
117
149
0
28 May 2021
Unsupervised Domain Adaptation of Object Detectors: A Survey
Poojan Oza
Vishwanath A. Sindagi
Vibashan Vs
Vishal M. Patel
OOD
ObjD
119
184
0
27 May 2021
Resilient and Adaptive Framework for Large Scale Android Malware Fingerprinting using Deep Learning and NLP Techniques
E. Karbab
M. Debbabi
AAML
38
4
0
27 May 2021
Leveraging Linguistic Coordination in Reranking N-Best Candidates For End-to-End Response Selection Using BERT
Mingzhi Yu
Diane Litman
31
2
0
27 May 2021
Inspecting the concept knowledge graph encoded by modern language models
Carlos Aspillaga
Marcelo Mendoza
Alvaro Soto
74
13
0
27 May 2021
Open-world Machine Learning: Applications, Challenges, and Opportunities
Jitendra Parmar
S. Chouhan
Vaskar Raychoudhury
S. Rathore
OffRL
106
96
0
27 May 2021
Recent advances and clinical applications of deep learning in medical image analysis
Xuxin Chen
Ximing Wang
Kecheng Zhang
K. Fung
T. Thai
K. Moore
Robert S. Mannel
Hong Liu
B. Zheng
Y. Qiu
OOD
138
616
0
27 May 2021
RAW-C: Relatedness of Ambiguous Words--in Context (A New Lexical Resource for English)
Sean Trott
Benjamin Bergen
137
20
0
27 May 2021
Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion
Yinyu Lan
Shizhu He
Xiangrong Zeng
Shengping Liu
Kang Liu
Jun Zhao
57
27
0
27 May 2021
Maria: A Visual Experience Powered Conversational Agent
Zujie Liang
Huang Hu
Can Xu
Chongyang Tao
Xiubo Geng
Yining Chen
Fan Liang
Daxin Jiang
91
32
0
27 May 2021
SSAN: Separable Self-Attention Network for Video Representation Learning
Xudong Guo
Xun Guo
Yan Lu
ViT
AI4TS
55
26
0
27 May 2021
Corpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat Violence
Andrew Halterman
Katherine A. Keith
Sheikh Muhammad Sarwar
Brendan O'Connor
79
29
0
27 May 2021
Contrastive Fine-tuning Improves Robustness for Neural Rankers
Xiaofei Ma
Cicero Nogueira dos Santos
Andrew O. Arnold
119
20
0
27 May 2021
Directed Acyclic Graph Network for Conversational Emotion Recognition
Weizhou Shen
Siyue Wu
Yunyi Yang
Xiaojun Quan
124
245
0
27 May 2021
Multi-Modal Semantic Inconsistency Detection in Social Media News Posts
S. McCrae
Kehan Wang
A. Zakhor
62
15
0
26 May 2021
BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition
Yinghao Li
Pranav Shetty
Lu Liu
Chao Zhang
Le Song
NoLa
84
35
0
26 May 2021
A Full-Stack Search Technique for Domain Optimized Deep Learning Accelerators
Dan Zhang
Safeen Huda
Ebrahim M. Songhori
Kartik Prabhu
Quoc V. Le
Anna Goldie
Azalia Mirhoseini
94
53
0
26 May 2021
Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading
Zhihan Zhou
Li-Qian Ma
Han Liu
AIFin
70
50
0
26 May 2021
Zero-shot Medical Entity Retrieval without Annotation: Learning From Rich Knowledge Graph Semantics
Luyang Kong
C. Winestock
Parminder Bhatia
AI4MH
130
7
0
26 May 2021
Language Model as an Annotator: Exploring DialoGPT for Dialogue Summarization
Xiachong Feng
Xiaocheng Feng
Libo Qin
Bing Qin
Ting Liu
VLM
71
93
0
26 May 2021
Sequence Parallelism: Long Sequence Training from System Perspective
Shenggui Li
Fuzhao Xue
Chaitanya Baranwal
Yongbin Li
Yang You
101
103
0
26 May 2021
Deception detection in text and its relation to the cultural dimension of individualism/collectivism
Katerina Papantoniou
P. Papadakos
Theodore Patkos
G. Flouris
Ion Androutsopoulos
Dimitris Plexousakis
96
7
0
26 May 2021
TreeBERT: A Tree-Based Pre-Trained Model for Programming Language
Xue Jiang
Zhuoran Zheng
Chen Lyu
Liang Li
Lei Lyu
85
91
0
26 May 2021
LMMS Reloaded: Transformer-based Sense Embeddings for Disambiguation and Beyond
Daniel Loureiro
A. Jorge
Jose Camacho-Collados
92
26
0
26 May 2021
The statistical advantage of automatic NLG metrics at the system level
Johnny Tian-Zheng Wei
Robin Jia
97
23
0
26 May 2021
Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger
Fanchao Qi
Mukai Li
Yangyi Chen
Zhengyan Zhang
Zhiyuan Liu
Yasheng Wang
Maosong Sun
SILM
106
235
0
26 May 2021
Unsupervised Pronoun Resolution via Masked Noun-Phrase Prediction
Minghan Shen
Pratyay Banerjee
Chitta Baral
SSL
65
5
0
26 May 2021
What data do we need for training an AV motion planner?
Long Chen
Lukas Platinsky
Stefanie Speichert
B. Osinski
Oliver Scheel
Yawei Ye
Hugo Grimmett
Luca Del Pero
Peter Ondruska
62
13
0
26 May 2021
Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking
Heng-Da Xu
Zhongli Li
Qingyu Zhou
Chao Li
Zizhen Wang
Yunbo Cao
Heyan Huang
Xian-Ling Mao
100
97
0
26 May 2021
SGPT: Semantic Graphs based Pre-training for Aspect-based Sentiment Analysis
Yong Qian
Zhongqing Wang
Rong Xiao
Chen Chen
Haihong Tang
54
6
0
26 May 2021
Database Workload Characterization with Query Plan Encoders
Debjyoti Paul
Jie Cao
Feifei Li
Vivek Srikumar
34
18
0
26 May 2021
NukeLM: Pre-Trained and Fine-Tuned Language Models for the Nuclear and Energy Domains
Lee Burke
K. Pazdernik
D. Fortin
Benjamin J. Wilson
Rustam Goychayev
J. Mattingly
37
3
0
25 May 2021
BASS: Boosting Abstractive Summarization with Unified Semantic Graph
Wenhao Wu
Wei Li
Xinyan Xiao
Jiachen Liu
Ziqiang Cao
Sujian Li
Hua Wu
Haifeng Wang
78
46
0
25 May 2021
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization
Chen Liang
Simiao Zuo
Minshuo Chen
Haoming Jiang
Xiaodong Liu
Pengcheng He
T. Zhao
Weizhu Chen
74
69
0
25 May 2021
Learning to Bridge Metric Spaces: Few-shot Joint Learning of Intent Detection and Slot Filling
Yutai Hou
Y. Lai
Cheng Chen
Wanxiang Che
Ting Liu
69
14
0
25 May 2021
Topic Modeling and Progression of American Digital News Media During the Onset of the COVID-19 Pandemic
Xiangpeng Wan
Michael C. Lucic
Hakim Ghazzai
Y. Massoud
31
5
0
25 May 2021
Previous
1
2
3
...
333
334
335
...
471
472
473
Next