Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,885 papers shown
Title
Harms of Gender Exclusivity and Challenges in Non-Binary Representation in Language Technologies
Sunipa Dev
Masoud Monajatipoor
Anaelia Ovalle
Arjun Subramonian
J. M. Phillips
Kai-Wei Chang
166
177
0
27 Aug 2021
EmoBERTa: Speaker-Aware Emotion Recognition in Conversation with RoBERTa
Taewoon Kim
Piek Vossen
103
102
0
26 Aug 2021
A New Sentence Ordering Method Using BERT Pretrained Model
Melika Golestani
S. Z. Razavi
Heshaam Faili
61
2
0
26 Aug 2021
Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive Text Summarization
Chujie Zheng
Kunpeng Zhang
Harry J. Wang
Ling Fan
Zhe Wang
60
7
0
26 Aug 2021
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
Muhammad Zubair Irshad
Niluthpol Chowdhury Mithun
Zachary Seymour
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
LM&Ro
84
51
0
26 Aug 2021
HAN: Higher-order Attention Network for Spoken Language Understanding
Dongsheng Chen
Zhiqi Huang
Yuexian Zou
54
1
0
26 Aug 2021
Similar Scenes arouse Similar Emotions: Parallel Data Augmentation for Stylized Image Captioning
Guodun Li
Yuchen Zhai
Zehao Lin
Yin Zhang
117
21
0
26 Aug 2021
A Survey on Automated Fact-Checking
Zhijiang Guo
Michael Schlichtkrull
Andreas Vlachos
144
498
0
26 Aug 2021
Alleviating Exposure Bias via Contrastive Learning for Abstractive Text Summarization
Shichao Sun
Wenjie Li
70
26
0
26 Aug 2021
Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts
Ashutosh Baheti
Maarten Sap
Alan Ritter
Mark O. Riedl
92
91
0
26 Aug 2021
SLIM: Explicit Slot-Intent Mapping with BERT for Joint Multi-Intent Detection and Slot Filling
Fengyu Cai
Wanhao Zhou
Fei Mi
Boi Faltings
70
19
0
26 Aug 2021
Data Augmentation for Low-Resource Named Entity Recognition Using Backtranslation
Usama Yaseen
Stefan Langer
MedIm
58
15
0
26 Aug 2021
Rethinking Why Intermediate-Task Fine-Tuning Works
Ting-Yun Chang
Chi-Jen Lu
LRM
98
30
0
26 Aug 2021
AR-BERT: Aspect-relation enhanced Aspect-level Sentiment Classification with Multi-modal Explanations
Sk Mainul Islam
Sourangshu Bhattacharya
62
12
0
26 Aug 2021
MCML: A Novel Memory-based Contrastive Meta-Learning Method for Few Shot Slot Tagging
Hongru Wang
Zezhong Wang
Gabriel Pui Cheong Fung
Kam-Fai Wong
OffRL
CLL
100
10
0
26 Aug 2021
Retrieval Augmented Code Generation and Summarization
Md. Rizwan Parvez
W. Ahmad
Saikat Chakraborty
Baishakhi Ray
Kai-Wei Chang
76
192
0
26 Aug 2021
LayoutReader: Pre-training of Text and Layout for Reading Order Detection
Zilong Wang
Yiheng Xu
Lei Cui
Jingbo Shang
Furu Wei
95
76
0
26 Aug 2021
Shifted Chunk Transformer for Spatio-Temporal Representational Learning
Xuefan Zha
Wentao Zhu
Tingxun Lv
Sen Yang
Ji Liu
AI4TS
ViT
92
27
0
26 Aug 2021
Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models
Yiming Cui
Weinan Zhang
Wanxiang Che
Ting Liu
Zhigang Chen
Shijin Wang
LRM
47
9
0
26 Aug 2021
Vision-Language Navigation: A Survey and Taxonomy
Wansen Wu
Tao Chang
Xinmeng Li
LM&Ro
81
24
0
26 Aug 2021
Auxiliary Task Update Decomposition: The Good, The Bad and The Neutral
Lucio Dery
Yann N. Dauphin
David Grangier
MoMe
79
29
0
25 Aug 2021
Lightweight Self-Attentive Sequential Recommendation
Yang Li
Tong Chen
Pengfei Zhang
Hongzhi Yin
HAI
AI4TS
81
109
0
25 Aug 2021
What do pre-trained code models know about code?
Anjan Karmakar
Romain Robbes
ELM
91
91
0
25 Aug 2021
Ontology-Enhanced Slot Filling
Yuhao Ding
Yik-Cheung Tam
33
0
0
25 Aug 2021
Exploring the Promises of Transformer-Based LMs for the Representation of Normative Claims in the Legal Domain
Reto Gubelmann
Peter Hongler
Siegfried Handschuh
AILaw
28
0
0
25 Aug 2021
Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Yuqing Song
Shizhe Chen
Qin Jin
Wei Luo
Jun Xie
Fei Huang
103
20
0
25 Aug 2021
A Framework for Learning Ante-hoc Explainable Models via Concepts
Anirban Sarkar
Deepak Vijaykeerthy
Anindya Sarkar
V. Balasubramanian
LRM
BDL
93
51
0
25 Aug 2021
Viola: A Topic Agnostic Generate-and-Rank Dialogue System
Hyundong Justin Cho
Basel Shbita
K. Shenoy
Shuai Liu
Nikhil Patel
Hitesh Pindikanti
Jennifer Lee
Jonathan May
66
2
0
25 Aug 2021
Social Norm Bias: Residual Harms of Fairness-Aware Algorithms
Myra Cheng
Maria De-Arteaga
Lester W. Mackey
Adam Tauman Kalai
FaML
111
9
0
25 Aug 2021
Using BERT Encoding and Sentence-Level Language Model for Sentence Ordering
Melika Golestani
S. Z. Razavi
Zeinab Borhanifard
Farnaz Tahmasebian
H. Faili
39
7
0
24 Aug 2021
Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts
Charangan Vasantharajan
Uthayasanker Thayasivam
57
39
0
24 Aug 2021
The Word is Mightier than the Label: Learning without Pointillistic Labels using Data Programming
Chufan Gao
Mononito Goswami
30
0
0
24 Aug 2021
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Zirui Wang
Jiahui Yu
Adams Wei Yu
Zihang Dai
Yulia Tsvetkov
Yuan Cao
VLM
MLLM
183
801
0
24 Aug 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation
Samuel Cahyawijaya
108
12
0
24 Aug 2021
Relation Extraction from Tables using Artificially Generated Metadata
Gaurav Singh
Siffi Singh
Joshua Wong
Amir Saffari
30
2
0
24 Aug 2021
Graph Neural Networks: Methods, Applications, and Opportunities
Lilapati Waikhom
Ripon Patgiri
GNN
100
42
0
24 Aug 2021
Are the Multilingual Models Better? Improving Czech Sentiment with Transformers
Pavel Přibáň
J. Steinberger
70
11
0
24 Aug 2021
Weakly Supervised Cross-platform Teenager Detection with Adversarial BERT
Peiling Yi
A. Zubiaga
44
1
0
24 Aug 2021
Prompt-Learning for Fine-Grained Entity Typing
Ning Ding
Yulin Chen
Xu Han
Guangwei Xu
Pengjun Xie
Haitao Zheng
Zhiyuan Liu
Juan-Zi Li
Hong-Gee Kim
95
159
0
24 Aug 2021
Detection of Criminal Texts for the Polish State Border Guard
Artur Nowakowski
K. Jassem
56
1
0
24 Aug 2021
Support-Set Based Cross-Supervision for Video Grounding
Xinpeng Ding
N. Wang
Shiwei Zhang
De Cheng
Xiaomeng Li
Ziyuan Huang
Mingqian Tang
Xinbo Gao
88
42
0
24 Aug 2021
sigmoidF1: A Smooth F1 Score Surrogate Loss for Multilabel Classification
Gabriel Bénédict
Vincent Koops
Daan Odijk
Maarten de Rijke
102
33
0
24 Aug 2021
Recurrent multiple shared layers in Depth for Neural Machine Translation
Guoliang Li
Yiyang Li
MoE
48
1
0
23 Aug 2021
Using Neighborhood Context to Improve Information Extraction from Visual Documents Captured on Mobile Phones
Kalpa Gunaratna
Vijay Srinivasan
Sandeep Nama
Hongxia Jin
61
5
0
23 Aug 2021
Legal Search in Case Law and Statute Law
Julien Rossi
Evangelos Kanoulas
AILaw
ELM
162
8
0
23 Aug 2021
High Performance GPU Code Generation for Matrix-Matrix Multiplication using MLIR: Some Early Results
Navdeep Katel
Vivek Khandelwal
Uday Bondhugula
41
7
0
23 Aug 2021
Event Extraction by Associating Event Types and Argument Roles
Qian Li
Shu Guo
Hongzhi Zhang
Jianxin Li
Shuaiyi Nie
Lihong Wang
Xiaohan Dong
Hao Peng
79
16
0
23 Aug 2021
TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
Jianwei Yang
Yonatan Bisk
Jianfeng Gao
123
140
0
23 Aug 2021
Modeling Dynamics of Facial Behavior for Mental Health Assessment
Minh Tran
Ellen R. Bradley
Michelle Matvey
J. Woolley
M. Soleymani
CVBM
45
3
0
23 Aug 2021
Fluent: An AI Augmented Writing Tool for People who Stutter
Bhavya Ghai
Klaus Mueller
73
16
0
23 Aug 2021
Previous
1
2
3
...
312
313
314
...
476
477
478
Next