Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,708 papers shown
Title
droidlet: modular, heterogenous, multi-modal agents
Anurag Pratik
Soumith Chintala
Kavya Srinet
Dhiraj Gandhi
Rebecca Qian
...
Anoushka Tiwari
Tucker Hart
Mary Williamson
Abhinav Gupta
Arthur Szlam
VLM
LM&Ro
57
3
0
25 Jan 2021
Curriculum Learning: A Survey
Petru Soviany
Radu Tudor Ionescu
Paolo Rota
N. Sebe
ODL
201
364
0
25 Jan 2021
Meta-Learning for Effective Multi-task and Multilingual Modelling
Ishan Tarunesh
Sushil Khyalia
Vishwajeet Kumar
Ganesh Ramakrishnan
Preethi Jyothi
81
16
0
25 Jan 2021
Transferable Interactiveness Knowledge for Human-Object Interaction Detection
Yong-Lu Li
Xinpeng Liu
Xiaoqian Wu
Xijie Huang
Liang Xu
Cewu Lu
82
5
0
25 Jan 2021
PAWLS: PDF Annotation With Labels and Structure
Mark Neumann
Zejiang Shen
Sam Skjonsberg
82
20
0
25 Jan 2021
TDMSci: A Specialized Corpus for Scientific Literature Entity Tagging of Tasks Datasets and Metrics
Yufang Hou
Charles Jochim
Martin Gleize
Francesca Bonin
Debasis Ganguly
76
48
0
25 Jan 2021
Learning From Revisions: Quality Assessment of Claims in Argumentation at Scale
Gabriella Skitalinskaya
Jonas Klaff
Henning Wachsmuth
66
29
0
25 Jan 2021
A Hybrid Approach to Measure Semantic Relatedness in Biomedical Concepts
Katikapalli Subramanyam Kalyan
S. Sangeetha
72
9
0
25 Jan 2021
Cross-lingual Visual Pre-training for Multimodal Machine Translation
Ozan Caglayan
Menekse Kuyu
Mustafa Sercan Amac
Pranava Madhyastha
Erkut Erdem
Aykut Erdem
Lucia Specia
VLM
82
46
0
25 Jan 2021
SpanEmo: Casting Multi-label Emotion Classification as Span-prediction
Hassan Alhuzali
Sophia Ananiadou
120
90
0
25 Jan 2021
Adversarial Text-to-Image Synthesis: A Review
Stanislav Frolov
Tobias Hinz
Federico Raue
Jörn Hees
Andreas Dengel
EGVM
88
178
0
25 Jan 2021
CHOLAN: A Modular Approach for Neural Entity Linking on Wikipedia and Wikidata
M. Ravi
Kuldeep Singh
I. Mulang'
Saeedeh Shekarpour
Johannes Hoffart
Jens Lehmann
KELM
82
36
0
25 Jan 2021
GP: Context-free Grammar Pre-training for Text-to-SQL Parsers
Liang Zhao
Hexin Cao
Yunsong Zhao
AI4CE
63
11
0
25 Jan 2021
ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement Learning
Yufei Wang
Ian D. Wood
Stephen Wan
Mark Johnson
47
8
0
25 Jan 2021
FakeFlow: Fake News Detection by Modeling the Flow of Affective Information
Bilal Ghanem
Simone Paolo Ponzetto
Paolo Rosso
Francisco Rangel
111
69
0
24 Jan 2021
Belief-based Generation of Argumentative Claims
Milad Alshomary
Wei-Fan Chen
Timon Ziegenbein
Henning Wachsmuth
185
25
0
24 Jan 2021
Modern Machine and Deep Learning Systems as a way to achieve Man-Computer Symbiosis
Chirag Gupta
74
0
0
24 Jan 2021
RomeBERT: Robust Training of Multi-Exit BERT
Shijie Geng
Peng Gao
Zuohui Fu
Yongfeng Zhang
81
28
0
24 Jan 2021
Stereotype and Skew: Quantifying Gender Bias in Pre-trained and Fine-tuned Language Models
Daniel de Vassimon Manela
D. Errington
Thomas Fisher
B. V. Breugel
Pasquale Minervini
54
96
0
24 Jan 2021
Dictionary-based Debiasing of Pre-trained Word Embeddings
Masahiro Kaneko
Danushka Bollegala
FaML
97
38
0
23 Jan 2021
Debiasing Pre-trained Contextualised Embeddings
Masahiro Kaneko
Danushka Bollegala
269
143
0
23 Jan 2021
Training Multilingual Pre-trained Language Model with Byte-level Subwords
Junqiu Wei
Qun Liu
Yinpeng Guo
Xin Jiang
63
20
0
23 Jan 2021
WebSRC: A Dataset for Web-Based Structural Reading Comprehension
Xingyu Chen
Zihan Zhao
Lu Chen
Danyang Zhang
Jiabao Ji
Ao Luo
Yuxuan Xiong
Kai Yu
RALM
90
98
0
23 Jan 2021
Advances and Challenges in Conversational Recommender Systems: A Survey
Chongming Gao
Wenqiang Lei
Xiangnan He
Maarten de Rijke
Tat-Seng Chua
258
284
0
23 Jan 2021
Reproducibility, Replicability and Beyond: Assessing Production Readiness of Aspect Based Sentiment Analysis in the Wild
Rajdeep Mukherjee
Shreyas Shetty
S. Chattopadhyay
Subhadeep Maji
S. Datta
Pawan Goyal
67
14
0
23 Jan 2021
Arabic aspect based sentiment analysis using bidirectional GRU based models
Mohammed Mustafa
T. H. Soliman
A. Taloba
Mohammed Fawzi Seedik
53
82
0
23 Jan 2021
Slot Self-Attentive Dialogue State Tracking
Fanghua Ye
Jarana Manotumruksa
Qiang Zhang
Shenghui Li
Emine Yilmaz
141
63
0
22 Jan 2021
Effects of Pre- and Post-Processing on type-based Embeddings in Lexical Semantic Change Detection
Jens Kaiser
Sinan Kurtyigit
Serge Kotchourko
Dominik Schlechtweg
70
10
0
22 Jan 2021
BERT Transformer model for Detecting Arabic GPT2 Auto-Generated Tweets
Fouzi Harrag
Maria Dabbah
Kareem Darwish
Ahmed Abdelali
DeLMO
53
24
0
22 Jan 2021
A Comprehensive Survey on Hardware-Aware Neural Architecture Search
Hadjer Benmeziane
Kaoutar El Maghraoui
Hamza Ouarnoughi
Smail Niar
Martin Wistuba
Naigang Wang
118
109
0
22 Jan 2021
Censorship of Online Encyclopedias: Implications for NLP Models
Eddie Yang
Margaret E. Roberts
43
16
0
22 Jan 2021
Extracting Lifestyle Factors for Alzheimer's Disease from Clinical Notes Using Deep Learning with Weak Supervision
Zitao Shen
Yoonkwon Yi
A. Bompelli
Fang Yu
Yanshan Wang
Rui Zhang
81
11
0
22 Jan 2021
The Impact of Multiple Parallel Phrase Suggestions on Email Input and Composition Behaviour of Native and Non-Native English Writers
Daniel Buschek
Martin Zurn
Malin Eiband
181
107
0
22 Jan 2021
The heads hypothesis: A unifying statistical approach towards understanding multi-headed attention in BERT
Madhura Pande
Aakriti Budhraja
Preksha Nema
Pratyush Kumar
Mitesh M. Khapra
76
19
0
22 Jan 2021
A multi-perspective combined recall and rank framework for Chinese procedure terminology normalization
Ming Liang
Kui Xue
Tong Ruan
85
0
0
22 Jan 2021
Enhanced word embeddings using multi-semantic representation through lexical chains
Terry Ruas
C. H. P. Ferreira
W. Grosky
F. O. França
D. D. Medeiros
93
18
0
22 Jan 2021
HASOCOne@FIRE-HASOC2020: Using BERT and Multilingual BERT models for Hate Speech Detection
Suman Dowlagar
R. Mamidi
43
21
0
22 Jan 2021
Visual Question Answering based on Local-Scene-Aware Referring Expression Generation
Jungjun Kim
Dong-Gyu Lee
Jialin Wu
Hong G Jung
Seong-Whan Lee
ObjD
96
22
0
22 Jan 2021
Fringe News Networks: Dynamics of US News Viewership following the 2020 Presidential Election
Ashiqur R. KhudaBukhsh
Rupak Sarkar
M. Kamlet
Tom Michael Mitchell
44
23
0
22 Jan 2021
Distilling Large Language Models into Tiny and Effective Students using pQRNN
P. Kaliamoorthi
Aditya Siddhant
Edward Li
Melvin Johnson
MQ
62
17
0
21 Jan 2021
PalmTree: Learning an Assembly Language Model for Instruction Embedding
Xuezixiang Li
Qu Yu
Heng Yin
87
155
0
21 Jan 2021
SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation
Brendan Duke
Abdalla Ahmed
Christian Wolf
P. Aarabi
Graham W. Taylor
VOS
74
167
0
21 Jan 2021
AI Choreographer: Music Conditioned 3D Dance Generation with AIST++
Ruilong Li
Sha Yang
David A. Ross
Angjoo Kanazawa
ViT
294
506
0
21 Jan 2021
Knowledge-Preserving Incremental Social Event Detection via Heterogeneous GNNs
Dongyuan Li
Hao Peng
Hongzhi Zhang
Yingtong Dou
Jianxin Li
Philip S. Yu
82
98
0
21 Jan 2021
Learning rich touch representations through cross-modal self-supervision
Martina Zambelli
Y. Aytar
Francesco Visin
Yuxiang Zhou
R. Hadsell
SSL
82
16
0
21 Jan 2021
Adv-OLM: Generating Textual Adversaries via OLM
Vijit Malik
A. Bhat
Ashutosh Modi
134
6
0
21 Jan 2021
Exponential Moving Average Normalization for Self-supervised and Semi-supervised Learning
Zhaowei Cai
Avinash Ravichandran
Subhransu Maji
Charless C. Fowlkes
Zhuowen Tu
Stefano Soatto
145
120
0
21 Jan 2021
Segmenting Transparent Object in the Wild with Transformer
Enze Xie
Wenjia Wang
Wenhai Wang
Pei Sun
Hang Xu
Ding Liang
Ping Luo
ViT
298
28
0
21 Jan 2021
Invariance, encodings, and generalization: learning identity effects with neural networks
Simone Brugiapaglia
Matthew Liu
P. Tupper
OOD
72
5
0
21 Jan 2021
ParaSCI: A Large Scientific Paraphrase Dataset for Longer Paraphrase Generation
Qingxiu Dong
Xiaojun Wan
Yue Cao
178
35
0
21 Jan 2021
Previous
1
2
3
...
365
366
367
...
473
474
475
Next