Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,708 papers shown
Title
Domain-Adversarial Training of Self-Attention Based Networks for Land Cover Classification using Multi-temporal Sentinel-2 Satellite Imagery
Mauro Martini
Vittorio Mazzia
Aleem Khaliq
Marcello Chiaberge
136
40
0
01 Apr 2021
WakaVT: A Sequential Variational Transformer for Waka Generation
Yuka Takeishi
Mingxuan Niu
Jing Luo
ZhongYi Jin
Xinyu Yang
88
1
0
01 Apr 2021
UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training
Mingyang Zhou
Luowei Zhou
Shuohang Wang
Yu Cheng
Linjie Li
Zhou Yu
Jingjing Liu
MLLM
VLM
99
92
0
01 Apr 2021
Normal vs. Adversarial: Salience-based Analysis of Adversarial Samples for Relation Extraction
Luoqiu Li
Xiang Chen
Zhen Bi
Xin Xie
Shumin Deng
Ningyu Zhang
Chuanqi Tan
Mosha Chen
Huajun Chen
AAML
115
7
0
01 Apr 2021
Evaluating Neural Word Embeddings for Sanskrit
Kevin Qinghong Lin
Om Adideva
Digumarthi Komal
Laxmidhar Behera
Pawan Goyal
109
12
0
01 Apr 2021
A Survey on Natural Language Video Localization
Xinfang Liu
Xiushan Nie
Zhifang Tan
Jie Guo
Yilong Yin
123
7
0
01 Apr 2021
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
Or Patashnik
Zongze Wu
Eli Shechtman
Daniel Cohen-Or
Dani Lischinski
CLIP
VLM
247
1,213
0
31 Mar 2021
Going deeper with Image Transformers
Hugo Touvron
Matthieu Cord
Alexandre Sablayrolles
Gabriel Synnaeve
Hervé Jégou
ViT
225
1,026
0
31 Mar 2021
Learning Spatio-Temporal Transformer for Visual Tracking
Bin Yan
Houwen Peng
Jianlong Fu
Dong Wang
Huchuan Lu
ViT
103
735
0
31 Mar 2021
A Neighbourhood Framework for Resource-Lean Content Flagging
Sheikh Muhammad Sarwar
Dimitrina Zlatkova
Momchil Hardalov
Yoan Dinkov
Isabelle Augenstein
Preslav Nakov
70
5
0
31 Mar 2021
Deep Neural Approaches to Relation Triplets Extraction: A Comprehensive Survey
Tapas Nayak
Navonil Majumder
Pawan Goyal
Soujanya Poria
ViT
62
52
0
31 Mar 2021
Few-shot learning through contextual data augmentation
Farid Arthaud
Rachel Bawden
Alexandra Birch
49
12
0
31 Mar 2021
Exploring Plausible Patches Using Source Code Embeddings in JavaScript
Viktor Csuvik
Dániel Horváth
Márk Lajkó
László Vidács
21
5
0
31 Mar 2021
Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human Videos
Annie S. Chen
Suraj Nair
Chelsea Finn
94
141
0
31 Mar 2021
Self-Supervised Euphemism Detection and Identification for Content Moderation
Wanzheng Zhu
Hongyu Gong
Rohan Bansal
Zachary Weinberg
Nicolas Christin
Giulia Fanti
S. Bhat
77
40
0
31 Mar 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Alana de Santana Correia
Esther Luna Colombini
HAI
132
198
0
31 Mar 2021
Dual Contrastive Loss and Attention for GANs
Ning Yu
Guilin Liu
Aysegül Dündar
Andrew Tao
Bryan Catanzaro
Larry S. Davis
Mario Fritz
GAN
137
61
0
31 Mar 2021
Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark
Tianlin Li
Xiujun Shu
Zhipeng Zhang
Bo Jiang
Yaowei Wang
Yonghong Tian
Feng Wu
94
165
0
31 Mar 2021
BASE Layers: Simplifying Training of Large, Sparse Models
M. Lewis
Shruti Bhosale
Tim Dettmers
Naman Goyal
Luke Zettlemoyer
MoE
231
285
0
30 Mar 2021
Probabilistic Analogical Mapping with Semantic Relation Networks
Hongjing Lu
Nicholas Ichien
K. Holyoak
90
36
0
30 Mar 2021
Pre-training for low resource speech-to-intent applications
Pu Wang
Hugo Van hamme
45
4
0
30 Mar 2021
DAP: Detection-Aware Pre-training with Weak Supervision
Yuanyi Zhong
Jianfeng Wang
Lijuan Wang
Jian-wei Peng
Yu-Xiong Wang
Lei Zhang
76
15
0
30 Mar 2021
Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers
Antoine Miech
Jean-Baptiste Alayrac
Ivan Laptev
Josef Sivic
Andrew Zisserman
ViT
110
139
0
30 Mar 2021
Spatiotemporal Transformer for Video-based Person Re-identification
Tianyu Zhang
Longhui Wei
Lingxi Xie
Zijie Zhuang
Yongfei Zhang
Yue Liu
Qi Tian
ViT
103
32
0
30 Mar 2021
EnergyVis: Interactively Tracking and Exploring Energy Consumption for ML Models
Omar Shaikh
Jon Saad-Falcon
Austin P. Wright
Nilaksh Das
Scott Freitas
O. Asensio
Duen Horng Chau
59
20
0
30 Mar 2021
Grounding Dialogue Systems via Knowledge Graph Aware Decoding with Pre-trained Transformers
Debanjan Chaudhuri
Md. Rony
Jens Lehmann
73
12
0
30 Mar 2021
Locally-Contextual Nonlinear CRFs for Sequence Labeling
Harshil Shah
Tim Z. Xiao
David Barber
58
4
0
30 Mar 2021
AfriKI: Machine-in-the-Loop Afrikaans Poetry Generation
Imke van Heerden
Anil Bas
61
3
0
30 Mar 2021
Autocorrect in the Process of Translation -- Multi-task Learning Improves Dialogue Machine Translation
Tao Wang
Chengqi Zhao
Mingxuan Wang
Lei Li
Deyi Xiong
58
13
0
30 Mar 2021
Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
Mingchen Zhuge
D. Gao
Deng-Ping Fan
Linbo Jin
Ben Chen
Hao Zhou
Minghui Qiu
Ling Shao
VLM
103
121
0
30 Mar 2021
Automatic Graph Partitioning for Very Large-scale Deep Learning
Masahiro Tanaka
Kenjiro Taura
T. Hanawa
Kentaro Torisawa
GNN
AI4CE
58
21
0
30 Mar 2021
Grounding Open-Domain Instructions to Automate Web Support Tasks
N. Xu
Sam Masling
Michael Du
Giovanni Campagna
Larry Heck
James A. Landay
M. Lam
LLMAG
AI4TS
77
44
0
30 Mar 2021
Self-supervised Image-text Pre-training With Mixed Data In Chest X-rays
Xiaosong Wang
Ziyue Xu
Leo K. Tam
Dong Yang
Daguang Xu
ViT
MedIm
73
24
0
30 Mar 2021
Domain-robust VQA with diverse datasets and methods but no target labels
Ruotong Wang
Tristan D. Maidment
Ahmad Diab
Adriana Kovashka
R. Hwa
OOD
131
23
0
29 Mar 2021
Contextual Text Embeddings for Twi
P. Azunre
Salomey Osei
S. Addo
Lawrence Asamoah Adu-Gyamfi
Stephen E. Moore
...
Standylove Birago Mensah
Lucien Mensah
Mark Amoako Marcel
A. Amponsah
J. B. Hayfron-Acquah
62
6
0
29 Mar 2021
TREC 2020 Podcasts Track Overview
R. Jones
Ben Carteree
Ann Clion
Maria Eskevich
G. Jones
Jussi Karlgren
Aasish Pappu
S. Reddy
Yongze Yu
3DGS
74
36
0
29 Mar 2021
Transformer visualization via dictionary learning: contextualized embedding as a linear superposition of transformer factors
Zeyu Yun
Yubei Chen
Bruno A. Olshausen
Yann LeCun
65
78
0
29 Mar 2021
Unsupervised Machine Translation On Dravidian Languages
Sai Koneru
Danni Liu
Jan Niehues
107
7
0
29 Mar 2021
CvT: Introducing Convolutions to Vision Transformers
Haiping Wu
Bin Xiao
Noel Codella
Mengchen Liu
Xiyang Dai
Lu Yuan
Lei Zhang
ViT
182
1,929
0
29 Mar 2021
Shrinking Bigfoot: Reducing wav2vec 2.0 footprint
Zilun Peng
Akshay Budhkar
Ilana Tuil
J. Levy
Parinaz Sobhani
Raphael Cohen
J. Nassour
55
33
0
29 Mar 2021
Retraining DistilBERT for a Voice Shopping Assistant by Using Universal Dependencies
P. Jayarao
Arpit Sharma
67
2
0
29 Mar 2021
CaSiNo: A Corpus of Campsite Negotiation Dialogues for Automatic Negotiation Systems
Kushal Chawla
Jaysa Ramirez
Rene Clever
Gale M. Lucas
Jonathan May
Jonathan Gratch
100
52
0
29 Mar 2021
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
242
2,178
0
29 Mar 2021
On the Adversarial Robustness of Vision Transformers
Rulin Shao
Zhouxing Shi
Jinfeng Yi
Pin-Yu Chen
Cho-Jui Hsieh
ViT
117
146
0
29 Mar 2021
Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models
Wenkai Yang
Lei Li
Zhiyuan Zhang
Xuancheng Ren
Xu Sun
Bin He
SILM
114
153
0
29 Mar 2021
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
Li Xu
He Huang
Jun Liu
ViT
LRM
114
88
0
29 Mar 2021
Transformer Tracking
Xin Chen
Bin Yan
Jiawen Zhu
Dong Wang
Xiaoyun Yang
Huchuan Lu
ViT
83
967
0
29 Mar 2021
Efficient Explanations from Empirical Explainers
Robert Schwarzenberg
Nils Feldhus
Sebastian Möller
FAtt
93
9
0
29 Mar 2021
Visual Distant Supervision for Scene Graph Generation
Yuan Yao
Ao Zhang
Xu Han
Mengdi Li
C. Weber
Zhiyuan Liu
S. Wermter
Maosong Sun
70
39
0
29 Mar 2021
Changing the Mind of Transformers for Topically-Controllable Language Generation
Haw-Shiuan Chang
Jiaming Yuan
Mohit Iyyer
Andrew McCallum
82
9
0
29 Mar 2021
Previous
1
2
3
...
351
352
353
...
473
474
475
Next