ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,708 papers shown
Title
Domain-Adversarial Training of Self-Attention Based Networks for Land
  Cover Classification using Multi-temporal Sentinel-2 Satellite Imagery
Domain-Adversarial Training of Self-Attention Based Networks for Land Cover Classification using Multi-temporal Sentinel-2 Satellite Imagery
Mauro Martini
Vittorio Mazzia
Aleem Khaliq
Marcello Chiaberge
136
40
0
01 Apr 2021
WakaVT: A Sequential Variational Transformer for Waka Generation
WakaVT: A Sequential Variational Transformer for Waka Generation
Yuka Takeishi
Mingxuan Niu
Jing Luo
ZhongYi Jin
Xinyu Yang
88
1
0
01 Apr 2021
UC2: Universal Cross-lingual Cross-modal Vision-and-Language
  Pre-training
UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training
Mingyang Zhou
Luowei Zhou
Shuohang Wang
Yu Cheng
Linjie Li
Zhou Yu
Jingjing Liu
MLLMVLM
99
92
0
01 Apr 2021
Normal vs. Adversarial: Salience-based Analysis of Adversarial Samples
  for Relation Extraction
Normal vs. Adversarial: Salience-based Analysis of Adversarial Samples for Relation Extraction
Luoqiu Li
Xiang Chen
Zhen Bi
Xin Xie
Shumin Deng
Ningyu Zhang
Chuanqi Tan
Mosha Chen
Huajun Chen
AAML
115
7
0
01 Apr 2021
Evaluating Neural Word Embeddings for Sanskrit
Evaluating Neural Word Embeddings for Sanskrit
Kevin Qinghong Lin
Om Adideva
Digumarthi Komal
Laxmidhar Behera
Pawan Goyal
109
12
0
01 Apr 2021
A Survey on Natural Language Video Localization
A Survey on Natural Language Video Localization
Xinfang Liu
Xiushan Nie
Zhifang Tan
Jie Guo
Yilong Yin
123
7
0
01 Apr 2021
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
Or Patashnik
Zongze Wu
Eli Shechtman
Daniel Cohen-Or
Dani Lischinski
CLIPVLM
247
1,213
0
31 Mar 2021
Going deeper with Image Transformers
Going deeper with Image Transformers
Hugo Touvron
Matthieu Cord
Alexandre Sablayrolles
Gabriel Synnaeve
Hervé Jégou
ViT
225
1,026
0
31 Mar 2021
Learning Spatio-Temporal Transformer for Visual Tracking
Learning Spatio-Temporal Transformer for Visual Tracking
Bin Yan
Houwen Peng
Jianlong Fu
Dong Wang
Huchuan Lu
ViT
103
735
0
31 Mar 2021
A Neighbourhood Framework for Resource-Lean Content Flagging
A Neighbourhood Framework for Resource-Lean Content Flagging
Sheikh Muhammad Sarwar
Dimitrina Zlatkova
Momchil Hardalov
Yoan Dinkov
Isabelle Augenstein
Preslav Nakov
70
5
0
31 Mar 2021
Deep Neural Approaches to Relation Triplets Extraction: A Comprehensive
  Survey
Deep Neural Approaches to Relation Triplets Extraction: A Comprehensive Survey
Tapas Nayak
Navonil Majumder
Pawan Goyal
Soujanya Poria
ViT
62
52
0
31 Mar 2021
Few-shot learning through contextual data augmentation
Few-shot learning through contextual data augmentation
Farid Arthaud
Rachel Bawden
Alexandra Birch
49
12
0
31 Mar 2021
Exploring Plausible Patches Using Source Code Embeddings in JavaScript
Exploring Plausible Patches Using Source Code Embeddings in JavaScript
Viktor Csuvik
Dániel Horváth
Márk Lajkó
László Vidács
21
5
0
31 Mar 2021
Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human
  Videos
Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human Videos
Annie S. Chen
Suraj Nair
Chelsea Finn
94
141
0
31 Mar 2021
Self-Supervised Euphemism Detection and Identification for Content
  Moderation
Self-Supervised Euphemism Detection and Identification for Content Moderation
Wanzheng Zhu
Hongyu Gong
Rohan Bansal
Zachary Weinberg
Nicolas Christin
Giulia Fanti
S. Bhat
77
40
0
31 Mar 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Attention, please! A survey of Neural Attention Models in Deep Learning
Alana de Santana Correia
Esther Luna Colombini
HAI
132
198
0
31 Mar 2021
Dual Contrastive Loss and Attention for GANs
Dual Contrastive Loss and Attention for GANs
Ning Yu
Guilin Liu
Aysegül Dündar
Andrew Tao
Bryan Catanzaro
Larry S. Davis
Mario Fritz
GAN
137
61
0
31 Mar 2021
Towards More Flexible and Accurate Object Tracking with Natural
  Language: Algorithms and Benchmark
Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark
Tianlin Li
Xiujun Shu
Zhipeng Zhang
Bo Jiang
Yaowei Wang
Yonghong Tian
Feng Wu
94
165
0
31 Mar 2021
BASE Layers: Simplifying Training of Large, Sparse Models
BASE Layers: Simplifying Training of Large, Sparse Models
M. Lewis
Shruti Bhosale
Tim Dettmers
Naman Goyal
Luke Zettlemoyer
MoE
231
285
0
30 Mar 2021
Probabilistic Analogical Mapping with Semantic Relation Networks
Probabilistic Analogical Mapping with Semantic Relation Networks
Hongjing Lu
Nicholas Ichien
K. Holyoak
90
36
0
30 Mar 2021
Pre-training for low resource speech-to-intent applications
Pre-training for low resource speech-to-intent applications
Pu Wang
Hugo Van hamme
45
4
0
30 Mar 2021
DAP: Detection-Aware Pre-training with Weak Supervision
DAP: Detection-Aware Pre-training with Weak Supervision
Yuanyi Zhong
Jianfeng Wang
Lijuan Wang
Jian-wei Peng
Yu-Xiong Wang
Lei Zhang
76
15
0
30 Mar 2021
Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with
  Transformers
Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers
Antoine Miech
Jean-Baptiste Alayrac
Ivan Laptev
Josef Sivic
Andrew Zisserman
ViT
110
139
0
30 Mar 2021
Spatiotemporal Transformer for Video-based Person Re-identification
Spatiotemporal Transformer for Video-based Person Re-identification
Tianyu Zhang
Longhui Wei
Lingxi Xie
Zijie Zhuang
Yongfei Zhang
Yue Liu
Qi Tian
ViT
103
32
0
30 Mar 2021
EnergyVis: Interactively Tracking and Exploring Energy Consumption for
  ML Models
EnergyVis: Interactively Tracking and Exploring Energy Consumption for ML Models
Omar Shaikh
Jon Saad-Falcon
Austin P. Wright
Nilaksh Das
Scott Freitas
O. Asensio
Duen Horng Chau
59
20
0
30 Mar 2021
Grounding Dialogue Systems via Knowledge Graph Aware Decoding with
  Pre-trained Transformers
Grounding Dialogue Systems via Knowledge Graph Aware Decoding with Pre-trained Transformers
Debanjan Chaudhuri
Md. Rony
Jens Lehmann
73
12
0
30 Mar 2021
Locally-Contextual Nonlinear CRFs for Sequence Labeling
Locally-Contextual Nonlinear CRFs for Sequence Labeling
Harshil Shah
Tim Z. Xiao
David Barber
58
4
0
30 Mar 2021
AfriKI: Machine-in-the-Loop Afrikaans Poetry Generation
AfriKI: Machine-in-the-Loop Afrikaans Poetry Generation
Imke van Heerden
Anil Bas
61
3
0
30 Mar 2021
Autocorrect in the Process of Translation -- Multi-task Learning
  Improves Dialogue Machine Translation
Autocorrect in the Process of Translation -- Multi-task Learning Improves Dialogue Machine Translation
Tao Wang
Chengqi Zhao
Mingxuan Wang
Lei Li
Deyi Xiong
58
13
0
30 Mar 2021
Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
Mingchen Zhuge
D. Gao
Deng-Ping Fan
Linbo Jin
Ben Chen
Hao Zhou
Minghui Qiu
Ling Shao
VLM
103
121
0
30 Mar 2021
Automatic Graph Partitioning for Very Large-scale Deep Learning
Automatic Graph Partitioning for Very Large-scale Deep Learning
Masahiro Tanaka
Kenjiro Taura
T. Hanawa
Kentaro Torisawa
GNNAI4CE
58
21
0
30 Mar 2021
Grounding Open-Domain Instructions to Automate Web Support Tasks
Grounding Open-Domain Instructions to Automate Web Support Tasks
N. Xu
Sam Masling
Michael Du
Giovanni Campagna
Larry Heck
James A. Landay
M. Lam
LLMAGAI4TS
77
44
0
30 Mar 2021
Self-supervised Image-text Pre-training With Mixed Data In Chest X-rays
Self-supervised Image-text Pre-training With Mixed Data In Chest X-rays
Xiaosong Wang
Ziyue Xu
Leo K. Tam
Dong Yang
Daguang Xu
ViTMedIm
73
24
0
30 Mar 2021
Domain-robust VQA with diverse datasets and methods but no target labels
Domain-robust VQA with diverse datasets and methods but no target labels
Ruotong Wang
Tristan D. Maidment
Ahmad Diab
Adriana Kovashka
R. Hwa
OOD
131
23
0
29 Mar 2021
Contextual Text Embeddings for Twi
Contextual Text Embeddings for Twi
P. Azunre
Salomey Osei
S. Addo
Lawrence Asamoah Adu-Gyamfi
Stephen E. Moore
...
Standylove Birago Mensah
Lucien Mensah
Mark Amoako Marcel
A. Amponsah
J. B. Hayfron-Acquah
62
6
0
29 Mar 2021
TREC 2020 Podcasts Track Overview
TREC 2020 Podcasts Track Overview
R. Jones
Ben Carteree
Ann Clion
Maria Eskevich
G. Jones
Jussi Karlgren
Aasish Pappu
S. Reddy
Yongze Yu
3DGS
74
36
0
29 Mar 2021
Transformer visualization via dictionary learning: contextualized
  embedding as a linear superposition of transformer factors
Transformer visualization via dictionary learning: contextualized embedding as a linear superposition of transformer factors
Zeyu Yun
Yubei Chen
Bruno A. Olshausen
Yann LeCun
65
78
0
29 Mar 2021
Unsupervised Machine Translation On Dravidian Languages
Unsupervised Machine Translation On Dravidian Languages
Sai Koneru
Danni Liu
Jan Niehues
107
7
0
29 Mar 2021
CvT: Introducing Convolutions to Vision Transformers
CvT: Introducing Convolutions to Vision Transformers
Haiping Wu
Bin Xiao
Noel Codella
Mengchen Liu
Xiyang Dai
Lu Yuan
Lei Zhang
ViT
182
1,929
0
29 Mar 2021
Shrinking Bigfoot: Reducing wav2vec 2.0 footprint
Shrinking Bigfoot: Reducing wav2vec 2.0 footprint
Zilun Peng
Akshay Budhkar
Ilana Tuil
J. Levy
Parinaz Sobhani
Raphael Cohen
J. Nassour
55
33
0
29 Mar 2021
Retraining DistilBERT for a Voice Shopping Assistant by Using Universal
  Dependencies
Retraining DistilBERT for a Voice Shopping Assistant by Using Universal Dependencies
P. Jayarao
Arpit Sharma
67
2
0
29 Mar 2021
CaSiNo: A Corpus of Campsite Negotiation Dialogues for Automatic
  Negotiation Systems
CaSiNo: A Corpus of Campsite Negotiation Dialogues for Automatic Negotiation Systems
Kushal Chawla
Jaysa Ramirez
Rene Clever
Gale M. Lucas
Jonathan May
Jonathan Gratch
100
52
0
29 Mar 2021
ViViT: A Video Vision Transformer
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
242
2,178
0
29 Mar 2021
On the Adversarial Robustness of Vision Transformers
On the Adversarial Robustness of Vision Transformers
Rulin Shao
Zhouxing Shi
Jinfeng Yi
Pin-Yu Chen
Cho-Jui Hsieh
ViT
117
146
0
29 Mar 2021
Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability
  of the Embedding Layers in NLP Models
Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models
Wenkai Yang
Lei Li
Zhiyuan Zhang
Xuancheng Ren
Xu Sun
Bin He
SILM
114
153
0
29 Mar 2021
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network
  for Video Reasoning over Traffic Events
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
Li Xu
He Huang
Jun Liu
ViTLRM
114
88
0
29 Mar 2021
Transformer Tracking
Transformer Tracking
Xin Chen
Bin Yan
Jiawen Zhu
Dong Wang
Xiaoyun Yang
Huchuan Lu
ViT
83
967
0
29 Mar 2021
Efficient Explanations from Empirical Explainers
Efficient Explanations from Empirical Explainers
Robert Schwarzenberg
Nils Feldhus
Sebastian Möller
FAtt
93
9
0
29 Mar 2021
Visual Distant Supervision for Scene Graph Generation
Visual Distant Supervision for Scene Graph Generation
Yuan Yao
Ao Zhang
Xu Han
Mengdi Li
C. Weber
Zhiyuan Liu
S. Wermter
Maosong Sun
70
39
0
29 Mar 2021
Changing the Mind of Transformers for Topically-Controllable Language
  Generation
Changing the Mind of Transformers for Topically-Controllable Language Generation
Haw-Shiuan Chang
Jiaming Yuan
Mohit Iyyer
Andrew McCallum
82
9
0
29 Mar 2021
Previous
123...351352353...473474475
Next