ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,688 papers shown
Title
On the Evolution of Syntactic Information Encoded by BERT's
  Contextualized Representations
On the Evolution of Syntactic Information Encoded by BERT's Contextualized Representations
Laura Pérez-Mayos
Roberto Carlini
Miguel Ballesteros
Leo Wanner
64
7
0
27 Jan 2021
Spatial-Channel Transformer Network for Trajectory Prediction on the
  Traffic Scenes
Spatial-Channel Transformer Network for Trajectory Prediction on the Traffic Scenes
Jingwen Zhao
Xuanpeng Li
Qifan Xue
Weigong Zhang
ViT
78
15
0
27 Jan 2021
KoreALBERT: Pretraining a Lite BERT Model for Korean Language
  Understanding
KoreALBERT: Pretraining a Lite BERT Model for Korean Language Understanding
HyunJae Lee
Jaewoong Yoon
Bonggyu Hwang
Seongho Joe
Seungjai Min
Youngjune Gwon
SSeg
58
16
0
27 Jan 2021
An explainable Transformer-based deep learning model for the prediction
  of incident heart failure
An explainable Transformer-based deep learning model for the prediction of incident heart failure
Shishir Rao
Yikuan Li
R. Ramakrishnan
A. Hassaine
D. Canoy
J. Cleland
Thomas Lukasiewicz
G. Salimi-Khorshidi
K. Rahimi
MedIm
54
72
0
27 Jan 2021
On the Interpretability of Deep Learning Based Models for Knowledge
  Tracing
On the Interpretability of Deep Learning Based Models for Knowledge Tracing
Xinyi Ding
Eric C. Larson
56
9
0
27 Jan 2021
Adversarial Stylometry in the Wild: Transferable Lexical Substitution
  Attacks on Author Profiling
Adversarial Stylometry in the Wild: Transferable Lexical Substitution Attacks on Author Profiling
Chris Emmery
Ákos Kádár
Grzegorz Chrupała
AAML
94
20
0
27 Jan 2021
Language Modelling as a Multi-Task Problem
Language Modelling as a Multi-Task Problem
Leon Weber
Jaap Jumelet
Elia Bruni
Dieuwke Hupkes
86
13
0
27 Jan 2021
VisualMRC: Machine Reading Comprehension on Document Images
VisualMRC: Machine Reading Comprehension on Document Images
Ryota Tanaka
Kyosuke Nishida
Sen Yoshida
101
146
0
27 Jan 2021
Joint Coreference Resolution and Character Linking for Multiparty
  Conversation
Joint Coreference Resolution and Character Linking for Multiparty Conversation
Jiaxin Bai
Hongming Zhang
Yangqiu Song
Kun Xu
89
7
0
27 Jan 2021
Neural Sentence Ordering Based on Constraint Graphs
Neural Sentence Ordering Based on Constraint Graphs
Yutao Zhu
Kun Zhou
J. Nie
Shengchao Liu
Zhicheng Dou
NAI
89
23
0
27 Jan 2021
Named Entity Recognition in the Style of Object Detection
Named Entity Recognition in the Style of Object Detection
Bing Li
49
4
0
26 Jan 2021
Cross-Lingual Named Entity Recognition Using Parallel Corpus: A New
  Approach Using XLM-RoBERTa Alignment
Cross-Lingual Named Entity Recognition Using Parallel Corpus: A New Approach Using XLM-RoBERTa Alignment
Bing Li
Yujie He
Wenjin Xu
89
24
0
26 Jan 2021
First Align, then Predict: Understanding the Cross-Lingual Ability of
  Multilingual BERT
First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT
Benjamin Muller
Yanai Elazar
Benoît Sagot
Djamé Seddah
LRM
67
77
0
26 Jan 2021
Text2Gestures: A Transformer-Based Network for Generating Emotive Body
  Gestures for Virtual Agents
Text2Gestures: A Transformer-Based Network for Generating Emotive Body Gestures for Virtual Agents
Uttaran Bhattacharya
Nicholas Rewkowski
A. Banerjee
P. Guhan
Aniket Bera
Tianyi Zhou
LM&Ro
88
153
0
26 Jan 2021
Adaptivity without Compromise: A Momentumized, Adaptive, Dual Averaged
  Gradient Method for Stochastic Optimization
Adaptivity without Compromise: A Momentumized, Adaptive, Dual Averaged Gradient Method for Stochastic Optimization
Aaron Defazio
Samy Jelassi
ODL
90
70
0
26 Jan 2021
Deep Subjecthood: Higher-Order Grammatical Features in Multilingual BERT
Deep Subjecthood: Higher-Order Grammatical Features in Multilingual BERT
Isabel Papadimitriou
Ethan A. Chi
Richard Futrell
Kyle Mahowald
90
44
0
26 Jan 2021
Muppet: Massive Multi-task Representations with Pre-Finetuning
Muppet: Massive Multi-task Representations with Pre-Finetuning
Armen Aghajanyan
Anchit Gupta
Akshat Shrivastava
Xilun Chen
Luke Zettlemoyer
Sonal Gupta
100
270
0
26 Jan 2021
A Case Study of Deep Learning Based Multi-Modal Methods for Predicting
  the Age-Suitability Rating of Movie Trailers
A Case Study of Deep Learning Based Multi-Modal Methods for Predicting the Age-Suitability Rating of Movie Trailers
Mahsa Shafaei
C. Smailis
I. Kakadiaris
Thamar Solorio
414
1
0
26 Jan 2021
An Efficient Statistical-based Gradient Compression Technique for
  Distributed Training Systems
An Efficient Statistical-based Gradient Compression Technique for Distributed Training Systems
A. Abdelmoniem
Ahmed Elzanaty
Mohamed-Slim Alouini
Marco Canini
132
77
0
26 Jan 2021
Summarising Historical Text in Modern Languages
Summarising Historical Text in Modern Languages
Xutan Peng
Yifei Zheng
Chenghua Lin
Advaith Siddharthan
AILaw
63
12
0
26 Jan 2021
Regulatory Compliance through Doc2Doc Information Retrieval: A case
  study in EU/UK legislation where text similarity has limitations
Regulatory Compliance through Doc2Doc Information Retrieval: A case study in EU/UK legislation where text similarity has limitations
Ilias Chalkidis
Manos Fergadiotis
Nikolaos Manginas
Eva Katakalou
Prodromos Malakasiotis
AILaw
60
27
0
26 Jan 2021
Combining Deep Generative Models and Multi-lingual Pretraining for
  Semi-supervised Document Classification
Combining Deep Generative Models and Multi-lingual Pretraining for Semi-supervised Document Classification
Yi Zhu
Ehsan Shareghi
Yingzhen Li
Roi Reichart
Anna Korhonen
VLM
49
5
0
26 Jan 2021
Exploring Transitivity in Neural NLI Models through Veridicality
Exploring Transitivity in Neural NLI Models through Veridicality
Hitomi Yanaka
K. Mineshima
Kentaro Inui
84
23
0
26 Jan 2021
Evaluation of BERT and ALBERT Sentence Embedding Performance on
  Downstream NLP Tasks
Evaluation of BERT and ALBERT Sentence Embedding Performance on Downstream NLP Tasks
Hyunjin Choi
Judong Kim
Seongho Joe
Youngjune Gwon
SSeg
97
104
0
26 Jan 2021
Representations for Question Answering from Documents with Tables and
  Text
Representations for Question Answering from Documents with Tables and Text
Vicky Zayats
Kristina Toutanova
Mari Ostendorf
LMTD
112
37
0
26 Jan 2021
RESPER: Computationally Modelling Resisting Strategies in Persuasive
  Conversations
RESPER: Computationally Modelling Resisting Strategies in Persuasive Conversations
Ritam Dutt
S. Sinha
Rishabh Joshi
S. Chakraborty
Meredith Riggs
Xinru Yan
Haogang Bao
Carolyn Rose
216
20
0
26 Jan 2021
El Volumen Louder Por Favor: Code-switching in Task-oriented Semantic
  Parsing
El Volumen Louder Por Favor: Code-switching in Task-oriented Semantic Parsing
Arash Einolghozati
Abhinav Arora
Lorena Sainz-Maza Lecanda
Anuj Kumar
Sonal Gupta
112
9
0
26 Jan 2021
Contrastive analysis for scatterplot-based representations of
  dimensionality reduction
Contrastive analysis for scatterplot-based representations of dimensionality reduction
Wilson E. Marcílio-Jr
D. M. Eler
Rogério E. Garcia
46
14
0
26 Jan 2021
On the Evaluation of Vision-and-Language Navigation Instructions
On the Evaluation of Vision-and-Language Navigation Instructions
Mingde Zhao
Peter Anderson
Vihan Jain
Su Wang
Alexander Ku
Jason Baldridge
Eugene Ie
326
53
0
26 Jan 2021
Model-agnostic interpretation by visualization of feature perturbations
Model-agnostic interpretation by visualization of feature perturbations
Wilson E. Marcílio-Jr
D. M. Eler
Fabricio A. Breve
AAML
43
1
0
26 Jan 2021
English Machine Reading Comprehension Datasets: A Survey
English Machine Reading Comprehension Datasets: A Survey
Daria Dzendzik
Carl Vogel
Jennifer Foster
RALMAIMat
92
49
0
25 Jan 2021
droidlet: modular, heterogenous, multi-modal agents
droidlet: modular, heterogenous, multi-modal agents
Anurag Pratik
Soumith Chintala
Kavya Srinet
Dhiraj Gandhi
Rebecca Qian
...
Anoushka Tiwari
Tucker Hart
Mary Williamson
Abhinav Gupta
Arthur Szlam
VLMLM&Ro
57
3
0
25 Jan 2021
Curriculum Learning: A Survey
Curriculum Learning: A Survey
Petru Soviany
Radu Tudor Ionescu
Paolo Rota
N. Sebe
ODL
201
364
0
25 Jan 2021
Meta-Learning for Effective Multi-task and Multilingual Modelling
Meta-Learning for Effective Multi-task and Multilingual Modelling
Ishan Tarunesh
Sushil Khyalia
Vishwajeet Kumar
Ganesh Ramakrishnan
Preethi Jyothi
81
16
0
25 Jan 2021
Transferable Interactiveness Knowledge for Human-Object Interaction
  Detection
Transferable Interactiveness Knowledge for Human-Object Interaction Detection
Yong-Lu Li
Xinpeng Liu
Xiaoqian Wu
Xijie Huang
Liang Xu
Cewu Lu
78
5
0
25 Jan 2021
PAWLS: PDF Annotation With Labels and Structure
PAWLS: PDF Annotation With Labels and Structure
Mark Neumann
Zejiang Shen
Sam Skjonsberg
82
20
0
25 Jan 2021
TDMSci: A Specialized Corpus for Scientific Literature Entity Tagging of
  Tasks Datasets and Metrics
TDMSci: A Specialized Corpus for Scientific Literature Entity Tagging of Tasks Datasets and Metrics
Yufang Hou
Charles Jochim
Martin Gleize
Francesca Bonin
Debasis Ganguly
76
48
0
25 Jan 2021
Learning From Revisions: Quality Assessment of Claims in Argumentation
  at Scale
Learning From Revisions: Quality Assessment of Claims in Argumentation at Scale
Gabriella Skitalinskaya
Jonas Klaff
Henning Wachsmuth
66
29
0
25 Jan 2021
A Hybrid Approach to Measure Semantic Relatedness in Biomedical Concepts
A Hybrid Approach to Measure Semantic Relatedness in Biomedical Concepts
Katikapalli Subramanyam Kalyan
S. Sangeetha
72
9
0
25 Jan 2021
Cross-lingual Visual Pre-training for Multimodal Machine Translation
Cross-lingual Visual Pre-training for Multimodal Machine Translation
Ozan Caglayan
Menekse Kuyu
Mustafa Sercan Amac
Pranava Madhyastha
Erkut Erdem
Aykut Erdem
Lucia Specia
VLM
82
46
0
25 Jan 2021
SpanEmo: Casting Multi-label Emotion Classification as Span-prediction
SpanEmo: Casting Multi-label Emotion Classification as Span-prediction
Hassan Alhuzali
Sophia Ananiadou
120
90
0
25 Jan 2021
Adversarial Text-to-Image Synthesis: A Review
Adversarial Text-to-Image Synthesis: A Review
Stanislav Frolov
Tobias Hinz
Federico Raue
Jörn Hees
Andreas Dengel
EGVM
88
178
0
25 Jan 2021
CHOLAN: A Modular Approach for Neural Entity Linking on Wikipedia and
  Wikidata
CHOLAN: A Modular Approach for Neural Entity Linking on Wikipedia and Wikidata
M. Ravi
Kuldeep Singh
I. Mulang'
Saeedeh Shekarpour
Johannes Hoffart
Jens Lehmann
KELM
82
36
0
25 Jan 2021
GP: Context-free Grammar Pre-training for Text-to-SQL Parsers
GP: Context-free Grammar Pre-training for Text-to-SQL Parsers
Liang Zhao
Hexin Cao
Yunsong Zhao
AI4CE
63
11
0
25 Jan 2021
ECOL-R: Encouraging Copying in Novel Object Captioning with
  Reinforcement Learning
ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement Learning
Yufei Wang
Ian D. Wood
Stephen Wan
Mark Johnson
47
8
0
25 Jan 2021
FakeFlow: Fake News Detection by Modeling the Flow of Affective
  Information
FakeFlow: Fake News Detection by Modeling the Flow of Affective Information
Bilal Ghanem
Simone Paolo Ponzetto
Paolo Rosso
Francisco Rangel
111
69
0
24 Jan 2021
Belief-based Generation of Argumentative Claims
Belief-based Generation of Argumentative Claims
Milad Alshomary
Wei-Fan Chen
Timon Ziegenbein
Henning Wachsmuth
185
25
0
24 Jan 2021
Modern Machine and Deep Learning Systems as a way to achieve
  Man-Computer Symbiosis
Modern Machine and Deep Learning Systems as a way to achieve Man-Computer Symbiosis
Chirag Gupta
74
0
0
24 Jan 2021
RomeBERT: Robust Training of Multi-Exit BERT
RomeBERT: Robust Training of Multi-Exit BERT
Shijie Geng
Peng Gao
Zuohui Fu
Yongfeng Zhang
81
28
0
24 Jan 2021
Stereotype and Skew: Quantifying Gender Bias in Pre-trained and
  Fine-tuned Language Models
Stereotype and Skew: Quantifying Gender Bias in Pre-trained and Fine-tuned Language Models
Daniel de Vassimon Manela
D. Errington
Thomas Fisher
B. V. Breugel
Pasquale Minervini
54
96
0
24 Jan 2021
Previous
123...364365366...472473474
Next