ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,573 papers shown
Title
DuTongChuan: Context-aware Translation Model for Simultaneous
  Interpreting
DuTongChuan: Context-aware Translation Model for Simultaneous Interpreting
Hao Xiong
Ruiqing Zhang
Chuanqiang Zhang
Zhongjun He
Hua Wu
Haifeng Wang
76
28
0
30 Jul 2019
Machine Translation Evaluation with BERT Regressor
Machine Translation Evaluation with BERT Regressor
Hiroki Shimanaka
Tomoyuki Kajiwara
Mamoru Komachi
87
25
0
29 Jul 2019
Neural Mention Detection
Neural Mention Detection
Juntao Yu
Bernd Bohnet
Massimo Poesio
77
17
0
29 Jul 2019
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
S. Rothe
Shashi Narayan
Aliaksei Severyn
SILM
155
438
0
29 Jul 2019
VIANA: Visual Interactive Annotation of Argumentation
VIANA: Visual Interactive Annotation of Argumentation
F. Sperrle
Rita Sevastjanova
Rebecca Kehlbeck
Mennatallah El-Assady
58
25
0
29 Jul 2019
ERNIE 2.0: A Continual Pre-training Framework for Language Understanding
ERNIE 2.0: A Continual Pre-training Framework for Language Understanding
Yu Sun
Shuohuan Wang
Yukun Li
Shikun Feng
Hao Tian
Hua Wu
Haifeng Wang
CLL
118
813
0
29 Jul 2019
An Empirical Study on Leveraging Scene Graphs for Visual Question
  Answering
An Empirical Study on Leveraging Scene Graphs for Visual Question Answering
Cheng Zhang
Wei-Lun Chao
D. Xuan
77
51
0
28 Jul 2019
A Hybrid Neural Network Model for Commonsense Reasoning
A Hybrid Neural Network Model for Commonsense Reasoning
Pengcheng He
Xiaodong Liu
Weizhu Chen
Jianfeng Gao
LRM
80
29
0
27 Jul 2019
Is BERT Really Robust? A Strong Baseline for Natural Language Attack on
  Text Classification and Entailment
Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment
Di Jin
Zhijing Jin
Qiufeng Wang
Peter Szolovits
SILMAAML
326
1,098
0
27 Jul 2019
DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks
DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks
Simon Wiedemann
H. Kirchhoffer
Stefan Matlage
Paul Haase
Arturo Marbán
...
Ahmed Osman
D. Marpe
H. Schwarz
Thomas Wiegand
Wojciech Samek
111
98
0
27 Jul 2019
DLGNet: A Transformer-based Model for Dialogue Response Generation
DLGNet: A Transformer-based Model for Dialogue Response Generation
O. Olabiyi
Erik T. Mueller
98
12
0
26 Jul 2019
Supervised and Unsupervised Neural Approaches to Text Readability
Supervised and Unsupervised Neural Approaches to Text Readability
Matej Martinc
Senja Pollak
Marko Robnik-Šikonja
99
145
0
26 Jul 2019
Automatically Learning Construction Injury Precursors from Text
Automatically Learning Construction Injury Precursors from Text
Henrietta Baker
Matthew R. Hallowell
A. Tixier
82
102
0
26 Jul 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
1.1K
24,645
0
26 Jul 2019
Investigating Self-Attention Network for Chinese Word Segmentation
Investigating Self-Attention Network for Chinese Word Segmentation
Leilei Gan
Yue Zhang
46
11
0
26 Jul 2019
Visual Interaction with Deep Learning Models through Collaborative
  Semantic Inference
Visual Interaction with Deep Learning Models through Collaborative Semantic Inference
Sebastian Gehrmann
Hendrik Strobelt
Robert Krüger
Hanspeter Pfister
Alexander M. Rush
HAI
101
58
0
24 Jul 2019
Careful Selection of Knowledge to solve Open Book Question Answering
Careful Selection of Knowledge to solve Open Book Question Answering
Pratyay Banerjee
Kuntal Kumar Pal
Arindam Mitra
Chitta Baral
78
62
0
24 Jul 2019
Cross-Attention End-to-End ASR for Two-Party Conversations
Cross-Attention End-to-End ASR for Two-Party Conversations
Suyoun Kim
Siddharth Dalmia
Florian Metze
52
18
0
24 Jul 2019
Generic Intent Representation in Web Search
Generic Intent Representation in Web Search
Hongfei Zhang
Xia Song
Chenyan Xiong
Corby Rosset
Paul N. Bennett
Nick Craswell
Saurabh Tiwary
107
52
0
24 Jul 2019
WinoGrande: An Adversarial Winograd Schema Challenge at Scale
WinoGrande: An Adversarial Winograd Schema Challenge at Scale
Keisuke Sakaguchi
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
99
223
0
24 Jul 2019
SpanBERT: Improving Pre-training by Representing and Predicting Spans
SpanBERT: Improving Pre-training by Representing and Predicting Spans
Mandar Joshi
Danqi Chen
Yinhan Liu
Daniel S. Weld
Luke Zettlemoyer
Omer Levy
306
1,974
0
24 Jul 2019
Unbabel's Participation in the WMT19 Translation Quality Estimation
  Shared Task
Unbabel's Participation in the WMT19 Translation Quality Estimation Shared Task
Fabio Kepler
Jonay Trénous
Marcos Vinícius Treviso
M. Vera
António Góis
M. Amin Farajian
António Vilarinho Lopes
André F. T. Martins
94
58
0
24 Jul 2019
Tripartite Heterogeneous Graph Propagation for Large-scale Social
  Recommendation
Tripartite Heterogeneous Graph Propagation for Large-scale Social Recommendation
KyungHyun Kim
Donghyun Kwak
Hanock Kwak
Young-Jin Park
Sangkwon Sim
Jae-Han Cho
Minkyu Kim
Jihun Kwon
Nako Sung
Jung-Woo Ha
71
19
0
24 Jul 2019
Zero-Shot Sign Language Recognition: Can Textual Data Uncover Sign
  Languages?
Zero-Shot Sign Language Recognition: Can Textual Data Uncover Sign Languages?
Yunus Can Bilge
Nazli Ikizler-Cinbis
R. G. Cinbis
SLR
72
29
0
24 Jul 2019
CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context
  in Morphology
CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology
Aditi Chaudhary
Elizabeth Salesky
G. Bhat
David R. Mortensen
J. Carbonell
Yulia Tsvetkov
70
4
0
23 Jul 2019
Structured Fusion Networks for Dialog
Structured Fusion Networks for Dialog
Shikib Mehri
Tejas Srinivasan
M. Eskénazi
109
89
0
23 Jul 2019
Bilinear Graph Networks for Visual Question Answering
Bilinear Graph Networks for Visual Question Answering
Dalu Guo
Chang Xu
Dacheng Tao
GNN
93
54
0
23 Jul 2019
EmotionX-HSU: Adopting Pre-trained BERT for Emotion Classification
EmotionX-HSU: Adopting Pre-trained BERT for Emotion Classification
Li Luo
Yue Wang
55
26
0
23 Jul 2019
Green AI
Green AI
Roy Schwartz
Jesse Dodge
Noah A. Smith
Oren Etzioni
160
1,164
0
22 Jul 2019
BEHRT: Transformer for Electronic Health Records
BEHRT: Transformer for Electronic Health Records
Yikuan Li
Shishir Rao
J. R. A. Solares
A. Hassaine
D. Canoy
Yajie Zhu
K. Rahimi
G. Salimi-Khorshidi
OOD
124
469
0
22 Jul 2019
Emotion Detection in Text: Focusing on Latent Representation
Emotion Detection in Text: Focusing on Latent Representation
Armin Seyeditabari
N. Tabari
Shafie Gholizadeh
Wlodek Zadrozny
34
15
0
22 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
141
136
0
22 Jul 2019
ELI5: Long Form Question Answering
ELI5: Long Form Question Answering
Angela Fan
Yacine Jernite
Ethan Perez
David Grangier
Jason Weston
Michael Auli
AI4MHELM
157
627
0
22 Jul 2019
GEAR: Graph-based Evidence Aggregating and Reasoning for Fact
  Verification
GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification
Jie Zhou
Xu Han
Cheng Yang
Zhiyuan Liu
Lifeng Wang
Changcheng Li
Maosong Sun
104
201
0
22 Jul 2019
Generating Sentiment-Preserving Fake Online Reviews Using Neural
  Language Models and Their Human- and Machine-based Detection
Generating Sentiment-Preserving Fake Online Reviews Using Neural Language Models and Their Human- and Machine-based Detection
David Ifeoluwa Adelani
H. Mai
Fuming Fang
H. Nguyen
Junichi Yamagishi
Isao Echizen
DeLMO
121
122
0
22 Jul 2019
Realistic Channel Models Pre-training
Realistic Channel Models Pre-training
Yourui Huangfu
Jian Wang
Chen Xu
Rong Li
Yiqun Ge
Xianbin Wang
Huazi Zhang
Jun Wang
53
6
0
22 Jul 2019
ER-AE: Differentially Private Text Generation for Authorship
  Anonymization
ER-AE: Differentially Private Text Generation for Authorship Anonymization
Haohan Bo
Steven H. H. Ding
Benjamin C. M. Fung
Farkhund Iqbal
DeLMO
78
38
0
20 Jul 2019
What is this Article about? Extreme Summarization with Topic-aware
  Convolutional Neural Networks
What is this Article about? Extreme Summarization with Topic-aware Convolutional Neural Networks
Shashi Narayan
Shay B. Cohen
Mirella Lapata
AILaw
79
18
0
19 Jul 2019
Structure-Invariant Testing for Machine Translation
Structure-Invariant Testing for Machine Translation
Pinjia He
Clara Meister
Z. Su
75
106
0
19 Jul 2019
A Pragmatics-Centered Evaluation Framework for Natural Language
  Understanding
A Pragmatics-Centered Evaluation Framework for Natural Language Understanding
Damien Sileo
Tim Van de Cruys
Camille Pradel
Philippe Muller
ELM
40
3
0
19 Jul 2019
DaiMoN: A Decentralized Artificial Intelligence Model Network
DaiMoN: A Decentralized Artificial Intelligence Model Network
Surat Teerapittayanon
H. T. Kung
FedML
39
3
0
19 Jul 2019
WriterForcing: Generating more interesting story endings
WriterForcing: Generating more interesting story endings
Prakhar Gupta
Vinayshekhar Bannihatti Kumar
Mukul Bhutani
A. Black
66
18
0
18 Jul 2019
Joint Learning of Named Entity Recognition and Entity Linking
Joint Learning of Named Entity Recognition and Entity Linking
Pedro Henrique Martins
Zita Marinho
André F. T. Martins
146
95
0
18 Jul 2019
Self-Attentional Credit Assignment for Transfer in Reinforcement
  Learning
Self-Attentional Credit Assignment for Transfer in Reinforcement Learning
Johan Ferret
Raphaël Marinier
Matthieu Geist
Olivier Pietquin
OffRL
86
6
0
18 Jul 2019
ELG: An Event Logic Graph
ELG: An Event Logic Graph
Xiao Ding
Zhongyang Li
Ting Liu
Kuo Liao
63
40
0
18 Jul 2019
Deep Neural Models for Medical Concept Normalization in User-Generated
  Texts
Deep Neural Models for Medical Concept Normalization in User-Generated Texts
Z. Miftahutdinov
E. Tutubalina
MedIm
56
44
0
18 Jul 2019
What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb
  Constructions?
What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb Constructions?
Miryam de Lhoneux
Sara Stymne
Joakim Nivre
47
3
0
18 Jul 2019
Self-Attentive Hawkes Processes
Self-Attentive Hawkes Processes
Qiang Zhang
Aldo Lipani
Ömer Kirnap
Emine Yilmaz
AI4TS
125
45
0
17 Jul 2019
Low-Shot Classification: A Comparison of Classical and Deep Transfer
  Machine Learning Approaches
Low-Shot Classification: A Comparison of Classical and Deep Transfer Machine Learning Approaches
Peter Usherwood
S. Smit
VLM
49
11
0
17 Jul 2019
SUMBT: Slot-Utterance Matching for Universal and Scalable Belief
  Tracking
SUMBT: Slot-Utterance Matching for Universal and Scalable Belief Tracking
Hwaran Lee
Jinsik Lee
Tae-Yoon Kim
60
162
0
17 Jul 2019
Previous
123...457458459...470471472
Next