ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,534 papers shown
Title
Controllable Unsupervised Text Attribute Transfer via Editing Entangled
  Latent Representation
Controllable Unsupervised Text Attribute Transfer via Editing Entangled Latent Representation
Ke Wang
Hang Hua
Xiaojun Wan
82
89
0
30 May 2019
A Compare-Aggregate Model with Latent Clustering for Answer Selection
A Compare-Aggregate Model with Latent Clustering for Answer Selection
Seunghyun Yoon
Franck Dernoncourt
Doo Soon Kim
Trung Bui
Kyomin Jung
72
69
0
30 May 2019
Semantically Conditioned Dialog Response Generation via Hierarchical
  Disentangled Self-Attention
Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention
Wenhu Chen
Jianshu Chen
Pengda Qin
Xifeng Yan
William Yang Wang
90
129
0
30 May 2019
A Simple but Effective Method to Incorporate Multi-turn Context with
  BERT for Conversational Machine Comprehension
A Simple but Effective Method to Incorporate Multi-turn Context with BERT for Conversational Machine Comprehension
Yasuhito Ohsugi
Itsumi Saito
Kyosuke Nishida
Hisako Asano
J. Tomita
85
43
0
30 May 2019
A Generalized Framework of Sequence Generation with Application to
  Undirected Sequence Models
A Generalized Framework of Sequence Generation with Application to Undirected Sequence Models
Elman Mansimov
Alex Jinpeng Wang
Sean Welleck
Kyunghyun Cho
AIMat
93
46
0
29 May 2019
Educating Text Autoencoders: Latent Representation Guidance via
  Denoising
Educating Text Autoencoders: Latent Representation Guidance via Denoising
T. Shen
Jonas W. Mueller
Regina Barzilay
Tommi Jaakkola
46
4
0
29 May 2019
Unsupervised Paraphrasing without Translation
Unsupervised Paraphrasing without Translation
Aurko Roy
David Grangier
BDLLRM
124
61
0
29 May 2019
Adapting Text Embeddings for Causal Inference
Adapting Text Embeddings for Causal Inference
Victor Veitch
Dhanya Sridhar
David M. Blei
CML
61
21
0
29 May 2019
Defending Against Neural Fake News
Defending Against Neural Fake News
Rowan Zellers
Ari Holtzman
Hannah Rashkin
Yonatan Bisk
Ali Farhadi
Franziska Roesner
Yejin Choi
AAML
153
1,035
0
29 May 2019
Towards better substitution-based word sense induction
Towards better substitution-based word sense induction
Asaf Amrami
Yoav Goldberg
100
40
0
29 May 2019
Learning Task-specific Representation for Novel Words in Sequence
  Labeling
Learning Task-specific Representation for Novel Words in Sequence Labeling
Minlong Peng
Qi Zhang
Xiaoyu Xing
Tao Gui
Jinlan Fu
Xuanjing Huang
70
8
0
29 May 2019
Strategies for Pre-training Graph Neural Networks
Strategies for Pre-training Graph Neural Networks
Weihua Hu
Bowen Liu
Joseph Gomes
Marinka Zitnik
Percy Liang
Vijay S. Pande
J. Leskovec
SSLAI4CE
133
1,424
0
29 May 2019
Better Long-Range Dependency By Bootstrapping A Mutual Information
  Regularizer
Better Long-Range Dependency By Bootstrapping A Mutual Information Regularizer
Yanshuai Cao
Peng Xu
28
2
0
28 May 2019
On Variational Learning of Controllable Representations for Text without
  Supervision
On Variational Learning of Controllable Representations for Text without Supervision
Peng Xu
Jackie C.K. Cheung
Yanshuai Cao
SSLDRL
70
9
0
28 May 2019
EDUCE: Explaining model Decisions through Unsupervised Concepts
  Extraction
EDUCE: Explaining model Decisions through Unsupervised Concepts Extraction
Diane Bouchacourt
Ludovic Denoyer
FAtt
74
21
0
28 May 2019
Interpreting and improving natural-language processing (in machines)
  with natural language-processing (in the brain)
Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain)
Mariya Toneva
Leila Wehbe
MILMAI4CE
108
235
0
28 May 2019
DSReg: Using Distant Supervision as a Regularizer
DSReg: Using Distant Supervision as a Regularizer
Yuxian Meng
Muyu Li
Xiaoya Li
Wei Wu
Jiwei Li
84
3
0
28 May 2019
XLDA: Cross-Lingual Data Augmentation for Natural Language Inference and
  Question Answering
XLDA: Cross-Lingual Data Augmentation for Natural Language Inference and Question Answering
Jasdeep Singh
Bryan McCann
N. Keskar
Caiming Xiong
R. Socher
ELM
81
81
0
27 May 2019
Combating Adversarial Misspellings with Robust Word Recognition
Combating Adversarial Misspellings with Robust Word Recognition
Danish Pruthi
Bhuwan Dhingra
Zachary Chase Lipton
221
309
0
27 May 2019
STAR-GCN: Stacked and Reconstructed Graph Convolutional Networks for
  Recommender Systems
STAR-GCN: Stacked and Reconstructed Graph Convolutional Networks for Recommender Systems
Jiani Zhang
Xingjian Shi
Shenglin Zhao
Irwin King
63
228
0
27 May 2019
SCAN: A Scalable Neural Networks Framework Towards Compact and Efficient
  Models
SCAN: A Scalable Neural Networks Framework Towards Compact and Efficient Models
Linfeng Zhang
Zhanhong Tan
Jiebo Song
Jingwei Chen
Chenglong Bao
Kaisheng Ma
55
71
0
27 May 2019
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
187
372
0
27 May 2019
Levenshtein Transformer
Levenshtein Transformer
Jiatao Gu
Changhan Wang
Jake Zhao
165
359
0
27 May 2019
AI-GAs: AI-generating algorithms, an alternate paradigm for producing
  general artificial intelligence
AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence
Jeff Clune
148
122
0
27 May 2019
QuesNet: A Unified Representation for Heterogeneous Test Questions
QuesNet: A Unified Representation for Heterogeneous Test Questions
Yu Yin
Qi Liu
Zhenya Huang
Enhong Chen
Wei Tong
Shijin Wang
Yu-Ho Su
28
47
0
27 May 2019
Extreme Multi-Label Legal Text Classification: A case study in EU
  Legislation
Extreme Multi-Label Legal Text Classification: A case study in EU Legislation
Ilias Chalkidis
Manos Fergadiotis
Prodromos Malakasiotis
Nikolaos Aletras
Ion Androutsopoulos
AILaw
81
75
0
26 May 2019
Where's My Head? Definition, Dataset and Models for Numeric Fused-Heads
  Identification and Resolution
Where's My Head? Definition, Dataset and Models for Numeric Fused-Heads Identification and Resolution
Yanai Elazar
Yoav Goldberg
108
23
0
26 May 2019
TIGS: An Inference Algorithm for Text Infilling with Gradient Search
TIGS: An Inference Algorithm for Text Infilling with Gradient Search
Dayiheng Liu
Jie Fu
Pengfei Liu
Jiancheng Lv
DiffM
139
27
0
26 May 2019
TACAM: Topic And Context Aware Argument Mining
TACAM: Topic And Context Aware Argument Mining
Michael Fromm
Evgeniy Faerman
T. Seidl
75
25
0
26 May 2019
Hashing based Answer Selection
Hashing based Answer Selection
Dong Xu
Wu-Jun Li
53
6
0
26 May 2019
Graph Attention Auto-Encoders
Graph Attention Auto-Encoders
Amin Salehi
H. Davulcu
GNN
72
125
0
26 May 2019
Are Sixteen Heads Really Better than One?
Are Sixteen Heads Really Better than One?
Paul Michel
Omer Levy
Graham Neubig
MoE
120
1,073
0
25 May 2019
Stochastic Shared Embeddings: Data-driven Regularization of Embedding
  Layers
Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers
Liwei Wu
Shuqing Li
Cho-Jui Hsieh
James Sharpnack
84
33
0
25 May 2019
SemEval-2019 Task 8: Fact Checking in Community Question Answering
  Forums
SemEval-2019 Task 8: Fact Checking in Community Question Answering Forums
Tsvetomila Mihaylova
Georgi Karadzhov
Pepa Atanasova
R. Baly
Mitra Mohtarami
Preslav Nakov
79
62
0
25 May 2019
Human vs. Muppet: A Conservative Estimate of Human Performance on the
  GLUE Benchmark
Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark
Nikita Nangia
Samuel R. Bowman
ELMALM
84
76
0
24 May 2019
Discrete Flows: Invertible Generative Models of Discrete Data
Discrete Flows: Invertible Generative Models of Discrete Data
Dustin Tran
Keyon Vafa
Kumar Krishna Agrawal
Laurent Dinh
Ben Poole
DRL
166
117
0
24 May 2019
SCRAM: Spatially Coherent Randomized Attention Maps
SCRAM: Spatially Coherent Randomized Attention Maps
D. A. Calian
P. Roelants
Jacques Calì
B. Carr
K. Dubba
John E. Reid
Dell Zhang
54
2
0
24 May 2019
Controlling Risk of Web Question Answering
Controlling Risk of Web Question Answering
Lixin Su
Jiafeng Guo
Yixing Fan
Yanyan Lan
Xueqi Cheng
56
9
0
24 May 2019
Label-aware Document Representation via Hybrid Attention for Extreme
  Multi-Label Text Classification
Label-aware Document Representation via Hybrid Attention for Extreme Multi-Label Text Classification
Xin Huang
Boli Chen
Lin Xiao
L. Jing
62
36
0
24 May 2019
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions
Christopher Clark
Kenton Lee
Ming-Wei Chang
Tom Kwiatkowski
Michael Collins
Kristina Toutanova
399
1,564
0
24 May 2019
Personalizing Dialogue Agents via Meta-Learning
Personalizing Dialogue Agents via Meta-Learning
Zhaojiang Lin
Andrea Madotto
Chien-Sheng Wu
Pascale Fung
140
187
0
24 May 2019
Fair is Better than Sensational:Man is to Doctor as Woman is to Doctor
Fair is Better than Sensational:Man is to Doctor as Woman is to Doctor
Malvina Nissim
Rik van Noord
Rob van der Goot
FaML
92
103
0
23 May 2019
Zero-shot Knowledge Transfer via Adversarial Belief Matching
Zero-shot Knowledge Transfer via Adversarial Belief Matching
P. Micaelli
Amos Storkey
82
230
0
23 May 2019
Misspelling Oblivious Word Embeddings
Misspelling Oblivious Word Embeddings
Bora Edizel
Aleksandra Piktus
Piotr Bojanowski
Rui A. Ferreira
Edouard Grave
Fabrizio Silvestri
74
65
0
23 May 2019
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire
  Evacuation Environment
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment
Jivitesh Sharma
Per-Arne Andersen
Ole-Christoffer Granmo
M. G. Olsen
AI4CE
76
70
0
23 May 2019
An Investigation of Transfer Learning-Based Sentiment Analysis in
  Japanese
An Investigation of Transfer Learning-Based Sentiment Analysis in Japanese
Enkhbold Bataa
Joshua Wu
78
33
0
23 May 2019
AMSI-Based Detection of Malicious PowerShell Code Using Contextual
  Embeddings
AMSI-Based Detection of Malicious PowerShell Code Using Contextual Embeddings
Amir Rubin
Shay Kels
Danny Hendler
57
2
0
23 May 2019
Data-Efficient Image Recognition with Contrastive Predictive Coding
Data-Efficient Image Recognition with Contrastive Predictive Coding
Olivier J. Hénaff
A. Srinivas
J. Fauw
Ali Razavi
Carl Doersch
S. M. Ali Eslami
Aaron van den Oord
SSL
204
1,437
0
22 May 2019
Deeper Text Understanding for IR with Contextual Neural Language
  Modeling
Deeper Text Understanding for IR with Contextual Neural Language Modeling
Zhuyun Dai
Jamie Callan
80
449
0
22 May 2019
Simplified Neural Unsupervised Domain Adaptation
Simplified Neural Unsupervised Domain Adaptation
Timothy A. Miller
67
29
0
22 May 2019
Previous
123...462463464...469470471
Next