ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 19,366 papers shown
Title
Masked Non-Autoregressive Image Captioning
Masked Non-Autoregressive Image Captioning
Junlong Gao
Xi Meng
Shiqi Wang
Xia Li
Shanshe Wang
Siwei Ma
Wen Gao
30
36
0
03 Jun 2019
BAYHENN: Combining Bayesian Deep Learning and Homomorphic Encryption for
  Secure DNN Inference
BAYHENN: Combining Bayesian Deep Learning and Homomorphic Encryption for Secure DNN Inference
Peichen Xie
Bingzhe Wu
Guangyu Sun
BDL
FedML
19
33
0
03 Jun 2019
Efficient 8-Bit Quantization of Transformer Neural Machine Language
  Translation Model
Efficient 8-Bit Quantization of Transformer Neural Machine Language Translation Model
Aishwarya Bhandare
Vamsi Sripathi
Deepthi Karkada
Vivek V. Menon
Sun Choi
Kushal Datta
V. Saletore
MQ
30
131
0
03 Jun 2019
A Survey of Natural Language Generation Techniques with a Focus on
  Dialogue Systems - Past, Present and Future Directions
A Survey of Natural Language Generation Techniques with a Focus on Dialogue Systems - Past, Present and Future Directions
Sashank Santhanam
Samira Shaikh
3DV
36
52
0
02 Jun 2019
Pretraining Methods for Dialog Context Representation Learning
Pretraining Methods for Dialog Context Representation Learning
Shikib Mehri
E. Razumovskaia
Tiancheng Zhao
M. Eskénazi
37
84
0
02 Jun 2019
Pre-training of Graph Augmented Transformers for Medication
  Recommendation
Pre-training of Graph Augmented Transformers for Medication Recommendation
Junyuan Shang
Tengfei Ma
Cao Xiao
Jimeng Sun
27
283
0
02 Jun 2019
Adversarial Generation and Encoding of Nested Texts
Adversarial Generation and Encoding of Nested Texts
A. Rozental
GAN
22
0
0
01 Jun 2019
Scoring Sentence Singletons and Pairs for Abstractive Summarization
Scoring Sentence Singletons and Pairs for Abstractive Summarization
Logan Lebanoff
Kaiqiang Song
Franck Dernoncourt
Doo Soon Kim
Seokhwan Kim
W. Chang
Fei Liu
CVBM
35
105
0
31 May 2019
Pre-Training Graph Neural Networks for Generic Structural Feature
  Extraction
Pre-Training Graph Neural Networks for Generic Structural Feature Extraction
Ziniu Hu
Changjun Fan
Ting-Li Chen
Kai-Wei Chang
Yizhou Sun
38
43
0
31 May 2019
Do Human Rationales Improve Machine Explanations?
Do Human Rationales Improve Machine Explanations?
Julia Strout
Ye Zhang
Raymond J. Mooney
19
57
0
31 May 2019
Investigating an Effective Character-level Embedding in Korean Sentence
  Classification
Investigating an Effective Character-level Embedding in Korean Sentence Classification
Won Ik Cho
Seokhwan Kim
N. Kim
30
8
0
31 May 2019
MultiQA: An Empirical Investigation of Generalization and Transfer in
  Reading Comprehension
MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension
Alon Talmor
Jonathan Berant
32
172
0
31 May 2019
Fine-Grained Spoiler Detection from Large-Scale Review Corpora
Fine-Grained Spoiler Detection from Large-Scale Review Corpora
Mengting Wan
Rishabh Misra
Ndapandula Nakashole
Julian McAuley
9
130
0
31 May 2019
Rewarding Smatch: Transition-Based AMR Parsing with Reinforcement
  Learning
Rewarding Smatch: Transition-Based AMR Parsing with Reinforcement Learning
Tahira Naseem
Abhishek Shah
Hui Wan
Radu Florian
Salim Roukos
Miguel Ballesteros
27
59
0
31 May 2019
A Lightweight Recurrent Network for Sequence Modeling
A Lightweight Recurrent Network for Sequence Modeling
Biao Zhang
Rico Sennrich
27
7
0
30 May 2019
Unbabel's Submission to the WMT2019 APE Shared Task: BERT-based
  Encoder-Decoder for Automatic Post-Editing
Unbabel's Submission to the WMT2019 APE Shared Task: BERT-based Encoder-Decoder for Automatic Post-Editing
António Vilarinho Lopes
M. Amin Farajian
Gonçalo M. Correia
Jonay Trénous
André F. T. Martins
33
35
0
30 May 2019
Semantically Conditioned Dialog Response Generation via Hierarchical
  Disentangled Self-Attention
Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention
Wenhu Chen
Jianshu Chen
Pengda Qin
Xifeng Yan
William Yang Wang
36
129
0
30 May 2019
A Simple but Effective Method to Incorporate Multi-turn Context with
  BERT for Conversational Machine Comprehension
A Simple but Effective Method to Incorporate Multi-turn Context with BERT for Conversational Machine Comprehension
Yasuhito Ohsugi
Itsumi Saito
Kyosuke Nishida
Hisako Asano
J. Tomita
38
43
0
30 May 2019
A Generalized Framework of Sequence Generation with Application to
  Undirected Sequence Models
A Generalized Framework of Sequence Generation with Application to Undirected Sequence Models
Elman Mansimov
Alex Jinpeng Wang
Sean Welleck
Kyunghyun Cho
AIMat
28
46
0
29 May 2019
Educating Text Autoencoders: Latent Representation Guidance via
  Denoising
Educating Text Autoencoders: Latent Representation Guidance via Denoising
T. Shen
Jonas W. Mueller
Regina Barzilay
Tommi Jaakkola
19
4
0
29 May 2019
Unsupervised Paraphrasing without Translation
Unsupervised Paraphrasing without Translation
Aurko Roy
David Grangier
BDL
LRM
16
61
0
29 May 2019
Adapting Text Embeddings for Causal Inference
Adapting Text Embeddings for Causal Inference
Victor Veitch
Dhanya Sridhar
David M. Blei
CML
22
21
0
29 May 2019
Defending Against Neural Fake News
Defending Against Neural Fake News
Rowan Zellers
Ari Holtzman
Hannah Rashkin
Yonatan Bisk
Ali Farhadi
Franziska Roesner
Yejin Choi
AAML
62
1,006
0
29 May 2019
Interpreting and improving natural-language processing (in machines)
  with natural language-processing (in the brain)
Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain)
Mariya Toneva
Leila Wehbe
MILM
AI4CE
47
223
0
28 May 2019
Combating Adversarial Misspellings with Robust Word Recognition
Combating Adversarial Misspellings with Robust Word Recognition
Danish Pruthi
Bhuwan Dhingra
Zachary Chase Lipton
31
302
0
27 May 2019
STAR-GCN: Stacked and Reconstructed Graph Convolutional Networks for
  Recommender Systems
STAR-GCN: Stacked and Reconstructed Graph Convolutional Networks for Recommender Systems
Jiani Zhang
Xingjian Shi
Shenglin Zhao
Irwin King
29
225
0
27 May 2019
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
143
360
0
27 May 2019
Levenshtein Transformer
Levenshtein Transformer
Jiatao Gu
Changhan Wang
Jake Zhao
59
359
0
27 May 2019
AI-GAs: AI-generating algorithms, an alternate paradigm for producing
  general artificial intelligence
AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence
Jeff Clune
34
118
0
27 May 2019
Where's My Head? Definition, Dataset and Models for Numeric Fused-Heads
  Identification and Resolution
Where's My Head? Definition, Dataset and Models for Numeric Fused-Heads Identification and Resolution
Yanai Elazar
Yoav Goldberg
45
23
0
26 May 2019
TIGS: An Inference Algorithm for Text Infilling with Gradient Search
TIGS: An Inference Algorithm for Text Infilling with Gradient Search
Dayiheng Liu
Jie Fu
Pengfei Liu
Jiancheng Lv
DiffM
34
27
0
26 May 2019
Hashing based Answer Selection
Hashing based Answer Selection
Dong Xu
Wu-Jun Li
24
6
0
26 May 2019
Stochastic Shared Embeddings: Data-driven Regularization of Embedding
  Layers
Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers
Liwei Wu
Shuqing Li
Cho-Jui Hsieh
James Sharpnack
38
31
0
25 May 2019
Human vs. Muppet: A Conservative Estimate of Human Performance on the
  GLUE Benchmark
Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark
Nikita Nangia
Samuel R. Bowman
ELM
ALM
34
75
0
24 May 2019
Discrete Flows: Invertible Generative Models of Discrete Data
Discrete Flows: Invertible Generative Models of Discrete Data
Dustin Tran
Keyon Vafa
Kumar Krishna Agrawal
Laurent Dinh
Ben Poole
DRL
29
115
0
24 May 2019
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions
Christopher Clark
Kenton Lee
Ming-Wei Chang
Tom Kwiatkowski
Michael Collins
Kristina Toutanova
101
1,425
0
24 May 2019
Personalizing Dialogue Agents via Meta-Learning
Personalizing Dialogue Agents via Meta-Learning
Zhaojiang Lin
Andrea Madotto
Chien-Sheng Wu
Pascale Fung
79
181
0
24 May 2019
Zero-shot Knowledge Transfer via Adversarial Belief Matching
Zero-shot Knowledge Transfer via Adversarial Belief Matching
P. Micaelli
Amos Storkey
19
228
0
23 May 2019
Misspelling Oblivious Word Embeddings
Misspelling Oblivious Word Embeddings
Bora Edizel
Aleksandra Piktus
Piotr Bojanowski
Rui A. Ferreira
Edouard Grave
Fabrizio Silvestri
27
63
0
23 May 2019
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire
  Evacuation Environment
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment
Jivitesh Sharma
Per-Arne Andersen
Ole-Christoffer Granmo
M. G. Olsen
AI4CE
46
68
0
23 May 2019
An Investigation of Transfer Learning-Based Sentiment Analysis in
  Japanese
An Investigation of Transfer Learning-Based Sentiment Analysis in Japanese
Enkhbold Bataa
Joshua Wu
29
33
0
23 May 2019
Data-Efficient Image Recognition with Contrastive Predictive Coding
Data-Efficient Image Recognition with Contrastive Predictive Coding
Olivier J. Hénaff
A. Srinivas
J. Fauw
Ali Razavi
Carl Doersch
S. M. Ali Eslami
Aaron van den Oord
SSL
72
1,419
0
22 May 2019
Deeper Text Understanding for IR with Contextual Neural Language
  Modeling
Deeper Text Understanding for IR with Contextual Neural Language Modeling
Zhuyun Dai
Jamie Callan
21
445
0
22 May 2019
AMR Parsing as Sequence-to-Graph Transduction
AMR Parsing as Sequence-to-Graph Transduction
Sheng Zhang
Xutai Ma
Kevin Duh
Benjamin Van Durme
36
148
0
21 May 2019
Answering while Summarizing: Multi-task Learning for Multi-hop QA with
  Evidence Extraction
Answering while Summarizing: Multi-task Learning for Multi-hop QA with Evidence Extraction
Kosuke Nishida
Kyosuke Nishida
Masaaki Nagata
Atsushi Otsuka
Itsumi Saito
Hisako Asano
J. Tomita
RALM
24
102
0
21 May 2019
Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate
  Representation
Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation
Jiaqi Guo
Zecheng Zhan
Yan Gao
Yan Xiao
Jian-Guang Lou
Ting Liu
Dongmei Zhang
25
374
0
20 May 2019
Interpretable Neural Predictions with Differentiable Binary Variables
Interpretable Neural Predictions with Differentiable Binary Variables
Jasmijn Bastings
Wilker Aziz
Ivan Titov
43
212
0
20 May 2019
HellaSwag: Can a Machine Really Finish Your Sentence?
HellaSwag: Can a Machine Really Finish Your Sentence?
Rowan Zellers
Ari Holtzman
Yonatan Bisk
Ali Farhadi
Yejin Choi
35
2,289
0
19 May 2019
Human-like machine thinking: Language guided imagination
Human-like machine thinking: Language guided imagination
Feng Qi
Wenchuan Wu
AI4CE
MLLM
16
5
0
18 May 2019
Story Ending Prediction by Transferable BERT
Story Ending Prediction by Transferable BERT
Zhongyang Li
Xiao Ding
Ting Liu
39
52
0
17 May 2019
Previous
123...382383384...386387388
Next