ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 18,282 papers shown
Title
Synthetic QA Corpora Generation with Roundtrip Consistency
Synthetic QA Corpora Generation with Roundtrip Consistency
Chris Alberti
D. Andor
Emily Pitler
Jacob Devlin
Michael Collins
SyDa
36
244
0
12 Jun 2019
Neural Arabic Question Answering
Neural Arabic Question Answering
Hussein Mozannar
Karl El Hajal
Elie Maamary
Hazem M. Hajj
21
134
0
12 Jun 2019
Explore, Propose, and Assemble: An Interpretable Model for Multi-Hop
  Reading Comprehension
Explore, Propose, and Assemble: An Interpretable Model for Multi-Hop Reading Comprehension
Yichen Jiang
Nitish Joshi
Yen-Chun Chen
Joey Tianyi Zhou
RALM
24
39
0
12 Jun 2019
Toward Interpretable Music Tagging with Self-Attention
Toward Interpretable Music Tagging with Self-Attention
Minz Won
Sanghyuk Chun
Xavier Serra
ViT
18
80
0
12 Jun 2019
Learning the Graphical Structure of Electronic Health Records with Graph
  Convolutional Transformer
Learning the Graphical Structure of Electronic Health Records with Graph Convolutional Transformer
Edward Choi
Zhen Xu
Yujia Li
Michael W. Dusenberry
Gerardo Flores
Yuan Xue
Andrew M. Dai
MedIm
24
238
0
11 Jun 2019
Retrieve, Read, Rerank: Towards End-to-End Multi-Document Reading
  Comprehension
Retrieve, Read, Rerank: Towards End-to-End Multi-Document Reading Comprehension
Minghao Hu
Yuxing Peng
Zhen Huang
Dongsheng Li
RALM
14
58
0
11 Jun 2019
Modeling Sentiment Dependencies with Graph Convolutional Networks for
  Aspect-level Sentiment Classification
Modeling Sentiment Dependencies with Graph Convolutional Networks for Aspect-level Sentiment Classification
Pinlong Zhao
Linlin Hou
Ou Wu
GNN
35
172
0
11 Jun 2019
Future Data Helps Training: Modeling Future Contexts for Session-based
  Recommendation
Future Data Helps Training: Modeling Future Contexts for Session-based Recommendation
Fajie Yuan
Xiangnan He
Haochuan Jiang
G. Guo
Jian Xiong
Zhezhao Xu
Yilin Xiong
AI4TS
18
103
0
11 Jun 2019
Self-Supervised Learning for Contextualized Extractive Summarization
Self-Supervised Learning for Contextualized Extractive Summarization
Hong Wang
Xin Eric Wang
Wenhan Xiong
Mo Yu
Xiaoxiao Guo
Shiyu Chang
William Yang Wang
SSL
37
56
0
11 Jun 2019
Lightweight and Efficient Neural Natural Language Processing with
  Quaternion Networks
Lightweight and Efficient Neural Natural Language Processing with Quaternion Networks
Yi Tay
Aston Zhang
Anh Tuan Luu
J. Rao
Shuai Zhang
Shuohang Wang
Jie Fu
S. Hui
23
55
0
11 Jun 2019
DoubleTransfer at MEDIQA 2019: Multi-Source Transfer Learning for
  Natural Language Understanding in the Medical Domain
DoubleTransfer at MEDIQA 2019: Multi-Source Transfer Learning for Natural Language Understanding in the Medical Domain
Yichong Xu
Xiaodong Liu
Chunyuan Li
Hoifung Poon
Jianfeng Gao
MedIm
24
15
0
11 Jun 2019
What Does BERT Look At? An Analysis of BERT's Attention
What Does BERT Look At? An Analysis of BERT's Attention
Kevin Clark
Urvashi Khandelwal
Omer Levy
Christopher D. Manning
MILM
120
1,584
0
11 Jun 2019
GLTR: Statistical Detection and Visualization of Generated Text
GLTR: Statistical Detection and Visualization of Generated Text
Sebastian Gehrmann
Hendrik Strobelt
Alexander M. Rush
DeLMO
17
519
0
10 Jun 2019
CAiRE_HKUST at SemEval-2019 Task 3: Hierarchical Attention for Dialogue
  Emotion Classification
CAiRE_HKUST at SemEval-2019 Task 3: Hierarchical Attention for Dialogue Emotion Classification
Genta Indra Winata
Andrea Madotto
Zhaojiang Lin
Jamin Shin
Yan Xu
Peng Xu
Pascale Fung
23
22
0
10 Jun 2019
Open-Domain Targeted Sentiment Analysis via Span-Based Extraction and
  Classification
Open-Domain Targeted Sentiment Analysis via Span-Based Extraction and Classification
Minghao Hu
Yuxing Peng
Zhen Huang
Dongsheng Li
Yiwei Lv
19
189
0
10 Jun 2019
Gendered Pronoun Resolution using BERT and an extractive question
  answering formulation
Gendered Pronoun Resolution using BERT and an extractive question answering formulation
Rakesh Chada
FaML
19
10
0
09 Jun 2019
Hierarchical Taxonomy-Aware and Attentional Graph Capsule RCNNs for
  Large-Scale Multi-Label Text Classification
Hierarchical Taxonomy-Aware and Attentional Graph Capsule RCNNs for Large-Scale Multi-Label Text Classification
Hao Peng
Jianxin Li
Qiran Gong
Senzhang Wang
Lifang He
Bo Li
Lihong Wang
Philip S. Yu
GNN
14
141
0
09 Jun 2019
Leveraging BERT for Extractive Text Summarization on Lectures
Leveraging BERT for Extractive Text Summarization on Lectures
Derek Miller
21
242
0
07 Jun 2019
Analyzing the Structure of Attention in a Transformer Language Model
Analyzing the Structure of Attention in a Transformer Language Model
Jesse Vig
Yonatan Belinkov
30
357
0
07 Jun 2019
From Caesar Cipher to Unsupervised Learning: A New Method for Classifier
  Parameter Estimation
From Caesar Cipher to Unsupervised Learning: A New Method for Classifier Parameter Estimation
Yu Liu
Li Deng
Jianshu Chen
C. Chen
SSL
26
0
0
06 Jun 2019
Conversing by Reading: Contentful Neural Conversation with On-demand
  Machine Reading
Conversing by Reading: Contentful Neural Conversation with On-demand Machine Reading
Lianhui Qin
Michel Galley
Chris Brockett
Xiaodong Liu
Xiang Gao
W. Dolan
Yejin Choi
Jianfeng Gao
29
109
0
06 Jun 2019
Visualizing and Measuring the Geometry of BERT
Visualizing and Measuring the Geometry of BERT
Andy Coenen
Emily Reif
Ann Yuan
Been Kim
Adam Pearce
F. Viégas
Martin Wattenberg
MILM
43
415
0
06 Jun 2019
Cross-Lingual Syntactic Transfer through Unsupervised Adaptation of
  Invertible Projections
Cross-Lingual Syntactic Transfer through Unsupervised Adaptation of Invertible Projections
Junxian He
Zhisong Zhang
Taylor Berg-Kirkpatrick
Graham Neubig
33
21
0
06 Jun 2019
Unsupervised Pivot Translation for Distant Languages
Unsupervised Pivot Translation for Distant Languages
Yichong Leng
Xu Tan
Tao Qin
Xiang-Yang Li
Tie-Yan Liu
33
30
0
06 Jun 2019
Extracting Symptoms and their Status from Clinical Conversations
Extracting Symptoms and their Status from Clinical Conversations
Nan Du
Kai Chen
Anjuli Kannan
Linh Tran
Yuhui Chen
Izhak Shafran
20
68
0
05 Jun 2019
Large-Scale Multi-Label Text Classification on EU Legislation
Large-Scale Multi-Label Text Classification on EU Legislation
Ilias Chalkidis
Manos Fergadiotis
Prodromos Malakasiotis
Ion Androutsopoulos
AILaw
19
214
0
05 Jun 2019
From Balustrades to Pierre Vinken: Looking for Syntax in Transformer
  Self-Attentions
From Balustrades to Pierre Vinken: Looking for Syntax in Transformer Self-Attentions
David Marecek
Rudolf Rosa
28
52
0
05 Jun 2019
The Secrets of Machine Learning: Ten Things You Wish You Had Known
  Earlier to be More Effective at Data Analysis
The Secrets of Machine Learning: Ten Things You Wish You Had Known Earlier to be More Effective at Data Analysis
Cynthia Rudin
David Carlson
HAI
30
34
0
04 Jun 2019
KERMIT: Generative Insertion-Based Modeling for Sequences
KERMIT: Generative Insertion-Based Modeling for Sequences
William Chan
Nikita Kitaev
Kelvin Guu
Mitchell Stern
Jakob Uszkoreit
VLM
23
65
0
04 Jun 2019
Sequence Tagging with Contextual and Non-Contextual Subword
  Representations: A Multilingual Evaluation
Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation
Benjamin Heinzerling
Michael Strube
16
35
0
04 Jun 2019
How multilingual is Multilingual BERT?
How multilingual is Multilingual BERT?
Telmo Pires
Eva Schlinger
Dan Garrette
LRM
VLM
95
1,374
0
04 Jun 2019
Converse Attention Knowledge Transfer for Low-Resource Named Entity
  Recognition
Converse Attention Knowledge Transfer for Low-Resource Named Entity Recognition
Shengfei Lyu
Linghao Sun
Huixiong Yi
Yong-jin Liu
Huanhuan Chen
Steven C. H. Hoi
21
0
0
04 Jun 2019
Detecting Local Insights from Global Labels: Supervised & Zero-Shot
  Sequence Labeling via a Convolutional Decomposition
Detecting Local Insights from Global Labels: Supervised & Zero-Shot Sequence Labeling via a Convolutional Decomposition
A. Schmaltz
27
8
0
04 Jun 2019
Episodic Memory in Lifelong Language Learning
Episodic Memory in Lifelong Language Learning
Cyprien de Masson dÁutume
Sebastian Ruder
Lingpeng Kong
Dani Yogatama
CLL
KELM
34
281
0
03 Jun 2019
Learning Representations by Maximizing Mutual Information Across Views
Learning Representations by Maximizing Mutual Information Across Views
Philip Bachman
R. Devon Hjelm
William Buchwalter
SSL
105
1,459
0
03 Jun 2019
Masked Non-Autoregressive Image Captioning
Masked Non-Autoregressive Image Captioning
Junlong Gao
Xi Meng
Shiqi Wang
Xia Li
Shanshe Wang
Siwei Ma
Wen Gao
19
36
0
03 Jun 2019
BAYHENN: Combining Bayesian Deep Learning and Homomorphic Encryption for
  Secure DNN Inference
BAYHENN: Combining Bayesian Deep Learning and Homomorphic Encryption for Secure DNN Inference
Peichen Xie
Bingzhe Wu
Guangyu Sun
BDL
FedML
16
33
0
03 Jun 2019
Efficient 8-Bit Quantization of Transformer Neural Machine Language
  Translation Model
Efficient 8-Bit Quantization of Transformer Neural Machine Language Translation Model
Aishwarya Bhandare
Vamsi Sripathi
Deepthi Karkada
Vivek V. Menon
Sun Choi
Kushal Datta
V. Saletore
MQ
30
131
0
03 Jun 2019
A Survey of Natural Language Generation Techniques with a Focus on
  Dialogue Systems - Past, Present and Future Directions
A Survey of Natural Language Generation Techniques with a Focus on Dialogue Systems - Past, Present and Future Directions
Sashank Santhanam
Samira Shaikh
3DV
31
52
0
02 Jun 2019
Pretraining Methods for Dialog Context Representation Learning
Pretraining Methods for Dialog Context Representation Learning
Shikib Mehri
E. Razumovskaia
Tiancheng Zhao
M. Eskénazi
22
84
0
02 Jun 2019
Adversarial Generation and Encoding of Nested Texts
Adversarial Generation and Encoding of Nested Texts
A. Rozental
GAN
19
0
0
01 Jun 2019
Scoring Sentence Singletons and Pairs for Abstractive Summarization
Scoring Sentence Singletons and Pairs for Abstractive Summarization
Logan Lebanoff
Kaiqiang Song
Franck Dernoncourt
Doo Soon Kim
Seokhwan Kim
W. Chang
Fei Liu
CVBM
30
103
0
31 May 2019
Do Human Rationales Improve Machine Explanations?
Do Human Rationales Improve Machine Explanations?
Julia Strout
Ye Zhang
Raymond J. Mooney
19
57
0
31 May 2019
Investigating an Effective Character-level Embedding in Korean Sentence
  Classification
Investigating an Effective Character-level Embedding in Korean Sentence Classification
Won Ik Cho
Seokhwan Kim
N. Kim
28
8
0
31 May 2019
MultiQA: An Empirical Investigation of Generalization and Transfer in
  Reading Comprehension
MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension
Alon Talmor
Jonathan Berant
20
172
0
31 May 2019
Fine-Grained Spoiler Detection from Large-Scale Review Corpora
Fine-Grained Spoiler Detection from Large-Scale Review Corpora
Mengting Wan
Rishabh Misra
Ndapandula Nakashole
Julian McAuley
9
130
0
31 May 2019
Rewarding Smatch: Transition-Based AMR Parsing with Reinforcement
  Learning
Rewarding Smatch: Transition-Based AMR Parsing with Reinforcement Learning
Tahira Naseem
Abhishek Shah
Hui Wan
Radu Florian
Salim Roukos
Miguel Ballesteros
25
59
0
31 May 2019
A Lightweight Recurrent Network for Sequence Modeling
A Lightweight Recurrent Network for Sequence Modeling
Biao Zhang
Rico Sennrich
27
7
0
30 May 2019
Unbabel's Submission to the WMT2019 APE Shared Task: BERT-based
  Encoder-Decoder for Automatic Post-Editing
Unbabel's Submission to the WMT2019 APE Shared Task: BERT-based Encoder-Decoder for Automatic Post-Editing
António Vilarinho Lopes
M. Amin Farajian
Gonçalo M. Correia
Jonay Trénous
André F. T. Martins
33
35
0
30 May 2019
Semantically Conditioned Dialog Response Generation via Hierarchical
  Disentangled Self-Attention
Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention
Wenhu Chen
Jianshu Chen
Pengda Qin
Xifeng Yan
William Yang Wang
31
129
0
30 May 2019
Previous
123...360361362...364365366
Next