ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,553 papers shown
Title
Microsoft AI Challenge India 2018: Learning to Rank Passages for Web
  Question Answering with Deep Attention Networks
Microsoft AI Challenge India 2018: Learning to Rank Passages for Web Question Answering with Deep Attention Networks
Chaitanya Sai Alaparthi
41
5
0
14 Jun 2019
Learning to Ask Unanswerable Questions for Machine Reading Comprehension
Learning to Ask Unanswerable Questions for Machine Reading Comprehension
Haichao Zhu
Li Dong
Furu Wei
Wenhui Wang
Bing Qin
Ting Liu
RALM
70
31
0
14 Jun 2019
Image Captioning: Transforming Objects into Words
Image Captioning: Transforming Objects into Words
Simão Herdade
Armin Kappeler
K. Boakye
Joao Soares
ViT
170
476
0
14 Jun 2019
Stand-Alone Self-Attention in Vision Models
Stand-Alone Self-Attention in Vision Models
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
VLMSLRViT
184
1,218
0
13 Jun 2019
On the Effect of Word Order on Cross-lingual Sentiment Analysis
On the Effect of Word Order on Cross-lingual Sentiment Analysis
Àlex R. Atrio
Toni Badia
Jeremy Barnes
38
4
0
13 Jun 2019
Sentiment analysis is not solved! Assessing and probing sentiment
  classification
Sentiment analysis is not solved! Assessing and probing sentiment classification
Jeremy Barnes
Lilja Øvrelid
Erik Velldal
67
32
0
13 Jun 2019
Telephonetic: Making Neural Language Models Robust to ASR and Semantic
  Noise
Telephonetic: Making Neural Language Models Robust to ASR and Semantic Noise
Christopher Larson
Tarek Lahlou
Diana Mingels
Zachary Kulis
Erik T. Mueller
38
2
0
13 Jun 2019
Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index
Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index
Minjoon Seo
Jinhyuk Lee
Tom Kwiatkowski
Ankur P. Parikh
Ali Farhadi
Hannaneh Hajishirzi
RALM
94
157
0
13 Jun 2019
Learning Video Representations using Contrastive Bidirectional
  Transformer
Learning Video Representations using Contrastive Bidirectional Transformer
Chen Sun
Fabien Baradel
Kevin Patrick Murphy
Cordelia Schmid
SSLViT
136
134
0
13 Jun 2019
2D Attentional Irregular Scene Text Recognizer
2D Attentional Irregular Scene Text Recognizer
Pengyuan Lyu
Zhicheng Yang
Xinhang Leng
Xiaojun Wu
Ruiyu Li
Xiaoyong Shen
3DV
113
50
0
13 Jun 2019
Proactive Human-Machine Conversation with Explicit Conversation Goals
Proactive Human-Machine Conversation with Explicit Conversation Goals
Wenquan Wu
Zhen Guo
Xiangyang Zhou
Hua Wu
Xiyuan Zhang
Rongzhong Lian
Haifeng Wang
84
195
0
13 Jun 2019
Lattice Transformer for Speech Translation
Lattice Transformer for Speech Translation
Pei Zhang
Boxing Chen
Niyu Ge
Kai Fan
80
50
0
13 Jun 2019
Interpretable ICD Code Embeddings with Self- and Mutual-Attention
  Mechanisms
Interpretable ICD Code Embeddings with Self- and Mutual-Attention Mechanisms
Dixin Luo
Hongteng Xu
Lawrence Carin
27
5
0
13 Jun 2019
Transfer Learning in Biomedical Natural Language Processing: An
  Evaluation of BERT and ELMo on Ten Benchmarking Datasets
Transfer Learning in Biomedical Natural Language Processing: An Evaluation of BERT and ELMo on Ten Benchmarking Datasets
Yifan Peng
Shankai Yan
Zhiyong Lu
LM&MAAI4MH
141
847
0
13 Jun 2019
Synthetic QA Corpora Generation with Roundtrip Consistency
Synthetic QA Corpora Generation with Roundtrip Consistency
Chris Alberti
D. Andor
Emily Pitler
Jacob Devlin
Michael Collins
SyDa
77
249
0
12 Jun 2019
Neural Arabic Question Answering
Neural Arabic Question Answering
Hussein Mozannar
Karl El Hajal
Elie Maamary
Hazem M. Hajj
84
136
0
12 Jun 2019
E3: Entailment-driven Extracting and Editing for Conversational Machine
  Reading
E3: Entailment-driven Extracting and Editing for Conversational Machine Reading
Victor Zhong
Luke Zettlemoyer
67
28
0
12 Jun 2019
COMET: Commonsense Transformers for Automatic Knowledge Graph
  Construction
COMET: Commonsense Transformers for Automatic Knowledge Graph Construction
Antoine Bosselut
Hannah Rashkin
Maarten Sap
Chaitanya Malaviya
Asli Celikyilmaz
Yejin Choi
161
914
0
12 Jun 2019
A Multiscale Visualization of Attention in the Transformer Model
A Multiscale Visualization of Attention in the Transformer Model
Jesse Vig
ViT
117
583
0
12 Jun 2019
Explore, Propose, and Assemble: An Interpretable Model for Multi-Hop
  Reading Comprehension
Explore, Propose, and Assemble: An Interpretable Model for Multi-Hop Reading Comprehension
Yichen Jiang
Nitish Joshi
Yen-Chun Chen
Joey Tianyi Zhou
RALM
73
39
0
12 Jun 2019
Unsupervised Question Answering by Cloze Translation
Unsupervised Question Answering by Cloze Translation
Patrick Lewis
Ludovic Denoyer
Sebastian Riedel
58
139
0
12 Jun 2019
Toward Interpretable Music Tagging with Self-Attention
Toward Interpretable Music Tagging with Self-Attention
Minz Won
Sanghyuk Chun
Xavier Serra
ViT
74
82
0
12 Jun 2019
A Systematic Comparison of English Noun Compound Representations
A Systematic Comparison of English Noun Compound Representations
Vered Shwartz
NAI
36
8
0
11 Jun 2019
Learning the Graphical Structure of Electronic Health Records with Graph
  Convolutional Transformer
Learning the Graphical Structure of Electronic Health Records with Graph Convolutional Transformer
Edward Choi
Zhen Xu
Yujia Li
Michael W. Dusenberry
Gerardo Flores
Yuan Xue
Andrew M. Dai
MedIm
88
246
0
11 Jun 2019
Retrieve, Read, Rerank: Towards End-to-End Multi-Document Reading
  Comprehension
Retrieve, Read, Rerank: Towards End-to-End Multi-Document Reading Comprehension
Minghao Hu
Yuxing Peng
Zhen Huang
Dongsheng Li
RALM
80
59
0
11 Jun 2019
Modeling Sentiment Dependencies with Graph Convolutional Networks for
  Aspect-level Sentiment Classification
Modeling Sentiment Dependencies with Graph Convolutional Networks for Aspect-level Sentiment Classification
Pinlong Zhao
Linlin Hou
Ou Wu
GNN
69
178
0
11 Jun 2019
Future Data Helps Training: Modeling Future Contexts for Session-based
  Recommendation
Future Data Helps Training: Modeling Future Contexts for Session-based Recommendation
Fajie Yuan
Xiangnan He
Haochuan Jiang
G. Guo
Jian Xiong
Zhezhao Xu
Yilin Xiong
AI4TS
92
104
0
11 Jun 2019
Self-Supervised Learning for Contextualized Extractive Summarization
Self-Supervised Learning for Contextualized Extractive Summarization
Hong Wang
Xin Eric Wang
Wenhan Xiong
Mo Yu
Xiaoxiao Guo
Shiyu Chang
William Yang Wang
SSL
117
56
0
11 Jun 2019
Lightweight and Efficient Neural Natural Language Processing with
  Quaternion Networks
Lightweight and Efficient Neural Natural Language Processing with Quaternion Networks
Yi Tay
Aston Zhang
Anh Tuan Luu
J. Rao
Shuai Zhang
Shuohang Wang
Jie Fu
S. Hui
85
57
0
11 Jun 2019
DoubleTransfer at MEDIQA 2019: Multi-Source Transfer Learning for
  Natural Language Understanding in the Medical Domain
DoubleTransfer at MEDIQA 2019: Multi-Source Transfer Learning for Natural Language Understanding in the Medical Domain
Yichong Xu
Xiaodong Liu
Chunyuan Li
Hoifung Poon
Jianfeng Gao
MedIm
81
15
0
11 Jun 2019
What Does BERT Look At? An Analysis of BERT's Attention
What Does BERT Look At? An Analysis of BERT's Attention
Kevin Clark
Urvashi Khandelwal
Omer Levy
Christopher D. Manning
MILM
319
1,610
0
11 Jun 2019
Performance Analysis and Characterization of Training Deep Learning
  Models on Mobile Devices
Performance Analysis and Characterization of Training Deep Learning Models on Mobile Devices
Jie Liu
Jiawen Liu
Wan Du
Dong Li
HAI
52
5
0
10 Jun 2019
Label-Agnostic Sequence Labeling by Copying Nearest Neighbors
Label-Agnostic Sequence Labeling by Copying Nearest Neighbors
Sam Wiseman
K. Stratos
71
65
0
10 Jun 2019
GLTR: Statistical Detection and Visualization of Generated Text
GLTR: Statistical Detection and Visualization of Generated Text
Sebastian Gehrmann
Hendrik Strobelt
Alexander M. Rush
DeLMO
157
548
0
10 Jun 2019
CAiRE_HKUST at SemEval-2019 Task 3: Hierarchical Attention for Dialogue
  Emotion Classification
CAiRE_HKUST at SemEval-2019 Task 3: Hierarchical Attention for Dialogue Emotion Classification
Genta Indra Winata
Andrea Madotto
Zhaojiang Lin
Jamin Shin
Yan Xu
Peng Xu
Pascale Fung
92
22
0
10 Jun 2019
A Survey of Reinforcement Learning Informed by Natural Language
A Survey of Reinforcement Learning Informed by Natural Language
Jelena Luketina
Nantas Nardelli
Gregory Farquhar
Jakob N. Foerster
Jacob Andreas
Edward Grefenstette
Shimon Whiteson
Tim Rocktaschel
LM&RoKELMOffRLLRM
113
282
0
10 Jun 2019
Learning to combine Grammatical Error Corrections
Learning to combine Grammatical Error Corrections
Yoav Kantor
Yoav Katz
Leshem Choshen
Edo Cohen-Karlik
Naftali Liberman
Assaf Toledo
Amir Menczel
Noam Slonim
69
29
0
10 Jun 2019
A Survey on Neural Machine Reading Comprehension
A Survey on Neural Machine Reading Comprehension
Boyu Qiu
Xu Chen
Jungang Xu
Yingfei Sun
FaMLAIMat
71
31
0
10 Jun 2019
Open-Domain Targeted Sentiment Analysis via Span-Based Extraction and
  Classification
Open-Domain Targeted Sentiment Analysis via Span-Based Extraction and Classification
Minghao Hu
Yuxing Peng
Zhen Huang
Dongsheng Li
Yiwei Lv
124
197
0
10 Jun 2019
Gendered Pronoun Resolution using BERT and an extractive question
  answering formulation
Gendered Pronoun Resolution using BERT and an extractive question answering formulation
Rakesh Chada
FaML
52
10
0
09 Jun 2019
Encouraging Paragraph Embeddings to Remember Sentence Identity Improves
  Classification
Encouraging Paragraph Embeddings to Remember Sentence Identity Improves Classification
Tu Vu
Mohit Iyyer
44
2
0
09 Jun 2019
Probing for Semantic Classes: Diagnosing the Meaning Content of Word
  Embeddings
Probing for Semantic Classes: Diagnosing the Meaning Content of Word Embeddings
Yadollah Yaghoobzadeh
Katharina Kann
Timothy J. Hazen
Eneko Agirre
Hinrich Schütze
83
38
0
09 Jun 2019
A Survey on Neural Network Language Models
A Survey on Neural Network Language Models
Kun Jing
Jungang Xu
58
56
0
09 Jun 2019
Hierarchical Taxonomy-Aware and Attentional Graph Capsule RCNNs for
  Large-Scale Multi-Label Text Classification
Hierarchical Taxonomy-Aware and Attentional Graph Capsule RCNNs for Large-Scale Multi-Label Text Classification
Hao Peng
Jianxin Li
Qiran Gong
Senzhang Wang
Lifang He
Bo Li
Lihong Wang
Philip S. Yu
GNN
77
142
0
09 Jun 2019
Sentence Centrality Revisited for Unsupervised Summarization
Sentence Centrality Revisited for Unsupervised Summarization
Hao Zheng
Mirella Lapata
72
172
0
08 Jun 2019
Real or Fake? Learning to Discriminate Machine from Human Generated Text
Real or Fake? Learning to Discriminate Machine from Human Generated Text
A. Bakhtin
Sam Gross
Myle Ott
Yuntian Deng
MarcÁurelio Ranzato
Arthur Szlam
DeLMO
103
173
0
07 Jun 2019
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million
  Narrated Video Clips
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips
Antoine Miech
Dimitri Zhukov
Jean-Baptiste Alayrac
Makarand Tapaswi
Ivan Laptev
Josef Sivic
VGen
133
1,212
0
07 Jun 2019
Leveraging BERT for Extractive Text Summarization on Lectures
Leveraging BERT for Extractive Text Summarization on Lectures
Derek Miller
68
244
0
07 Jun 2019
Matching the Blanks: Distributional Similarity for Relation Learning
Matching the Blanks: Distributional Similarity for Relation Learning
Livio Baldini Soares
Nicholas FitzGerald
Jeffrey Ling
Tom Kwiatkowski
100
777
0
07 Jun 2019
Analyzing the Structure of Attention in a Transformer Language Model
Analyzing the Structure of Attention in a Transformer Language Model
Jesse Vig
Yonatan Belinkov
85
371
0
07 Jun 2019
Previous
123...460461462...470471472
Next