ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,491 papers shown
Title
Multilingual Constituency Parsing with Self-Attention and Pre-Training
Multilingual Constituency Parsing with Self-Attention and Pre-Training
Nikita Kitaev
Steven Cao
Dan Klein
LRM
93
255
0
31 Dec 2018
A neural joint model for Vietnamese word segmentation, POS tagging and
  dependency parsing
A neural joint model for Vietnamese word segmentation, POS tagging and dependency parsing
Dat Quoc Nguyen
81
12
0
30 Dec 2018
Double Neural Counterfactual Regret Minimization
Double Neural Counterfactual Regret Minimization
Hui Li
Kailiang Hu
Zhibang Ge
Tao Jiang
Yuan Qi
Le Song
76
52
0
27 Dec 2018
Adversarial Attack and Defense on Graph Data: A Survey
Adversarial Attack and Defense on Graph Data: A Survey
Lichao Sun
Yingtong Dou
Carl Yang
Ji Wang
Yixin Liu
Philip S. Yu
Lifang He
Yangqiu Song
GNNAAML
139
286
0
26 Dec 2018
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual
  Transfer and Beyond
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
Mikel Artetxe
Holger Schwenk
3DV
194
1,020
0
26 Dec 2018
Deep Representation Learning for Clustering of Health Tweets
Deep Representation Learning for Clustering of Health Tweets
O. Gencoglu
SSL
30
10
0
25 Dec 2018
Exploiting Cross-Lingual Subword Similarities in Low-Resource Document
  Classification
Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification
Mozhi Zhang
Yoshinari Fujinuma
Jordan L. Boyd-Graber
85
21
0
22 Dec 2018
Joint Slot Filling and Intent Detection via Capsule Neural Networks
Joint Slot Filling and Intent Detection via Capsule Neural Networks
Chenwei Zhang
Yaliang Li
Nan Du
Wei Fan
Philip S. Yu
62
234
0
22 Dec 2018
A Survey on Deep Learning for Named Entity Recognition
A Survey on Deep Learning for Named Entity Recognition
Junlin Li
Aixin Sun
Jianglei Han
Chenliang Li
3DV
110
1,173
0
22 Dec 2018
Graph Neural Networks: A Review of Methods and Applications
Graph Neural Networks: A Review of Methods and Applications
Jie Zhou
Ganqu Cui
Shengding Hu
Zhengyan Zhang
Cheng Yang
Zhiyuan Liu
Lifeng Wang
Changcheng Li
Maosong Sun
AI4CEGNN
1.2K
5,605
0
20 Dec 2018
Found in Translation: Learning Robust Joint Representations by Cyclic
  Translations Between Modalities
Found in Translation: Learning Robust Joint Representations by Cyclic Translations Between Modalities
Hai Pham
Paul Pu Liang
Thomas Manzini
Louis-Philippe Morency
Barnabás Póczós
89
417
0
19 Dec 2018
A Tutorial on Deep Latent Variable Models of Natural Language
A Tutorial on Deep Latent Variable Models of Natural Language
Yoon Kim
Sam Wiseman
Alexander M. Rush
BDLVLM
121
42
0
17 Dec 2018
Conditional BERT Contextual Augmentation
Conditional BERT Contextual Augmentation
Xing Wu
Shangwen Lv
Liangjun Zang
Jizhong Han
Songlin Hu
77
315
0
17 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual
  Question Answering
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Peng Gao
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Hongsheng Li
AIMat
106
368
0
13 Dec 2018
Detecting weak and strong Islamophobic hate speech on social media
Detecting weak and strong Islamophobic hate speech on social media
Bertie Vidgen
T. Yasseri
92
140
0
12 Dec 2018
SMIT: Stochastic Multi-Label Image-to-Image Translation
SMIT: Stochastic Multi-Label Image-to-Image Translation
Andrés Romero
Pablo Arbelaez
Luc Van Gool
Radu Timofte
73
66
0
10 Dec 2018
SDNet: Contextualized Attention-based Deep Network for Conversational
  Question Answering
SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering
Chenguang Zhu
Michael Zeng
Xuedong Huang
87
125
0
10 Dec 2018
What is the Effect of Importance Weighting in Deep Learning?
What is the Effect of Importance Weighting in Deep Learning?
Jonathon Byrd
Zachary Chase Lipton
165
467
0
08 Dec 2018
Auto-Encoding Scene Graphs for Image Captioning
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
182
704
0
06 Dec 2018
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Tian Shi
Yaser Keneshloo
Naren Ramakrishnan
Chandan K. Reddy
140
234
0
05 Dec 2018
Efficient Attention: Attention with Linear Complexities
Efficient Attention: Attention with Linear Complexities
Zhuoran Shen
Mingyuan Zhang
Haiyu Zhao
Shuai Yi
Hongsheng Li
138
536
0
04 Dec 2018
Practical Text Classification With Large Pre-Trained Language Models
Practical Text Classification With Large Pre-Trained Language Models
Neel Kant
Raul Puri
Nikolai Yakovenko
Bryan Catanzaro
VLM
59
68
0
04 Dec 2018
Flexible and Scalable State Tracking Framework for Goal-Oriented
  Dialogue Systems
Flexible and Scalable State Tracking Framework for Goal-Oriented Dialogue Systems
Rahul Goel
Shachi Paul
Tagyoung Chung
Jérémie Lecomte
Arindam Mandal
Dilek Z. Hakkani-Tür
61
15
0
30 Nov 2018
Visual Question Answering as Reading Comprehension
Visual Question Answering as Reading Comprehension
Hui Li
Peng Wang
Chunhua Shen
Anton Van Den Hengel
62
41
0
29 Nov 2018
Unsupervised Multi-modal Neural Machine Translation
Unsupervised Multi-modal Neural Machine Translation
Yuanhang Su
Kai Fan
Nguyen Bach
C.-C. Jay Kuo
Fei Huang
152
63
0
28 Nov 2018
From Recognition to Cognition: Visual Commonsense Reasoning
From Recognition to Cognition: Visual Commonsense Reasoning
Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
LRMBDLOCLReLM
248
885
0
27 Nov 2018
Synergistic Drug Combination Prediction by Integrating Multi-omics Data
  in Deep Learning Models
Synergistic Drug Combination Prediction by Integrating Multi-omics Data in Deep Learning Models
Tianyu Zhang
Liwei Zhang
Philip R. O. Payne
Fuhai Li
45
98
0
16 Nov 2018
Survey of Computational Approaches to Lexical Semantic Change
Survey of Computational Approaches to Lexical Semantic Change
Nina Tahmasebi
L. Borin
Adam Jatowt
115
164
0
15 Nov 2018
An Introductory Survey on Attention Mechanisms in NLP Problems
An Introductory Survey on Attention Mechanisms in NLP Problems
Dichao Hu
AIMat
83
247
0
12 Nov 2018
Speech Intention Understanding in a Head-final Language: A
  Disambiguation Utilizing Intonation-dependency
Speech Intention Understanding in a Head-final Language: A Disambiguation Utilizing Intonation-dependency
Won Ik Cho
Hyeon Seung Lee
J. Yoon
Seokhwan Kim
N. Kim
85
5
0
10 Nov 2018
Language GANs Falling Short
Language GANs Falling Short
Massimo Caccia
Lucas Caccia
W. Fedus
Hugo Larochelle
Joelle Pineau
Laurent Charlin
236
220
0
06 Nov 2018
Elastic CRFs for Open-ontology Slot Filling
Elastic CRFs for Open-ontology Slot Filling
Yinpei Dai
Yichi Zhang
Hong Liu
Zhijian Ou
Yanmeng Wang
Junlan Feng
87
2
0
04 Nov 2018
Learning to Rank Query Graphs for Complex Question Answering over
  Knowledge Graphs
Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs
Gaurav Maheshwari
Priyansh Trivedi
Denis Lukovnikov
Nilesh Chakraborty
Asja Fischer
Jens Lehmann
GNN
87
72
0
02 Nov 2018
Sentence Encoders on STILTs: Supplementary Training on Intermediate
  Labeled-data Tasks
Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks
Jason Phang
Thibault Févry
Samuel R. Bowman
125
470
0
02 Nov 2018
CommonsenseQA: A Question Answering Challenge Targeting Commonsense
  Knowledge
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
Alon Talmor
Jonathan Herzig
Nicholas Lourie
Jonathan Berant
RALM
172
1,754
0
02 Nov 2018
On the Generation of Medical Question-Answer Pairs
On the Generation of Medical Question-Answer Pairs
Sheng Shen
Yaliang Li
Nan Du
X. Wu
Yusheng Xie
Shen Ge
Tao Yang
Kai Wang
Xin-Fang Liang
Wei Fan
MedIm
61
21
0
01 Nov 2018
A Corpus for Reasoning About Natural Language Grounded in Photographs
A Corpus for Reasoning About Natural Language Grounded in Photographs
Alane Suhr
Stephanie Zhou
Ally Zhang
Iris Zhang
Huajun Bai
Yoav Artzi
LRM
122
610
0
01 Nov 2018
Improving Machine Reading Comprehension with General Reading Strategies
Improving Machine Reading Comprehension with General Reading Strategies
Kai Sun
Dian Yu
Dong Yu
Claire Cardie
AI4CE
87
116
0
31 Oct 2018
Cross-Lingual Transfer Learning for Multilingual Task Oriented Dialog
Cross-Lingual Transfer Learning for Multilingual Task Oriented Dialog
Sebastian Schuster
S. Gupta
Rushin Shah
M. Lewis
112
286
0
31 Oct 2018
Automated Machine Learning: From Principles to Practices
Automated Machine Learning: From Principles to Practices
Quanming Yao
Mengshuo Wang
Hugo Jair Escalante
Huan Zhao
Qiang Yang
121
259
0
31 Oct 2018
Giving Space to Your Message: Assistive Word Segmentation for the
  Electronic Typing of Digital Minorities
Giving Space to Your Message: Assistive Word Segmentation for the Electronic Typing of Digital Minorities
Won Ik Cho
Sung Jun Cheon
Woohyun Kang
Jiwon Kim
N. Kim
41
2
0
31 Oct 2018
GraphIE: A Graph-Based Framework for Information Extraction
GraphIE: A Graph-Based Framework for Information Extraction
Yujie Qian
Enrico Santus
Zhijing Jin
Jiang Guo
Regina Barzilay
GNN
94
113
0
31 Oct 2018
Advancing PICO Element Detection in Biomedical Text via Deep Neural
  Networks
Advancing PICO Element Detection in Biomedical Text via Deep Neural Networks
Di Jin
Peter Szolovits
56
32
0
30 Oct 2018
A Simple Recurrent Unit with Reduced Tensor Product Representations
A Simple Recurrent Unit with Reduced Tensor Product Representations
Shuai Tang
P. Smolensky
V. D. Sa
90
2
0
29 Oct 2018
A Pragmatic Guide to Geoparsing Evaluation
A Pragmatic Guide to Geoparsing Evaluation
Milan Gritta
Mohammad Taher Pilehvar
Nigel Collier
97
69
0
29 Oct 2018
A Hitchhiker's Guide On Distributed Training of Deep Neural Networks
A Hitchhiker's Guide On Distributed Training of Deep Neural Networks
K. Chahal
Manraj Singh Grover
Kuntal Dey
3DHOOD
90
54
0
28 Oct 2018
Variational Semi-supervised Aspect-term Sentiment Analysis via
  Transformer
Variational Semi-supervised Aspect-term Sentiment Analysis via Transformer
Xingyi Cheng
Weidi Xu
Taifeng Wang
Wei Chu
63
23
0
24 Oct 2018
Testing the Generalization Power of Neural Network Models Across NLI
  Benchmarks
Testing the Generalization Power of Neural Network Models Across NLI Benchmarks
Aarne Talman
S. Chatzikyriakidis
ELM
91
48
0
23 Oct 2018
Compositional Coding Capsule Network with K-Means Routing for Text
  Classification
Compositional Coding Capsule Network with K-Means Routing for Text Classification
Hao Ren
Hong-wei Lu
104
53
0
22 Oct 2018
pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence
  Inference
pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference
Mandar Joshi
Eunsol Choi
Omer Levy
Daniel S. Weld
Luke Zettlemoyer
CoGe
67
47
0
20 Oct 2018
Previous
123...468469470
Next