ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 19,366 papers shown
Title
BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional
  Encoder Representations from Transformer
BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer
Guan-Lin Chao
Ian Lane
6
103
0
05 Jul 2019
Graph Representation Learning via Hard and Channel-Wise Attention
  Networks
Graph Representation Learning via Hard and Channel-Wise Attention Networks
Hongyang Gao
Shuiwang Ji
GNN
25
57
0
05 Jul 2019
Invariant Risk Minimization
Invariant Risk Minimization
Martín Arjovsky
Léon Bottou
Ishaan Gulrajani
David Lopez-Paz
OOD
116
2,177
0
05 Jul 2019
Multi-lingual Intent Detection and Slot Filling in a Joint BERT-based
  Model
Multi-lingual Intent Detection and Slot Filling in a Joint BERT-based Model
Giuseppe Castellucci
Valentina Bellomaria
Andrea Favalli
Raniero Romagnoli
VLM
19
73
0
05 Jul 2019
Head-Driven Phrase Structure Grammar Parsing on Penn Treebank
Head-Driven Phrase Structure Grammar Parsing on Penn Treebank
Junru Zhou
Zhao Hai
47
144
0
05 Jul 2019
Improving Chemical Named Entity Recognition in Patents with
  Contextualized Word Embeddings
Improving Chemical Named Entity Recognition in Patents with Contextualized Word Embeddings
Zenan Zhai
Dat Quoc Nguyen
S. Akhondi
Camilo Thorne
Christian Druckenbrodt
Trevor Cohn
M. Gregory
Karin Verspoor
14
42
0
05 Jul 2019
Transfer Learning for Risk Classification of Social Media Posts: Model
  Evaluation Study
Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study
Derek Howard
M. Maslej
Justin Lee
Jacob Ritchie
G. Woollard
L. French
AI4MH
26
30
0
04 Jul 2019
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue
  Systems
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems
Hung Le
Doyen Sahoo
Nancy F. Chen
Guosheng Lin
22
111
0
02 Jul 2019
Few-Shot Representation Learning for Out-Of-Vocabulary Words
Few-Shot Representation Learning for Out-Of-Vocabulary Words
Ziniu Hu
Ting-Li Chen
Kai-Wei Chang
Yizhou Sun
40
76
0
01 Jul 2019
Patent Claim Generation by Fine-Tuning OpenAI GPT-2
Patent Claim Generation by Fine-Tuning OpenAI GPT-2
Jieh-Sheng Lee
J. Hsiang
21
145
0
01 Jul 2019
ICDAR 2019 Competition on Scene Text Visual Question Answering
ICDAR 2019 Competition on Scene Text Visual Question Answering
Ali Furkan Biten
Rubèn Pérez Tito
Andrés Mafla
Lluís Gómez
Marçal Rusiñol
Minesh Mathew
C. V. Jawahar
Ernest Valveny
Dimosthenis Karatzas
24
76
0
30 Jun 2019
BERTphone: Phonetically-Aware Encoder Representations for
  Utterance-Level Speaker and Language Recognition
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language Recognition
Shaoshi Ling
Julian Salazar
Yuzong Liu
Katrin Kirchhoff
SSL
33
28
0
30 Jun 2019
Self-Supervised Dialogue Learning
Self-Supervised Dialogue Learning
Jiawei Wu
Xin Eric Wang
William Yang Wang
SSL
19
58
0
30 Jun 2019
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer
  on Time Series Forecasting
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting
Shiyang Li
Xiaoyong Jin
Yao Xuan
Xiyou Zhou
Wenhu Chen
Yu Wang
Xifeng Yan
AI4TS
26
1,391
0
29 Jun 2019
Deep Gamblers: Learning to Abstain with Portfolio Theory
Deep Gamblers: Learning to Abstain with Portfolio Theory
Liu Ziyin
Zhikang T. Wang
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
Masahito Ueda
40
111
0
29 Jun 2019
GPT-based Generation for Classical Chinese Poetry
GPT-based Generation for Classical Chinese Poetry
Yi-Lun Liao
Yasheng Wang
Qun Liu
Xin Jiang
29
40
0
29 Jun 2019
Relating Simple Sentence Representations in Deep Neural Networks and the
  Brain
Relating Simple Sentence Representations in Deep Neural Networks and the Brain
Sharmistha Jat
Hao Tang
Partha P. Talukdar
Tom Michael Mitchell
22
21
0
27 Jun 2019
Good Secretaries, Bad Truck Drivers? Occupational Gender Stereotypes in
  Sentiment Analysis
Good Secretaries, Bad Truck Drivers? Occupational Gender Stereotypes in Sentiment Analysis
J. Bhaskaran
Isha Bhallamudi
27
47
0
24 Jun 2019
Language Modelling Makes Sense: Propagating Representations through
  WordNet for Full-Coverage Word Sense Disambiguation
Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation
Daniel Loureiro
A. Jorge
24
138
0
24 Jun 2019
LIAAD at SemDeep-5 Challenge: Word-in-Context (WiC)
LIAAD at SemDeep-5 Challenge: Word-in-Context (WiC)
Daniel Loureiro
A. Jorge
22
17
0
24 Jun 2019
Classification and Clustering of Arguments with Contextualized Word
  Embeddings
Classification and Clustering of Arguments with Contextualized Word Embeddings
Nils Reimers
Benjamin Schiller
Tilman Beck
Johannes Daxenberger
Christian Stab
Iryna Gurevych
22
165
0
24 Jun 2019
EQuANt (Enhanced Question Answer Network)
EQuANt (Enhanced Question Answer Network)
Franccois-Xavier Aubet
D. Danks
Yuchen Zhu
26
3
0
24 Jun 2019
Evaluating the Supervised and Zero-shot Performance of Multi-lingual
  Translation Models
Evaluating the Supervised and Zero-shot Performance of Multi-lingual Translation Models
Chris Hokamp
John Glover
D. Ghalandari
26
14
0
24 Jun 2019
Deep Leakage from Gradients
Deep Leakage from Gradients
Ligeng Zhu
Zhijian Liu
Song Han
FedML
43
2,169
0
21 Jun 2019
Graph Star Net for Generalized Multi-Task Learning
Graph Star Net for Generalized Multi-Task Learning
H. Lu
Seth H. Huang
Tian Ye
Xiuyan Guo
GNN
33
46
0
21 Jun 2019
SMILES-X: autonomous molecular compounds characterization for small
  datasets without descriptors
SMILES-X: autonomous molecular compounds characterization for small datasets without descriptors
G. Lambard
Ekaterina Gracheva
27
21
0
20 Jun 2019
Learning Compressed Sentence Representations for On-Device Text
  Processing
Learning Compressed Sentence Representations for On-Device Text Processing
Dinghan Shen
Pengyu Cheng
Dhanasekar Sundararaman
Xinyuan Zhang
Qian Yang
Meng Tang
Asli Celikyilmaz
Lawrence Carin
23
22
0
19 Jun 2019
SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly
  Representative Neural Architectures
SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly Representative Neural Architectures
Hsin-Pai Cheng
Tunhou Zhang
Yukun Yang
Feng Yan
Shiyu Li
Harris Teague
H. Li
Yiran Chen
25
11
0
19 Jun 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
124
8,361
0
19 Jun 2019
Evaluating Protein Transfer Learning with TAPE
Evaluating Protein Transfer Learning with TAPE
Roshan Rao
Nicholas Bhattacharya
Neil Thomas
Yan Duan
Xi Chen
John F. Canny
Pieter Abbeel
Yun S. Song
SSL
61
783
0
19 Jun 2019
Fine-tuning Pre-Trained Transformer Language Models to Distantly
  Supervised Relation Extraction
Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction
Christoph Alt
Marc Hübner
Leonhard Hennig
20
119
0
19 Jun 2019
Improving Sentiment Analysis with Multi-task Learning of Negation
Improving Sentiment Analysis with Multi-task Learning of Negation
Jeremy Barnes
Erik Velldal
Lilja Øvrelid
26
36
0
18 Jun 2019
Zero-Shot Entity Linking by Reading Entity Descriptions
Zero-Shot Entity Linking by Reading Entity Descriptions
Lajanugen Logeswaran
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
Jacob Devlin
Honglak Lee
VLM
17
252
0
18 Jun 2019
Measuring Bias in Contextualized Word Representations
Measuring Bias in Contextualized Word Representations
Keita Kurita
Nidhi Vyas
Ayush Pareek
A. Black
Yulia Tsvetkov
63
448
0
18 Jun 2019
Towards Transfer Learning for End-to-End Speech Synthesis from Deep
  Pre-Trained Language Models
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models
Wei Fang
Yu-An Chung
James R. Glass
26
27
0
17 Jun 2019
Coherent and Controllable Outfit Generation
Coherent and Controllable Outfit Generation
Kedan Li
Chen Liu
David A. Forsyth
51
15
0
17 Jun 2019
Open Domain Event Extraction Using Neural Latent Variable Models
Open Domain Event Extraction Using Neural Latent Variable Models
Xiao Liu
Heyan Huang
Yue Zhang
BDL
DRL
27
57
0
17 Jun 2019
ParNet: Position-aware Aggregated Relation Network for Image-Text
  matching
ParNet: Position-aware Aggregated Relation Network for Image-Text matching
Yaxian Xia
Lun Huang
Wenmin Wang
Xiao-Yong Wei
Jie Chen
32
1
0
17 Jun 2019
Meta-learning Pseudo-differential Operators with Deep Neural Networks
Meta-learning Pseudo-differential Operators with Deep Neural Networks
Jordi Feliu-Fabà
Yuwei Fan
Lexing Ying
24
39
0
16 Jun 2019
One Epoch Is All You Need
One Epoch Is All You Need
Aran Komatsuzaki
29
50
0
16 Jun 2019
Multi-Hop Paragraph Retrieval for Open-Domain Question Answering
Multi-Hop Paragraph Retrieval for Open-Domain Question Answering
Yair Feldman
Ran El-Yaniv
RALM
32
100
0
15 Jun 2019
Context is Key: Grammatical Error Detection with Contextual Word
  Representations
Context is Key: Grammatical Error Detection with Contextual Word Representations
Samuel J. Bell
H. Yannakoudakis
Marek Rei
37
41
0
15 Jun 2019
Can neural networks understand monotonicity reasoning?
Can neural networks understand monotonicity reasoning?
Hitomi Yanaka
K. Mineshima
D. Bekki
Kentaro Inui
Satoshi Sekine
Lasha Abzianidze
Johan Bos
LRM
41
80
0
15 Jun 2019
Scalable Syntax-Aware Language Models Using Knowledge Distillation
Scalable Syntax-Aware Language Models Using Knowledge Distillation
A. Kuncoro
Chris Dyer
Laura Rimell
S. Clark
Phil Blunsom
40
26
0
14 Jun 2019
"My Way of Telling a Story": Persona based Grounded Story Generation
"My Way of Telling a Story": Persona based Grounded Story Generation
Shrimai Prabhumoye
Khyathi Chandu
Ruslan Salakhutdinov
A. Black
32
35
0
14 Jun 2019
Augmenting Neural Networks with First-order Logic
Augmenting Neural Networks with First-order Logic
Tao Li
Vivek Srikumar
21
109
0
14 Jun 2019
A Simple and Effective Approach to Automatic Post-Editing with Transfer
  Learning
A Simple and Effective Approach to Automatic Post-Editing with Transfer Learning
Gonçalo M. Correia
André F. T. Martins
19
42
0
14 Jun 2019
DocRED: A Large-Scale Document-Level Relation Extraction Dataset
DocRED: A Large-Scale Document-Level Relation Extraction Dataset
Yuan Yao
Deming Ye
Peng Li
Xu Han
Yankai Lin
Zhenghao Liu
Zhiyuan Liu
Lixin Huang
Jie Zhou
Maosong Sun
22
448
0
14 Jun 2019
Learning to Ask Unanswerable Questions for Machine Reading Comprehension
Learning to Ask Unanswerable Questions for Machine Reading Comprehension
Haichao Zhu
Li Dong
Furu Wei
Wenhui Wang
Bing Qin
Ting Liu
RALM
26
31
0
14 Jun 2019
Image Captioning: Transforming Objects into Words
Image Captioning: Transforming Objects into Words
Simão Herdade
Armin Kappeler
K. Boakye
Joao Soares
ViT
62
464
0
14 Jun 2019
Previous
123...380381382...386387388
Next