ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 19,144 papers shown
Title
Visual Interaction with Deep Learning Models through Collaborative
  Semantic Inference
Visual Interaction with Deep Learning Models through Collaborative Semantic Inference
Sebastian Gehrmann
Hendrik Strobelt
Robert Krüger
Hanspeter Pfister
Alexander M. Rush
HAI
21
57
0
24 Jul 2019
SpanBERT: Improving Pre-training by Representing and Predicting Spans
SpanBERT: Improving Pre-training by Representing and Predicting Spans
Mandar Joshi
Danqi Chen
Yinhan Liu
Daniel S. Weld
Luke Zettlemoyer
Omer Levy
80
1,947
0
24 Jul 2019
Unbabel's Participation in the WMT19 Translation Quality Estimation
  Shared Task
Unbabel's Participation in the WMT19 Translation Quality Estimation Shared Task
Fabio Kepler
Jonay Trénous
Marcos Vinícius Treviso
M. Vera
António Góis
M. Amin Farajian
António Vilarinho Lopes
André F. T. Martins
28
58
0
24 Jul 2019
Tripartite Heterogeneous Graph Propagation for Large-scale Social
  Recommendation
Tripartite Heterogeneous Graph Propagation for Large-scale Social Recommendation
KyungHyun Kim
Donghyun Kwak
Hanock Kwak
Young-Jin Park
Sangkwon Sim
Jae-Han Cho
Minkyu Kim
Jihun Kwon
Nako Sung
Jung-Woo Ha
13
19
0
24 Jul 2019
Zero-Shot Sign Language Recognition: Can Textual Data Uncover Sign
  Languages?
Zero-Shot Sign Language Recognition: Can Textual Data Uncover Sign Languages?
Yunus Can Bilge
Nazli Ikizler-Cinbis
R. G. Cinbis
SLR
29
29
0
24 Jul 2019
CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context
  in Morphology
CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology
Aditi Chaudhary
Elizabeth Salesky
G. Bhat
David R. Mortensen
J. Carbonell
Yulia Tsvetkov
29
4
0
23 Jul 2019
EmotionX-HSU: Adopting Pre-trained BERT for Emotion Classification
EmotionX-HSU: Adopting Pre-trained BERT for Emotion Classification
Li Luo
Yue Wang
39
26
0
23 Jul 2019
BEHRT: Transformer for Electronic Health Records
BEHRT: Transformer for Electronic Health Records
Yikuan Li
Shishir Rao
J. R. A. Solares
A. Hassaine
D. Canoy
Yajie Zhu
K. Rahimi
G. Salimi-Khorshidi
OOD
45
447
0
22 Jul 2019
Emotion Detection in Text: Focusing on Latent Representation
Emotion Detection in Text: Focusing on Latent Representation
Armin Seyeditabari
N. Tabari
Shafie Gholizadeh
Wlodek Zadrozny
25
14
0
22 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
35
133
0
22 Jul 2019
ELI5: Long Form Question Answering
ELI5: Long Form Question Answering
Angela Fan
Yacine Jernite
Ethan Perez
David Grangier
Jason Weston
Michael Auli
AI4MH
ELM
43
601
0
22 Jul 2019
GEAR: Graph-based Evidence Aggregating and Reasoning for Fact
  Verification
GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification
Jie Zhou
Xu Han
Cheng Yang
Zhiyuan Liu
Lifeng Wang
Changcheng Li
Maosong Sun
24
198
0
22 Jul 2019
ER-AE: Differentially Private Text Generation for Authorship
  Anonymization
ER-AE: Differentially Private Text Generation for Authorship Anonymization
Haohan Bo
Steven H. H. Ding
Benjamin C. M. Fung
Farkhund Iqbal
DeLMO
39
38
0
20 Jul 2019
What is this Article about? Extreme Summarization with Topic-aware
  Convolutional Neural Networks
What is this Article about? Extreme Summarization with Topic-aware Convolutional Neural Networks
Shashi Narayan
Shay B. Cohen
Mirella Lapata
AILaw
42
18
0
19 Jul 2019
Structure-Invariant Testing for Machine Translation
Structure-Invariant Testing for Machine Translation
Pinjia He
Clara Meister
Z. Su
32
104
0
19 Jul 2019
WriterForcing: Generating more interesting story endings
WriterForcing: Generating more interesting story endings
Prakhar Gupta
Vinayshekhar Bannihatti Kumar
Mukul Bhutani
A. Black
40
18
0
18 Jul 2019
Joint Learning of Named Entity Recognition and Entity Linking
Joint Learning of Named Entity Recognition and Entity Linking
Pedro Henrique Martins
Zita Marinho
André F. T. Martins
69
93
0
18 Jul 2019
Deep Neural Models for Medical Concept Normalization in User-Generated
  Texts
Deep Neural Models for Medical Concept Normalization in User-Generated Texts
Z. Miftahutdinov
E. Tutubalina
MedIm
23
44
0
18 Jul 2019
What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb
  Constructions?
What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb Constructions?
Miryam de Lhoneux
Sara Stymne
Joakim Nivre
12
3
0
18 Jul 2019
Probing Neural Network Comprehension of Natural Language Arguments
Probing Neural Network Comprehension of Natural Language Arguments
Timothy Niven
Hung-Yu kao
AAML
45
453
0
17 Jul 2019
Fake News Detection as Natural Language Inference
Fake News Detection as Natural Language Inference
Kai-Chou Yang
Timothy Niven
Hung-Yu kao
18
36
0
17 Jul 2019
DeepTrax: Embedding Graphs of Financial Transactions
DeepTrax: Embedding Graphs of Financial Transactions
C. Bayan Bruss
Anish Khazane
Jonathan Rider
R. Serpe
Antonia Gogoglou
Keegan E. Hines
AIFin
GNN
32
43
0
16 Jul 2019
Multi-modal Sentiment Analysis using Deep Canonical Correlation Analysis
Multi-modal Sentiment Analysis using Deep Canonical Correlation Analysis
Zhongkai Sun
P. Sarma
W. Sethares
E. Bucy
24
23
0
15 Jul 2019
Myers-Briggs Personality Classification and Personality-Specific
  Language Generation Using Pre-trained Language Models
Myers-Briggs Personality Classification and Personality-Specific Language Generation Using Pre-trained Language Models
Sedrick Scott Keh
Immensee Cheng
44
49
0
15 Jul 2019
A Novel User Representation Paradigm for Making Personalized Candidate
  Retrieval
A Novel User Representation Paradigm for Making Personalized Candidate Retrieval
Zheng Liu
Yu Xing
Jianxun Lian
Defu Lian
Ziyao Li
Xing Xie
38
3
0
15 Jul 2019
TWEETQA: A Social Media Focused Question Answering Dataset
TWEETQA: A Social Media Focused Question Answering Dataset
Wenhan Xiong
Jiawei Wu
Hong Wang
Vivek Kulkarni
Mo Yu
Shiyu Chang
Xiaoxiao Guo
William Yang Wang
26
75
0
14 Jul 2019
Task Selection Policies for Multitask Learning
Task Selection Policies for Multitask Learning
John Glover
Chris Hokamp
OffRL
29
7
0
14 Jul 2019
Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level
  Neural Machine Translation
Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level Neural Machine Translation
Marcin Junczys-Dowmunt
21
156
0
14 Jul 2019
The University of Edinburgh's Submissions to the WMT19 News Translation
  Task
The University of Edinburgh's Submissions to the WMT19 News Translation Task
Rachel Bawden
Nikolay Bogoychev
Ulrich Germann
Roman Grundkiewicz
Faheem Kirefu
Antonio Valerio Miceli Barone
Alexandra Birch
22
32
0
12 Jul 2019
R-Transformer: Recurrent Neural Network Enhanced Transformer
R-Transformer: Recurrent Neural Network Enhanced Transformer
Z. Wang
Yao Ma
Zitao Liu
Jiliang Tang
ViT
24
105
0
12 Jul 2019
LakhNES: Improving multi-instrumental music generation with cross-domain
  pre-training
LakhNES: Improving multi-instrumental music generation with cross-domain pre-training
Chris Donahue
H. H. Mao
Yiting Li
G. Cottrell
Julian McAuley
46
117
0
10 Jul 2019
Sparse Networks from Scratch: Faster Training without Losing Performance
Sparse Networks from Scratch: Faster Training without Losing Performance
Tim Dettmers
Luke Zettlemoyer
20
335
0
10 Jul 2019
BAM! Born-Again Multi-Task Networks for Natural Language Understanding
BAM! Born-Again Multi-Task Networks for Natural Language Understanding
Kevin Clark
Minh-Thang Luong
Urvashi Khandelwal
Christopher D. Manning
Quoc V. Le
30
228
0
10 Jul 2019
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural
  Language Processing
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing
Jian Guo
He He
Tong He
Leonard Lausen
Mu Li
...
Hang Zhang
Zhi-Li Zhang
Zhongyue Zhang
Shuai Zheng
Yi Zhu
VLM
BDL
29
196
0
09 Jul 2019
Transfer Learning from Audio-Visual Grounding to Speech Recognition
Transfer Learning from Audio-Visual Grounding to Speech Recognition
Wei-Ning Hsu
David Harwath
James R. Glass
SSL
26
32
0
09 Jul 2019
To Tune or Not To Tune? How About the Best of Both Worlds?
To Tune or Not To Tune? How About the Best of Both Worlds?
Ran A. Wang
Haibo Su
Chunye Wang
Kailin Ji
J. Ding
VLM
36
17
0
09 Jul 2019
Incorporating Query Term Independence Assumption for Efficient Retrieval
  and Ranking using Deep Neural Networks
Incorporating Query Term Independence Assumption for Efficient Retrieval and Ranking using Deep Neural Networks
Bhaskar Mitra
Corby Rosset
D. Hawking
Nick Craswell
Fernando Diaz
Emine Yilmaz
24
30
0
08 Jul 2019
Improving short text classification through global augmentation methods
Improving short text classification through global augmentation methods
Vukosi Marivate
T. Sefara
VLM
26
95
0
07 Jul 2019
Neural Aspect and Opinion Term Extraction with Mined Rules as Weak
  Supervision
Neural Aspect and Opinion Term Extraction with Mined Rules as Weak Supervision
Hongliang Dai
Yangqiu Song
21
107
0
07 Jul 2019
Graph based Neural Networks for Event Factuality Prediction using
  Syntactic and Semantic Structures
Graph based Neural Networks for Event Factuality Prediction using Syntactic and Semantic Structures
Amir Pouran Ben Veyseh
Thien Huu Nguyen
Dejing Dou
48
45
0
07 Jul 2019
Graph Representation Learning via Hard and Channel-Wise Attention
  Networks
Graph Representation Learning via Hard and Channel-Wise Attention Networks
Hongyang Gao
Shuiwang Ji
GNN
25
57
0
05 Jul 2019
Invariant Risk Minimization
Invariant Risk Minimization
Martín Arjovsky
Léon Bottou
Ishaan Gulrajani
David Lopez-Paz
OOD
116
2,167
0
05 Jul 2019
Multi-lingual Intent Detection and Slot Filling in a Joint BERT-based
  Model
Multi-lingual Intent Detection and Slot Filling in a Joint BERT-based Model
Giuseppe Castellucci
Valentina Bellomaria
Andrea Favalli
Raniero Romagnoli
VLM
19
73
0
05 Jul 2019
Head-Driven Phrase Structure Grammar Parsing on Penn Treebank
Head-Driven Phrase Structure Grammar Parsing on Penn Treebank
Junru Zhou
Zhao Hai
47
144
0
05 Jul 2019
Transfer Learning for Risk Classification of Social Media Posts: Model
  Evaluation Study
Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study
Derek Howard
M. Maslej
Justin Lee
Jacob Ritchie
G. Woollard
L. French
AI4MH
26
30
0
04 Jul 2019
Few-Shot Representation Learning for Out-Of-Vocabulary Words
Few-Shot Representation Learning for Out-Of-Vocabulary Words
Ziniu Hu
Ting-Li Chen
Kai-Wei Chang
Yizhou Sun
40
76
0
01 Jul 2019
Patent Claim Generation by Fine-Tuning OpenAI GPT-2
Patent Claim Generation by Fine-Tuning OpenAI GPT-2
Jieh-Sheng Lee
J. Hsiang
16
145
0
01 Jul 2019
ICDAR 2019 Competition on Scene Text Visual Question Answering
ICDAR 2019 Competition on Scene Text Visual Question Answering
Ali Furkan Biten
Rubèn Pérez Tito
Andrés Mafla
Lluís Gómez
Marçal Rusiñol
Minesh Mathew
C. V. Jawahar
Ernest Valveny
Dimosthenis Karatzas
24
75
0
30 Jun 2019
BERTphone: Phonetically-Aware Encoder Representations for
  Utterance-Level Speaker and Language Recognition
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language Recognition
Shaoshi Ling
Julian Salazar
Yuzong Liu
Katrin Kirchhoff
SSL
33
28
0
30 Jun 2019
Self-Supervised Dialogue Learning
Self-Supervised Dialogue Learning
Jiawei Wu
Xin Eric Wang
William Yang Wang
SSL
19
58
0
30 Jun 2019
Previous
123...375376377...381382383
Next