ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 19,767 papers shown
Title
Multi-modal Sentiment Analysis using Deep Canonical Correlation Analysis
Multi-modal Sentiment Analysis using Deep Canonical Correlation Analysis
Zhongkai Sun
P. Sarma
W. Sethares
E. Bucy
24
23
0
15 Jul 2019
Myers-Briggs Personality Classification and Personality-Specific
  Language Generation Using Pre-trained Language Models
Myers-Briggs Personality Classification and Personality-Specific Language Generation Using Pre-trained Language Models
Sedrick Scott Keh
Immensee Cheng
47
49
0
15 Jul 2019
A Novel User Representation Paradigm for Making Personalized Candidate
  Retrieval
A Novel User Representation Paradigm for Making Personalized Candidate Retrieval
Zheng Liu
Yu Xing
Jianxun Lian
Defu Lian
Ziyao Li
Xing Xie
38
3
0
15 Jul 2019
TWEETQA: A Social Media Focused Question Answering Dataset
TWEETQA: A Social Media Focused Question Answering Dataset
Wenhan Xiong
Jiawei Wu
Hong Wang
Vivek Kulkarni
Mo Yu
Shiyu Chang
Xiaoxiao Guo
William Yang Wang
26
75
0
14 Jul 2019
Task Selection Policies for Multitask Learning
Task Selection Policies for Multitask Learning
John Glover
Chris Hokamp
OffRL
34
7
0
14 Jul 2019
Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level
  Neural Machine Translation
Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level Neural Machine Translation
Marcin Junczys-Dowmunt
21
156
0
14 Jul 2019
The University of Edinburgh's Submissions to the WMT19 News Translation
  Task
The University of Edinburgh's Submissions to the WMT19 News Translation Task
Rachel Bawden
Nikolay Bogoychev
Ulrich Germann
Roman Grundkiewicz
Faheem Kirefu
Antonio Valerio Miceli Barone
Alexandra Birch
22
32
0
12 Jul 2019
R-Transformer: Recurrent Neural Network Enhanced Transformer
R-Transformer: Recurrent Neural Network Enhanced Transformer
Z. Wang
Yao Ma
Zitao Liu
Jiliang Tang
ViT
24
105
0
12 Jul 2019
LakhNES: Improving multi-instrumental music generation with cross-domain
  pre-training
LakhNES: Improving multi-instrumental music generation with cross-domain pre-training
Chris Donahue
H. H. Mao
Yiting Li
G. Cottrell
Julian McAuley
46
117
0
10 Jul 2019
Sparse Networks from Scratch: Faster Training without Losing Performance
Sparse Networks from Scratch: Faster Training without Losing Performance
Tim Dettmers
Luke Zettlemoyer
20
335
0
10 Jul 2019
BAM! Born-Again Multi-Task Networks for Natural Language Understanding
BAM! Born-Again Multi-Task Networks for Natural Language Understanding
Kevin Clark
Minh-Thang Luong
Urvashi Khandelwal
Christopher D. Manning
Quoc V. Le
35
228
0
10 Jul 2019
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural
  Language Processing
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing
Jian Guo
He He
Tong He
Leonard Lausen
Mu Li
...
Hang Zhang
Zhi-Li Zhang
Zhongyue Zhang
Shuai Zheng
Yi Zhu
VLM
BDL
29
196
0
09 Jul 2019
Transfer Learning from Audio-Visual Grounding to Speech Recognition
Transfer Learning from Audio-Visual Grounding to Speech Recognition
Wei-Ning Hsu
David Harwath
James R. Glass
SSL
26
32
0
09 Jul 2019
To Tune or Not To Tune? How About the Best of Both Worlds?
To Tune or Not To Tune? How About the Best of Both Worlds?
Ran A. Wang
Haibo Su
Chunye Wang
Kailin Ji
J. Ding
VLM
36
17
0
09 Jul 2019
Incorporating Query Term Independence Assumption for Efficient Retrieval
  and Ranking using Deep Neural Networks
Incorporating Query Term Independence Assumption for Efficient Retrieval and Ranking using Deep Neural Networks
Bhaskar Mitra
Corby Rosset
D. Hawking
Nick Craswell
Fernando Diaz
Emine Yilmaz
24
30
0
08 Jul 2019
Improving short text classification through global augmentation methods
Improving short text classification through global augmentation methods
Vukosi Marivate
T. Sefara
VLM
28
95
0
07 Jul 2019
Neural Aspect and Opinion Term Extraction with Mined Rules as Weak
  Supervision
Neural Aspect and Opinion Term Extraction with Mined Rules as Weak Supervision
Hongliang Dai
Yangqiu Song
21
107
0
07 Jul 2019
Graph based Neural Networks for Event Factuality Prediction using
  Syntactic and Semantic Structures
Graph based Neural Networks for Event Factuality Prediction using Syntactic and Semantic Structures
Amir Pouran Ben Veyseh
Thien Huu Nguyen
Dejing Dou
51
45
0
07 Jul 2019
BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional
  Encoder Representations from Transformer
BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer
Guan-Lin Chao
Ian Lane
13
103
0
05 Jul 2019
Graph Representation Learning via Hard and Channel-Wise Attention
  Networks
Graph Representation Learning via Hard and Channel-Wise Attention Networks
Hongyang Gao
Shuiwang Ji
GNN
25
57
0
05 Jul 2019
Invariant Risk Minimization
Invariant Risk Minimization
Martín Arjovsky
Léon Bottou
Ishaan Gulrajani
David Lopez-Paz
OOD
116
2,177
0
05 Jul 2019
Multi-lingual Intent Detection and Slot Filling in a Joint BERT-based
  Model
Multi-lingual Intent Detection and Slot Filling in a Joint BERT-based Model
Giuseppe Castellucci
Valentina Bellomaria
Andrea Favalli
Raniero Romagnoli
VLM
24
74
0
05 Jul 2019
Head-Driven Phrase Structure Grammar Parsing on Penn Treebank
Head-Driven Phrase Structure Grammar Parsing on Penn Treebank
Junru Zhou
Zhao Hai
47
144
0
05 Jul 2019
Improving Chemical Named Entity Recognition in Patents with
  Contextualized Word Embeddings
Improving Chemical Named Entity Recognition in Patents with Contextualized Word Embeddings
Zenan Zhai
Dat Quoc Nguyen
S. Akhondi
Camilo Thorne
Christian Druckenbrodt
Trevor Cohn
M. Gregory
Karin Verspoor
14
42
0
05 Jul 2019
Transfer Learning for Risk Classification of Social Media Posts: Model
  Evaluation Study
Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study
Derek Howard
M. Maslej
Justin Lee
Jacob Ritchie
G. Woollard
L. French
AI4MH
26
30
0
04 Jul 2019
Depth Growing for Neural Machine Translation
Depth Growing for Neural Machine Translation
Lijun Wu
Yiren Wang
Yingce Xia
Fei Tian
Fei Gao
Tao Qin
Jianhuang Lai
Tie-Yan Liu
21
41
0
03 Jul 2019
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue
  Systems
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems
Hung Le
Doyen Sahoo
Nancy F. Chen
Guosheng Lin
22
111
0
02 Jul 2019
Few-Shot Representation Learning for Out-Of-Vocabulary Words
Few-Shot Representation Learning for Out-Of-Vocabulary Words
Ziniu Hu
Ting-Li Chen
Kai-Wei Chang
Yizhou Sun
40
76
0
01 Jul 2019
Patent Claim Generation by Fine-Tuning OpenAI GPT-2
Patent Claim Generation by Fine-Tuning OpenAI GPT-2
Jieh-Sheng Lee
J. Hsiang
21
147
0
01 Jul 2019
ICDAR 2019 Competition on Scene Text Visual Question Answering
ICDAR 2019 Competition on Scene Text Visual Question Answering
Ali Furkan Biten
Rubèn Pérez Tito
Andrés Mafla
Lluís Gómez
Marçal Rusiñol
Minesh Mathew
C. V. Jawahar
Ernest Valveny
Dimosthenis Karatzas
24
76
0
30 Jun 2019
BERTphone: Phonetically-Aware Encoder Representations for
  Utterance-Level Speaker and Language Recognition
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language Recognition
Shaoshi Ling
Julian Salazar
Yuzong Liu
Katrin Kirchhoff
SSL
33
28
0
30 Jun 2019
Self-Supervised Dialogue Learning
Self-Supervised Dialogue Learning
Jiawei Wu
Xin Eric Wang
William Yang Wang
SSL
19
58
0
30 Jun 2019
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer
  on Time Series Forecasting
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting
Shiyang Li
Xiaoyong Jin
Yao Xuan
Xiyou Zhou
Wenhu Chen
Yu Wang
Xifeng Yan
AI4TS
26
1,391
0
29 Jun 2019
Deep Gamblers: Learning to Abstain with Portfolio Theory
Deep Gamblers: Learning to Abstain with Portfolio Theory
Liu Ziyin
Zhikang T. Wang
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
Masahito Ueda
40
110
0
29 Jun 2019
GPT-based Generation for Classical Chinese Poetry
GPT-based Generation for Classical Chinese Poetry
Yi-Lun Liao
Yasheng Wang
Qun Liu
Xin Jiang
29
40
0
29 Jun 2019
Relating Simple Sentence Representations in Deep Neural Networks and the
  Brain
Relating Simple Sentence Representations in Deep Neural Networks and the Brain
Sharmistha Jat
Hao Tang
Partha P. Talukdar
Tom Michael Mitchell
22
21
0
27 Jun 2019
Good Secretaries, Bad Truck Drivers? Occupational Gender Stereotypes in
  Sentiment Analysis
Good Secretaries, Bad Truck Drivers? Occupational Gender Stereotypes in Sentiment Analysis
J. Bhaskaran
Isha Bhallamudi
27
47
0
24 Jun 2019
Language Modelling Makes Sense: Propagating Representations through
  WordNet for Full-Coverage Word Sense Disambiguation
Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation
Daniel Loureiro
A. Jorge
24
138
0
24 Jun 2019
LIAAD at SemDeep-5 Challenge: Word-in-Context (WiC)
LIAAD at SemDeep-5 Challenge: Word-in-Context (WiC)
Daniel Loureiro
A. Jorge
22
17
0
24 Jun 2019
Classification and Clustering of Arguments with Contextualized Word
  Embeddings
Classification and Clustering of Arguments with Contextualized Word Embeddings
Nils Reimers
Benjamin Schiller
Tilman Beck
Johannes Daxenberger
Christian Stab
Iryna Gurevych
22
166
0
24 Jun 2019
EQuANt (Enhanced Question Answer Network)
EQuANt (Enhanced Question Answer Network)
Franccois-Xavier Aubet
D. Danks
Yuchen Zhu
26
3
0
24 Jun 2019
Evaluating the Supervised and Zero-shot Performance of Multi-lingual
  Translation Models
Evaluating the Supervised and Zero-shot Performance of Multi-lingual Translation Models
Chris Hokamp
John Glover
D. Ghalandari
26
14
0
24 Jun 2019
Deep Leakage from Gradients
Deep Leakage from Gradients
Ligeng Zhu
Zhijian Liu
Song Han
FedML
43
2,176
0
21 Jun 2019
Graph Star Net for Generalized Multi-Task Learning
Graph Star Net for Generalized Multi-Task Learning
H. Lu
Seth H. Huang
Tian Ye
Xiuyan Guo
GNN
35
46
0
21 Jun 2019
SMILES-X: autonomous molecular compounds characterization for small
  datasets without descriptors
SMILES-X: autonomous molecular compounds characterization for small datasets without descriptors
G. Lambard
Ekaterina Gracheva
27
21
0
20 Jun 2019
Learning Compressed Sentence Representations for On-Device Text
  Processing
Learning Compressed Sentence Representations for On-Device Text Processing
Dinghan Shen
Pengyu Cheng
Dhanasekar Sundararaman
Xinyuan Zhang
Qian Yang
Meng Tang
Asli Celikyilmaz
Lawrence Carin
23
22
0
19 Jun 2019
SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly
  Representative Neural Architectures
SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly Representative Neural Architectures
Hsin-Pai Cheng
Tunhou Zhang
Yukun Yang
Feng Yan
Shiyu Li
Harris Teague
H. Li
Yiran Chen
25
11
0
19 Jun 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
129
8,361
0
19 Jun 2019
Evaluating Protein Transfer Learning with TAPE
Evaluating Protein Transfer Learning with TAPE
Roshan Rao
Nicholas Bhattacharya
Neil Thomas
Yan Duan
Xi Chen
John F. Canny
Pieter Abbeel
Yun S. Song
SSL
61
786
0
19 Jun 2019
Fine-tuning Pre-Trained Transformer Language Models to Distantly
  Supervised Relation Extraction
Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction
Christoph Alt
Marc Hübner
Leonhard Hennig
20
119
0
19 Jun 2019
Previous
123...387388389...394395396
Next