ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 18,335 papers shown
Title
PRNet: Self-Supervised Learning for Partial-to-Partial Registration
PRNet: Self-Supervised Learning for Partial-to-Partial Registration
Yue Wang
Justin Solomon
SSL
3DPC
28
379
0
27 Oct 2019
Fair Generative Modeling via Weak Supervision
Fair Generative Modeling via Weak Supervision
Kristy Choi
Aditya Grover
Trisha Singh
Rui Shu
Stefano Ermon
36
133
0
26 Oct 2019
FineText: Text Classification via Attention-based Language Model
  Fine-tuning
FineText: Text Classification via Attention-based Language Model Fine-tuning
Yunzhe Tao
Saurabh Gupta
Satyapriya Krishna
Xiong Zhou
Orchid Majumder
Vineet Khare
21
3
0
25 Oct 2019
Improving Graph Attention Networks with Large Margin-based Constraints
Improving Graph Attention Networks with Large Margin-based Constraints
Guangtao Wang
Rex Ying
Jing-ling Huang
J. Leskovec
22
80
0
25 Oct 2019
Current Limitations in Cyberbullying Detection: on Evaluation Criteria,
  Reproducibility, and Data Scarcity
Current Limitations in Cyberbullying Detection: on Evaluation Criteria, Reproducibility, and Data Scarcity
Chris Emmery
B. Verhoeven
G. Pauw
Gilles Jacobs
Cynthia Van Hee
Els Lefever
Bart Desmet
Véronique Hoste
Walter Daelemans
28
43
0
25 Oct 2019
On the Cross-lingual Transferability of Monolingual Representations
On the Cross-lingual Transferability of Monolingual Representations
Mikel Artetxe
Sebastian Ruder
Dani Yogatama
30
780
0
25 Oct 2019
Evaluation of Sentence Representations in Polish
Evaluation of Sentence Representations in Polish
Slawomir Dadas
Michal Perelkiewicz
Rafal Poswiata
33
13
0
25 Oct 2019
DENS: A Dataset for Multi-class Emotion Analysis
DENS: A Dataset for Multi-class Emotion Analysis
Chen Cecilia Liu
Muhammad Osama
Anderson de Andrade
AI4CE
30
37
0
25 Oct 2019
Meta-Learning with Dynamic-Memory-Based Prototypical Network for
  Few-Shot Event Detection
Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection
Shumin Deng
Ningyu Zhang
Jiaojian Kang
Yichi Zhang
Wei Zhang
Huajun Chen
31
131
0
25 Oct 2019
SpeechBERT: An Audio-and-text Jointly Learned Language Model for
  End-to-end Spoken Question Answering
SpeechBERT: An Audio-and-text Jointly Learned Language Model for End-to-end Spoken Question Answering
Yung-Sung Chuang
Chi-Liang Liu
Hung-yi Lee
Lin-shan Lee
AuLLM
30
39
0
25 Oct 2019
HUBERT Untangles BERT to Improve Transfer across NLP Tasks
HUBERT Untangles BERT to Improve Transfer across NLP Tasks
M. Moradshahi
Hamid Palangi
M. Lam
P. Smolensky
Jianfeng Gao
31
16
0
25 Oct 2019
L2RS: A Learning-to-Rescore Mechanism for Automatic Speech Recognition
L2RS: A Learning-to-Rescore Mechanism for Automatic Speech Recognition
Yuanfeng Song
Di Jiang
Xuefang Zhao
Qian Xu
Raymond Chi-Wing Wong
Lixin Fan
Qiang Yang
29
17
0
25 Oct 2019
The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and
  Cross-Lingual Transfer for Inflection
The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection
Arya D. McCarthy
Ekaterina Vylomova
Shijie Wu
Chaitanya Malaviya
Lawrence Wolf-Sonkin
...
Miikka Silfverberg
Sabrina J. Mielke
Jeffrey Heinz
Ryan Cotterell
Mans Hulden
41
112
0
25 Oct 2019
Mockingjay: Unsupervised Speech Representation Learning with Deep
  Bidirectional Transformer Encoders
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
47
372
0
25 Oct 2019
A Unified MRC Framework for Named Entity Recognition
A Unified MRC Framework for Named Entity Recognition
Xiaoya Li
Jingrong Feng
Yuxian Meng
Qinghong Han
Fei Wu
Jiwei Li
34
629
0
25 Oct 2019
QASC: A Dataset for Question Answering via Sentence Composition
QASC: A Dataset for Question Answering via Sentence Composition
Tushar Khot
Peter Clark
Michal Guerquin
Peter Alexander Jansen
Ashish Sabharwal
CoGe
41
319
0
25 Oct 2019
ÚFAL MRPipe at MRP 2019: UDPipe Goes Semantic in the Meaning
  Representation Parsing Shared Task
ÚFAL MRPipe at MRP 2019: UDPipe Goes Semantic in the Meaning Representation Parsing Shared Task
Milan Straka
Jana Straková
AI4CE
20
7
0
24 Oct 2019
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta
  Reinforcement Learning
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Tianhe Yu
Deirdre Quillen
Zhanpeng He
Ryan Julian
Avnish Narayan
Hayden Shively
Adithya Bellathur
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
103
1,132
0
24 Oct 2019
Selective Attention Based Graph Convolutional Networks for Aspect-Level
  Sentiment Classification
Selective Attention Based Graph Convolutional Networks for Aspect-Level Sentiment Classification
Xiaochen Hou
Jing Huang
Guangtao Wang
Xiaodong He
Bowen Zhou
34
53
0
24 Oct 2019
Hierarchical Transformers for Long Document Classification
Hierarchical Transformers for Long Document Classification
R. Pappagari
Piotr Żelasko
Jesús Villalba
Yishay Carmiel
Najim Dehak
33
239
0
23 Oct 2019
Correction of Automatic Speech Recognition with Transformer
  Sequence-to-sequence Model
Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model
Oleksii Hrinchuk
Mariya Popova
Boris Ginsburg
VLM
20
87
0
23 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
144
19,578
0
23 Oct 2019
Generative Pre-Training for Speech with Autoregressive Predictive Coding
Generative Pre-Training for Speech with Autoregressive Predictive Coding
Yu-An Chung
James R. Glass
SSL
31
173
0
23 Oct 2019
BanditRank: Learning to Rank Using Contextual Bandits
BanditRank: Learning to Rank Using Contextual Bandits
Phanideep Gampa
Sumio Fujita
OffRL
27
10
0
23 Oct 2019
KnowIT VQA: Answering Knowledge-Based Questions about Videos
KnowIT VQA: Answering Knowledge-Based Questions about Videos
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
30
77
0
23 Oct 2019
Depth-Adaptive Transformer
Depth-Adaptive Transformer
Maha Elbayad
Jiatao Gu
Edouard Grave
Michael Auli
19
187
0
22 Oct 2019
Improving Transformer-based Speech Recognition Using Unsupervised
  Pre-training
Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Dongwei Jiang
Xiaoning Lei
Wubo Li
Ne Luo
Yuxuan Hu
Wei Zou
Xiangang Li
24
99
0
22 Oct 2019
Automatic Extraction of Personality from Text: Challenges and
  Opportunities
Automatic Extraction of Personality from Text: Challenges and Opportunities
N. Akrami
Johan Fernquist
T. Isbister
Lisa Kaati
Björn Pelzer
6
10
0
22 Oct 2019
IPOD: An Industrial and Professional Occupations Dataset and its
  Applications to Occupational Data Mining and Analysis
IPOD: An Industrial and Professional Occupations Dataset and its Applications to Occupational Data Mining and Analysis
Junhua Liu
Yung Chuen Ng
Kristin L. Wood
Kwan Hui Lim
31
6
0
22 Oct 2019
Two-Step Sound Source Separation: Training on Learned Latent Targets
Two-Step Sound Source Separation: Training on Learned Latent Targets
Efthymios Tzinis
Shrikant Venkataramani
Zhepei Wang
Y. C. Sübakan
Paris Smaragdis
26
64
0
22 Oct 2019
Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Yongqiang Wang
Abdel-rahman Mohamed
Duc Le
Chunxi Liu
Alex Xiao
...
Xiaohui Zhang
Frank Zhang
Christian Fuegen
Geoffrey Zweig
M. Seltzer
16
248
0
22 Oct 2019
MRQA 2019 Shared Task: Evaluating Generalization in Reading
  Comprehension
MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension
Adam Fisch
Alon Talmor
Robin Jia
Minjoon Seo
Eunsol Choi
Danqi Chen
33
302
0
22 Oct 2019
Composite Neural Network: Theory and Application to PM2.5 Prediction
Composite Neural Network: Theory and Application to PM2.5 Prediction
M. Yang
Meng Chang Chen
PINN
19
9
0
22 Oct 2019
Learning to Make Generalizable and Diverse Predictions for
  Retrosynthesis
Learning to Make Generalizable and Diverse Predictions for Retrosynthesis
Benson Chen
T. Shen
Tommi Jaakkola
Regina Barzilay
24
46
0
21 Oct 2019
Designovel's system description for Fashion-IQ challenge 2019
Designovel's system description for Fashion-IQ challenge 2019
Jianri Li
Xieyuanli Chen
Zongtan Zhou
Ki-young Shin
Huimin Lu
24
6
0
21 Oct 2019
Domain-agnostic Question-Answering with Adversarial Training
Domain-agnostic Question-Answering with Adversarial Training
Seanie Lee
Donggyu Kim
Jangwon Park
OOD
35
72
0
21 Oct 2019
A Neural Entity Coreference Resolution Review
A Neural Entity Coreference Resolution Review
Nikolaos Stylianou
I. Vlahavas
24
38
0
21 Oct 2019
Diversify Your Datasets: Analyzing Generalization via Controlled
  Variance in Adversarial Datasets
Diversify Your Datasets: Analyzing Generalization via Controlled Variance in Adversarial Datasets
Ohad Rozen
Vered Shwartz
Roee Aharoni
Ido Dagan
AAML
19
37
0
21 Oct 2019
Localization of Fake News Detection via Multitask Transfer Learning
Localization of Fake News Detection via Multitask Transfer Learning
Jan Christian Blaise Cruz
Julianne Agatha Tan
C. Cheng
28
33
0
21 Oct 2019
Constructing Artificial Data for Fine-tuning for Low-Resource Biomedical
  Text Tagging with Applications in PICO Annotation
Constructing Artificial Data for Fine-tuning for Low-Resource Biomedical Text Tagging with Applications in PICO Annotation
Gaurav Singh
Zahra Sabet
John Shawe-Taylor
James Thomas
26
7
0
21 Oct 2019
Good, Better, Best: Textual Distractors Generation for Multiple-Choice
  Visual Question Answering via Reinforcement Learning
Good, Better, Best: Textual Distractors Generation for Multiple-Choice Visual Question Answering via Reinforcement Learning
Jiaying Lu
Xin Ye
Yi Ren
Yezhou Yang
18
10
0
21 Oct 2019
Discovering the Compositional Structure of Vector Representations with
  Role Learning Networks
Discovering the Compositional Structure of Vector Representations with Role Learning Networks
Paul Soulos
R. Thomas McCoy
Tal Linzen
P. Smolensky
CoGe
29
43
0
21 Oct 2019
Findings of the NLP4IF-2019 Shared Task on Fine-Grained Propaganda
  Detection
Findings of the NLP4IF-2019 Shared Task on Fine-Grained Propaganda Detection
Giovanni Da San Martino
Alberto Barrón-Cedeño
Preslav Nakov
25
80
0
20 Oct 2019
Improving Sequence Modeling Ability of Recurrent Neural Networks via
  Sememes
Improving Sequence Modeling Ability of Recurrent Neural Networks via Sememes
Yujia Qin
Fanchao Qi
Sicong Ouyang
Zhiyuan Liu
Cheng Yang
Yasheng Wang
Qun Liu
Maosong Sun
28
5
0
20 Oct 2019
XL-Editor: Post-editing Sentences with XLNet
XL-Editor: Post-editing Sentences with XLNet
Yong-Siang Shih
Wei-Cheng Chang
Yiming Yang
KELM
25
11
0
19 Oct 2019
Keyphrase Extraction from Scholarly Articles as Sequence Labeling using
  Contextualized Embeddings
Keyphrase Extraction from Scholarly Articles as Sequence Labeling using Contextualized Embeddings
Dhruva Sahrawat
Debanjan Mahata
Mayank Kulkarni
Haimin Zhang
Rakesh Gosangi
Amanda Stent
Agniv Sharma
Yaman Kumar Singla
R. Shah
Roger Zimmermann
17
30
0
19 Oct 2019
Natural Question Generation with Reinforcement Learning Based
  Graph-to-Sequence Model
Natural Question Generation with Reinforcement Learning Based Graph-to-Sequence Model
Yu Chen
Lingfei Wu
Mohammed J Zaki
19
11
0
19 Oct 2019
Model Compression with Two-stage Multi-teacher Knowledge Distillation
  for Web Question Answering System
Model Compression with Two-stage Multi-teacher Knowledge Distillation for Web Question Answering System
Ze Yang
Linjun Shou
Ming Gong
Wutao Lin
Daxin Jiang
28
92
0
18 Oct 2019
A Mutual Information Maximization Perspective of Language Representation
  Learning
A Mutual Information Maximization Perspective of Language Representation Learning
Lingpeng Kong
Cyprien de Masson dÁutume
Wang Ling
Lei Yu
Zihang Dai
Dani Yogatama
SSL
226
166
0
18 Oct 2019
Theoretical Investigation of Composite Neural Network
Theoretical Investigation of Composite Neural Network
M. Yang
Meng Chang Chen
PINN
11
3
0
18 Oct 2019
Previous
123...353354355...365366367
Next