ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 19,144 papers shown
Title
A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief
  States towards Semi-Supervised Learning
A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning
Yichi Zhang
Zhijian Ou
Huixin Wang
Junlan Feng
RALM
31
67
0
17 Sep 2020
Code-switching pre-training for neural machine translation
Code-switching pre-training for neural machine translation
Zhen Yang
Bojie Hu
Ambyera Han
Shen Huang
Qi Ju
32
71
0
17 Sep 2020
Efficient Transformer-based Large Scale Language Representations using
  Hardware-friendly Block Structured Pruning
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning
Bingbing Li
Zhenglun Kong
Tianyun Zhang
Ji Li
Zechao Li
Hang Liu
Caiwen Ding
VLM
34
64
0
17 Sep 2020
Tasty Burgers, Soggy Fries: Probing Aspect Robustness in Aspect-Based
  Sentiment Analysis
Tasty Burgers, Soggy Fries: Probing Aspect Robustness in Aspect-Based Sentiment Analysis
Xiaoyu Xing
Zhijing Jin
Di Jin
Bingning Wang
Qi Zhang
Xuanjing Huang
CoGe
21
44
0
16 Sep 2020
Transformer Based Multi-Source Domain Adaptation
Transformer Based Multi-Source Domain Adaptation
Dustin Wright
Isabelle Augenstein
32
53
0
16 Sep 2020
GLUCOSE: GeneraLized and COntextualized Story Explanations
GLUCOSE: GeneraLized and COntextualized Story Explanations
N. Mostafazadeh
Aditya Kalyanpur
Lori Moon
David W. Buchanan
Lauren Berkowitz
Or Biran
Jennifer Chu-Carroll
32
121
0
16 Sep 2020
Automated Source Code Generation and Auto-completion Using Deep
  Learning: Comparing and Discussing Current Language-Model-Related Approaches
Automated Source Code Generation and Auto-completion Using Deep Learning: Comparing and Discussing Current Language-Model-Related Approaches
Juan Cruz-Benito
Sanjay Vishwakarma
Francisco Martín-Fernández
Ismael Faro Ibm Quantum
27
31
0
16 Sep 2020
SelfAugment: Automatic Augmentation Policies for Self-Supervised
  Learning
SelfAugment: Automatic Augmentation Policies for Self-Supervised Learning
Colorado Reed
Sean L. Metzger
A. Srinivas
Trevor Darrell
Kurt Keutzer
SSL
33
49
0
16 Sep 2020
Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News
Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News
Reuben Tan
Bryan A. Plummer
Kate Saenko
AAML
26
72
0
16 Sep 2020
Reasoning about Goals, Steps, and Temporal Ordering with WikiHow
Reasoning about Goals, Steps, and Temporal Ordering with WikiHow
Li Zhang
Qing Lyu
Chris Callison-Burch
ReLM
LRM
32
86
0
16 Sep 2020
Reusing a Pretrained Language Model on Languages with Limited Corpora
  for Unsupervised NMT
Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT
Alexandra Chronopoulou
Dario Stojanovski
Alexander Fraser
21
33
0
16 Sep 2020
UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation
UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation
Jian Guan
Minlie Huang
29
69
0
16 Sep 2020
Answering Any-hop Open-domain Questions with Iterative Document
  Reranking
Answering Any-hop Open-domain Questions with Iterative Document Reranking
Ping Nie
Yuyu Zhang
Arun Ramamurthy
Le Song
33
20
0
16 Sep 2020
Grounded Adaptation for Zero-shot Executable Semantic Parsing
Grounded Adaptation for Zero-shot Executable Semantic Parsing
Victor Zhong
M. Lewis
Sida I. Wang
Luke Zettlemoyer
46
99
0
16 Sep 2020
Multi-span Style Extraction for Generative Reading Comprehension
Multi-span Style Extraction for Generative Reading Comprehension
Junjie Yang
Zhuosheng Zhang
Hai Zhao
SyDa
24
14
0
15 Sep 2020
Evaluating representations by the complexity of learning low-loss
  predictors
Evaluating representations by the complexity of learning low-loss predictors
William F. Whitney
M. Song
David Brandfonbrener
Jaan Altosaar
Kyunghyun Cho
33
23
0
15 Sep 2020
Augmented Natural Language for Generative Sequence Labeling
Augmented Natural Language for Generative Sequence Labeling
Ben Athiwaratkun
Cicero Nogueira dos Santos
Jason Krone
Bing Xiang
VLM
24
61
0
15 Sep 2020
BERT-QE: Contextualized Query Expansion for Document Re-ranking
BERT-QE: Contextualized Query Expansion for Document Re-ranking
Zhi Zheng
Kai Hui
Xianpei Han
Xianpei Han
Le Sun
Andrew Yates
27
93
0
15 Sep 2020
Critical Thinking for Language Models
Critical Thinking for Language Models
Gregor Betz
Christian Voigt
Kyle Richardson
SyDa
ReLM
LRM
AI4CE
31
35
0
15 Sep 2020
Multimodal Joint Attribute Prediction and Value Extraction for
  E-commerce Product
Multimodal Joint Attribute Prediction and Value Extraction for E-commerce Product
Tiangang Zhu
Yue Wang
Haoran Li
Youzheng Wu
Xiaodong He
Bowen Zhou
27
69
0
15 Sep 2020
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
Timo Schick
Hinrich Schütze
62
958
0
15 Sep 2020
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
Louis Clouâtre
P. Trempe
Amal Zouaq
Sarath Chandar
25
43
0
15 Sep 2020
Global-aware Beam Search for Neural Abstractive Summarization
Global-aware Beam Search for Neural Abstractive Summarization
Ye Ma
Zixun Lan
Lu Zong
Kaizhu Huang
28
12
0
15 Sep 2020
MatScIE: An automated tool for the generation of databases of methods
  and parameters used in the computational materials science literature
MatScIE: An automated tool for the generation of databases of methods and parameters used in the computational materials science literature
Souradip Guha
Ankan Mullick
Jatin Agrawal
Swetarekha Ram
Samir Ghui
Seung-Cheol Lee
S. Bhattacharjee
Pawan Goyal
21
15
0
15 Sep 2020
Efficient Transformers: A Survey
Efficient Transformers: A Survey
Yi Tay
Mostafa Dehghani
Dara Bahri
Donald Metzler
VLM
123
1,105
0
14 Sep 2020
GeDi: Generative Discriminator Guided Sequence Generation
GeDi: Generative Discriminator Guided Sequence Generation
Ben Krause
Akhilesh Deepak Gotmare
Bryan McCann
N. Keskar
Shafiq Joty
R. Socher
Nazneen Rajani
56
393
0
14 Sep 2020
A Systematic Literature Review on the Use of Deep Learning in Software
  Engineering Research
A Systematic Literature Review on the Use of Deep Learning in Software Engineering Research
Cody Watson
Nathan Cooper
David Nader-Palacio
Kevin Moran
Denys Poshyvanyk
31
111
0
14 Sep 2020
Beyond Accuracy: ROI-driven Data Analytics of Empirical Data
Beyond Accuracy: ROI-driven Data Analytics of Empirical Data
Gouri Deshpande
Guenther Ruhe
6
6
0
14 Sep 2020
Learning an Effective Context-Response Matching Model with
  Self-Supervised Tasks for Retrieval-based Dialogues
Learning an Effective Context-Response Matching Model with Self-Supervised Tasks for Retrieval-based Dialogues
Ruijian Xu
Chongyang Tao
Daxin Jiang
Xueliang Zhao
Dongyan Zhao
Rui Yan
37
70
0
14 Sep 2020
Contrastive Triple Extraction with Generative Transformer
Contrastive Triple Extraction with Generative Transformer
Hongbin Ye
Ningyu Zhang
Shumin Deng
Mosha Chen
Chuanqi Tan
Fei Huang
Huajun Chen
22
128
0
14 Sep 2020
On Robustness and Bias Analysis of BERT-based Relation Extraction
On Robustness and Bias Analysis of BERT-based Relation Extraction
Luoqiu Li
Xiang Chen
Hongbin Ye
Zhen Bi
Shumin Deng
Ningyu Zhang
Huajun Chen
53
18
0
14 Sep 2020
SCOUTER: Slot Attention-based Classifier for Explainable Image
  Recognition
SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition
Liangzhi Li
Bowen Wang
Manisha Verma
Yuta Nakashima
R. Kawasaki
Hajime Nagahara
OCL
23
49
0
14 Sep 2020
Cluster-Former: Clustering-based Sparse Transformer for Long-Range
  Dependency Encoding
Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding
Shuohang Wang
Luowei Zhou
Zhe Gan
Yen-Chun Chen
Yuwei Fang
S. Sun
Yu Cheng
Jingjing Liu
48
28
0
13 Sep 2020
Cosine meets Softmax: A tough-to-beat baseline for visual grounding
Cosine meets Softmax: A tough-to-beat baseline for visual grounding
N. Rufus
U. R. Nair
K. M. Krishna
Vineet Gandhi
27
13
0
13 Sep 2020
DualDE: Dually Distilling Knowledge Graph Embedding for Faster and
  Cheaper Reasoning
DualDE: Dually Distilling Knowledge Graph Embedding for Faster and Cheaper Reasoning
Yushan Zhu
Wen Zhang
Yin Hua
Hui Chen
Xu-Xin Cheng
Wei Zhang
Huajun Chen Zhejiang University
27
27
0
13 Sep 2020
Differentially Private Language Models Benefit from Public Pre-training
Differentially Private Language Models Benefit from Public Pre-training
Gavin Kerrigan
Dylan Slack
Jens Tuyls
24
56
0
13 Sep 2020
Improving Machine Reading Comprehension with Contextualized Commonsense
  Knowledge
Improving Machine Reading Comprehension with Contextualized Commonsense Knowledge
Kai Sun
Dian Yu
Jianshu Chen
Dong Yu
Claire Cardie
38
12
0
12 Sep 2020
CIA_NITT at WNUT-2020 Task 2: Classification of COVID-19 Tweets Using
  Pre-trained Language Models
CIA_NITT at WNUT-2020 Task 2: Classification of COVID-19 Tweets Using Pre-trained Language Models
Yandrapati Prakash Babu
Eswari Rajagopal
31
9
0
12 Sep 2020
Abstractive Information Extraction from Scanned Invoices (AIESI) using
  End-to-end Sequential Approach
Abstractive Information Extraction from Scanned Invoices (AIESI) using End-to-end Sequential Approach
Shreeshiv Patel
Dvijesh N Bhatt
30
11
0
12 Sep 2020
Improving Indonesian Text Classification Using Multilingual Language
  Model
Improving Indonesian Text Classification Using Multilingual Language Model
Ilham Firdausi Putra
Ayu Purwarianti
18
7
0
12 Sep 2020
Alfie: An Interactive Robot with a Moral Compass
Alfie: An Interactive Robot with a Moral Compass
Cigdem Turan
P. Schramowski
Constantin Rothkopf
Kristian Kersting
LM&Ro
16
0
0
11 Sep 2020
UPB at SemEval-2020 Task 11: Propaganda Detection with Domain-Specific
  Trained BERT
UPB at SemEval-2020 Task 11: Propaganda Detection with Domain-Specific Trained BERT
Andrei Paraschiv
Dumitru-Clementin Cercel
M. Dascalu
19
17
0
11 Sep 2020
Transfer Learning of Graph Neural Networks with Ego-graph Information
  Maximization
Transfer Learning of Graph Neural Networks with Ego-graph Information Maximization
Qi Zhu
Carl Yang
Yidan Xu
Haonan Wang
Chao Zhang
Jiawei Han
50
117
0
11 Sep 2020
Sparsifying Transformer Models with Trainable Representation Pooling
Sparsifying Transformer Models with Trainable Representation Pooling
Michal Pietruszka
Łukasz Borchmann
Lukasz Garncarek
28
10
0
10 Sep 2020
FILTER: An Enhanced Fusion Method for Cross-lingual Language
  Understanding
FILTER: An Enhanced Fusion Method for Cross-lingual Language Understanding
Yuwei Fang
Shuohang Wang
Zhe Gan
S. Sun
Jingjing Liu
VLM
28
58
0
10 Sep 2020
Rank over Class: The Untapped Potential of Ranking in Natural Language
  Processing
Rank over Class: The Untapped Potential of Ranking in Natural Language Processing
Amir Atapour-Abarghouei
Stephen Bonner
A. Mcgough
28
4
0
10 Sep 2020
Multi-Task Learning with Deep Neural Networks: A Survey
Multi-Task Learning with Deep Neural Networks: A Survey
M. Crawshaw
CVBM
58
610
0
10 Sep 2020
Investigating Gender Bias in BERT
Investigating Gender Bias in BERT
Rishabh Bhardwaj
Navonil Majumder
Soujanya Poria
33
106
0
10 Sep 2020
Multi-Hop Fact Checking of Political Claims
Multi-Hop Fact Checking of Political Claims
W. Ostrowski
Arnav Arora
Pepa Atanasova
Isabelle Augenstein
LRM
27
39
0
10 Sep 2020
Brain2Word: Decoding Brain Activity for Language Generation
Brain2Word: Decoding Brain Activity for Language Generation
Nicolas Affolter
Béni Egressy
Damian Pascual
Roger Wattenhofer
16
23
0
10 Sep 2020
Previous
123...342343344...381382383
Next