ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,687 papers shown
Title
NaturalConv: A Chinese Dialogue Dataset Towards Multi-turn Topic-driven
  Conversation
NaturalConv: A Chinese Dialogue Dataset Towards Multi-turn Topic-driven Conversation
Xiaoyang Wang
Chen Li
Jianqiao Zhao
Dong Yu
82
40
0
03 Mar 2021
Weakly-Supervised Open-Retrieval Conversational Question Answering
Weakly-Supervised Open-Retrieval Conversational Question Answering
Chen Qu
Liu Yang
Cen Chen
W. Bruce Croft
Kalpesh Krishna
Mohit Iyyer
RALM
73
13
0
03 Mar 2021
OAG-BERT: Towards A Unified Backbone Language Model For Academic
  Knowledge Services
OAG-BERT: Towards A Unified Backbone Language Model For Academic Knowledge Services
Xiao Liu
Da Yin
Jingnan Zheng
Xingjian Zhang
Peng Zhang
Hongxia Yang
Yuxiao Dong
Jie Tang
VLM
115
32
0
03 Mar 2021
Relate and Predict: Structure-Aware Prediction with Jointly Optimized
  Neural DAG
Relate and Predict: Structure-Aware Prediction with Jointly Optimized Neural DAG
Arshdeep Sekhon
Zhe Wang
Yanjun Qi
GNN
38
0
0
03 Mar 2021
Video Sentiment Analysis with Bimodal Information-augmented Multi-Head
  Attention
Video Sentiment Analysis with Bimodal Information-augmented Multi-Head Attention
Ting-Wei Wu
Jun-jie Peng
Wenqiang Zhang
Huiran Zhang
Chuan Ma
Yansong Huang
77
88
0
03 Mar 2021
Simplified Data Wrangling with ir_datasets
Simplified Data Wrangling with ir_datasets
Sean MacAvaney
Andrew Yates
Sergey Feldman
Doug Downey
Arman Cohan
Nazli Goharian
157
109
0
03 Mar 2021
Data Augmentation with Hierarchical SQL-to-Question Generation for
  Cross-domain Text-to-SQL Parsing
Data Augmentation with Hierarchical SQL-to-Question Generation for Cross-domain Text-to-SQL Parsing
Kun Wu
Lijie Wang
Zhenghua Li
Ao Zhang
Xinyan Xiao
Hua Wu
Min Zhang
Haifeng Wang
66
35
0
03 Mar 2021
Zero-Shot Cross-Lingual Dependency Parsing through Contextual Embedding
  Transformation
Zero-Shot Cross-Lingual Dependency Parsing through Contextual Embedding Transformation
Haoran Xu
Philipp Koehn
58
7
0
03 Mar 2021
Gradual Fine-Tuning for Low-Resource Domain Adaptation
Gradual Fine-Tuning for Low-Resource Domain Adaptation
Haoran Xu
Seth Ebner
M. Yarmohammadi
A. White
Benjamin Van Durme
Kenton W. Murray
CLL
82
39
0
03 Mar 2021
Two-Stage Framework for Seasonal Time Series Forecasting
Two-Stage Framework for Seasonal Time Series Forecasting
Qingyang Xu
Qingsong Wen
Liang Sun
BDLAI4TS
27
7
0
03 Mar 2021
Random Feature Attention
Random Feature Attention
Hao Peng
Nikolaos Pappas
Dani Yogatama
Roy Schwartz
Noah A. Smith
Lingpeng Kong
148
362
0
03 Mar 2021
Parametric Complexity Bounds for Approximating PDEs with Neural Networks
Parametric Complexity Bounds for Approximating PDEs with Neural Networks
Tanya Marwah
Zachary Chase Lipton
Andrej Risteski
84
19
0
03 Mar 2021
Self-supervised Pretraining of Visual Features in the Wild
Self-supervised Pretraining of Visual Features in the Wild
Priya Goyal
Mathilde Caron
Benjamin Lefaudeux
Min Xu
Pengchao Wang
...
Mannat Singh
Vitaliy Liptchinsky
Ishan Misra
Armand Joulin
Piotr Bojanowski
VLMSSL
101
274
0
02 Mar 2021
WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual
  Machine Learning
WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning
Krishna Srinivasan
K. Raman
Jiecao Chen
Michael Bendersky
Marc Najork
VLM
290
322
0
02 Mar 2021
The Rediscovery Hypothesis: Language Models Need to Meet Linguistics
The Rediscovery Hypothesis: Language Models Need to Meet Linguistics
Vassilina Nikoulina
Maxat Tezekbayev
Nuradil Kozhakhmet
Madina Babazhanova
Matthias Gallé
Z. Assylbekov
69
8
0
02 Mar 2021
Emotion Ratings: How Intensity, Annotation Confidence and Agreements are
  Entangled
Emotion Ratings: How Intensity, Annotation Confidence and Agreements are Entangled
Enrica Troiano
Sebastian Padó
Roman Klinger
55
20
0
02 Mar 2021
Hate Towards the Political Opponent: A Twitter Corpus Study of the 2020
  US Elections on the Basis of Offensive Speech and Stance Detection
Hate Towards the Political Opponent: A Twitter Corpus Study of the 2020 US Elections on the Basis of Offensive Speech and Stance Detection
Lara Grimminger
Roman Klinger
73
74
0
02 Mar 2021
Disentangling Syntax and Semantics in the Brain with Deep Networks
Disentangling Syntax and Semantics in the Brain with Deep Networks
Charlotte Caucheteux
Alexandre Gramfort
J. King
129
74
0
02 Mar 2021
Missing Value Imputation on Multidimensional Time Series
Missing Value Imputation on Multidimensional Time Series
Parikshit Bansal
Prathamesh Deshpande
Sunita Sarawagi
AI4TS
163
67
0
02 Mar 2021
An End-to-End Network for Emotion-Cause Pair Extraction
An End-to-End Network for Emotion-Cause Pair Extraction
Aaditya Singh
Shreeshail Hingane
Saim Wani
Ashutosh Modi
67
38
0
02 Mar 2021
Towards Efficiently Diversifying Dialogue Generation via Embedding
  Augmentation
Towards Efficiently Diversifying Dialogue Generation via Embedding Augmentation
Yu Cao
Liang Ding
Zhiliang Tian
Meng Fang
91
14
0
02 Mar 2021
Generalizing to Unseen Domains: A Survey on Domain Generalization
Generalizing to Unseen Domains: A Survey on Domain Generalization
Jindong Wang
Cuiling Lan
Chang-Shu Liu
Yidong Ouyang
Tao Qin
Wang Lu
Yiqiang Chen
Wenjun Zeng
Philip S. Yu
OOD
299
1,241
0
02 Mar 2021
ToxCCIn: Toxic Content Classification with Interpretability
ToxCCIn: Toxic Content Classification with Interpretability
Tong Xiang
Sean MacAvaney
Eugene Yang
Nazli Goharian
126
16
0
01 Mar 2021
DEUS: A Data-driven Approach to Estimate User Satisfaction in Multi-turn
  Dialogues
DEUS: A Data-driven Approach to Estimate User Satisfaction in Multi-turn Dialogues
Ziming Li
Dookun Park
Julia Kiseleva
Young-Bum Kim
Sungjin Lee
141
6
0
01 Mar 2021
Generative Adversarial Transformers
Generative Adversarial Transformers
Drew A. Hudson
C. L. Zitnick
ViT
133
182
0
01 Mar 2021
Sentiment Analysis of Users' Reviews on COVID-19 Contact Tracing Apps
  with a Benchmark Dataset
Sentiment Analysis of Users' Reviews on COVID-19 Contact Tracing Apps with a Benchmark Dataset
Kashif Ahmad
Firoj Alam
Junaid Qadir
Basheer Qolomany
Imran Khan
...
M. Suleman
Naina Said
Syed Zohaib Hassan
Asma Gul
Ala I. Al-Fuqaha
54
7
0
01 Mar 2021
The Healthy States of America: Creating a Health Taxonomy with Social
  Media
The Healthy States of America: Creating a Health Taxonomy with Social Media
S. Šćepanović
L. Aiello
Ke Zhou
Sagar Joglekar
Daniele Quercia
21
6
0
01 Mar 2021
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding
  on Point Clouds through Instance Multi-level Contextual Referring
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring
Zhihao Yuan
Xu Yan
Yinghong Liao
Ruimao Zhang
Sheng Wang
Zhen Li
Shuguang Cui
129
135
0
01 Mar 2021
OmniNet: Omnidirectional Representations from Transformers
OmniNet: Omnidirectional Representations from Transformers
Yi Tay
Mostafa Dehghani
V. Aribandi
Jai Gupta
Philip Pham
Zhen Qin
Dara Bahri
Da-Cheng Juan
Donald Metzler
116
30
0
01 Mar 2021
A survey on Variational Autoencoders from a GreenAI perspective
A survey on Variational Autoencoders from a GreenAI perspective
Andrea Asperti
David Evangelista
E. Loli Piccolomini
DRL
91
53
0
01 Mar 2021
Adapting MARBERT for Improved Arabic Dialect Identification: Submission
  to the NADI 2021 Shared Task
Adapting MARBERT for Improved Arabic Dialect Identification: Submission to the NADI 2021 Shared Task
Badr AlKhamissi
Mohamed Gabr
Muhammad N. ElNokrashy
Khaled Essam
89
20
0
01 Mar 2021
CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double
  Back-Translation for Vision-and-Language Navigation
CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation
A. Magassouba
K. Sugiura
Hisashi Kawai
73
10
0
01 Mar 2021
A Brief Summary of Interactions Between Meta-Learning and
  Self-Supervised Learning
A Brief Summary of Interactions Between Meta-Learning and Self-Supervised Learning
Huimin Peng
SSL
32
4
0
01 Mar 2021
M6: A Chinese Multimodal Pretrainer
M6: A Chinese Multimodal Pretrainer
Junyang Lin
Rui Men
An Yang
Chan Zhou
Ming Ding
...
Yong Li
Wei Lin
Jingren Zhou
J. Tang
Hongxia Yang
VLMMoE
161
134
0
01 Mar 2021
Sandglasset: A Light Multi-Granularity Self-attentive Network For
  Time-Domain Speech Separation
Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation
Max W. Y. Lam
Jun Wang
Jane Polak Scowcroft
Dong Yu
AI4TS
121
49
0
01 Mar 2021
Query Rewriting via Cycle-Consistent Translation for E-Commerce Search
Query Rewriting via Cycle-Consistent Translation for E-Commerce Search
Yiming Qiu
Kang Zhang
Han Zhang
Songlin Wang
Sulong Xu
Yun Xiao
Bo Long
Wen-Yun Yang
95
16
0
01 Mar 2021
Single-Shot Motion Completion with Transformer
Single-Shot Motion Completion with Transformer
Yinglin Duan
Tianyang Shi
Zhengxia Zou
Yenan Lin
Zhehui Qian
Bohan Zhang
U. Michigan
ViT
93
77
0
01 Mar 2021
Self-supervised Auxiliary Learning for Graph Neural Networks via
  Meta-Learning
Self-supervised Auxiliary Learning for Graph Neural Networks via Meta-Learning
Dasol Hwang
Jinyoung Park
Sunyoung Kwon
KyungHyun Kim
Jung-Woo Ha
Hyunwoo J. Kim
OODSSL
83
8
0
01 Mar 2021
Long Document Summarization in a Low Resource Setting using Pretrained
  Language Models
Long Document Summarization in a Low Resource Setting using Pretrained Language Models
Ahsaas Bajaj
Pavitra Dangati
Kalpesh Krishna
Pradhiksha Ashok Kumar
Rheeya Uppaal
Bradford T. Windsor
Eliot Brenner
Dominic Dotterrer
Rajarshi Das
Andrew McCallum
AILawRALM
94
52
0
01 Mar 2021
Combat COVID-19 Infodemic Using Explainable Natural Language Processing
  Models
Combat COVID-19 Infodemic Using Explainable Natural Language Processing Models
Jackie Ayoub
X. J. Yang
Feng Zhou
110
132
0
01 Mar 2021
Token-Modification Adversarial Attacks for Natural Language Processing:
  A Survey
Token-Modification Adversarial Attacks for Natural Language Processing: A Survey
Tom Roth
Yansong Gao
A. Abuadbba
Surya Nepal
Wei Liu
AAML
120
12
0
01 Mar 2021
RuSentEval: Linguistic Source, Encoder Force!
RuSentEval: Linguistic Source, Encoder Force!
Vladislav Mikhailov
Ekaterina Taktasheva
Elina Sigdel
Ekaterina Artemova
VLM
53
6
0
28 Feb 2021
A Survey on Deep Semi-supervised Learning
A Survey on Deep Semi-supervised Learning
Xiangli Yang
Zixing Song
Irwin King
Zenglin Xu
120
594
0
28 Feb 2021
On the Utility of Gradient Compression in Distributed Training Systems
On the Utility of Gradient Compression in Distributed Training Systems
Saurabh Agarwal
Hongyi Wang
Shivaram Venkataraman
Dimitris Papailiopoulos
111
47
0
28 Feb 2021
Towards Conversational Humor Analysis and Design
Towards Conversational Humor Analysis and Design
Tanishq Chaudhary
Mayank Goel
R. Mamidi
46
4
0
28 Feb 2021
Citizen Participation and Machine Learning for a Better Democracy
Citizen Participation and Machine Learning for a Better Democracy
Miguel Arana Catania
Felix-Anselm van Lier
Rob Procter
N. Tkachenko
Yulan He
A. Zubiaga
Maria Liakata
73
58
0
28 Feb 2021
NLP-CUET@LT-EDI-EACL2021: Multilingual Code-Mixed Hope Speech Detection
  using Cross-lingual Representation Learner
NLP-CUET@LT-EDI-EACL2021: Multilingual Code-Mixed Hope Speech Detection using Cross-lingual Representation Learner
E. Hossain
Omar Sharif
M. M. Hoque
68
25
0
28 Feb 2021
NLP-CUET@DravidianLangTech-EACL2021: Offensive Language Detection from
  Multilingual Code-Mixed Text using Transformers
NLP-CUET@DravidianLangTech-EACL2021: Offensive Language Detection from Multilingual Code-Mixed Text using Transformers
Omar Sharif
E. Hossain
M. M. Hoque
55
36
0
28 Feb 2021
Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based
  Bias in NLP
Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP
Timo Schick
Sahana Udupa
Hinrich Schütze
380
389
0
28 Feb 2021
Transformers with Competitive Ensembles of Independent Mechanisms
Transformers with Competitive Ensembles of Independent Mechanisms
Alex Lamb
Di He
Anirudh Goyal
Guolin Ke
Chien-Feng Liao
Mirco Ravanelli
Yoshua Bengio
MoE
97
23
0
27 Feb 2021
Previous
123...357358359...472473474
Next