ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,555 papers shown
Title
Gated Embeddings in End-to-End Speech Recognition for
  Conversational-Context Fusion
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion
Suyoun Kim
Siddharth Dalmia
Florian Metze
94
23
0
27 Jun 2019
Eliciting Knowledge from Experts:Automatic Transcript Parsing for
  Cognitive Task Analysis
Eliciting Knowledge from Experts:Automatic Transcript Parsing for Cognitive Task Analysis
Junyi Du
He Jiang
Jiaming Shen
Xiang Ren
58
3
0
26 Jun 2019
Determining Relative Argument Specificity and Stance for Complex
  Argumentative Structures
Determining Relative Argument Specificity and Stance for Complex Argumentative Structures
Esin Durmus
Faisal Ladhak
Claire Cardie
54
18
0
26 Jun 2019
Enhancing PIO Element Detection in Medical Text Using Contextualized
  Embedding
Enhancing PIO Element Detection in Medical Text Using Contextualized Embedding
H. Mezaoui
A. Gontcharov
Isuru Gunasekara
21
5
0
26 Jun 2019
Good Secretaries, Bad Truck Drivers? Occupational Gender Stereotypes in
  Sentiment Analysis
Good Secretaries, Bad Truck Drivers? Occupational Gender Stereotypes in Sentiment Analysis
J. Bhaskaran
Isha Bhallamudi
66
47
0
24 Jun 2019
Is It Worth the Attention? A Comparative Evaluation of Attention Layers
  for Argument Unit Segmentation
Is It Worth the Attention? A Comparative Evaluation of Attention Layers for Argument Unit Segmentation
Maximilian Spliethover
Jonas Klaff
Hendrik Heuer
54
10
0
24 Jun 2019
Language Modelling Makes Sense: Propagating Representations through
  WordNet for Full-Coverage Word Sense Disambiguation
Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation
Daniel Loureiro
A. Jorge
85
138
0
24 Jun 2019
LIAAD at SemDeep-5 Challenge: Word-in-Context (WiC)
LIAAD at SemDeep-5 Challenge: Word-in-Context (WiC)
Daniel Loureiro
A. Jorge
65
17
0
24 Jun 2019
Classification and Clustering of Arguments with Contextualized Word
  Embeddings
Classification and Clustering of Arguments with Contextualized Word Embeddings
Nils Reimers
Benjamin Schiller
Tilman Beck
Johannes Daxenberger
Christian Stab
Iryna Gurevych
91
171
0
24 Jun 2019
EQuANt (Enhanced Question Answer Network)
EQuANt (Enhanced Question Answer Network)
Franccois-Xavier Aubet
D. Danks
Yuchen Zhu
56
3
0
24 Jun 2019
Evaluating the Supervised and Zero-shot Performance of Multi-lingual
  Translation Models
Evaluating the Supervised and Zero-shot Performance of Multi-lingual Translation Models
Chris Hokamp
John Glover
D. Ghalandari
60
14
0
24 Jun 2019
Alchemy: A Quantum Chemistry Dataset for Benchmarking AI Models
Alchemy: A Quantum Chemistry Dataset for Benchmarking AI Models
Guangyong Chen
Pengfei Chen
Chang-Yu Hsieh
Chee-Kong Lee
B. Liao
...
J. Qiu
Qiming Sun
Jie Tang
R. Zemel
Shengyu Zhang
79
76
0
22 Jun 2019
Identification of Tasks, Datasets, Evaluation Metrics, and Numeric
  Scores for Scientific Leaderboards Construction
Identification of Tasks, Datasets, Evaluation Metrics, and Numeric Scores for Scientific Leaderboards Construction
Yufang Hou
Charles Jochim
Martin Gleize
Francesca Bonin
Debasis Ganguly
71
95
0
21 Jun 2019
Deep Leakage from Gradients
Deep Leakage from Gradients
Ligeng Zhu
Zhijian Liu
Song Han
FedML
114
2,249
0
21 Jun 2019
Graph Star Net for Generalized Multi-Task Learning
Graph Star Net for Generalized Multi-Task Learning
H. Lu
Seth H. Huang
Tian Ye
Xiuyan Guo
GNN
85
46
0
21 Jun 2019
Informative Image Captioning with External Sources of Information
Informative Image Captioning with External Sources of Information
Sanqiang Zhao
Piyush Sharma
Tomer Levinboim
Radu Soricut
65
46
0
20 Jun 2019
Few-Shot Sequence Labeling with Label Dependency Transfer and Pair-wise
  Embedding
Few-Shot Sequence Labeling with Label Dependency Transfer and Pair-wise Embedding
Yutai Hou
Zhihan Zhou
Yijia Liu
Ning Wang
Wanxiang Che
Han Liu
Ting Liu
71
9
0
20 Jun 2019
Generating Empathetic Responses by Looking Ahead the User's Sentiment
Generating Empathetic Responses by Looking Ahead the User's Sentiment
Jamin Shin
Peng Xu
Andrea Madotto
Pascale Fung
55
48
0
20 Jun 2019
Multi-Grained Named Entity Recognition
Multi-Grained Named Entity Recognition
Congying Xia
Chenwei Zhang
Tao Yang
Yaliang Li
Nan Du
Xian Wu
Wei Fan
Fenglong Ma
Philip Yu
86
87
0
20 Jun 2019
SMILES-X: autonomous molecular compounds characterization for small
  datasets without descriptors
SMILES-X: autonomous molecular compounds characterization for small datasets without descriptors
G. Lambard
Ekaterina Gracheva
71
21
0
20 Jun 2019
Learning Compressed Sentence Representations for On-Device Text
  Processing
Learning Compressed Sentence Representations for On-Device Text Processing
Dinghan Shen
Pengyu Cheng
Dhanasekar Sundararaman
Xinyuan Zhang
Qian Yang
Meng Tang
Asli Celikyilmaz
Lawrence Carin
67
23
0
19 Jun 2019
REflex: Flexible Framework for Relation Extraction in Multiple Domains
REflex: Flexible Framework for Relation Extraction in Multiple Domains
Geeticka Chauhan
Matthew B. A. McDermott
Peter Szolovits
44
13
0
19 Jun 2019
SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly
  Representative Neural Architectures
SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly Representative Neural Architectures
Hsin-Pai Cheng
Tunhou Zhang
Yukun Yang
Feng Yan
Shiyu Li
Harris Teague
H. Li
Yiran Chen
77
11
0
19 Jun 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
497
8,472
0
19 Jun 2019
Evaluating Protein Transfer Learning with TAPE
Evaluating Protein Transfer Learning with TAPE
Roshan Rao
Nicholas Bhattacharya
Neil Thomas
Yan Duan
Xi Chen
John F. Canny
Pieter Abbeel
Yun S. Song
SSL
110
813
0
19 Jun 2019
Fine-tuning Pre-Trained Transformer Language Models to Distantly
  Supervised Relation Extraction
Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction
Christoph Alt
Marc Hübner
Leonhard Hennig
77
122
0
19 Jun 2019
Surf at MEDIQA 2019: Improving Performance of Natural Language Inference
  in the Clinical Domain by Adopting Pre-trained Language Model
Surf at MEDIQA 2019: Improving Performance of Natural Language Inference in the Clinical Domain by Adopting Pre-trained Language Model
Jiin Nam
Seunghyun Yoon
Kyomin Jung
LM&MA
39
3
0
19 Jun 2019
Improving Sentiment Analysis with Multi-task Learning of Negation
Improving Sentiment Analysis with Multi-task Learning of Negation
Jeremy Barnes
Erik Velldal
Lilja Øvrelid
70
36
0
18 Jun 2019
Curriculum-based transfer learning for an effective end-to-end spoken
  language understanding and domain portability
Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability
Antoine Caubrière
N. Tomashenko
Antoine Laurent
Emmanuel Morin
Nathalie Camelin
Yannick Esteve
52
54
0
18 Jun 2019
Towards Robust Named Entity Recognition for Historic German
Towards Robust Named Entity Recognition for Historic German
Stefan Schweter
Johannes Baiter
53
23
0
18 Jun 2019
Transfer Learning for Causal Sentence Detection
Transfer Learning for Causal Sentence Detection
Manolis Kyriakakis
Ion Androutsopoulos
Joan Ginés i Ametllé
Artur Saudabayev
52
25
0
18 Jun 2019
Zero-Shot Entity Linking by Reading Entity Descriptions
Zero-Shot Entity Linking by Reading Entity Descriptions
Lajanugen Logeswaran
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
Jacob Devlin
Honglak Lee
VLM
91
257
0
18 Jun 2019
Measuring Bias in Contextualized Word Representations
Measuring Bias in Contextualized Word Representations
Keita Kurita
Nidhi Vyas
Ayush Pareek
A. Black
Yulia Tsvetkov
121
454
0
18 Jun 2019
Towards Transfer Learning for End-to-End Speech Synthesis from Deep
  Pre-Trained Language Models
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models
Wei Fang
Yu-An Chung
James R. Glass
61
27
0
17 Jun 2019
Coherent and Controllable Outfit Generation
Coherent and Controllable Outfit Generation
Kedan Li
Chen Liu
David A. Forsyth
72
15
0
17 Jun 2019
Open Domain Event Extraction Using Neural Latent Variable Models
Open Domain Event Extraction Using Neural Latent Variable Models
Xiao Liu
Heyan Huang
Yue Zhang
BDLDRL
65
57
0
17 Jun 2019
ParNet: Position-aware Aggregated Relation Network for Image-Text
  matching
ParNet: Position-aware Aggregated Relation Network for Image-Text matching
Yaxian Xia
Lun Huang
Wenmin Wang
Xiao-Yong Wei
Jie Chen
128
1
0
17 Jun 2019
MixUp as Directional Adversarial Training
MixUp as Directional Adversarial Training
Guillaume P. Archambault
Yongyi Mao
Hongyu Guo
Richong Zhang
AAML
58
23
0
17 Jun 2019
Understanding Natural Language Instructions for Fetching Daily Objects
  Using GAN-Based Multimodal Target-Source Classification
Understanding Natural Language Instructions for Fetching Daily Objects Using GAN-Based Multimodal Target-Source Classification
A. Magassouba
K. Sugiura
Anh Trinh Quoc
Hisashi Kawai
70
34
0
17 Jun 2019
Meta-learning Pseudo-differential Operators with Deep Neural Networks
Meta-learning Pseudo-differential Operators with Deep Neural Networks
Jordi Feliu-Fabà
Yuwei Fan
Lexing Ying
66
40
0
16 Jun 2019
Theoretical Limitations of Self-Attention in Neural Sequence Models
Theoretical Limitations of Self-Attention in Neural Sequence Models
Michael Hahn
96
276
0
16 Jun 2019
One Epoch Is All You Need
One Epoch Is All You Need
Aran Komatsuzaki
78
51
0
16 Jun 2019
Multi-Hop Paragraph Retrieval for Open-Domain Question Answering
Multi-Hop Paragraph Retrieval for Open-Domain Question Answering
Yair Feldman
Ran El-Yaniv
RALM
97
100
0
15 Jun 2019
Context is Key: Grammatical Error Detection with Contextual Word
  Representations
Context is Key: Grammatical Error Detection with Contextual Word Representations
Samuel J. Bell
H. Yannakoudakis
Marek Rei
82
43
0
15 Jun 2019
Can neural networks understand monotonicity reasoning?
Can neural networks understand monotonicity reasoning?
Hitomi Yanaka
K. Mineshima
D. Bekki
Kentaro Inui
Satoshi Sekine
Lasha Abzianidze
Johan Bos
LRM
67
81
0
15 Jun 2019
High-Performance Deep Learning via a Single Building Block
High-Performance Deep Learning via a Single Building Block
E. Georganas
K. Banerjee
Dhiraj D. Kalamkar
Sasikanth Avancha
Anand Venkat
Michael J. Anderson
G. Henry
Hans Pabst
A. Heinecke
43
12
0
15 Jun 2019
Scalable Syntax-Aware Language Models Using Knowledge Distillation
Scalable Syntax-Aware Language Models Using Knowledge Distillation
A. Kuncoro
Chris Dyer
Laura Rimell
S. Clark
Phil Blunsom
146
26
0
14 Jun 2019
"My Way of Telling a Story": Persona based Grounded Story Generation
"My Way of Telling a Story": Persona based Grounded Story Generation
Shrimai Prabhumoye
Khyathi Chandu
Ruslan Salakhutdinov
A. Black
82
35
0
14 Jun 2019
Comparison of Diverse Decoding Methods from Conditional Language Models
Comparison of Diverse Decoding Methods from Conditional Language Models
Daphne Ippolito
Reno Kriz
M. Kustikova
João Sedoc
Chris Callison-Burch
AI4CE
100
114
0
14 Jun 2019
Augmenting Neural Networks with First-order Logic
Augmenting Neural Networks with First-order Logic
Tao Li
Vivek Srikumar
66
109
0
14 Jun 2019
Previous
123...459460461...470471472
Next