Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 18,335 papers shown
Title
K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters
Ruize Wang
Duyu Tang
Nan Duan
Zhongyu Wei
Xuanjing Huang
Jianshu Ji
Guihong Cao
Daxin Jiang
Ming Zhou
KELM
48
545
0
05 Feb 2020
Parsing as Pretraining
David Vilares
Michalina Strzyz
Anders Søgaard
Carlos Gómez-Rodríguez
43
31
0
05 Feb 2020
Vocoder-free End-to-End Voice Conversion with Transformer Network
June-Woo Kim
H. Jung
Minho Lee
30
4
0
05 Feb 2020
Fake News Detection by means of Uncertainty Weighted Causal Graphs
E.C. Garrido-Merchán
C. Puente
Rafael Palacios
GNN
CML
17
2
0
04 Feb 2020
How Far are We from Effective Context Modeling? An Exploratory Study on Semantic Parsing in Context
Jian Liu
Yubo Chen
Jian Liu
Jian-Guang Lou
Bin Zhou
Dongmei Zhang
25
72
0
03 Feb 2020
IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems
Liu Yang
Minghui Qiu
Chen Qu
Cen Chen
Jiafeng Guo
Yongfeng Zhang
W. Bruce Croft
Haiqing Chen
38
38
0
03 Feb 2020
Schema-Guided Dialogue State Tracking Task at DSTC8
Abhinav Rastogi
Xiaoxue Zang
Srinivas Sunkara
Raghav Gupta
Pranav Khaitan
27
41
0
02 Feb 2020
Explaining Relationships Between Scientific Documents
Kelvin Luu
Xinyi Wu
Rik Koncel-Kedziorski
Kyle Lo
Isabel Cachola
Noah A. Smith
41
48
0
02 Feb 2020
Beat the AI: Investigating Adversarial Human Annotation for Reading Comprehension
Max Bartolo
A. Roberts
Johannes Welbl
Sebastian Riedel
Pontus Stenetorp
AAML
37
167
0
02 Feb 2020
Improving Domain-Adapted Sentiment Classification by Deep Adversarial Mutual Learning
Qian Xue
Wei Zhang
H. Zha
32
39
0
01 Feb 2020
FEA-Net: A Physics-guided Data-driven Model for Efficient Mechanical Response Prediction
Houpu Yao
Yi Gao
Yongming Liu
AI4CE
71
66
0
31 Jan 2020
Self-attention-based BiGRU and capsule network for named entity recognition
Jianfeng Deng
Lianglun Cheng
Zhuowei Wang
16
10
0
30 Jan 2020
Are Pre-trained Language Models Aware of Phrases? Simple but Strong Baselines for Grammar Induction
Taeuk Kim
Jihun Choi
Daniel Edmiston
Sang-goo Lee
22
90
0
30 Jan 2020
On the Importance of Word Order Information in Cross-lingual Sequence Labeling
Zihan Liu
Genta Indra Winata
Samuel Cahyawijaya
Andrea Madotto
Zhaojiang Lin
Pascale Fung
24
3
0
30 Jan 2020
MEMO: A Deep Network for Flexible Combination of Episodic Memories
Andrea Banino
Adria Puigdomenech Badia
Raphael Köster
Martin Chadwick
V. Zambaldi
Demis Hassabis
Caswell Barry
M. Botvinick
D. Kumaran
Charles Blundell
KELM
26
33
0
29 Jan 2020
Joint Contextual Modeling for ASR Correction and Language Understanding
Yue Weng
Sai Sumanth Miryala
Chandra Khatri
Runze Wang
H. Zheng
...
Mahdi Namazifar
Alexandros Papangelis
Hugh Williams
Franziska Bell
Gokhan Tur
36
50
0
28 Jan 2020
Guiding Corpus-based Set Expansion by Auxiliary Sets Generation and Co-Expansion
Jiaxin Huang
Yiqing Xie
Yu Meng
Jiaming Shen
Yunyi Zhang
Jiawei Han
58
27
0
27 Jan 2020
Retrospective Reader for Machine Reading Comprehension
ZhuoSheng Zhang
Junjie Yang
Hai Zhao
RALM
25
226
0
27 Jan 2020
Asking Questions the Human Way: Scalable Question-Answer Generation from Text Corpus
Bang Liu
Haojie Wei
Di Niu
Haolan Chen
Yancheng He
27
92
0
27 Jan 2020
TaxoExpan: Self-supervised Taxonomy Expansion with Position-Enhanced Graph Neural Network
Jiaming Shen
Zhihong Shen
Chenyan Xiong
Chi Wang
Kuansan Wang
Jiawei Han
32
74
0
26 Jan 2020
From Stock Prediction to Financial Relevance: Repurposing Attention Weights to Assess News Relevance Without Manual Annotations
Luciano Del Corro
Johannes Hoffart
AIFin
21
3
0
26 Jan 2020
DUMA: Reading Comprehension with Transposition Thinking
Pengfei Zhu
Hai Zhao
Xiaoguang Li
AI4CE
39
35
0
26 Jan 2020
Generating Representative Headlines for News Stories
Xiaotao Gu
Yuning Mao
Jiawei Han
Jialu Liu
Hongkun Yu
You Wu
Cong Yu
Daniel Finnie
Jiaqi Zhai
Nicholas Zukoski
30
70
0
26 Jan 2020
Multi-task self-supervised learning for Robust Speech Recognition
Mirco Ravanelli
Jianyuan Zhong
Santiago Pascual
P. Swietojanski
João Monteiro
J. Trmal
Yoshua Bengio
SSL
189
288
0
25 Jan 2020
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
121
277
0
24 Jan 2020
Multilingual Denoising Pre-training for Neural Machine Translation
Yinhan Liu
Jiatao Gu
Naman Goyal
Xian Li
Sergey Edunov
Marjan Ghazvininejad
M. Lewis
Luke Zettlemoyer
AI4CE
AIMat
76
1,777
0
22 Jan 2020
ImageBERT: Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data
Di Qi
Lin Su
Jianwei Song
Edward Cui
Taroon Bharti
Arun Sacheti
VLM
40
259
0
22 Jan 2020
Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference
Timo Schick
Hinrich Schütze
258
1,591
0
21 Jan 2020
Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue Systems
Vevake Balaraman
Bernardo Magnini
26
19
0
21 Jan 2020
Multi-level Head-wise Match and Aggregation in Transformer for Textual Sequence Matching
Shuohang Wang
Yunshi Lan
Yi Tay
Jing Jiang
Jingjing Liu
ViT
32
7
0
20 Jan 2020
Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models
M. Farazi
Salman H. Khan
Nick Barnes
25
17
0
20 Jan 2020
The Parallelism Motifs of Genomic Data Analysis
Katherine Yelick
A. Buluç
M. Awan
A. Azad
Benjamin Brock
...
Giulia Guidi
S. Hofmeyr
Oguz Selvitopi
Cristina Teodoropol
L. Oliker
19
17
0
20 Jan 2020
A multimodal deep learning approach for named entity recognition from social media
M. Asgari-Chenaghlu
M. Feizi-Derakhshi
Leili Farzinvash
M. Balafar
C. Motamed
19
28
0
19 Jan 2020
Capturing Evolution in Word Usage: Just Add More Clusters?
Matej Martinc
Syrielle Montariol
Elaine Zosa
Lidia Pivovarova
43
47
0
18 Jan 2020
Plato Dialogue System: A Flexible Conversational AI Research Platform
Alexandros Papangelis
Mahdi Namazifar
Chandra Khatri
Yi-Chia Wang
Piero Molino
Gokhan Tur
LLMAG
29
23
0
17 Jan 2020
A Common Semantic Space for Monolingual and Cross-Lingual Meta-Embeddings
G. R. Claramunt
Rodrigo Agerri
German Rigau
37
7
0
17 Jan 2020
RobBERT: a Dutch RoBERTa-based Language Model
Pieter Delobelle
Thomas Winters
Bettina Berendt
18
233
0
17 Jan 2020
Comparing Rule-based, Feature-based and Deep Neural Methods for De-identification of Dutch Medical Records
Jan Trienes
D. Trieschnigg
C. Seifert
D. Hiemstra
24
26
0
16 Jan 2020
FGN: Fusion Glyph Network for Chinese Named Entity Recognition
Zhenyu Xuan
Rui Bao
Shengyi Jiang
24
34
0
15 Jan 2020
Graph-Bert: Only Attention is Needed for Learning Graph Representations
Jiawei Zhang
Haopeng Zhang
Congying Xia
Li Sun
31
299
0
15 Jan 2020
A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation
Jian Guan
Fei Huang
Zhihao Zhao
Xiaoyan Zhu
Minlie Huang
LRM
SyDa
27
242
0
15 Jan 2020
"Why is 'Chicago' deceptive?" Towards Building Model-Driven Tutorials for Humans
Vivian Lai
Han Liu
Chenhao Tan
35
139
0
14 Jan 2020
Multi-Source Domain Adaptation for Text Classification via DistanceNet-Bandits
Han Guo
Ramakanth Pasunuru
Joey Tianyi Zhou
30
114
0
13 Jan 2020
CLUENER2020: Fine-grained Named Entity Recognition Dataset and Benchmark for Chinese
Liang Xu
Yu Tong
Qianqian Dong
Yixuan Liao
Cong Yu
Yin Tian
Weitang Liu
Lu Li
Caiquan Liu
Xuanwei Zhang
32
48
0
13 Jan 2020
Visually Guided Self Supervised Learning of Speech Representations
Abhinav Shukla
Konstantinos Vougioukas
Pingchuan Ma
Stavros Petridis
Maja Pantic
SSL
29
24
0
13 Jan 2020
Residual Attention Net for Superior Cross-Domain Time Sequence Modeling
Seth H. Huang
Lingjie Xu
Congwei Jiang
AI4TS
34
10
0
13 Jan 2020
ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training
Weizhen Qi
Yu Yan
Yeyun Gong
Dayiheng Liu
Nan Duan
Jiusheng Chen
Ruofei Zhang
Ming Zhou
AI4TS
27
446
0
13 Jan 2020
Deep Learning based Pedestrian Inertial Navigation: Methods, Dataset and On-Device Inference
Changhao Chen
Peijun Zhao
Chris Xiaoxuan Lu
Wei Wang
Andrew Markham
A. Trigoni
29
113
0
13 Jan 2020
A comprehensive deep learning-based approach to reduced order modeling of nonlinear time-dependent parametrized PDEs
S. Fresca
Luca Dede'
Andrea Manzoni
AI4CE
30
258
0
12 Jan 2020
Rethinking Generalization of Neural Models: A Named Entity Recognition Case Study
Jinlan Fu
Pengfei Liu
Qi Zhang
Xuanjing Huang
AI4CE
33
73
0
12 Jan 2020
Previous
1
2
3
...
348
349
350
...
365
366
367
Next