Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,935 papers shown
Title
A Closer Look at Linguistic Knowledge in Masked Language Models: The Case of Relative Clauses in American English
Marius Mosbach
Stefania Degaetano-Ortlieb
Marie-Pauline Krielke
Badr M. Abdullah
Dietrich Klakow
27
7
0
02 Nov 2020
COSMO: Conditional SEQ2SEQ-based Mixture Model for Zero-Shot Commonsense Question Answering
Farhad Moghimifar
Zhuang Li
Terry Yue Zhuo
Mahsa Baktashmotlagh
Gholamreza Haffari
LRM
43
8
0
02 Nov 2020
I Know What You Asked: Graph Path Learning using AMR for Commonsense Reasoning
J. Lim
Dongsuk Oh
Yoonna Jang
Kisu Yang
Heuiseok Lim
ReLM
GNN
LRM
62
35
0
02 Nov 2020
Transformer-based Multi-Aspect Modeling for Multi-Aspect Multi-Sentiment Analysis
Zhanghua Wu
Chengcan Ying
Xinyu Dai
Shujian Huang
Jiajun Chen
18
9
0
01 Nov 2020
Understanding Pre-trained BERT for Aspect-based Sentiment Analysis
Hu Xu
Lei Shu
Philip S. Yu
Bing-Quan Liu
SSL
119
46
0
31 Oct 2020
SLM: Learning a Discourse Language Representation with Sentence Unshuffling
Haejun Lee
Drew A. Hudson
Kangwook Lee
Christopher D. Manning
SSL
119
52
0
30 Oct 2020
A Comprehensive Survey on Word Representation Models: From Classical to State-Of-The-Art Word Representation Language Models
Usman Naseem
Imran Razzak
S. Khan
M. Prasad
86
161
0
28 Oct 2020
Unmasking Contextual Stereotypes: Measuring and Mitigating BERT's Gender Bias
Marion Bartl
Malvina Nissim
Albert Gatt
91
125
0
27 Oct 2020
Probing Task-Oriented Dialogue Representation from Language Models
Chien-Sheng Wu
Caiming Xiong
79
20
0
26 Oct 2020
UPB at SemEval-2020 Task 12: Multilingual Offensive Language Detection on Social Media by Fine-tuning a Variety of BERT-based Models
Mircea-Adrian Tanase
Dumitru-Clementin Cercel
Costin-Gabriel Chiru
42
15
0
26 Oct 2020
Accelerating Training of Transformer-Based Language Models with Progressive Layer Dropping
Minjia Zhang
Yuxiong He
AI4CE
48
104
0
26 Oct 2020
Learning Contextualized Knowledge Structures for Commonsense Reasoning
Jun Yan
Mrigank Raman
Aaron Chan
Tianyu Zhang
Ryan Rossi
Handong Zhao
Sungchul Kim
Nedim Lipka
Xiang Ren
306
37
0
24 Oct 2020
Rethinking embedding coupling in pre-trained language models
Hyung Won Chung
Thibault Févry
Henry Tsai
Melvin Johnson
Sebastian Ruder
177
143
0
24 Oct 2020
A Frustratingly Easy Approach for Entity and Relation Extraction
Zexuan Zhong
Danqi Chen
229
108
0
24 Oct 2020
Pre-training Text-to-Text Transformers for Concept-centric Common Sense
Wangchunshu Zhou
Dong-Ho Lee
Ravi Kiran Selvam
Seyeon Lee
Bill Yuchen Lin
Xiang Ren
LRM
VLM
55
72
0
24 Oct 2020
Improving Multilingual Models with Language-Clustered Vocabularies
Hyung Won Chung
Dan Garrette
Kiat Chuan Tan
Jason Riesa
VLM
129
65
0
24 Oct 2020
ANLIzing the Adversarial Natural Language Inference Dataset
Adina Williams
Tristan Thrush
Douwe Kiela
AAML
255
47
0
24 Oct 2020
Comparative analysis of word embeddings in assessing semantic similarity of complex sentences
Dhivya Chandrasekaran
Vijay K. Mago
124
8
0
23 Oct 2020
DICT-MLM: Improved Multilingual Pre-Training using Bilingual Dictionaries
Aditi Chaudhary
K. Raman
Krishna Srinivasan
Jiecao Chen
81
25
0
23 Oct 2020
GiBERT: Introducing Linguistic Knowledge into BERT through a Lightweight Gated Injection Method
Nicole Peinelt
Marek Rei
Maria Liakata
55
2
0
23 Oct 2020
Generating Plausible Counterfactual Explanations for Deep Transformers in Financial Text Classification
Linyi Yang
Eoin M. Kenny
T. L. J. Ng
Yi Yang
Barry Smyth
Ruihai Dong
96
73
0
23 Oct 2020
Event-Driven Learning of Systematic Behaviours in Stock Markets
Xianchao Wu
AIFin
49
7
0
23 Oct 2020
Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian Tweets
Gaurish Thakkar
Marcis Pinnis
95
9
0
23 Oct 2020
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
Dongling Xiao
Yukun Li
Han Zhang
Yu Sun
Hao Tian
Hua Wu
Haifeng Wang
29
39
0
23 Oct 2020
Language Models are Open Knowledge Graphs
Chenguang Wang
Xiao Liu
Basel Alomair
SSL
KELM
79
137
0
22 Oct 2020
AdapterDrop: On the Efficiency of Adapters in Transformers
Andreas Rucklé
Gregor Geigle
Max Glockner
Tilman Beck
Jonas Pfeiffer
Nils Reimers
Iryna Gurevych
125
267
0
22 Oct 2020
Contrastive Self-Supervised Learning for Wireless Power Control
Navid Naderializadeh
SSL
95
7
0
22 Oct 2020
Towards Fully Bilingual Deep Language Modeling
Li-Hsin Chang
S. Pyysalo
Jenna Kanerva
Filip Ginter
67
3
0
22 Oct 2020
Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
Lingkai Kong
Haoming Jiang
Yuchen Zhuang
Jie Lyu
T. Zhao
Chao Zhang
OODD
91
26
0
22 Oct 2020
Detection of COVID-19 informative tweets using RoBERTa
Sirigireddy Dhanalaxmi
Rohit Agarwal
Aman Sinha
42
7
0
21 Oct 2020
Knowledge Distillation for Improved Accuracy in Spoken Question Answering
Chenyu You
Nuo Chen
Yuexian Zou
112
52
0
21 Oct 2020
Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering
Chenyu You
Nuo Chen
Yuexian Zou
111
38
0
21 Oct 2020
Complaint Identification in Social Media with Transformer Networks
Mali Jin
Nikolaos Aletras
44
16
0
21 Oct 2020
German's Next Language Model
Branden Chan
Stefan Schweter
Timo Möller
104
276
0
21 Oct 2020
Learning to Embed Categorical Features without Embedding Tables for Recommendation
Wang-Cheng Kang
D. Cheng
Tiansheng Yao
Xinyang Yi
Ting-Li Chen
Lichan Hong
Ed H. Chi
LMTD
CML
DML
110
72
0
21 Oct 2020
Self-supervised Graph Learning for Recommendation
Jiancan Wu
Xiang Wang
Fuli Feng
Xiangnan He
Liang Chen
Jianxun Lian
Xing Xie
SSL
202
1,182
0
21 Oct 2020
An Empirical Investigation of Contextualized Number Prediction
Daniel M. Spokoyny
Taylor Berg-Kirkpatrick
AI4TS
80
38
0
20 Oct 2020
Bayesian Attention Modules
Xinjie Fan
Shujian Zhang
Bo Chen
Mingyuan Zhou
183
62
0
20 Oct 2020
Optimal Subarchitecture Extraction For BERT
Adrian de Wynter
Daniel J. Perry
MQ
98
18
0
20 Oct 2020
BERT2DNN: BERT Distillation with Massive Unlabeled Data for Online E-Commerce Search
Yunjiang Jiang
Yue Shang
Ziyang Liu
Hongwei Shen
Yun Xiao
Wei Xiong
Sulong Xu
Weipeng P. Yan
Di Jin
62
17
0
20 Oct 2020
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters
Hicham El Boukkouri
Olivier Ferret
Thomas Lavergne
Hiroshi Noji
Pierre Zweigenbaum
Junichi Tsujii
164
161
0
20 Oct 2020
A Benchmark for Lease Contract Review
Spyretta Leivaditi
Julien Rossi
Evangelos Kanoulas
AILaw
175
37
0
20 Oct 2020
Bi-directional Cognitive Thinking Network for Machine Reading Comprehension
Wei Peng
Yue Hu
Luxi Xing
Yuqiang Xie
Jing Yu
Yajing Sun
Xiangpeng Wei
59
7
0
20 Oct 2020
CoRT: Complementary Rankings from Transformers
Marco Wrzalik
D. Krechel
44
10
0
20 Oct 2020
ChemBERTa: Large-Scale Self-Supervised Pretraining for Molecular Property Prediction
Seyone Chithrananda
Gabriel Grand
Bharath Ramsundar
AI4CE
128
415
0
19 Oct 2020
Technical Question Answering across Tasks and Domains
Wenhao Yu
Lingfei Wu
Yu Deng
Qingkai Zeng
R. Mahindru
S. Guven
Meng Jiang
60
8
0
19 Oct 2020
An Empirical Study for Vietnamese Constituency Parsing with Pre-training
Tuan-Vi Tran
Xuan-Thien Pham
Duc-Vu Nguyen
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
46
4
0
19 Oct 2020
Better Distractions: Transformer-based Distractor Generation and Multiple Choice Question Filtering
J. Offerijns
Suzan Verberne
Tessa Verhoef
59
26
0
19 Oct 2020
Knowledge-guided Open Attribute Value Extraction with Reinforcement Learning
Ye Liu
Sheng Zhang
Rui Song
Suo Feng
Yanghua Xiao
62
8
0
19 Oct 2020
Towards Interpreting BERT for Reading Comprehension Based QA
Sahana Ramnath
Preksha Nema
Deep Sahni
Mitesh M. Khapra
94
30
0
18 Oct 2020
Previous
1
2
3
...
49
50
51
...
57
58
59
Next