Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,913 papers shown
Title
QUACKIE: A NLP Classification Task With Ground Truth Explanations
Yves Rychener
X. Renard
Djamé Seddah
P. Frossard
Marcin Detyniecki
32
3
0
24 Dec 2020
A Survey on Visual Transformer
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
23
2,135
0
23 Dec 2020
Multi-Head Self-Attention with Role-Guided Masks
Dongsheng Wang
Casper Hansen
Lucas Chaves Lima
Christian B. Hansen
Maria Maistro
J. Simonsen
Christina Lioma
26
1
0
22 Dec 2020
Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective
S. Kiritchenko
I. Nejadgholi
Kathleen C. Fraser
AILaw
28
86
0
22 Dec 2020
Undivided Attention: Are Intermediate Layers Necessary for BERT?
S. N. Sridhar
Anthony Sarah
30
14
0
22 Dec 2020
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning
Armen Aghajanyan
Luke Zettlemoyer
Sonal Gupta
22
538
1
22 Dec 2020
RealFormer: Transformer Likes Residual Attention
Ruining He
Anirudh Ravula
Bhargav Kanagal
Joshua Ainslie
27
108
0
21 Dec 2020
Sub-Linear Memory: How to Make Performers SLiM
Valerii Likhosherstov
K. Choromanski
Jared Davis
Xingyou Song
Adrian Weller
23
19
0
21 Dec 2020
A Graph Reasoning Network for Multi-turn Response Selection via Customized Pre-training
Yongkang Liu
Shi Feng
Daling Wang
Kaisong Song
Feiliang Ren
Yifei Zhang
LRM
21
21
0
21 Dec 2020
Adaptive Bi-directional Attention: Exploring Multi-Granularity Representations for Machine Reading Comprehension
Nuo Chen
Fenglin Liu
Chenyu You
Peilin Zhou
Yuexian Zou
9
31
0
20 Dec 2020
Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning
Jerry Zikun Chen
S. Yu
Haoran Wang
244
5
0
18 Dec 2020
BERT Goes Shopping: Comparing Distributional Models for Product Representations
Federico Bianchi
Bingqing Yu
Jacopo Tagliabue
33
15
0
17 Dec 2020
MASKER: Masked Keyword Regularization for Reliable Text Classification
S. Moon
Sangwoo Mo
Kimin Lee
Jaeho Lee
Jinwoo Shin
32
38
0
17 Dec 2020
Costs to Consider in Adopting NLP for Your Business
Made Nindyatama Nityasya
Haryo Akbarianto Wibowo
Radityo Eko Prasojo
Alham Fikri Aji
VLM
24
3
0
16 Dec 2020
A Lightweight Neural Model for Biomedical Entity Linking
Lihu Chen
Gaël Varoquaux
Fabian M. Suchanek
MedIm
14
32
0
16 Dec 2020
Pre-Training Transformers as Energy-Based Cloze Models
Kevin Clark
Minh-Thang Luong
Quoc V. Le
Christopher D. Manning
23
79
0
15 Dec 2020
StackRec: Efficient Training of Very Deep Sequential Recommender Models by Iterative Stacking
Jiachun Wang
Fajie Yuan
Jian Chen
Qingyao Wu
Min Yang
Yang Sun
Guoxiao Zhang
BDL
40
26
0
14 Dec 2020
Parameter-Efficient Transfer Learning with Diff Pruning
Demi Guo
Alexander M. Rush
Yoon Kim
20
386
0
14 Dec 2020
MiniVLM: A Smaller and Faster Vision-Language Model
Jianfeng Wang
Xiaowei Hu
Pengchuan Zhang
Xiujun Li
Lijuan Wang
Lefei Zhang
Jianfeng Gao
Zicheng Liu
VLM
MLLM
35
59
0
13 Dec 2020
Reinforced Multi-Teacher Selection for Knowledge Distillation
Fei Yuan
Linjun Shou
J. Pei
Wutao Lin
Ming Gong
Yan Fu
Daxin Jiang
15
121
0
11 Dec 2020
Improving Task-Agnostic BERT Distillation with Layer Mapping Search
Xiaoqi Jiao
Huating Chang
Yichun Yin
Lifeng Shang
Xin Jiang
Xiao Chen
Linlin Li
Fang Wang
Qun Liu
29
12
0
11 Dec 2020
GNN-XML: Graph Neural Networks for Extreme Multi-label Text Classification
Daoming Zong
Shiliang Sun
11
9
0
10 Dec 2020
Know Your Limits: Uncertainty Estimation with ReLU Classifiers Fails at Reliable OOD Detection
Dennis Ulmer
Giovanni Cina
OODD
40
31
0
09 Dec 2020
Label Confusion Learning to Enhance Text Classification Models
Biyang Guo
Songqiao Han
Xiao Han
Hailiang Huang
Ting Lu
63
68
0
09 Dec 2020
Fusing Context Into Knowledge Graph for Commonsense Question Answering
Yichong Xu
Chenguang Zhu
Ruochen Xu
Yang Liu
Michael Zeng
Xuedong Huang
14
69
0
09 Dec 2020
Unsupervised Label Refinement Improves Dataless Text Classification
Zewei Chu
K. Stratos
Kevin Gimpel
17
15
0
08 Dec 2020
Parameter Efficient Multimodal Transformers for Video Representation Learning
Sangho Lee
Youngjae Yu
Gunhee Kim
Thomas Breuel
Jan Kautz
Yale Song
ViT
29
76
0
08 Dec 2020
Semantics Altering Modifications for Evaluating Comprehension in Machine Reading
Viktor Schlegel
Goran Nenadic
Riza Batista-Navarro
27
18
0
07 Dec 2020
Detecting Insincere Questions from Text: A Transfer Learning Approach
Ashwin Rachha
Gaurav Vanmane
12
4
0
07 Dec 2020
Reference Knowledgeable Network for Machine Reading Comprehension
Yilin Zhao
Zhuosheng Zhang
Hai Zhao
23
5
0
07 Dec 2020
Self-supervised Deep Learning for Reading Activity Classification
M. Islam
Shuji Sakamoto
Yoshihiro Yamada
Andrew W. Vargo
M. Iwata
Masakazu Iwamura
K. Kise
13
2
0
07 Dec 2020
An Empirical Survey of Unsupervised Text Representation Methods on Twitter Data
Lili Wang
Chongyang Gao
Jason W. Wei
Weicheng Ma
Ruibo Liu
Soroush Vosoughi
16
15
0
07 Dec 2020
Pre-training Protein Language Models with Label-Agnostic Binding Pairs Enhances Performance in Downstream Tasks
Modestas Filipavicius
Matteo Manica
Joris Cadow
María Rodríguez Martínez
26
13
0
05 Dec 2020
Playing Text-Based Games with Common Sense
Sahith N. Dambekodi
Spencer Frazier
Prithviraj Ammanabrolu
Mark O. Riedl
LLMAG
11
25
0
04 Dec 2020
Pre-trained language models as knowledge bases for Automotive Complaint Analysis
V. D. Viellieber
Matthias Aßenmacher
8
2
0
04 Dec 2020
WeaQA: Weak Supervision via Captions for Visual Question Answering
Pratyay Banerjee
Tejas Gokhale
Yezhou Yang
Chitta Baral
25
35
0
04 Dec 2020
Federated Learning for Personalized Humor Recognition
Xu Guo
Han Yu
Boyang Albert Li
Hao Wang
Pengwei Xing
Siwei Feng
Zaiqing Nie
Chunyan Miao
FedML
14
13
0
03 Dec 2020
Circles are like Ellipses, or Ellipses are like Circles? Measuring the Degree of Asymmetry of Static and Contextual Embeddings and the Implications to Representation Learning
Wei Zhang
Murray Campbell
Yang Yu
Yara Rizk
24
0
0
03 Dec 2020
Empirical Study on the Software Engineering Practices in Open Source ML Package Repositories
Minke Xiu
Ellis E. Eghan
Zhen Ming
Z. Jiang
Bram Adams
19
0
0
02 Dec 2020
Classification of Multimodal Hate Speech -- The Winning Solution of Hateful Memes Challenge
Xiayu Zhong
28
15
0
02 Dec 2020
Interactive Teaching for Conversational AI
Q. Ping
Fei Niu
Govind Thattai
Joel Chengottusseriyil
Qiaozi Gao
Aishwarya N. Reganti
Prashanth Rajagopal
Gokhan Tur
Dilek Z. Hakkani-Tür
Prem Nataraja
14
6
0
02 Dec 2020
ReMP: Rectified Metric Propagation for Few-Shot Learning
Yang Zhao
Chunyuan Li
Ping Yu
Changyou Chen
35
6
0
02 Dec 2020
CLIMATE-FEVER: A Dataset for Verification of Real-World Climate Claims
Thomas Diggelmann
Jordan L. Boyd-Graber
Jannis Bulian
Massimiliano Ciaramita
Markus Leippold
28
192
0
01 Dec 2020
CPM: A Large-scale Generative Chinese Pre-trained Language Model
Zhengyan Zhang
Xu Han
Hao Zhou
Pei Ke
Yuxian Gu
...
Wentao Han
Jie Tang
Juan-Zi Li
Xiaoyan Zhu
Maosong Sun
26
113
0
01 Dec 2020
Modifying Memories in Transformer Models
Chen Zhu
A. S. Rawat
Manzil Zaheer
Srinadh Bhojanapalli
Daliang Li
Felix X. Yu
Sanjiv Kumar
KELM
32
192
0
01 Dec 2020
Intrinsic Knowledge Evaluation on Chinese Language Models
Zhiruo Wang
Renfen Hu
KELM
ELM
22
1
0
29 Nov 2020
EdgeBERT: Sentence-Level Energy Optimizations for Latency-Aware Multi-Task NLP Inference
Thierry Tambe
Coleman Hooper
Lillian Pentecost
Tianyu Jia
En-Yu Yang
...
Victor Sanh
P. Whatmough
Alexander M. Rush
David Brooks
Gu-Yeon Wei
20
117
0
28 Nov 2020
Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training Speedup
Cheng Yang
Shengnan Wang
Chao Yang
Yuechuan Li
Ru He
Jingqiao Zhang
32
25
0
27 Nov 2020
CoRe: An Efficient Coarse-refined Training Framework for BERT
Cheng Yang
Shengnan Wang
Yuechuan Li
Chao Yang
Ming Yan
Jingqiao Zhang
Fangquan Lin
22
0
0
27 Nov 2020
Two Stage Transformer Model for COVID-19 Fake News Detection and Fact Checking
Rutvik Vijjali
Prathyush Potluri
S. Kumar
Sundeep Teki
MedIm
31
74
0
26 Nov 2020
Previous
1
2
3
...
47
48
49
...
57
58
59
Next