Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,935 papers shown
Title
NumGPT: Improving Numeracy Ability of Generative Pre-trained Models
Zhihua Jin
Xin Jiang
Xingbo Wang
Qun Liu
Yong Wang
Xiaozhe Ren
Huamin Qu
77
19
0
07 Sep 2021
IndicBART: A Pre-trained Model for Indic Natural Language Generation
Raj Dabre
Himani Shrotriya
Anoop Kunchukuttan
Ratish Puduppully
Mitesh M. Khapra
Pratyush Kumar
129
74
0
07 Sep 2021
Sent2Span: Span Detection for PICO Extraction in the Biomedical Text without Span Annotations
Shifeng Liu
Yifang Sun
Bing Li
Wei Wang
Florence T. Bourgeois
A. Dunn
52
14
0
06 Sep 2021
STaCK: Sentence Ordering with Temporal Commonsense Knowledge
Deepanway Ghosal
Navonil Majumder
Rada Mihalcea
Soujanya Poria
121
11
0
06 Sep 2021
Re-entry Prediction for Online Conversations via Self-Supervised Learning
Lingzhi Wang
Xingshan Zeng
Huang Hu
Kam-Fai Wong
Daxin Jiang
68
6
0
05 Sep 2021
FewshotQA: A simple framework for few-shot learning of question answering tasks using pre-trained text-to-text models
Rakesh Chada
P. Natarajan
92
46
0
04 Sep 2021
Frustratingly Simple Pretraining Alternatives to Masked Language Modeling
Atsuki Yamaguchi
G. Chrysostomou
Katerina Margatina
Nikolaos Aletras
69
25
0
04 Sep 2021
Do Prompt-Based Models Really Understand the Meaning of their Prompts?
Albert Webson
Ellie Pavlick
LRM
136
374
0
02 Sep 2021
So Cloze yet so Far: N400 Amplitude is Better Predicted by Distributional Information than Human Predictability Judgements
J. Michaelov
S. Coulson
Benjamin Bergen
73
44
0
02 Sep 2021
Fight Fire with Fire: Fine-tuning Hate Detectors using Large Samples of Generated Hate Speech
Tomer Wullach
A. Adler
Einat Minkov
73
41
0
01 Sep 2021
Does Knowledge Help General NLU? An Empirical Study
Ruochen Xu
Yuwei Fang
Chenguang Zhu
Michael Zeng
ELM
70
9
0
01 Sep 2021
What Have Been Learned & What Should Be Learned? An Empirical Study of How to Selectively Augment Text for Classification
Biyang Guo
S. Han
Hailiang Huang
39
5
0
01 Sep 2021
It's not Rocket Science : Interpreting Figurative Language in Narratives
Tuhin Chakrabarty
Yejin Choi
Vered Shwartz
97
58
0
31 Aug 2021
Effectiveness of Deep Networks in NLP using BiDAF as an example architecture
Soumyendu Sarkar
41
2
0
31 Aug 2021
Thermostat: A Large Collection of NLP Model Explanations and Analysis Tools
Nils Feldhus
Robert Schwarzenberg
Sebastian Möller
123
14
0
31 Aug 2021
DNNFusion: Accelerating Deep Neural Networks Execution with Advanced Operator Fusion
Wei Niu
Jiexiong Guan
Yanzhi Wang
G. Agrawal
Bin Ren
AI4CE
76
153
0
30 Aug 2021
ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language Understanding
Lingyun Feng
Jianwei Yu
Deng Cai
Songxiang Liu
Haitao Zheng
Yan Wang
ELM
179
14
0
30 Aug 2021
Shatter: An Efficient Transformer Encoder with Single-Headed Self-Attention and Relative Sequence Partitioning
Ran Tian
Joshua Maynez
Ankur P. Parikh
ViT
56
2
0
30 Aug 2021
Generating Answer Candidates for Quizzes and Answer-Aware Question Generators
Kristiyan Vachev
Momchil Hardalov
Georgi Karadzhov
Georgi Georgiev
Ivan Koychev
Preslav Nakov
AI4Ed
46
5
0
29 Aug 2021
Span Fine-tuning for Pre-trained Language Models
Rongzhou Bao
Zhuosheng Zhang
Hai Zhao
50
2
0
29 Aug 2021
Analyzing and Mitigating Interference in Neural Architecture Search
Jin Xu
Xu Tan
Kaitao Song
Renqian Luo
Yichong Leng
Tao Qin
Tie-Yan Liu
Jian Li
MoMe
91
29
0
29 Aug 2021
Transfer Learning for Multi-lingual Tasks -- a Survey
A. Jafari
Behnam Heidary
R. Farahbakhsh
Mostafa Salehi
Mahdi Jalili
LRM
51
5
0
28 Aug 2021
DKM: Differentiable K-Means Clustering Layer for Neural Network Compression
Minsik Cho
Keivan Alizadeh Vahid
Saurabh N. Adya
Mohammad Rastegari
95
34
0
28 Aug 2021
Prototype-Guided Memory Replay for Continual Learning
Stella Ho
Ming Liu
Lan Du
Longxiang Gao
Yong Xiang
CLL
67
32
0
28 Aug 2021
Layer-wise Model Pruning based on Mutual Information
Chun Fan
Jiwei Li
Xiang Ao
Leilei Gan
Yuxian Meng
Xiaofei Sun
84
19
0
28 Aug 2021
Code-switched inspired losses for generic spoken dialog representations
E. Chapuis
Pierre Colombo
Matthieu Labeau
Chloe Clave
177
12
0
27 Aug 2021
A Partition Filter Network for Joint Entity and Relation Extraction
Zhiheng Yan
Chong Zhang
Jinlan Fu
Qi Zhang
Zhongyu Wei
120
140
0
27 Aug 2021
Query-Focused Extractive Summarisation for Finding Ideal Answers to Biomedical and COVID-19 Questions
Diego Mollá Aliod
Urvashi Khanna
Dima Galat
Vincent Nguyen
Maciej Rybiński
RALM
71
2
0
27 Aug 2021
Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models
Yiming Cui
Weinan Zhang
Wanxiang Che
Ting Liu
Zhigang Chen
Shijin Wang
LRM
47
9
0
26 Aug 2021
Exploring the Promises of Transformer-Based LMs for the Representation of Normative Claims in the Legal Domain
Reto Gubelmann
Peter Hongler
Siegfried Handschuh
AILaw
21
0
0
25 Aug 2021
YANMTT: Yet Another Neural Machine Translation Toolkit
Raj Dabre
Eiichiro Sumita
72
13
0
25 Aug 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation
Samuel Cahyawijaya
103
12
0
24 Aug 2021
Are the Multilingual Models Better? Improving Czech Sentiment with Transformers
Pavel Přibáň
J. Steinberger
70
11
0
24 Aug 2021
Recurrent multiple shared layers in Depth for Neural Machine Translation
Guoliang Li
Yiyang Li
MoE
48
1
0
23 Aug 2021
APObind: A Dataset of Ligand Unbound Protein Conformations for Machine Learning Applications in De Novo Drug Design
Rishal Aggarwal
Akash Gupta
U. Priyakumar
42
11
0
23 Aug 2021
Fastformer: Additive Attention Can Be All You Need
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
Xing Xie
91
121
0
20 Aug 2021
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured Semantics for Medical Text Mining
Taolin Zhang
Zerui Cai
Chengyu Wang
Minghui Qiu
Bite Yang
Xiaofeng He
AI4MH
73
54
0
20 Aug 2021
Detection of Illicit Drug Trafficking Events on Instagram: A Deep Multimodal Multilabel Learning Approach
Chuanbo Hu
Minglei Yin
Bin Liu
Xin Li
Yanfang Ye
43
15
0
19 Aug 2021
TSI: an Ad Text Strength Indicator using Text-to-CTR and Semantic-Ad-Similarity
Shaunak Mishra
Changwei Hu
Manisha Verma
Kevin Yen
Yifan Hu
M. Sviridenko
44
8
0
18 Aug 2021
EviDR: Evidence-Emphasized Discrete Reasoning for Reasoning Machine Reading Comprehension
Yongwei Zhou
Junwei Bao
Haipeng Sun
Jiahui Liang
Youzheng Wu
Xiaodong He
Bowen Zhou
Tiejun Zhao
29
5
0
18 Aug 2021
RRLFSOR: An Efficient Self-Supervised Learning Strategy of Graph Convolutional Networks
Feng Sun
Ajith Kumar
Guanci Yang
Qikui Zhu
Yiyun Zhang
Ansi Zhang
Dhruv Makwana
SSL
GNN
114
0
0
17 Aug 2021
Exploring Generalization Ability of Pretrained Language Models on Arithmetic and Logical Reasoning
Cunxiang Wang
Boyuan Zheng
Y. Niu
Yue Zhang
LRM
78
23
0
15 Aug 2021
Contrastive Self-supervised Sequential Recommendation with Robust Augmentation
Zhiwei Liu
Yong-Guang Chen
Jia Li
Philip S. Yu
Julian McAuley
Caiming Xiong
72
171
0
14 Aug 2021
FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning
Jing Zhou
Yanan Zheng
Jie Tang
Jian Li
Zhilin Yang
VLM
89
80
0
13 Aug 2021
Low-Resource Adaptation of Open-Domain Generative Chatbots
Greyson Gerhard-Young
R. Anantha
Srinivas Chappidi
Björn Hoffmeister
80
3
0
13 Aug 2021
AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing
Katikapalli Subramanyam Kalyan
A. Rajasekharan
S. Sangeetha
VLM
LM&MA
111
270
0
12 Aug 2021
Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval
Luyu Gao
Jamie Callan
RALM
298
342
0
12 Aug 2021
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code Representation
Xin Wang
Yasheng Wang
Fei Mi
Pingyi Zhou
Yao Wan
Xiao Liu
Li Li
Hao Wu
Jin Liu
Xin Jiang
142
118
0
10 Aug 2021
Making Transformers Solve Compositional Tasks
Santiago Ontañón
Joshua Ainslie
Vaclav Cvicek
Zachary Kenneth Fisher
109
74
0
09 Aug 2021
Unifying Heterogeneous Electronic Health Records Systems via Text-Based Code Embedding
Kyunghoon Hur
Jiyoung Lee
Jungwoo Oh
Wesley Price
Young-Hak Kim
Edward Choi
103
19
0
08 Aug 2021
Previous
1
2
3
...
38
39
40
...
57
58
59
Next