Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,935 papers shown
Title
LadRa-Net: Locally-Aware Dynamic Re-read Attention Net for Sentence Semantic Matching
Kun Zhang
Guangyi Lv
Le Wu
Enhong Chen
Qi Liu
Meng Wang
61
6
0
06 Aug 2021
Robust Transfer Learning with Pretrained Language Models through Adapters
Wenjuan Han
Bo Pang
Ying Nian Wu
69
56
0
05 Aug 2021
Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification
Shengding Hu
Ning Ding
Huadong Wang
Zhiyuan Liu
Jingang Wang
Juan-Zi Li
Wei Wu
Maosong Sun
VLM
106
373
0
04 Aug 2021
How to Query Language Models?
Leonard Adolphs
Shehzaad Dhuliawala
Thomas Hofmann
KELM
86
15
0
04 Aug 2021
Your fairness may vary: Pretrained language model fairness in toxic text classification
Ioana Baldini
Dennis L. Wei
Karthikeyan N. Ramamurthy
Mikhail Yurochkin
Moninder Singh
93
54
0
03 Aug 2021
LICHEE: Improving Language Model Pre-training with Multi-grained Tokenization
Weidong Guo
Mingjun Zhao
Lusheng Zhang
Di Niu
Jinwen Luo
Zhenhua Liu
Zhenyang Li
J. Tang
55
8
0
02 Aug 2021
From LSAT: The Progress and Challenges of Complex Reasoning
Siyuan Wang
Zhongkun Liu
Wanjun Zhong
Ming Zhou
Zhongyu Wei
Zhumin Chen
Nan Duan
ELM
85
46
0
02 Aug 2021
Transformer-Encoder-GRU (T-E-GRU) for Chinese Sentiment Analysis on Chinese Comment Text
Binlong Zhang
Wei Zhou
42
17
0
01 Aug 2021
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Xianrui Zheng
Chao Zhang
P. Woodland
34
49
0
29 Jul 2021
UIBert: Learning Generic Multimodal Representations for UI Understanding
Chongyang Bai
Xiaoxue Zang
Ying Xu
Srinivas Sunkara
Abhinav Rastogi
Jindong Chen
Blaise Agüera y Arcas
90
95
0
29 Jul 2021
AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models
Yichun Yin
Cheng Chen
Lifeng Shang
Xin Jiang
Xiao Chen
Qun Liu
VLM
69
50
0
29 Jul 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLM
SyDa
362
4,050
0
28 Jul 2021
Exceeding the Limits of Visual-Linguistic Multi-Task Learning
Cameron R. Wolfe
Keld T. Lundgaard
VLM
76
2
0
27 Jul 2021
gaBERT -- an Irish Language Model
James Barry
Joachim Wagner
Lauren Cassidy
Alan Cowap
Teresa Lynn
Abigail Walsh
Mícheál J. Ó Meachair
Jennifer Foster
65
18
0
27 Jul 2021
Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking
Jinyu Guo
Kai Shuang
Jijie Li
Zihan Wang
75
18
0
27 Jul 2021
Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction
Lu Xu
Yew Ken Chia
Lidong Bing
158
188
0
26 Jul 2021
Fine-Grained Emotion Prediction by Modeling Emotion Definitions
Gargi Singh
Dhanajit Brahma
Piyush Rai
Ashutosh Modi
62
10
0
26 Jul 2021
ICDAR 2021 Competition on Scene Video Text Spotting
Zhanzhan Cheng
Jing Lu
Baorui Zou
Shuigeng Zhou
Leilei Gan
22
4
0
26 Jul 2021
Graph-free Multi-hop Reading Comprehension: A Select-to-Guide Strategy
Bohong Wu
Zhuosheng Zhang
Hai Zhao
72
21
0
25 Jul 2021
Go Wider Instead of Deeper
Fuzhao Xue
Ziji Shi
Futao Wei
Yuxuan Lou
Yong Liu
Yang You
ViT
MoE
95
84
0
25 Jul 2021
Query2Label: A Simple Transformer Way to Multi-Label Classification
Shilong Liu
Lei Zhang
Xiao Yang
Hang Su
Jun Zhu
73
193
0
22 Jul 2021
Back-Translated Task Adaptive Pretraining: Improving Accuracy and Robustness on Text Classification
Junghoon Lee
Jounghee Kim
Pilsung Kang
VLM
72
5
0
22 Jul 2021
Multi-stage Pre-training over Simplified Multimodal Pre-training Models
Tongtong Liu
Fangxiang Feng
Xiaojie Wang
38
13
0
22 Jul 2021
CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with Minimal Supervision
Zhongyang Li
Xiao Ding
Kuo Liao
Bing Qin
Ting Liu
CML
107
19
0
21 Jul 2021
Improving Sentence-Level Relation Extraction through Curriculum Learning
Seongsik Park
Harksoo Kim
60
14
0
20 Jul 2021
Token-Level Supervised Contrastive Learning for Punctuation Restoration
Qiushi Huang
Tom Ko
Lilian H. Y. Tang
Xubo Liu
Boyong Wu
65
23
0
19 Jul 2021
Clinical Relation Extraction Using Transformer-based Models
Xi Yang
Zehao Yu
Yi Guo
Jiang Bian
Yonghui Wu
LM&MA
MedIm
65
20
0
19 Jul 2021
Pre-trained Language Models as Prior Knowledge for Playing Text-based Games
Ishika Singh
Gargi Singh
Ashutosh Modi
OffRL
AI4CE
103
29
0
18 Jul 2021
Generative Pretraining for Paraphrase Evaluation
J. Weston
R. Lenain
U. Meepegama
E. Fristed
AIMat
59
10
0
17 Jul 2021
AutoBERT-Zero: Evolving BERT Backbone from Scratch
Jiahui Gao
Hang Xu
Han Shi
Xiaozhe Ren
Philip L. H. Yu
Xiaodan Liang
Xin Jiang
Zhenguo Li
85
37
0
15 Jul 2021
Automatic Task Requirements Writing Evaluation via Machine Reading Comprehension
Shiting Xu
Guowei Xu
Peilei Jia
Wenbiao Ding
Zhongqin Wu
Zitao Liu
28
1
0
15 Jul 2021
The Benchmark Lottery
Mostafa Dehghani
Yi Tay
A. Gritsenko
Zhe Zhao
N. Houlsby
Fernando Diaz
Donald Metzler
Oriol Vinyals
117
92
0
14 Jul 2021
BERT Fine-Tuning for Sentiment Analysis on Indonesian Mobile Apps Reviews
Kuncahyo Setyo Nugroho
Anantha Yullian Sukmadewa
DW HaftittahWuswilahaken
F. A. Bachtiar
N. Yudistira
33
45
0
14 Jul 2021
Importance-based Neuron Allocation for Multilingual Neural Machine Translation
Wanying Xie
Yang Feng
Shuhao Gu
Dong Yu
111
34
0
14 Jul 2021
Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
82
9
0
13 Jul 2021
Human Attention during Goal-directed Reading Comprehension Relies on Task Optimization
Jiajie Zou
Yuran Zhang
Jialu Li
Xing Tian
Nai Ding
AIMat
92
2
0
13 Jul 2021
Combiner: Full Attention Transformer with Sparse Computation Cost
Hongyu Ren
H. Dai
Zihang Dai
Mengjiao Yang
J. Leskovec
Dale Schuurmans
Bo Dai
166
80
0
12 Jul 2021
Trustworthy AI: A Computational Perspective
Haochen Liu
Yiqi Wang
Wenqi Fan
Xiaorui Liu
Yaxin Li
Shaili Jain
Yunhao Liu
Anil K. Jain
Jiliang Tang
FaML
194
213
0
12 Jul 2021
The Brownian motion in the transformer model
Yingshi Chen
116
1
0
12 Jul 2021
LexSubCon: Integrating Knowledge from Lexical Resources into Contextual Embeddings for Lexical Substitution
George Michalopoulos
I. McKillop
Alexander Wong
Helen H. Chen
120
19
0
11 Jul 2021
Noise Stability Regularization for Improving BERT Fine-tuning
Hang Hua
Xingjian Li
Dejing Dou
Chengzhong Xu
Jiebo Luo
79
45
0
10 Jul 2021
Transformer-Based Behavioral Representation Learning Enables Transfer Learning for Mobile Sensing in Small Datasets
Michael Merrill
Tim Althoff
AI4TS
MU
MedIm
50
5
0
09 Jul 2021
An Initial Investigation of Non-Native Spoken Question-Answering
V. Raina
Mark Gales
66
1
0
09 Jul 2021
Can Deep Neural Networks Predict Data Correlations from Column Names?
Immanuel Trummer
78
8
0
09 Jul 2021
Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERT
Usman Naseem
A. Dunn
Matloob Khushi
Jinman Kim
OOD
LM&MA
AI4MH
86
43
0
09 Jul 2021
UniRE: A Unified Label Space for Entity Relation Extraction
Yijun Wang
Changzhi Sun
Yuanbin Wu
Hao Zhou
Lei Li
Junchi Yan
82
116
0
09 Jul 2021
Joint Models for Answer Verification in Question Answering Systems
Zeyu Zhang
Thuy Vu
Alessandro Moschitti
53
24
0
09 Jul 2021
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer
Zineng Tang
Jaemin Cho
Hao Tan
Joey Tianyi Zhou
VLM
59
29
0
06 Jul 2021
PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Xiaoxue Zang
Lijuan Liu
Maria Wang
Yang Song
Hao Zhang
Jindong Chen
VLM
101
60
0
06 Jul 2021
Sarcasm Detection: A Comparative Study
Hamed Yaghoobian
H. Arabnia
Khaled Rasheed
57
23
0
05 Jul 2021
Previous
1
2
3
...
39
40
41
...
57
58
59
Next