Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,913 papers shown
Title
Improving Sentence-Level Relation Extraction through Curriculum Learning
Seongsik Park
Harksoo Kim
24
14
0
20 Jul 2021
Token-Level Supervised Contrastive Learning for Punctuation Restoration
Qiushi Huang
Tom Ko
Lilian H. Y. Tang
Xubo Liu
Boyong Wu
18
23
0
19 Jul 2021
Clinical Relation Extraction Using Transformer-based Models
Xi Yang
Zehao Yu
Yi Guo
Jiang Bian
Yonghui Wu
LM&MA
MedIm
32
20
0
19 Jul 2021
Pre-trained Language Models as Prior Knowledge for Playing Text-based Games
Ishika Singh
Gargi Singh
Ashutosh Modi
OffRL
AI4CE
32
29
0
18 Jul 2021
Generative Pretraining for Paraphrase Evaluation
J. Weston
R. Lenain
U. Meepegama
E. Fristed
AIMat
27
10
0
17 Jul 2021
AutoBERT-Zero: Evolving BERT Backbone from Scratch
Jiahui Gao
Hang Xu
Han Shi
Xiaozhe Ren
Philip L. H. Yu
Xiaodan Liang
Xin Jiang
Zhenguo Li
21
37
0
15 Jul 2021
Automatic Task Requirements Writing Evaluation via Machine Reading Comprehension
Shiting Xu
Guowei Xu
Peilei Jia
Wenbiao Ding
Zhongqin Wu
Zitao Liu
26
1
0
15 Jul 2021
The Benchmark Lottery
Mostafa Dehghani
Yi Tay
A. Gritsenko
Zhe Zhao
N. Houlsby
Fernando Diaz
Donald Metzler
Oriol Vinyals
47
89
0
14 Jul 2021
BERT Fine-Tuning for Sentiment Analysis on Indonesian Mobile Apps Reviews
Kuncahyo Setyo Nugroho
Anantha Yullian Sukmadewa
DW HaftittahWuswilahaken
F. A. Bachtiar
N. Yudistira
26
43
0
14 Jul 2021
Importance-based Neuron Allocation for Multilingual Neural Machine Translation
Wanying Xie
Yang Feng
Shuhao Gu
Dong Yu
44
32
0
14 Jul 2021
Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
29
9
0
13 Jul 2021
Human Attention during Goal-directed Reading Comprehension Relies on Task Optimization
Jiajie Zou
Yuran Zhang
Jialu Li
Xing Tian
Nai Ding
AIMat
40
2
0
13 Jul 2021
Combiner: Full Attention Transformer with Sparse Computation Cost
Hongyu Ren
H. Dai
Zihang Dai
Mengjiao Yang
J. Leskovec
Dale Schuurmans
Bo Dai
87
78
0
12 Jul 2021
Trustworthy AI: A Computational Perspective
Haochen Liu
Yiqi Wang
Wenqi Fan
Xiaorui Liu
Yaxin Li
Shaili Jain
Yunhao Liu
Anil K. Jain
Jiliang Tang
FaML
104
198
0
12 Jul 2021
The Brownian motion in the transformer model
Yingshi Chen
21
1
0
12 Jul 2021
LexSubCon: Integrating Knowledge from Lexical Resources into Contextual Embeddings for Lexical Substitution
George Michalopoulos
I. McKillop
Alexander Wong
Helen H. Chen
83
19
0
11 Jul 2021
Noise Stability Regularization for Improving BERT Fine-tuning
Hang Hua
Xingjian Li
Dejing Dou
Chengzhong Xu
Jiebo Luo
19
44
0
10 Jul 2021
Transformer-Based Behavioral Representation Learning Enables Transfer Learning for Mobile Sensing in Small Datasets
Michael Merrill
Tim Althoff
AI4TS
MU
MedIm
20
5
0
09 Jul 2021
An Initial Investigation of Non-Native Spoken Question-Answering
V. Raina
Mark Gales
21
1
0
09 Jul 2021
Can Deep Neural Networks Predict Data Correlations from Column Names?
Immanuel Trummer
22
8
0
09 Jul 2021
Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERT
Usman Naseem
A. Dunn
Matloob Khushi
Jinman Kim
OOD
LM&MA
AI4MH
29
43
0
09 Jul 2021
UniRE: A Unified Label Space for Entity Relation Extraction
Yijun Wang
Changzhi Sun
Yuanbin Wu
Hao Zhou
Lei Li
Junchi Yan
22
113
0
09 Jul 2021
Joint Models for Answer Verification in Question Answering Systems
Zeyu Zhang
Thuy Vu
Alessandro Moschitti
14
24
0
09 Jul 2021
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer
Zineng Tang
Jaemin Cho
Hao Tan
Joey Tianyi Zhou
VLM
38
29
0
06 Jul 2021
PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Xiaoxue Zang
Lijuan Liu
Maria Wang
Yang Song
Hao Zhang
Jindong Chen
VLM
35
55
0
06 Jul 2021
Sarcasm Detection: A Comparative Study
Hamed Yaghoobian
H. Arabnia
Khaled Rasheed
31
22
0
05 Jul 2021
ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Yu Sun
Shuohuan Wang
Shikun Feng
Siyu Ding
Chao Pang
...
Ouyang Xuan
Dianhai Yu
Hao Tian
Hua Wu
Haifeng Wang
61
454
0
05 Jul 2021
Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal Reasoning Models
Mingyue Han
Yinglin Wang
LRM
21
11
0
05 Jul 2021
Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks
Jinghui Qin
Xiaodan Liang
Yining Hong
Jianheng Tang
Liang Lin
AIMat
AAML
29
57
0
03 Jul 2021
R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling
Xiang Hu
Haitao Mi
Zujie Wen
Yafang Wang
Yi Su
Jing Zheng
Gerard de Melo
17
22
0
02 Jul 2021
Learned Token Pruning for Transformers
Sehoon Kim
Sheng Shen
D. Thorsley
A. Gholami
Woosuk Kwon
Joseph Hassoun
Kurt Keutzer
17
146
0
02 Jul 2021
AutoFormer: Searching Transformers for Visual Recognition
Minghao Chen
Houwen Peng
Jianlong Fu
Haibin Ling
ViT
58
260
0
01 Jul 2021
Pretext Tasks selection for multitask self-supervised speech representation learning
Salah Zaiem
Titouan Parcollet
S. Essid
Abdel Heba
SSL
29
12
0
01 Jul 2021
Elbert: Fast Albert with Confidence-Window Based Early Exit
Keli Xie
Siyuan Lu
Meiqi Wang
Zhongfeng Wang
22
20
0
01 Jul 2021
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information
Zijun Sun
Xiaoya Li
Xiaofei Sun
Yuxian Meng
Xiang Ao
Qing He
Fei Wu
Jiwei Li
SSeg
59
184
0
30 Jun 2021
HySPA: Hybrid Span Generation for Scalable Text-to-Graph Extraction
Liliang Ren
Chenkai Sun
Heng Ji
J. Hockenmaier
43
14
0
30 Jun 2021
New Arabic Medical Dataset for Diseases Classification
Jaafar Hammoud
A. Vatian
N. Dobrenko
N. Vedernikov
A. Shalyto
N. Gusarova
OOD
19
6
0
29 Jun 2021
SCARF: Self-Supervised Contrastive Learning using Random Feature Corruption
Dara Bahri
Heinrich Jiang
Yi Tay
Donald Metzler
SSL
31
164
0
29 Jun 2021
TWAG: A Topic-Guided Wikipedia Abstract Generator
Fangwei Zhu
Shangqing Tu
Jiaxin Shi
Juan-Zi Li
Lei Hou
Tong Cui
11
11
0
29 Jun 2021
A Knowledge-Grounded Dialog System Based on Pre-Trained Language Models
Weijie Zhang
Jiaoxuan Chen
Haipang Wu
Sanhui Wan
Gongfeng Li
30
4
0
28 Jun 2021
A Closer Look at How Fine-tuning Changes BERT
Yichu Zhou
Vivek Srikumar
31
64
0
27 Jun 2021
Improving Sequential Recommendation Consistency with Self-Supervised Imitation
Xu Yuan
Hongshen Chen
Yonghao Song
Xiaofang Zhao
Zhuoye Ding
Zhen He
Bo Long
21
22
0
26 Jun 2021
Answering Chinese Elementary School Social Study Multiple Choice Questions
Daniel Lee
Chao-Chun Liang
Keh-Yih Su
27
1
0
26 Jun 2021
Benchmarking Differential Privacy and Federated Learning for BERT Models
Priya Basu
Tiasa Singha Roy
Rakshit Naidu
Zumrut Muftuoglu
Sahib Singh
Fatemehsadat Mireshghallah
FedML
AI4MH
24
50
0
26 Jun 2021
Knowledge-Grounded Self-Rationalization via Extractive and Natural Language Explanations
Bodhisattwa Prasad Majumder
Oana-Maria Camburu
Thomas Lukasiewicz
Julian McAuley
35
35
0
25 Jun 2021
Learning to Sample Replacements for ELECTRA Pre-Training
Y. Hao
Li Dong
Hangbo Bao
Ke Xu
Furu Wei
MU
11
11
0
25 Jun 2021
aiSTROM -- A roadmap for developing a successful AI strategy
Dorien Herremans
26
7
0
25 Jun 2021
ParaLaw Nets -- Cross-lingual Sentence-level Pretraining for Legal Text Processing
Nguyen Ha Thanh
Vu Tran
Phuong Minh Nguyen
Thi-Hai-Yen Vuong
Quan Minh Bui
Chau Nguyen
Binh Dang
Minh Le Nguyen
Kenji Satoh
AILaw
29
10
0
25 Jun 2021
Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models
Robert L Logan IV
Ivana Balavzević
Eric Wallace
Fabio Petroni
Sameer Singh
Sebastian Riedel
VPVLM
39
209
0
24 Jun 2021
Physics perception in sloshing scenes with guaranteed thermodynamic consistency
B. Moya
Alberto Badías
D. González
Francisco Chinesta
Elías Cueto
35
14
0
24 Jun 2021
Previous
1
2
3
...
39
40
41
...
57
58
59
Next