ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSL
    AIMat
ArXivPDFHTML

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,913 papers shown
Title
Improving Sentence-Level Relation Extraction through Curriculum Learning
Improving Sentence-Level Relation Extraction through Curriculum Learning
Seongsik Park
Harksoo Kim
24
14
0
20 Jul 2021
Token-Level Supervised Contrastive Learning for Punctuation Restoration
Token-Level Supervised Contrastive Learning for Punctuation Restoration
Qiushi Huang
Tom Ko
Lilian H. Y. Tang
Xubo Liu
Boyong Wu
18
23
0
19 Jul 2021
Clinical Relation Extraction Using Transformer-based Models
Clinical Relation Extraction Using Transformer-based Models
Xi Yang
Zehao Yu
Yi Guo
Jiang Bian
Yonghui Wu
LM&MA
MedIm
32
20
0
19 Jul 2021
Pre-trained Language Models as Prior Knowledge for Playing Text-based
  Games
Pre-trained Language Models as Prior Knowledge for Playing Text-based Games
Ishika Singh
Gargi Singh
Ashutosh Modi
OffRL
AI4CE
32
29
0
18 Jul 2021
Generative Pretraining for Paraphrase Evaluation
Generative Pretraining for Paraphrase Evaluation
J. Weston
R. Lenain
U. Meepegama
E. Fristed
AIMat
27
10
0
17 Jul 2021
AutoBERT-Zero: Evolving BERT Backbone from Scratch
AutoBERT-Zero: Evolving BERT Backbone from Scratch
Jiahui Gao
Hang Xu
Han Shi
Xiaozhe Ren
Philip L. H. Yu
Xiaodan Liang
Xin Jiang
Zhenguo Li
21
37
0
15 Jul 2021
Automatic Task Requirements Writing Evaluation via Machine Reading
  Comprehension
Automatic Task Requirements Writing Evaluation via Machine Reading Comprehension
Shiting Xu
Guowei Xu
Peilei Jia
Wenbiao Ding
Zhongqin Wu
Zitao Liu
26
1
0
15 Jul 2021
The Benchmark Lottery
The Benchmark Lottery
Mostafa Dehghani
Yi Tay
A. Gritsenko
Zhe Zhao
N. Houlsby
Fernando Diaz
Donald Metzler
Oriol Vinyals
47
89
0
14 Jul 2021
BERT Fine-Tuning for Sentiment Analysis on Indonesian Mobile Apps
  Reviews
BERT Fine-Tuning for Sentiment Analysis on Indonesian Mobile Apps Reviews
Kuncahyo Setyo Nugroho
Anantha Yullian Sukmadewa
DW HaftittahWuswilahaken
F. A. Bachtiar
N. Yudistira
26
43
0
14 Jul 2021
Importance-based Neuron Allocation for Multilingual Neural Machine
  Translation
Importance-based Neuron Allocation for Multilingual Neural Machine Translation
Wanying Xie
Yang Feng
Shuhao Gu
Dong Yu
44
32
0
14 Jul 2021
Conformer-based End-to-end Speech Recognition With Rotary Position
  Embedding
Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
29
9
0
13 Jul 2021
Human Attention during Goal-directed Reading Comprehension Relies on
  Task Optimization
Human Attention during Goal-directed Reading Comprehension Relies on Task Optimization
Jiajie Zou
Yuran Zhang
Jialu Li
Xing Tian
Nai Ding
AIMat
40
2
0
13 Jul 2021
Combiner: Full Attention Transformer with Sparse Computation Cost
Combiner: Full Attention Transformer with Sparse Computation Cost
Hongyu Ren
H. Dai
Zihang Dai
Mengjiao Yang
J. Leskovec
Dale Schuurmans
Bo Dai
87
78
0
12 Jul 2021
Trustworthy AI: A Computational Perspective
Trustworthy AI: A Computational Perspective
Haochen Liu
Yiqi Wang
Wenqi Fan
Xiaorui Liu
Yaxin Li
Shaili Jain
Yunhao Liu
Anil K. Jain
Jiliang Tang
FaML
104
198
0
12 Jul 2021
The Brownian motion in the transformer model
The Brownian motion in the transformer model
Yingshi Chen
21
1
0
12 Jul 2021
LexSubCon: Integrating Knowledge from Lexical Resources into Contextual
  Embeddings for Lexical Substitution
LexSubCon: Integrating Knowledge from Lexical Resources into Contextual Embeddings for Lexical Substitution
George Michalopoulos
I. McKillop
Alexander Wong
Helen H. Chen
83
19
0
11 Jul 2021
Noise Stability Regularization for Improving BERT Fine-tuning
Noise Stability Regularization for Improving BERT Fine-tuning
Hang Hua
Xingjian Li
Dejing Dou
Chengzhong Xu
Jiebo Luo
19
44
0
10 Jul 2021
Transformer-Based Behavioral Representation Learning Enables Transfer
  Learning for Mobile Sensing in Small Datasets
Transformer-Based Behavioral Representation Learning Enables Transfer Learning for Mobile Sensing in Small Datasets
Michael Merrill
Tim Althoff
AI4TS
MU
MedIm
20
5
0
09 Jul 2021
An Initial Investigation of Non-Native Spoken Question-Answering
An Initial Investigation of Non-Native Spoken Question-Answering
V. Raina
Mark Gales
21
1
0
09 Jul 2021
Can Deep Neural Networks Predict Data Correlations from Column Names?
Can Deep Neural Networks Predict Data Correlations from Column Names?
Immanuel Trummer
22
8
0
09 Jul 2021
Benchmarking for Biomedical Natural Language Processing Tasks with a
  Domain Specific ALBERT
Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERT
Usman Naseem
A. Dunn
Matloob Khushi
Jinman Kim
OOD
LM&MA
AI4MH
29
43
0
09 Jul 2021
UniRE: A Unified Label Space for Entity Relation Extraction
UniRE: A Unified Label Space for Entity Relation Extraction
Yijun Wang
Changzhi Sun
Yuanbin Wu
Hao Zhou
Lei Li
Junchi Yan
22
113
0
09 Jul 2021
Joint Models for Answer Verification in Question Answering Systems
Joint Models for Answer Verification in Question Answering Systems
Zeyu Zhang
Thuy Vu
Alessandro Moschitti
14
24
0
09 Jul 2021
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge
  Transfer
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer
Zineng Tang
Jaemin Cho
Hao Tan
Joey Tianyi Zhou
VLM
38
29
0
06 Jul 2021
PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior
  for Joint Image-Text Modeling
PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Xiaoxue Zang
Lijuan Liu
Maria Wang
Yang Song
Hao Zhang
Jindong Chen
VLM
35
55
0
06 Jul 2021
Sarcasm Detection: A Comparative Study
Sarcasm Detection: A Comparative Study
Hamed Yaghoobian
H. Arabnia
Khaled Rasheed
31
22
0
05 Jul 2021
ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language
  Understanding and Generation
ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Yu Sun
Shuohuan Wang
Shikun Feng
Siyu Ding
Chao Pang
...
Ouyang Xuan
Dianhai Yu
Hao Tian
Hua Wu
Haifeng Wang
61
454
0
05 Jul 2021
Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal
  Reasoning Models
Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal Reasoning Models
Mingyue Han
Yinglin Wang
LRM
21
11
0
05 Jul 2021
Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks
Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks
Jinghui Qin
Xiaodan Liang
Yining Hong
Jianheng Tang
Liang Lin
AIMat
AAML
29
57
0
03 Jul 2021
R2D2: Recursive Transformer based on Differentiable Tree for
  Interpretable Hierarchical Language Modeling
R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling
Xiang Hu
Haitao Mi
Zujie Wen
Yafang Wang
Yi Su
Jing Zheng
Gerard de Melo
17
22
0
02 Jul 2021
Learned Token Pruning for Transformers
Learned Token Pruning for Transformers
Sehoon Kim
Sheng Shen
D. Thorsley
A. Gholami
Woosuk Kwon
Joseph Hassoun
Kurt Keutzer
17
146
0
02 Jul 2021
AutoFormer: Searching Transformers for Visual Recognition
AutoFormer: Searching Transformers for Visual Recognition
Minghao Chen
Houwen Peng
Jianlong Fu
Haibin Ling
ViT
58
260
0
01 Jul 2021
Pretext Tasks selection for multitask self-supervised speech
  representation learning
Pretext Tasks selection for multitask self-supervised speech representation learning
Salah Zaiem
Titouan Parcollet
S. Essid
Abdel Heba
SSL
29
12
0
01 Jul 2021
Elbert: Fast Albert with Confidence-Window Based Early Exit
Elbert: Fast Albert with Confidence-Window Based Early Exit
Keli Xie
Siyuan Lu
Meiqi Wang
Zhongfeng Wang
22
20
0
01 Jul 2021
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin
  Information
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information
Zijun Sun
Xiaoya Li
Xiaofei Sun
Yuxian Meng
Xiang Ao
Qing He
Fei Wu
Jiwei Li
SSeg
59
184
0
30 Jun 2021
HySPA: Hybrid Span Generation for Scalable Text-to-Graph Extraction
HySPA: Hybrid Span Generation for Scalable Text-to-Graph Extraction
Liliang Ren
Chenkai Sun
Heng Ji
J. Hockenmaier
43
14
0
30 Jun 2021
New Arabic Medical Dataset for Diseases Classification
New Arabic Medical Dataset for Diseases Classification
Jaafar Hammoud
A. Vatian
N. Dobrenko
N. Vedernikov
A. Shalyto
N. Gusarova
OOD
19
6
0
29 Jun 2021
SCARF: Self-Supervised Contrastive Learning using Random Feature
  Corruption
SCARF: Self-Supervised Contrastive Learning using Random Feature Corruption
Dara Bahri
Heinrich Jiang
Yi Tay
Donald Metzler
SSL
31
164
0
29 Jun 2021
TWAG: A Topic-Guided Wikipedia Abstract Generator
TWAG: A Topic-Guided Wikipedia Abstract Generator
Fangwei Zhu
Shangqing Tu
Jiaxin Shi
Juan-Zi Li
Lei Hou
Tong Cui
11
11
0
29 Jun 2021
A Knowledge-Grounded Dialog System Based on Pre-Trained Language Models
A Knowledge-Grounded Dialog System Based on Pre-Trained Language Models
Weijie Zhang
Jiaoxuan Chen
Haipang Wu
Sanhui Wan
Gongfeng Li
30
4
0
28 Jun 2021
A Closer Look at How Fine-tuning Changes BERT
A Closer Look at How Fine-tuning Changes BERT
Yichu Zhou
Vivek Srikumar
31
64
0
27 Jun 2021
Improving Sequential Recommendation Consistency with Self-Supervised
  Imitation
Improving Sequential Recommendation Consistency with Self-Supervised Imitation
Xu Yuan
Hongshen Chen
Yonghao Song
Xiaofang Zhao
Zhuoye Ding
Zhen He
Bo Long
21
22
0
26 Jun 2021
Answering Chinese Elementary School Social Study Multiple Choice
  Questions
Answering Chinese Elementary School Social Study Multiple Choice Questions
Daniel Lee
Chao-Chun Liang
Keh-Yih Su
27
1
0
26 Jun 2021
Benchmarking Differential Privacy and Federated Learning for BERT Models
Benchmarking Differential Privacy and Federated Learning for BERT Models
Priya Basu
Tiasa Singha Roy
Rakshit Naidu
Zumrut Muftuoglu
Sahib Singh
Fatemehsadat Mireshghallah
FedML
AI4MH
24
50
0
26 Jun 2021
Knowledge-Grounded Self-Rationalization via Extractive and Natural
  Language Explanations
Knowledge-Grounded Self-Rationalization via Extractive and Natural Language Explanations
Bodhisattwa Prasad Majumder
Oana-Maria Camburu
Thomas Lukasiewicz
Julian McAuley
35
35
0
25 Jun 2021
Learning to Sample Replacements for ELECTRA Pre-Training
Learning to Sample Replacements for ELECTRA Pre-Training
Y. Hao
Li Dong
Hangbo Bao
Ke Xu
Furu Wei
MU
11
11
0
25 Jun 2021
aiSTROM -- A roadmap for developing a successful AI strategy
aiSTROM -- A roadmap for developing a successful AI strategy
Dorien Herremans
26
7
0
25 Jun 2021
ParaLaw Nets -- Cross-lingual Sentence-level Pretraining for Legal Text
  Processing
ParaLaw Nets -- Cross-lingual Sentence-level Pretraining for Legal Text Processing
Nguyen Ha Thanh
Vu Tran
Phuong Minh Nguyen
Thi-Hai-Yen Vuong
Quan Minh Bui
Chau Nguyen
Binh Dang
Minh Le Nguyen
Kenji Satoh
AILaw
29
10
0
25 Jun 2021
Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with
  Language Models
Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models
Robert L Logan IV
Ivana Balavzević
Eric Wallace
Fabio Petroni
Sameer Singh
Sebastian Riedel
VPVLM
39
209
0
24 Jun 2021
Physics perception in sloshing scenes with guaranteed thermodynamic
  consistency
Physics perception in sloshing scenes with guaranteed thermodynamic consistency
B. Moya
Alberto Badías
D. González
Francisco Chinesta
Elías Cueto
35
14
0
24 Jun 2021
Previous
123...394041...575859
Next