ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
Decoupling anomaly discrimination and representation learning:
  self-supervised learning for anomaly detection on attributed graph
Decoupling anomaly discrimination and representation learning: self-supervised learning for anomaly detection on attributed graph
Yanming Hu
Chuan Chen
Bowen Deng
Yujing Lai
Hao Lin
Zibin Zheng
Jing Bian
85
3
0
11 Apr 2023
Sim-T: Simplify the Transformer Network by Multiplexing Technique for
  Speech Recognition
Sim-T: Simplify the Transformer Network by Multiplexing Technique for Speech Recognition
Guangyong Wei
Zhikui Duan
Shiren Li
Guangguang Yang
Xinmei Yu
Junhua Li
63
5
0
11 Apr 2023
UATTA-EB: Uncertainty-Aware Test-Time Augmented Ensemble of BERTs for
  Classifying Common Mental Illnesses on Social Media Posts
UATTA-EB: Uncertainty-Aware Test-Time Augmented Ensemble of BERTs for Classifying Common Mental Illnesses on Social Media Posts
Pratinav Seth
Mihir Agarwal
AI4MH
60
1
0
10 Apr 2023
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a
  Regularized Encoder-Decoder
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder
Z. Fu
W. Lam
Qian Yu
Anthony Man-Cho So
Shengding Hu
Zhiyuan Liu
Nigel Collier
AuLLM
69
44
0
08 Apr 2023
SSS at SemEval-2023 Task 10: Explainable Detection of Online Sexism
  using Majority Voted Fine-Tuned Transformers
SSS at SemEval-2023 Task 10: Explainable Detection of Online Sexism using Majority Voted Fine-Tuned Transformers
Sriya Rallabandi
Sanchit Singhal
Pratinav Seth
21
3
0
07 Apr 2023
Bridging the Language Gap: Knowledge Injected Multilingual Question
  Answering
Bridging the Language Gap: Knowledge Injected Multilingual Question Answering
Zhichao Duan
Xiuxing Li
Zhengyan Zhang
Zhenyu Li
Ning Liu
Jianyong Wang
65
8
0
06 Apr 2023
Bengali Fake Review Detection using Semi-supervised Generative
  Adversarial Networks
Bengali Fake Review Detection using Semi-supervised Generative Adversarial Networks
Md. Tanvir Rouf Shawon
G. M. Shahariar
F. Shah
Mohammad Shafiul Alam
Md. Shahriar Mahbub
82
6
0
05 Apr 2023
PromptAid: Prompt Exploration, Perturbation, Testing and Iteration using
  Visual Analytics for Large Language Models
PromptAid: Prompt Exploration, Perturbation, Testing and Iteration using Visual Analytics for Large Language Models
Aditi Mishra
Utkarsh Soni
Anjana Arunkumar
Jinbin Huang
Bum Chul Kwon
Chris Bryan
LRM
76
35
0
04 Apr 2023
San-BERT: Extractive Summarization for Sanskrit Documents using BERT and
  it's variants
San-BERT: Extractive Summarization for Sanskrit Documents using BERT and it's variants
Kartikeya Bhatnagar
Sampath Lonka
Jammi Kunal
Mahabala Rao
60
2
0
04 Apr 2023
A Survey on Contextualised Semantic Shift Detection
A Survey on Contextualised Semantic Shift Detection
S. Montanelli
Francesco Periti
ObjDAI4TS
94
35
0
04 Apr 2023
An Embedding-based Approach to Inconsistency-tolerant Reasoning with
  Inconsistent Ontologies
An Embedding-based Approach to Inconsistency-tolerant Reasoning with Inconsistent Ontologies
Keyu Wang
Si-Nuo Li
Jiaye Li
Guilin Qi
Qiu Ji
99
2
0
04 Apr 2023
Multidimensional Perceptron for Efficient and Explainable Long Text
  Classification
Multidimensional Perceptron for Efficient and Explainable Long Text Classification
Yexiang Wang
Yating Zhang
Xiaozhong Liu
Changlong Sun
27
0
0
04 Apr 2023
One Small Step for Generative AI, One Giant Leap for AGI: A Complete
  Survey on ChatGPT in AIGC Era
One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era
Chaoning Zhang
Chenshuang Zhang
Chenghao Li
Yu Qiao
Sheng Zheng
...
Sung-Ho Bae
Lik-Hang Lee
Pan Hui
In So Kweon
Choong Seon Hong
LM&MAAI4MHLRMELM
106
137
0
04 Apr 2023
Safety Analysis in the Era of Large Language Models: A Case Study of
  STPA using ChatGPT
Safety Analysis in the Era of Large Language Models: A Case Study of STPA using ChatGPT
Yi Qi
Xingyu Zhao
Siddartha Khastgir
Xiaowei Huang
80
16
0
03 Apr 2023
Evaluation of GPT and BERT-based models on identifying protein-protein
  interactions in biomedical text
Evaluation of GPT and BERT-based models on identifying protein-protein interactions in biomedical text
Hasin Rehana
Nur Bengisu Çam
Mert Basmacı
Jie Zheng
Christianah Jemiyo
Y. He
Arzucan Özgür
J. Hur
LM&MA
66
32
0
30 Mar 2023
oBERTa: Improving Sparse Transfer Learning via improved initialization,
  distillation, and pruning regimes
oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes
Daniel Fernando Campos
Alexandre Marques
Mark Kurtz
Chengxiang Zhai
VLMAAML
50
2
0
30 Mar 2023
BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection
BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection
Sihao Hu
Zhen Zhang
B. Luo
Shengliang Lu
Bingsheng He
Ling Liu
74
44
0
29 Mar 2023
PMAA: A Progressive Multi-scale Attention Autoencoder Model for
  High-performance Cloud Removal from Multi-temporal Satellite Imagery
PMAA: A Progressive Multi-scale Attention Autoencoder Model for High-performance Cloud Removal from Multi-temporal Satellite Imagery
Xuechao Zou
Keqin Li
Junliang Xing
Pin Tao
Yachao Cui
59
15
0
29 Mar 2023
An Information Extraction Study: Take In Mind the Tokenization!
An Information Extraction Study: Take In Mind the Tokenization!
Christos Theodoropoulos
Marie-Francine Moens
43
6
0
27 Mar 2023
Scaling Pre-trained Language Models to Deeper via Parameter-efficient
  Architecture
Scaling Pre-trained Language Models to Deeper via Parameter-efficient Architecture
Peiyu Liu
Ze-Feng Gao
Yushuo Chen
Wayne Xin Zhao
Ji-Rong Wen
MoE
68
0
0
27 Mar 2023
Exploring the Impact of Instruction Data Scaling on Large Language
  Models: An Empirical Study on Real-World Use Cases
Exploring the Impact of Instruction Data Scaling on Large Language Models: An Empirical Study on Real-World Use Cases
Yunjie Ji
Yong Deng
Yan Gong
Yiping Peng
Qiang Niu
Lefei Zhang
Baochang Ma
Xiangang Li
ALM
70
97
0
26 Mar 2023
Energy-efficient Task Adaptation for NLP Edge Inference Leveraging
  Heterogeneous Memory Architectures
Energy-efficient Task Adaptation for NLP Edge Inference Leveraging Heterogeneous Memory Architectures
Zirui Fu
Aleksandre Avaliani
M. Donato
82
1
0
25 Mar 2023
Beyond Universal Transformer: block reusing with adaptor in Transformer
  for automatic speech recognition
Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognition
Haoyu Tang
Zhaoyi Liu
Chang Zeng
Xinfeng Li
54
1
0
23 Mar 2023
Analyzing the Generalizability of Deep Contextualized Language
  Representations For Text Classification
Analyzing the Generalizability of Deep Contextualized Language Representations For Text Classification
Berfu Buyukoz
30
2
0
22 Mar 2023
TRON: Transformer Neural Network Acceleration with Non-Coherent Silicon
  Photonics
TRON: Transformer Neural Network Acceleration with Non-Coherent Silicon Photonics
S. Afifi
Febin P. Sunny
Mahdi Nikdast
S. Pasricha
GNN
71
22
0
22 Mar 2023
Salient Span Masking for Temporal Understanding
Salient Span Masking for Temporal Understanding
Jeremy R. Cole
Aditi Chaudhary
Bhuwan Dhingra
Partha P. Talukdar
91
13
0
22 Mar 2023
GrapeQA: GRaph Augmentation and Pruning to Enhance Question-Answering
GrapeQA: GRaph Augmentation and Pruning to Enhance Question-Answering
Dhaval Taunk
Lakshya Khanna
Pavan Kandru
Vasudeva Varma
Charu Sharma
Makarand Tapaswi
78
24
0
22 Mar 2023
FedMAE: Federated Self-Supervised Learning with One-Block Masked
  Auto-Encoder
FedMAE: Federated Self-Supervised Learning with One-Block Masked Auto-Encoder
Nan Yang
Xuanyu Chen
Charles Z. Liu
Dong Yuan
Wei Bao
Li-zhen Cui
69
3
0
20 Mar 2023
Character, Word, or Both? Revisiting the Segmentation Granularity for
  Chinese Pre-trained Language Models
Character, Word, or Both? Revisiting the Segmentation Granularity for Chinese Pre-trained Language Models
Xinnian Liang
Zefan Zhou
Hui Huang
Shuangzhi Wu
Tong Xiao
Muyun Yang
Zhoujun Li
Chao Bian
VLM
65
2
0
20 Mar 2023
Retrieving Multimodal Information for Augmented Generation: A Survey
Retrieving Multimodal Information for Augmented Generation: A Survey
Ruochen Zhao
Hailin Chen
Weishi Wang
Fangkai Jiao
Do Xuan Long
...
Bosheng Ding
Xiaobao Guo
Minzhi Li
Xingxuan Li
Shafiq Joty
129
88
0
20 Mar 2023
Label Name is Mantra: Unifying Point Cloud Segmentation across
  Heterogeneous Datasets
Label Name is Mantra: Unifying Point Cloud Segmentation across Heterogeneous Datasets
Yixun Liang
Hao He
Shishi Xiao
Hao Lu
Yingke Chen
3DPC
47
3
0
19 Mar 2023
An Empirical Study of Pre-trained Language Models in Simple Knowledge
  Graph Question Answering
An Empirical Study of Pre-trained Language Models in Simple Knowledge Graph Question Answering
Nan Hu
Yike Wu
Guilin Qi
Dehai Min
Jiaoyan Chen
Jeff Z. Pan
Z. Ali
ELMAI4MH
86
40
0
18 Mar 2023
CerviFormer: A Pap-smear based cervical cancer classification method
  using cross attention and latent transformer
CerviFormer: A Pap-smear based cervical cancer classification method using cross attention and latent transformer
Bhaswati Singha Deo
M. Pal
P. Panigrahi
A. Pradhan
MedIm
43
25
0
17 Mar 2023
Trained on 100 million words and still in shape: BERT meets British
  National Corpus
Trained on 100 million words and still in shape: BERT meets British National Corpus
David Samuel
Andrey Kutuzov
Lilja Øvrelid
Erik Velldal
101
32
0
17 Mar 2023
SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches
  for news genre, topic and persuasion technique classification
SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches for news genre, topic and persuasion technique classification
Ben Wu
Olesya Razuvayevskaya
Freddy Heppell
João A. Leite
Carolina Scarton
Kalina Bontcheva
Xingyi Song
42
9
0
16 Mar 2023
SmartBERT: A Promotion of Dynamic Early Exiting Mechanism for
  Accelerating BERT Inference
SmartBERT: A Promotion of Dynamic Early Exiting Mechanism for Accelerating BERT Inference
Boren Hu
Yun Zhu
Jiacheng Li
Siliang Tang
58
9
0
16 Mar 2023
SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning
SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning
Mengxin Zheng
Jiaqi Xue
Zihao Wang
Xun Chen
Qian Lou
Lei Jiang
Xiaofeng Wang
102
13
0
16 Mar 2023
Enhancing Text Generation with Cooperative Training
Enhancing Text Generation with Cooperative Training
Tong Wu
Hao Wang
Zhongshen Zeng
Wei Wang
Haimin Zheng
Jiaxing Zhang
SyDa
115
1
0
16 Mar 2023
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech
  Recognition Models
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Steven M. Hernandez
Ding Zhao
Shaojin Ding
A. Bruguier
Rohit Prabhavalkar
Tara N. Sainath
Yanzhang He
Ian McGraw
102
9
0
15 Mar 2023
Finding Similar Exercises in Retrieval Manner
Finding Similar Exercises in Retrieval Manner
Tongwen Huang
Xihua Li
Chao Yi
Xuemin Zhao
Yunbo Cao
36
0
0
15 Mar 2023
Clinical Concept and Relation Extraction Using Prompt-based Machine
  Reading Comprehension
Clinical Concept and Relation Extraction Using Prompt-based Machine Reading Comprehension
C.A.I. Peng
Xi Yang
Zehao Yu
Jiang Bian
W. Hogan
Yonghui Wu
176
24
0
14 Mar 2023
Contextualized Medication Information Extraction Using Transformer-based
  Deep Learning Architectures
Contextualized Medication Information Extraction Using Transformer-based Deep Learning Architectures
Aokun Chen
Zehao Yu
Xi Yang
Yi Guo
Jiang Bian
Yonghui Wu
52
24
0
14 Mar 2023
The Life Cycle of Knowledge in Big Language Models: A Survey
The Life Cycle of Knowledge in Big Language Models: A Survey
Boxi Cao
Hongyu Lin
Xianpei Han
Le Sun
KELM
95
28
0
14 Mar 2023
Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on
  Consistency with Human Preferences
Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on Consistency with Human Preferences
Yunjie Ji
Yan Gong
Yiping Peng
Chao Ni
Peiyan Sun
Dongyu Pan
Baochang Ma
Xiangang Li
ELMALMAI4MH
76
38
0
14 Mar 2023
Input-length-shortening and text generation via attention values
Input-length-shortening and text generation via attention values
Necset Ozkan Tan
A. Peng
Joshua Bensemann
Qiming Bao
Tim Hartill
M. Gahegan
Michael Witbrock
84
1
0
14 Mar 2023
Multimodal Reinforcement Learning for Robots Collaborating with Humans
Multimodal Reinforcement Learning for Robots Collaborating with Humans
Afagh Mehri Shervedani
S. Li
Natawut Monaikul
Bahareh Abbasi
Barbara Di Eugenio
Milos Zefran
OffRL
38
4
0
13 Mar 2023
An Overview on Language Models: Recent Developments and Outlook
An Overview on Language Models: Recent Developments and Outlook
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
93
47
0
10 Mar 2023
Lexical Complexity Prediction: An Overview
Lexical Complexity Prediction: An Overview
Kai North
Marcos Zampieri
Matthew Shardlow
58
26
0
08 Mar 2023
Comprehensive Event Representations using Event Knowledge Graphs and
  Natural Language Processing
Comprehensive Event Representations using Event Knowledge Graphs and Natural Language Processing
Tin Kuculo
NAI
60
1
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
105
555
0
07 Mar 2023
Previous
123...192021...575859
Next