Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,935 papers shown
Title
Decoupling anomaly discrimination and representation learning: self-supervised learning for anomaly detection on attributed graph
Yanming Hu
Chuan Chen
Bowen Deng
Yujing Lai
Hao Lin
Zibin Zheng
Jing Bian
85
3
0
11 Apr 2023
Sim-T: Simplify the Transformer Network by Multiplexing Technique for Speech Recognition
Guangyong Wei
Zhikui Duan
Shiren Li
Guangguang Yang
Xinmei Yu
Junhua Li
63
5
0
11 Apr 2023
UATTA-EB: Uncertainty-Aware Test-Time Augmented Ensemble of BERTs for Classifying Common Mental Illnesses on Social Media Posts
Pratinav Seth
Mihir Agarwal
AI4MH
60
1
0
10 Apr 2023
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder
Z. Fu
W. Lam
Qian Yu
Anthony Man-Cho So
Shengding Hu
Zhiyuan Liu
Nigel Collier
AuLLM
69
44
0
08 Apr 2023
SSS at SemEval-2023 Task 10: Explainable Detection of Online Sexism using Majority Voted Fine-Tuned Transformers
Sriya Rallabandi
Sanchit Singhal
Pratinav Seth
21
3
0
07 Apr 2023
Bridging the Language Gap: Knowledge Injected Multilingual Question Answering
Zhichao Duan
Xiuxing Li
Zhengyan Zhang
Zhenyu Li
Ning Liu
Jianyong Wang
65
8
0
06 Apr 2023
Bengali Fake Review Detection using Semi-supervised Generative Adversarial Networks
Md. Tanvir Rouf Shawon
G. M. Shahariar
F. Shah
Mohammad Shafiul Alam
Md. Shahriar Mahbub
82
6
0
05 Apr 2023
PromptAid: Prompt Exploration, Perturbation, Testing and Iteration using Visual Analytics for Large Language Models
Aditi Mishra
Utkarsh Soni
Anjana Arunkumar
Jinbin Huang
Bum Chul Kwon
Chris Bryan
LRM
76
35
0
04 Apr 2023
San-BERT: Extractive Summarization for Sanskrit Documents using BERT and it's variants
Kartikeya Bhatnagar
Sampath Lonka
Jammi Kunal
Mahabala Rao
60
2
0
04 Apr 2023
A Survey on Contextualised Semantic Shift Detection
S. Montanelli
Francesco Periti
ObjD
AI4TS
94
35
0
04 Apr 2023
An Embedding-based Approach to Inconsistency-tolerant Reasoning with Inconsistent Ontologies
Keyu Wang
Si-Nuo Li
Jiaye Li
Guilin Qi
Qiu Ji
99
2
0
04 Apr 2023
Multidimensional Perceptron for Efficient and Explainable Long Text Classification
Yexiang Wang
Yating Zhang
Xiaozhong Liu
Changlong Sun
27
0
0
04 Apr 2023
One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era
Chaoning Zhang
Chenshuang Zhang
Chenghao Li
Yu Qiao
Sheng Zheng
...
Sung-Ho Bae
Lik-Hang Lee
Pan Hui
In So Kweon
Choong Seon Hong
LM&MA
AI4MH
LRM
ELM
106
137
0
04 Apr 2023
Safety Analysis in the Era of Large Language Models: A Case Study of STPA using ChatGPT
Yi Qi
Xingyu Zhao
Siddartha Khastgir
Xiaowei Huang
80
16
0
03 Apr 2023
Evaluation of GPT and BERT-based models on identifying protein-protein interactions in biomedical text
Hasin Rehana
Nur Bengisu Çam
Mert Basmacı
Jie Zheng
Christianah Jemiyo
Y. He
Arzucan Özgür
J. Hur
LM&MA
66
32
0
30 Mar 2023
oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes
Daniel Fernando Campos
Alexandre Marques
Mark Kurtz
Chengxiang Zhai
VLM
AAML
50
2
0
30 Mar 2023
BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection
Sihao Hu
Zhen Zhang
B. Luo
Shengliang Lu
Bingsheng He
Ling Liu
74
44
0
29 Mar 2023
PMAA: A Progressive Multi-scale Attention Autoencoder Model for High-performance Cloud Removal from Multi-temporal Satellite Imagery
Xuechao Zou
Keqin Li
Junliang Xing
Pin Tao
Yachao Cui
59
15
0
29 Mar 2023
An Information Extraction Study: Take In Mind the Tokenization!
Christos Theodoropoulos
Marie-Francine Moens
43
6
0
27 Mar 2023
Scaling Pre-trained Language Models to Deeper via Parameter-efficient Architecture
Peiyu Liu
Ze-Feng Gao
Yushuo Chen
Wayne Xin Zhao
Ji-Rong Wen
MoE
68
0
0
27 Mar 2023
Exploring the Impact of Instruction Data Scaling on Large Language Models: An Empirical Study on Real-World Use Cases
Yunjie Ji
Yong Deng
Yan Gong
Yiping Peng
Qiang Niu
Lefei Zhang
Baochang Ma
Xiangang Li
ALM
70
97
0
26 Mar 2023
Energy-efficient Task Adaptation for NLP Edge Inference Leveraging Heterogeneous Memory Architectures
Zirui Fu
Aleksandre Avaliani
M. Donato
82
1
0
25 Mar 2023
Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognition
Haoyu Tang
Zhaoyi Liu
Chang Zeng
Xinfeng Li
54
1
0
23 Mar 2023
Analyzing the Generalizability of Deep Contextualized Language Representations For Text Classification
Berfu Buyukoz
30
2
0
22 Mar 2023
TRON: Transformer Neural Network Acceleration with Non-Coherent Silicon Photonics
S. Afifi
Febin P. Sunny
Mahdi Nikdast
S. Pasricha
GNN
71
22
0
22 Mar 2023
Salient Span Masking for Temporal Understanding
Jeremy R. Cole
Aditi Chaudhary
Bhuwan Dhingra
Partha P. Talukdar
91
13
0
22 Mar 2023
GrapeQA: GRaph Augmentation and Pruning to Enhance Question-Answering
Dhaval Taunk
Lakshya Khanna
Pavan Kandru
Vasudeva Varma
Charu Sharma
Makarand Tapaswi
78
24
0
22 Mar 2023
FedMAE: Federated Self-Supervised Learning with One-Block Masked Auto-Encoder
Nan Yang
Xuanyu Chen
Charles Z. Liu
Dong Yuan
Wei Bao
Li-zhen Cui
69
3
0
20 Mar 2023
Character, Word, or Both? Revisiting the Segmentation Granularity for Chinese Pre-trained Language Models
Xinnian Liang
Zefan Zhou
Hui Huang
Shuangzhi Wu
Tong Xiao
Muyun Yang
Zhoujun Li
Chao Bian
VLM
65
2
0
20 Mar 2023
Retrieving Multimodal Information for Augmented Generation: A Survey
Ruochen Zhao
Hailin Chen
Weishi Wang
Fangkai Jiao
Do Xuan Long
...
Bosheng Ding
Xiaobao Guo
Minzhi Li
Xingxuan Li
Shafiq Joty
129
88
0
20 Mar 2023
Label Name is Mantra: Unifying Point Cloud Segmentation across Heterogeneous Datasets
Yixun Liang
Hao He
Shishi Xiao
Hao Lu
Yingke Chen
3DPC
47
3
0
19 Mar 2023
An Empirical Study of Pre-trained Language Models in Simple Knowledge Graph Question Answering
Nan Hu
Yike Wu
Guilin Qi
Dehai Min
Jiaoyan Chen
Jeff Z. Pan
Z. Ali
ELM
AI4MH
86
40
0
18 Mar 2023
CerviFormer: A Pap-smear based cervical cancer classification method using cross attention and latent transformer
Bhaswati Singha Deo
M. Pal
P. Panigrahi
A. Pradhan
MedIm
43
25
0
17 Mar 2023
Trained on 100 million words and still in shape: BERT meets British National Corpus
David Samuel
Andrey Kutuzov
Lilja Øvrelid
Erik Velldal
101
32
0
17 Mar 2023
SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches for news genre, topic and persuasion technique classification
Ben Wu
Olesya Razuvayevskaya
Freddy Heppell
João A. Leite
Carolina Scarton
Kalina Bontcheva
Xingyi Song
42
9
0
16 Mar 2023
SmartBERT: A Promotion of Dynamic Early Exiting Mechanism for Accelerating BERT Inference
Boren Hu
Yun Zhu
Jiacheng Li
Siliang Tang
58
9
0
16 Mar 2023
SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning
Mengxin Zheng
Jiaqi Xue
Zihao Wang
Xun Chen
Qian Lou
Lei Jiang
Xiaofeng Wang
102
13
0
16 Mar 2023
Enhancing Text Generation with Cooperative Training
Tong Wu
Hao Wang
Zhongshen Zeng
Wei Wang
Haimin Zheng
Jiaxing Zhang
SyDa
115
1
0
16 Mar 2023
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Steven M. Hernandez
Ding Zhao
Shaojin Ding
A. Bruguier
Rohit Prabhavalkar
Tara N. Sainath
Yanzhang He
Ian McGraw
102
9
0
15 Mar 2023
Finding Similar Exercises in Retrieval Manner
Tongwen Huang
Xihua Li
Chao Yi
Xuemin Zhao
Yunbo Cao
36
0
0
15 Mar 2023
Clinical Concept and Relation Extraction Using Prompt-based Machine Reading Comprehension
C.A.I. Peng
Xi Yang
Zehao Yu
Jiang Bian
W. Hogan
Yonghui Wu
176
24
0
14 Mar 2023
Contextualized Medication Information Extraction Using Transformer-based Deep Learning Architectures
Aokun Chen
Zehao Yu
Xi Yang
Yi Guo
Jiang Bian
Yonghui Wu
52
24
0
14 Mar 2023
The Life Cycle of Knowledge in Big Language Models: A Survey
Boxi Cao
Hongyu Lin
Xianpei Han
Le Sun
KELM
95
28
0
14 Mar 2023
Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on Consistency with Human Preferences
Yunjie Ji
Yan Gong
Yiping Peng
Chao Ni
Peiyan Sun
Dongyu Pan
Baochang Ma
Xiangang Li
ELM
ALM
AI4MH
76
38
0
14 Mar 2023
Input-length-shortening and text generation via attention values
Necset Ozkan Tan
A. Peng
Joshua Bensemann
Qiming Bao
Tim Hartill
M. Gahegan
Michael Witbrock
84
1
0
14 Mar 2023
Multimodal Reinforcement Learning for Robots Collaborating with Humans
Afagh Mehri Shervedani
S. Li
Natawut Monaikul
Bahareh Abbasi
Barbara Di Eugenio
Milos Zefran
OffRL
38
4
0
13 Mar 2023
An Overview on Language Models: Recent Developments and Outlook
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
93
47
0
10 Mar 2023
Lexical Complexity Prediction: An Overview
Kai North
Marcos Zampieri
Matthew Shardlow
58
26
0
08 Mar 2023
Comprehensive Event Representations using Event Knowledge Graphs and Natural Language Processing
Tin Kuculo
NAI
60
1
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
105
555
0
07 Mar 2023
Previous
1
2
3
...
19
20
21
...
57
58
59
Next