Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,935 papers shown
Title
How Gender Interacts with Political Values: A Case Study on Czech BERT Models
Adnan Al Ali
Jindvrich Libovický
50
0
0
20 Mar 2024
Adaptive Ensembles of Fine-Tuned Transformers for LLM-Generated Text Detection
Zhixin Lai
Xuesheng Zhang
Suiyao Chen
DeLMO
75
36
0
20 Mar 2024
WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar
Runwei Guan
Liye Jia
Fengyufan Yang
Shanliang Yao
Erick Purwanto
...
Eng Gee Lim
Jeremy S. Smith
Ka Lok Man
Xuming Hu
Yutao Yue
119
9
0
19 Mar 2024
Simple Hack for Transformers against Heavy Long-Text Classification on a Time- and Memory-Limited GPU Service
Mirza Alim Mutasodirin
Radityo Eko Prasojo
Achmad F. Abka
Hanif Rasyidi
VLM
51
0
0
19 Mar 2024
Improving Generalizability of Extracting Social Determinants of Health Using Large Language Models through Prompt-tuning
C.A.I. Peng
Zehao Yu
Kaleb E. Smith
W. Lo‐Ciganic
Jiang Bian
Yonghui Wu
LM&MA
47
1
0
19 Mar 2024
Large language models in 6G security: challenges and opportunities
Tri Nguyen
Huong Nguyen
Ahmad Ijaz
Saeid Sheikhi
Athanasios V. Vasilakos
Panos Kostakos
ELM
73
13
0
18 Mar 2024
SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator
J. Asl
Mohammad H. Rafiei
Manar Alohaly
Daniel Takabi
AAML
SILM
33
3
0
18 Mar 2024
Metaphor Understanding Challenge Dataset for LLMs
Xiaoyu Tong
Rochelle Choenni
Martha Lewis
Ekaterina Shutova
64
12
0
18 Mar 2024
Semantic-Enhanced Representation Learning for Road Networks with Temporal Dynamics
Yile Chen
Xiucheng Li
Gao Cong
Zhifeng Bao
Cheng Long
47
2
0
18 Mar 2024
A Modified Word Saliency-Based Adversarial Attack on Text Classification Models
Hetvi Waghela
Sneha Rakshit
Jaydip Sen
AAML
68
7
0
17 Mar 2024
Rethinking Multi-view Representation Learning via Distilled Disentangling
Guanzhou Ke
Bo Wang
Xiaoli Wang
Shengfeng He
108
3
0
16 Mar 2024
ATOM: Asynchronous Training of Massive Models for Deep Learning in a Decentralized Environment
Xiaofeng Wu
Jia Rao
Wei Chen
51
3
0
15 Mar 2024
ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
66
1
0
15 Mar 2024
FBPT: A Fully Binary Point Transformer
Zhixing Hou
Yuzhang Shang
Yan Yan
MQ
62
1
0
15 Mar 2024
depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning Researchers
Kaichao You
Runsheng Bai
Meng Cao
Jianmin Wang
Ion Stoica
Mingsheng Long
VLM
77
0
0
14 Mar 2024
Rethinking Referring Object Removal
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
74
0
0
14 Mar 2024
Language models scale reliably with over-training and on downstream tasks
S. Gadre
Georgios Smyrnis
Vaishaal Shankar
Suchin Gururangan
Mitchell Wortsman
...
Y. Carmon
Achal Dave
Reinhard Heckel
Niklas Muennighoff
Ludwig Schmidt
ALM
ELM
LRM
180
48
0
13 Mar 2024
Masked AutoDecoder is Effective Multi-Task Vision Generalist
Han Qiu
Jiaxing Huang
Peng Gao
Lewei Lu
Xiaoqin Zhang
Shijian Lu
85
4
0
12 Mar 2024
A Logical Pattern Memory Pre-trained Model for Entailment Tree Generation
Li Yuan
Yi Cai
Haopeng Ren
Jiexin Wang
LRM
59
5
0
11 Mar 2024
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
Jialin Li
Qiang Nie
Weifu Fu
Yuhuan Lin
Guangpin Tao
Yong-Jin Liu
Chengjie Wang
85
5
0
07 Mar 2024
On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder
Tingxu Han
Shenghan Huang
Ziqi Ding
Weisong Sun
Yebo Feng
...
Hanwei Qian
Cong Wu
Quanjun Zhang
Yang Liu
Zhenyu Chen
54
8
0
06 Mar 2024
A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching
Dongyu Yao
Asaad Alghamdi
Qingrong Xia
Xiaoye Qu
Xinyu Duan
Zhefeng Wang
Yi Zheng
Baoxing Huai
Peilun Cheng
Zhou Zhao
61
0
0
05 Mar 2024
Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
Zhenyu Zhang
Runjin Chen
Shiwei Liu
Zhewei Yao
Olatunji Ruwase
Beidi Chen
Xiaoxia Wu
Zhangyang Wang
95
36
0
05 Mar 2024
A Tutorial on the Pretrain-Finetune Paradigm for Natural Language Processing
Yu Wang
Wen Qu
76
0
0
04 Mar 2024
Vision-Language Models for Medical Report Generation and Visual Question Answering: A Review
Iryna Hartsock
Ghulam Rasool
102
80
0
04 Mar 2024
How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider Transformer Models
Xin Lu
Yanyan Zhao
Bing Qin
65
0
0
04 Mar 2024
Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment
Luyao Wang
Pengnian Qi
Xigang Bao
Chunlai Zhou
Biao Qin
66
11
0
02 Mar 2024
ATP: Enabling Fast LLM Serving via Attention on Top Principal Keys
Yue Niu
Saurav Prakash
Salman Avestimehr
51
1
0
01 Mar 2024
Hierarchical Indexing for Retrieval-Augmented Opinion Summarization
Tom Hosking
Hao Tang
Mirella Lapata
102
5
0
01 Mar 2024
Rethinking Tokenization: Crafting Better Tokenizers for Large Language Models
Jinbiao Yang
LLMAG
169
11
0
01 Mar 2024
Cause and Effect: Can Large Language Models Truly Understand Causality?
Swagata Ashwani
Kshiteesh Hegde
Nishith Reddy Mannuru
Mayank Jindal
Dushyant Singh Sengar
Krishna Chaitanya Rao Kathala
Dishant Banga
Vinija Jain
Aman Chadha
LRM
104
25
0
28 Feb 2024
Securing Reliability: A Brief Overview on Enhancing In-Context Learning for Foundation Models
Yunpeng Huang
Yaonan Gu
Jingwei Xu
Zhihong Zhu
Zhaorun Chen
Xiaoxing Ma
64
3
0
27 Feb 2024
Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks
Huajian Zhang
Yumo Xu
Laura Perez-Beltrachini
HILM
81
13
0
27 Feb 2024
Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology
Wenhao Tang
Fengtao Zhou
Shengyue Huang
Xiang Zhu
Yi Zhang
Bo Liu
137
25
0
27 Feb 2024
Generating Effective Ensembles for Sentiment Analysis
Itay Etelis
Avi Rosenfeld
Abraham Itzhak Weinberg
David Sarne
79
2
0
26 Feb 2024
Unveiling Vulnerability of Self-Attention
Khai Jiet Liong
Hongqiu Wu
Haizhen Zhao
69
0
0
26 Feb 2024
Layer-wise Regularized Dropout for Neural Language Models
Shiwen Ni
Min Yang
Ruifeng Xu
Chengming Li
Xiping Hu
46
0
0
26 Feb 2024
QASE Enhanced PLMs: Improved Control in Text Generation for MRC
Lin Ai
Zheng Hui
Zizhou Liu
Julia Hirschberg
53
0
0
26 Feb 2024
OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining
Fanjin Zhang
Shijie Shi
Yifan Zhu
Bo Chen
Yukuo Cen
...
Huihui Yuan
Jian Song
Xiaoyan Li
Yuxiao Dong
Jie Tang
95
20
0
24 Feb 2024
Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation Inconsistency
Yiran Liu
Ke Yang
Zehan Qi
Xiao-Yang Liu
Yang Yu
ChengXiang Zhai
107
1
0
23 Feb 2024
Second-Order Fine-Tuning without Pain for LLMs:A Hessian Informed Zeroth-Order Optimizer
Yanjun Zhao
Sizhe Dang
Haishan Ye
Guang Dai
Yi Qian
Ivor W.Tsang
148
13
0
23 Feb 2024
An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach
Mohammad Amaz Uddin
Iqbal H. Sarker
75
18
0
21 Feb 2024
EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries
Jing Han Sun
Ali Emami
69
4
0
20 Feb 2024
Detecting misinformation through Framing Theory: the Frame Element-based Model
Guan-Hua Wang
Rebecca Frederick
Jinglong Duan
William Wong
V. Rupar
Weihua Li
Quan-wei Bai
95
2
0
19 Feb 2024
Head-wise Shareable Attention for Large Language Models
Zouying Cao
Yifei Yang
Hai Zhao
59
4
0
19 Feb 2024
Utilizing BERT for Information Retrieval: Survey, Applications, Resources, and Challenges
Jiajia Wang
Jimmy Xiangji Huang
Xinhui Tu
Junmei Wang
Angela J. Huang
Md Tahmid Rahman Laskar
Amran Bhuiyan
100
37
0
18 Feb 2024
Puzzle Solving using Reasoning of Large Language Models: A Survey
Panagiotis Giadikiaroglou
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
ELM
ReLM
LRM
144
34
0
17 Feb 2024
EEG2Rep: Enhancing Self-supervised EEG Representation Through Informative Masked Inputs
Navid Mohammadi Foumani
G. Mackellar
Soheila Ghane
Saad Irtza
Nam Nguyen
Mahsa Salehi
99
17
0
17 Feb 2024
A Question Answering Based Pipeline for Comprehensive Chinese EHR Information Extraction
Huaiyuan Ying
Sheng Yu
MedIm
49
0
0
17 Feb 2024
Enhancing ESG Impact Type Identification through Early Fusion and Multilingual Models
Hariram Veeramani
Surendrabikram Thapa
Usman Naseem
50
5
0
16 Feb 2024
Previous
1
2
3
...
8
9
10
...
57
58
59
Next