Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,935 papers shown
Title
Introducing "Forecast Utterance" for Conversational Data Science
Md. Mahadi Hassan
Alex Knipper
S. Karmaker
AI4TS
51
0
0
07 Sep 2023
Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs
Chao Feng
Xinyu Zhang
Zichu Fei
KELM
83
50
0
06 Sep 2023
One Wide Feedforward is All You Need
Telmo Pires
António V. Lopes
Yannick Assogba
Hendra Setiawan
78
13
0
04 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
Anh Tuan Luu
Wei Bi
Freda Shi
Shuming Shi
RALM
LRM
HILM
150
582
0
03 Sep 2023
FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs
Zhenheng Tang
Yuxin Wang
Xin He
Longteng Zhang
Xinglin Pan
...
Rongfei Zeng
Kaiyong Zhao
Shaoshuai Shi
Bingsheng He
Xiaowen Chu
106
30
0
03 Sep 2023
Studying the impacts of pre-training using ChatGPT-generated text on downstream tasks
Sarthak Anand
58
0
0
02 Sep 2023
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model
Fengxiang Bie
Yibo Yang
Zhongzhu Zhou
Adam Ghanem
Minjia Zhang
...
Pareesa Ameneh Golnari
David A. Clifton
Yuxiong He
Dacheng Tao
Shuaiwen Leon Song
EGVM
105
25
0
02 Sep 2023
Learning to Taste: A Multimodal Wine Dataset
Thoranna Bender
Simon Moe Sorensen
A. Kashani
K. E. Hjorleifsson
Grethe Hyldig
Søren Hauberg
Serge Belongie
Frederik Warburg
CoGe
109
4
0
31 Aug 2023
ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation
Weihan Wang
Zhiyong Yang
Bin Xu
Juanzi Li
Yankui Sun
VLM
91
8
0
31 Aug 2023
Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection
Fatma Elsafoury
84
3
0
31 Aug 2023
ToddlerBERTa: Exploiting BabyBERTa for Grammar Learning and Language Understanding
Omer Veysel Cagatan
72
2
0
30 Aug 2023
Introducing Language Guidance in Prompt-based Continual Learning
Muhammad Gul Zain Ali Khan
Muhammad Ferjad Naeem
Luc Van Gool
D. Stricker
F. Tombari
Muhammad Zeshan Afzal
VLM
CLL
103
51
0
30 Aug 2023
Cyberbullying Detection for Low-resource Languages and Dialects: Review of the State of the Art
Tanjim Mahmud
M. Ptaszynski
J. Eronen
Fumito Masui
63
70
0
30 Aug 2023
TransPrompt v2: A Transferable Prompting Framework for Cross-task Text Classification
Jiadong Wang
Chengyu Wang
Cen Chen
Ming Gao
Jun Huang
Aoying Zhou
VLM
94
0
0
29 Aug 2023
Video Multimodal Emotion Recognition System for Real World Applications
Sun-Kyung Lee
Jong-Hwan Kim
CVBM
40
0
0
28 Aug 2023
LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
Chengkun Wei
Wenlong Meng
Zhikun Zhang
M. Chen
Ming-Hui Zhao
Wenjing Fang
Lei Wang
Zihui Zhang
Wenzhi Chen
AAML
63
11
0
26 Aug 2023
FwdLLM: Efficient FedLLM using Forward Gradient
Mengwei Xu
Dongqi Cai
Yaozong Wu
Xiang Li
Shangguang Wang
FedML
118
26
0
26 Aug 2023
WellXplain: Wellness Concept Extraction and Classification in Reddit Posts for Mental Health Analysis
Muskan Garg
AI4MH
52
10
0
25 Aug 2023
TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs
P. Phothilimthana
Sami Abu-El-Haija
Kaidi Cao
Bahare Fatemi
Mike Burrows
Charith Mendis
Bryan Perozzi
GNN
AI4TS
127
20
0
25 Aug 2023
Construction Grammar and Language Models
Harish Tayyar Madabushi
Laurence Romain
P. Milin
Dagmar Divjak
126
5
0
25 Aug 2023
Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities
Maximilian Mozes
Xuanli He
Bennett Kleinberg
Lewis D. Griffin
87
87
0
24 Aug 2023
A Small and Fast BERT for Chinese Medical Punctuation Restoration
Tongtao Ling
Chen Liao
Lei Chen
Shilei Huang
Yi Liu
MedIm
53
1
0
24 Aug 2023
Evolution of ESG-focused DLT Research: An NLP Analysis of the Literature
Walter Hernandez Cruz
K. Tylinski
Alastair Moore
Niall Roche
Nikhil Vadgama
Horst Treiblmaier
J. Shangguan
Paolo Tasca
Jiahua Xu
135
2
0
23 Aug 2023
GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised Learning
Mainak Singha
Ankit Jha
Biplab Banerjee
VLM
75
4
0
22 Aug 2023
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Xi Deng
Han Shi
Runhu Huang
Changlin Li
Hang Xu
Jianhua Han
James T. Kwok
Shen Zhao
Wei Zhang
Xiaodan Liang
CLIP
VLM
91
3
0
22 Aug 2023
Systematic Offensive Stereotyping (SOS) Bias in Language Models
Fatma Elsafoury
27
2
0
21 Aug 2023
Large Language Models for Software Engineering: A Systematic Literature Review
Xinying Hou
Yanjie Zhao
Yue Liu
Zhou Yang
Kailong Wang
Li Li
Xiapu Luo
David Lo
John C. Grundy
Haoyu Wang
123
437
0
21 Aug 2023
Learning Representations on Logs for AIOps
Pranjal Gupta
Harshit Kumar
Debanjana Kar
Karan Bhukar
Pooja Aggarwal
P. Mohapatra
52
11
0
18 Aug 2023
BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model with Non-textual Features for CTR Prediction
Dong Wang
Kave Salamatian
Yunqing Xia
Weiwei Deng
Qi Zhang
56
14
0
17 Aug 2023
Lightweight Adaptation of Neural Language Models via Subspace Embedding
Amit Kumar Jaiswal
Haiming Liu
57
2
0
16 Aug 2023
BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity Recognition
Vera Pavlova
M. Makhlouf
58
3
0
16 Aug 2023
Finding Stakeholder-Material Information from 10-K Reports using Fine-Tuned BERT and LSTM Models
V. Z. Chen
59
0
0
15 Aug 2023
gSASRec: Reducing Overconfidence in Sequential Recommendation Trained with Negative Sampling
Aleksandr V. Petrov
Craig Macdonald
61
35
0
14 Aug 2023
A Novel Ehanced Move Recognition Algorithm Based on Pre-trained Models with Positional Embeddings
H. Wen
Jie Wang
Xiaodong Qiao
55
0
0
14 Aug 2023
SLoRA: Federated Parameter Efficient Fine-Tuning of Language Models
Sara Babakniya
A. Elkordy
Yahya H. Ezzeldin
Qingfeng Liu
Kee-Bong Song
Mostafa El-Khamy
Salman Avestimehr
76
72
0
12 Aug 2023
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Youliang Yuan
Wenxiang Jiao
Wenxuan Wang
Jen-tse Huang
Pinjia He
Shuming Shi
Zhaopeng Tu
SILM
121
285
0
12 Aug 2023
Identification of the Relevance of Comments in Codes Using Bag of Words and Transformer Based Models
S. Sruthi
Tanmay Basu
31
1
0
11 Aug 2023
LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach Prompts
Shangqing Tu
Zheyuan Zhang
Jifan Yu
Chunyang Li
Siyu Zhang
Zijun Yao
Lei Hou
Juanzi Li
73
11
0
11 Aug 2023
Performance Analysis of Transformer Based Models (BERT, ALBERT and RoBERTa) in Fake News Detection
Shafna Fitria Nur Azizah
Hasan Dwi Cahyono
S. W. Sihwi
Wisnu Widiarto
24
13
0
09 Aug 2023
Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval
Yi Bin
Haoxuan Li
Yahui Xu
Xing Xu
Yang Yang
Heng Tao Shen
VOS
64
19
0
08 Aug 2023
Guarding the Guardians: Automated Analysis of Online Child Sexual Abuse
J. Puentes
Angela Castillo
Wilmar Osejo
Yuly Calderón
Viviana Quintero
L. Saldarriaga
D. Agudelo
Pablo Arbelaez
55
2
0
07 Aug 2023
Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models
Seungcheol Park
Ho-Jin Choi
U. Kang
VLM
80
8
0
07 Aug 2023
Analysis of the Evolution of Advanced Transformer-Based Language Models: Experiments on Opinion Mining
Nour Eddine Zekaoui
Siham Yousfi
Maryem Rhanoui
M. Mikram
49
3
0
07 Aug 2023
Spanish Pre-trained BERT Model and Evaluation Data
J. Cañete
Gabriel Chaperon
Rodrigo Fuentes
Jou-Hui Ho
Hojin Kang
Jorge Pérez
92
667
0
06 Aug 2023
Bengali Fake Reviews: A Benchmark Dataset and Detection System
G. M. Shahariar
Rouf Shawon
F. Shah
Mohammad Shafiul Alam
Md. Shahriar Mahbub
89
6
0
03 Aug 2023
Target-point Attention Transformer: A novel trajectory predict network for end-to-end autonomous driving
Jing Du
Yang Zhao
Hong-wei Cheng
ViT
48
1
0
03 Aug 2023
Survey on Computer Vision Techniques for Internet-of-Things Devices
Ishmeet Kaur
Adwaita Janardhan Jadhav
AI4CE
51
1
0
02 Aug 2023
Contrastive Learning for API Aspect Analysis
G. M. Shahariar
Tahmid Hasan
Anindya Iqbal
Gias Uddin
45
0
0
31 Jul 2023
Multi-output Headed Ensembles for Product Item Classification
H. Shiokawa
Pradipto Das
Arthur R. Toth
Justin Chiu
23
0
0
29 Jul 2023
DPBERT: Efficient Inference for BERT based on Dynamic Planning
Weixin Wu
H. Zhuo
16
0
0
26 Jul 2023
Previous
1
2
3
...
14
15
16
...
57
58
59
Next