ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
Introducing "Forecast Utterance" for Conversational Data Science
Introducing "Forecast Utterance" for Conversational Data Science
Md. Mahadi Hassan
Alex Knipper
S. Karmaker
AI4TS
51
0
0
07 Sep 2023
Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from
  Knowledge Graphs
Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs
Chao Feng
Xinyu Zhang
Zichu Fei
KELM
83
50
0
06 Sep 2023
One Wide Feedforward is All You Need
One Wide Feedforward is All You Need
Telmo Pires
António V. Lopes
Yannick Assogba
Hendra Setiawan
78
13
0
04 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large
  Language Models
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
Anh Tuan Luu
Wei Bi
Freda Shi
Shuming Shi
RALMLRMHILM
150
582
0
03 Sep 2023
FusionAI: Decentralized Training and Deploying LLMs with Massive
  Consumer-Level GPUs
FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs
Zhenheng Tang
Yuxin Wang
Xin He
Longteng Zhang
Xinglin Pan
...
Rongfei Zeng
Kaiyong Zhao
Shaoshuai Shi
Bingsheng He
Xiaowen Chu
106
30
0
03 Sep 2023
Studying the impacts of pre-training using ChatGPT-generated text on
  downstream tasks
Studying the impacts of pre-training using ChatGPT-generated text on downstream tasks
Sarthak Anand
58
0
0
02 Sep 2023
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of
  Large Model
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model
Fengxiang Bie
Yibo Yang
Zhongzhu Zhou
Adam Ghanem
Minjia Zhang
...
Pareesa Ameneh Golnari
David A. Clifton
Yuxiong He
Dacheng Tao
Shuaiwen Leon Song
EGVM
105
25
0
02 Sep 2023
Learning to Taste: A Multimodal Wine Dataset
Learning to Taste: A Multimodal Wine Dataset
Thoranna Bender
Simon Moe Sorensen
A. Kashani
K. E. Hjorleifsson
Grethe Hyldig
Søren Hauberg
Serge Belongie
Frederik Warburg
CoGe
109
4
0
31 Aug 2023
ViLTA: Enhancing Vision-Language Pre-training through Textual
  Augmentation
ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation
Weihan Wang
Zhiyong Yang
Bin Xu
Juanzi Li
Yankui Sun
VLM
91
8
0
31 Aug 2023
Thesis Distillation: Investigating The Impact of Bias in NLP Models on
  Hate Speech Detection
Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection
Fatma Elsafoury
84
3
0
31 Aug 2023
ToddlerBERTa: Exploiting BabyBERTa for Grammar Learning and Language
  Understanding
ToddlerBERTa: Exploiting BabyBERTa for Grammar Learning and Language Understanding
Omer Veysel Cagatan
72
2
0
30 Aug 2023
Introducing Language Guidance in Prompt-based Continual Learning
Introducing Language Guidance in Prompt-based Continual Learning
Muhammad Gul Zain Ali Khan
Muhammad Ferjad Naeem
Luc Van Gool
D. Stricker
F. Tombari
Muhammad Zeshan Afzal
VLMCLL
103
51
0
30 Aug 2023
Cyberbullying Detection for Low-resource Languages and Dialects: Review
  of the State of the Art
Cyberbullying Detection for Low-resource Languages and Dialects: Review of the State of the Art
Tanjim Mahmud
M. Ptaszynski
J. Eronen
Fumito Masui
63
70
0
30 Aug 2023
TransPrompt v2: A Transferable Prompting Framework for Cross-task Text
  Classification
TransPrompt v2: A Transferable Prompting Framework for Cross-task Text Classification
Jiadong Wang
Chengyu Wang
Cen Chen
Ming Gao
Jun Huang
Aoying Zhou
VLM
94
0
0
29 Aug 2023
Video Multimodal Emotion Recognition System for Real World Applications
Video Multimodal Emotion Recognition System for Real World Applications
Sun-Kyung Lee
Jong-Hwan Kim
CVBM
40
0
0
28 Aug 2023
LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
Chengkun Wei
Wenlong Meng
Zhikun Zhang
M. Chen
Ming-Hui Zhao
Wenjing Fang
Lei Wang
Zihui Zhang
Wenzhi Chen
AAML
63
11
0
26 Aug 2023
FwdLLM: Efficient FedLLM using Forward Gradient
FwdLLM: Efficient FedLLM using Forward Gradient
Mengwei Xu
Dongqi Cai
Yaozong Wu
Xiang Li
Shangguang Wang
FedML
118
26
0
26 Aug 2023
WellXplain: Wellness Concept Extraction and Classification in Reddit
  Posts for Mental Health Analysis
WellXplain: Wellness Concept Extraction and Classification in Reddit Posts for Mental Health Analysis
Muskan Garg
AI4MH
52
10
0
25 Aug 2023
TpuGraphs: A Performance Prediction Dataset on Large Tensor
  Computational Graphs
TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs
P. Phothilimthana
Sami Abu-El-Haija
Kaidi Cao
Bahare Fatemi
Mike Burrows
Charith Mendis
Bryan Perozzi
GNNAI4TS
127
20
0
25 Aug 2023
Construction Grammar and Language Models
Construction Grammar and Language Models
Harish Tayyar Madabushi
Laurence Romain
P. Milin
Dagmar Divjak
126
5
0
25 Aug 2023
Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and
  Vulnerabilities
Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities
Maximilian Mozes
Xuanli He
Bennett Kleinberg
Lewis D. Griffin
87
87
0
24 Aug 2023
A Small and Fast BERT for Chinese Medical Punctuation Restoration
A Small and Fast BERT for Chinese Medical Punctuation Restoration
Tongtao Ling
Chen Liao
Lei Chen
Shilei Huang
Yi Liu
MedIm
53
1
0
24 Aug 2023
Evolution of ESG-focused DLT Research: An NLP Analysis of the Literature
Evolution of ESG-focused DLT Research: An NLP Analysis of the Literature
Walter Hernandez Cruz
K. Tylinski
Alastair Moore
Niall Roche
Nikhil Vadgama
Horst Treiblmaier
J. Shangguan
Paolo Tasca
Jiahua Xu
135
2
0
23 Aug 2023
GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised
  Learning
GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised Learning
Mainak Singha
Ankit Jha
Biplab Banerjee
VLM
75
4
0
22 Aug 2023
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive
  Language-Image Pre-training
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training
Xi Deng
Han Shi
Runhu Huang
Changlin Li
Hang Xu
Jianhua Han
James T. Kwok
Shen Zhao
Wei Zhang
Xiaodan Liang
CLIPVLM
91
3
0
22 Aug 2023
Systematic Offensive Stereotyping (SOS) Bias in Language Models
Systematic Offensive Stereotyping (SOS) Bias in Language Models
Fatma Elsafoury
27
2
0
21 Aug 2023
Large Language Models for Software Engineering: A Systematic Literature
  Review
Large Language Models for Software Engineering: A Systematic Literature Review
Xinying Hou
Yanjie Zhao
Yue Liu
Zhou Yang
Kailong Wang
Li Li
Xiapu Luo
David Lo
John C. Grundy
Haoyu Wang
123
437
0
21 Aug 2023
Learning Representations on Logs for AIOps
Learning Representations on Logs for AIOps
Pranjal Gupta
Harshit Kumar
Debanjana Kar
Karan Bhukar
Pooja Aggarwal
P. Mohapatra
52
11
0
18 Aug 2023
BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model
  with Non-textual Features for CTR Prediction
BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model with Non-textual Features for CTR Prediction
Dong Wang
Kave Salamatian
Yunqing Xia
Weiwei Deng
Qi Zhang
56
14
0
17 Aug 2023
Lightweight Adaptation of Neural Language Models via Subspace Embedding
Lightweight Adaptation of Neural Language Models via Subspace Embedding
Amit Kumar Jaiswal
Haiming Liu
57
2
0
16 Aug 2023
BIOptimus: Pre-training an Optimal Biomedical Language Model with
  Curriculum Learning for Named Entity Recognition
BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity Recognition
Vera Pavlova
M. Makhlouf
58
3
0
16 Aug 2023
Finding Stakeholder-Material Information from 10-K Reports using
  Fine-Tuned BERT and LSTM Models
Finding Stakeholder-Material Information from 10-K Reports using Fine-Tuned BERT and LSTM Models
V. Z. Chen
59
0
0
15 Aug 2023
gSASRec: Reducing Overconfidence in Sequential Recommendation Trained
  with Negative Sampling
gSASRec: Reducing Overconfidence in Sequential Recommendation Trained with Negative Sampling
Aleksandr V. Petrov
Craig Macdonald
61
35
0
14 Aug 2023
A Novel Ehanced Move Recognition Algorithm Based on Pre-trained Models
  with Positional Embeddings
A Novel Ehanced Move Recognition Algorithm Based on Pre-trained Models with Positional Embeddings
H. Wen
Jie Wang
Xiaodong Qiao
55
0
0
14 Aug 2023
SLoRA: Federated Parameter Efficient Fine-Tuning of Language Models
SLoRA: Federated Parameter Efficient Fine-Tuning of Language Models
Sara Babakniya
A. Elkordy
Yahya H. Ezzeldin
Qingfeng Liu
Kee-Bong Song
Mostafa El-Khamy
Salman Avestimehr
76
72
0
12 Aug 2023
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Youliang Yuan
Wenxiang Jiao
Wenxuan Wang
Jen-tse Huang
Pinjia He
Shuming Shi
Zhaopeng Tu
SILM
121
285
0
12 Aug 2023
Identification of the Relevance of Comments in Codes Using Bag of Words
  and Transformer Based Models
Identification of the Relevance of Comments in Codes Using Bag of Words and Transformer Based Models
S. Sruthi
Tanmay Basu
31
1
0
11 Aug 2023
LittleMu: Deploying an Online Virtual Teaching Assistant via
  Heterogeneous Sources Integration and Chain of Teach Prompts
LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach Prompts
Shangqing Tu
Zheyuan Zhang
Jifan Yu
Chunyang Li
Siyu Zhang
Zijun Yao
Lei Hou
Juanzi Li
73
11
0
11 Aug 2023
Performance Analysis of Transformer Based Models (BERT, ALBERT and
  RoBERTa) in Fake News Detection
Performance Analysis of Transformer Based Models (BERT, ALBERT and RoBERTa) in Fake News Detection
Shafna Fitria Nur Azizah
Hasan Dwi Cahyono
S. W. Sihwi
Wisnu Widiarto
24
13
0
09 Aug 2023
Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval
Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval
Yi Bin
Haoxuan Li
Yahui Xu
Xing Xu
Yang Yang
Heng Tao Shen
VOS
64
19
0
08 Aug 2023
Guarding the Guardians: Automated Analysis of Online Child Sexual Abuse
Guarding the Guardians: Automated Analysis of Online Child Sexual Abuse
J. Puentes
Angela Castillo
Wilmar Osejo
Yuly Calderón
Viviana Quintero
L. Saldarriaga
D. Agudelo
Pablo Arbelaez
55
2
0
07 Aug 2023
Accurate Retraining-free Pruning for Pretrained Encoder-based Language
  Models
Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models
Seungcheol Park
Ho-Jin Choi
U. Kang
VLM
80
8
0
07 Aug 2023
Analysis of the Evolution of Advanced Transformer-Based Language Models:
  Experiments on Opinion Mining
Analysis of the Evolution of Advanced Transformer-Based Language Models: Experiments on Opinion Mining
Nour Eddine Zekaoui
Siham Yousfi
Maryem Rhanoui
M. Mikram
49
3
0
07 Aug 2023
Spanish Pre-trained BERT Model and Evaluation Data
Spanish Pre-trained BERT Model and Evaluation Data
J. Cañete
Gabriel Chaperon
Rodrigo Fuentes
Jou-Hui Ho
Hojin Kang
Jorge Pérez
92
667
0
06 Aug 2023
Bengali Fake Reviews: A Benchmark Dataset and Detection System
Bengali Fake Reviews: A Benchmark Dataset and Detection System
G. M. Shahariar
Rouf Shawon
F. Shah
Mohammad Shafiul Alam
Md. Shahriar Mahbub
89
6
0
03 Aug 2023
Target-point Attention Transformer: A novel trajectory predict network
  for end-to-end autonomous driving
Target-point Attention Transformer: A novel trajectory predict network for end-to-end autonomous driving
Jing Du
Yang Zhao
Hong-wei Cheng
ViT
48
1
0
03 Aug 2023
Survey on Computer Vision Techniques for Internet-of-Things Devices
Survey on Computer Vision Techniques for Internet-of-Things Devices
Ishmeet Kaur
Adwaita Janardhan Jadhav
AI4CE
51
1
0
02 Aug 2023
Contrastive Learning for API Aspect Analysis
Contrastive Learning for API Aspect Analysis
G. M. Shahariar
Tahmid Hasan
Anindya Iqbal
Gias Uddin
45
0
0
31 Jul 2023
Multi-output Headed Ensembles for Product Item Classification
Multi-output Headed Ensembles for Product Item Classification
H. Shiokawa
Pradipto Das
Arthur R. Toth
Justin Chiu
23
0
0
29 Jul 2023
DPBERT: Efficient Inference for BERT based on Dynamic Planning
DPBERT: Efficient Inference for BERT based on Dynamic Planning
Weixin Wu
H. Zhuo
16
0
0
26 Jul 2023
Previous
123...141516...575859
Next