ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for
  Controllable Language Generation
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
Chengxing Jia
Pengyuan Wang
Ziniu Li
Yi-Chen Li
Zhilong Zhang
Nan Tang
Yang Yu
OffRL
64
2
0
27 May 2024
Recent advances in text embedding: A Comprehensive Review of
  Top-Performing Methods on the MTEB Benchmark
Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark
Hongliu Cao
AI4TS
104
15
0
27 May 2024
SoK: Leveraging Transformers for Malware Analysis
SoK: Leveraging Transformers for Malware Analysis
Pradip Kunwar
Kshitiz Aryal
Maanak Gupta
Mahmoud Abdelsalam
Elisa Bertino
176
0
0
27 May 2024
Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization
Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization
Dixuan Wang
Yanda Li
Junyuan Jiang
Zepeng Ding
Ziqin Luo
Guochao Jiang
Jiaqing Liang
Deqing Yang
106
15
0
27 May 2024
Cocktail: A Comprehensive Information Retrieval Benchmark with
  LLM-Generated Documents Integration
Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration
Sunhao Dai
Weihao Liu
Yuqi Zhou
Liang Pang
Rongju Ruan
Gang Wang
Zhenhua Dong
Jun Xu
Jirong Wen
129
12
0
26 May 2024
Accelerating Transformers with Spectrum-Preserving Token Merging
Accelerating Transformers with Spectrum-Preserving Token Merging
Hoai-Chau Tran
D. M. Nguyen
Duy M. Nguyen
Trung Thanh Nguyen
Ngan Le
Pengtao Xie
Daniel Sonntag
James Y. Zou
Binh T. Nguyen
Mathias Niepert
106
13
0
25 May 2024
MoEUT: Mixture-of-Experts Universal Transformers
MoEUT: Mixture-of-Experts Universal Transformers
Róbert Csordás
Kazuki Irie
Jürgen Schmidhuber
Christopher Potts
Christopher D. Manning
MoE
83
11
0
25 May 2024
GPT is Not an Annotator: The Necessity of Human Annotation in Fairness
  Benchmark Construction
GPT is Not an Annotator: The Necessity of Human Annotation in Fairness Benchmark Construction
Virginia K. Felkner
Jennifer A. Thompson
Jonathan May
98
11
0
24 May 2024
ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking
ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking
Xudong Han
Nobuyuki Oishi
Yueying Tian
Elif Ucurum
R. Young
C. Chatwin
Philip Birch
82
5
0
24 May 2024
Optimizing Large Language Models for OpenAPI Code Completion
Optimizing Large Language Models for OpenAPI Code Completion
Bohdan Petryshyn
M. Lukoševičius
LLMAGALM
70
0
0
24 May 2024
Thinking Forward: Memory-Efficient Federated Finetuning of Language
  Models
Thinking Forward: Memory-Efficient Federated Finetuning of Language Models
Kunjal Panchal
Nisarg Parikh
Sunav Choudhary
Lijun Zhang
Yuriy Brun
Hui Guan
112
3
0
24 May 2024
CEEBERT: Cross-Domain Inference in Early Exit BERT
CEEBERT: Cross-Domain Inference in Early Exit BERT
Divya J. Bajpai
M. Hanawal
LRM
79
5
0
23 May 2024
Super Tiny Language Models
Super Tiny Language Models
Dylan Hillier
Leon Guertler
Cheston Tan
Palaash Agrawal
Ruirui Chen
Bobby Cheng
111
6
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
330
54
0
23 May 2024
WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather
  Representations from Small Datasets
WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets
Adib Hasan
Mardavij Roozbehani
M. Dahleh
AI4TS
41
0
0
22 May 2024
Investigating Persuasion Techniques in Arabic: An Empirical Study
  Leveraging Large Language Models
Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models
Abdurahmman Alzahrani
Eyad Babkier
Faisal Yanbaawi
Firas Yanbaawi
Hassan Alhuzali
66
0
0
21 May 2024
Talk2Radar: Bridging Natural Language with 4D mmWave Radar for 3D Referring Expression Comprehension
Talk2Radar: Bridging Natural Language with 4D mmWave Radar for 3D Referring Expression Comprehension
Runwei Guan
Ruixiao Zhang
Ningwei Ouyang
Tao Huang
Ka Lok Man
...
Ming Xu
Jeremy S. Smith
Eng Gee Lim
Yutao Yue
Hui Xiong
215
10
0
21 May 2024
CReMa: Crisis Response through Computational Identification and Matching
  of Cross-Lingual Requests and Offers Shared on Social Media
CReMa: Crisis Response through Computational Identification and Matching of Cross-Lingual Requests and Offers Shared on Social Media
Rabindra Lamsal
M. Read
S. Karunasekera
Muhammad Imran
60
3
0
20 May 2024
Case-Based Reasoning Approach for Solving Financial Question Answering
Case-Based Reasoning Approach for Solving Financial Question Answering
Yikyung Kim
Jay-Yoon Lee
AIMat
46
0
0
18 May 2024
The Future of Large Language Model Pre-training is Federated
The Future of Large Language Model Pre-training is Federated
Lorenzo Sani
Alexandru Iacob
Zeyu Cao
Bill Marino
Yan Gao
...
Wanru Zhao
William F. Shen
Preslav Aleksandrov
Xinchi Qiu
Nicholas D. Lane
AI4CE
161
21
0
17 May 2024
Large Language Model (LLM) for Telecommunications: A Comprehensive
  Survey on Principles, Key Techniques, and Opportunities
Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities
Hao Zhou
Chengming Hu
Ye Yuan
Yufei Cui
Yili Jin
...
Di Wu
Xue Liu
Charlie Zhang
Xianbin Wang
Jiangchuan Liu
111
79
0
17 May 2024
A survey on fairness of large language models in e-commerce: progress,
  application, and challenge
A survey on fairness of large language models in e-commerce: progress, application, and challenge
Qingyang Ren
Zilin Jiang
Jinghan Cao
Sijia Li
Chiqu Li
Yiyang Liu
Shuning Huo
Tiange He
Yuan Chen
AILawFaML
101
7
0
15 May 2024
A Survey of Generative Techniques for Spatial-Temporal Data Mining
A Survey of Generative Techniques for Spatial-Temporal Data Mining
Qianru Zhang
Haixin Wang
Cheng Long
Liangcai Su
Xingwei He
...
Tailin Wu
Hongzhi Yin
Siu-Ming Yiu
Qi Tian
Christian S. Jensen
AI4TS
95
9
0
15 May 2024
A Survey on Transformers in NLP with Focus on Efficiency
A Survey on Transformers in NLP with Focus on Efficiency
Wazib Ansar
Saptarsi Goswami
Amlan Chakrabarti
MedIm
93
2
0
15 May 2024
Impact of Stickers on Multimodal Chat Sentiment Analysis and Intent
  Recognition: A New Task, Dataset and Baseline
Impact of Stickers on Multimodal Chat Sentiment Analysis and Intent Recognition: A New Task, Dataset and Baseline
Yuanchen Shi
Biao Ma
Fang Kong
57
0
0
14 May 2024
A Decoupling and Aggregating Framework for Joint Extraction of Entities
  and Relations
A Decoupling and Aggregating Framework for Joint Extraction of Entities and Relations
Yao Wang
Xin Liu
Weikun Kong
Hai-tao Yu
Teeradaj Racharak
Kyoung-Sook Kim
Le-Minh Nguyen
84
0
0
14 May 2024
ViWikiFC: Fact-Checking for Vietnamese Wikipedia-Based Textual Knowledge
  Source
ViWikiFC: Fact-Checking for Vietnamese Wikipedia-Based Textual Knowledge Source
Hung Tuan Le
Long Truong To
Manh Trong Nguyen
Kiet Van Nguyen
112
3
0
13 May 2024
DEPTH: Discourse Education through Pre-Training Hierarchically
DEPTH: Discourse Education through Pre-Training Hierarchically
Zachary Bamberger
Ofek Glick
Chaim Baskin
Yonatan Belinkov
126
0
0
13 May 2024
Branching Narratives: Character Decision Points Detection
Branching Narratives: Character Decision Points Detection
Alexey Tikhonov
57
2
0
12 May 2024
ExplainableDetector: Exploring Transformer-based Language Modeling
  Approach for SMS Spam Detection with Explainability Analysis
ExplainableDetector: Exploring Transformer-based Language Modeling Approach for SMS Spam Detection with Explainability Analysis
Mohammad Amaz Uddin
Muhammad Nazrul Islam
Leandros A. Maglaras
Helge Janicke
Iqbal H. Sarker
70
3
0
12 May 2024
SaudiBERT: A Large Language Model Pretrained on Saudi Dialect Corpora
SaudiBERT: A Large Language Model Pretrained on Saudi Dialect Corpora
Faisal Qarah
65
6
0
10 May 2024
Similarity Guided Multimodal Fusion Transformer for Semantic Location
  Prediction in Social Media
Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media
Zhizhen Zhang
Ning Wang
Haojie Li
Zhihui Wang
66
0
0
09 May 2024
Multi-level Shared Knowledge Guided Learning for Knowledge Graph
  Completion
Multi-level Shared Knowledge Guided Learning for Knowledge Graph Completion
Yongxue Shan
Jie Zhou
Jie Peng
Xin Zhou
Jiaqian Yin
Xiaodong Wang
105
3
0
08 May 2024
A Review on Discriminative Self-supervised Learning Methods in Computer Vision
A Review on Discriminative Self-supervised Learning Methods in Computer Vision
Nikolaos Giakoumoglou
Tania Stathaki
Athanasios Gkelias
SSL
121
0
0
08 May 2024
Switchable Decision: Dynamic Neural Generation Networks
Switchable Decision: Dynamic Neural Generation Networks
Shujian Zhang
Korawat Tanwisuth
Chengyue Gong
Pengcheng He
Mi Zhou
BDL
72
0
0
07 May 2024
Revisiting character-level adversarial attacks
Revisiting character-level adversarial attacks
Elias Abad Rocamora
Yongtao Wu
Fanghui Liu
Grigorios G. Chrysos
Volkan Cevher
AAML
96
4
0
07 May 2024
LingML: Linguistic-Informed Machine Learning for Enhanced Fake News
  Detection
LingML: Linguistic-Informed Machine Learning for Enhanced Fake News Detection
Jasraj Singh
Fang Liu
Hong Xu
Bee Chin Ng
Wei Zhang
AI4CE
52
0
0
07 May 2024
Exploring prompts to elicit memorization in masked language model-based
  named entity recognition
Exploring prompts to elicit memorization in masked language model-based named entity recognition
Yuxi Xia
Anastasiia Sedova
Pedro Henrique Luz de Araujo
Vasiliki Kougia
Lisa Nussbaumer
Benjamin Roth
86
1
0
05 May 2024
Enabling Patient-side Disease Prediction via the Integration of Patient
  Narratives
Enabling Patient-side Disease Prediction via the Integration of Patient Narratives
Zhixiang Su
Yinan Zhang
Jiazheng Jing
Jie Xiao
Zhiqi Shen
42
0
0
05 May 2024
Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset
Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset
Hsuvas Borkakoty
Luis Espinosa-Anke
77
1
0
03 May 2024
Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders
  and Identifying Distinct Features
Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders and Identifying Distinct Features
Chuanbo Hu
Wenqi Li
Mindi Ruan
Xiangxu Yu
Lynn K. Paul
Shuo Wang
Xin Li
34
3
0
03 May 2024
Large Language Models for UAVs: Current State and Pathways to the Future
Large Language Models for UAVs: Current State and Pathways to the Future
Shumaila Javaid
Nasir Saeed
Bin He
96
24
0
02 May 2024
Enhancing Language Models for Financial Relation Extraction with Named
  Entities and Part-of-Speech
Enhancing Language Models for Financial Relation Extraction with Named Entities and Part-of-Speech
Menglin Li
Kwan Hui Lim
61
1
0
02 May 2024
A Named Entity Recognition and Topic Modeling-based Solution for
  Locating and Better Assessment of Natural Disasters in Social Media
A Named Entity Recognition and Topic Modeling-based Solution for Locating and Better Assessment of Natural Disasters in Social Media
Ayaz Mehmood
Muhammad Tayyab Zamir
Muhammad Asif Ayub
Nasir Ahmad
Kashif Ahmad
53
2
0
01 May 2024
EfficientASR: Speech Recognition Network Compression via Attention
  Redundancy and Chunk-Level FFN Optimization
EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization
Jianzong Wang
Ziqi Liang
Xulong Zhang
Ning Cheng
Jing Xiao
81
0
0
30 Apr 2024
Enhancing Pre-Trained Generative Language Models with Question Attended
  Span Extraction on Machine Reading Comprehension
Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension
Lin Ai
Zheng Hui
Zizhou Liu
Julia Hirschberg
53
1
0
27 Apr 2024
Transfer Learning Enhanced Single-choice Decision for Multi-choice
  Question Answering
Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering
Chenhao Cui
Yufan Jiang
Shuangzhi Wu
Zhoujun Li
FaML
55
0
0
27 Apr 2024
CoSD: Collaborative Stance Detection with Contrastive Heterogeneous
  Topic Graph Learning
CoSD: Collaborative Stance Detection with Contrastive Heterogeneous Topic Graph Learning
Yinghan Cheng
Qi Zhang
Chongyang Shi
Liang Xiao
Shufeng Hao
Liang Hu
80
0
0
26 Apr 2024
Exploring Internal Numeracy in Language Models: A Case Study on ALBERT
Exploring Internal Numeracy in Language Models: A Case Study on ALBERT
Ulme Wennberg
G. Henter
MILM
71
1
0
25 Apr 2024
Exploring Learngene via Stage-wise Weight Sharing for Initializing
  Variable-sized Models
Exploring Learngene via Stage-wise Weight Sharing for Initializing Variable-sized Models
Shiyu Xia
Wenxuan Zhu
Xu Yang
Xin Geng
54
2
0
25 Apr 2024
Previous
123...678...575859
Next