ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
TIER: Text-Image Encoder-based Regression for AIGC Image Quality
  Assessment
TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment
Jiquan Yuan
Xinyan Cao
Jinming Che
Qinyuan Wang
Sen Liang
Wei Ren
Jinlong Lin
Xixin Cao
EGVM
49
1
0
08 Jan 2024
An Exploratory Study on Automatic Identification of Assumptions in the
  Development of Deep Learning Frameworks
An Exploratory Study on Automatic Identification of Assumptions in the Development of Deep Learning Frameworks
Chen Yang
Peng Liang
Zinan Ma
38
0
0
08 Jan 2024
Building Efficient and Effective OpenQA Systems for Low-Resource
  Languages
Building Efficient and Effective OpenQA Systems for Low-Resource Languages
Emrah Budur
Riza Ozccelik
Dilara Soylu
Omar Khattab
Tunga Güngör
Christopher Potts
67
3
0
07 Jan 2024
MERBench: A Unified Evaluation Benchmark for Multimodal Emotion
  Recognition
MERBench: A Unified Evaluation Benchmark for Multimodal Emotion Recognition
Zheng Lian
Guoying Zhao
Yong Ren
Hao Gu
Haiyang Sun
Lan Chen
Bin Liu
Jianhua Tao
124
13
0
07 Jan 2024
Enhancing Context Through Contrast
Enhancing Context Through Contrast
Kshitij Ambilduke
Aneesh Shetye
Diksha Bagade
Rishika Bhagwatkar
Khurshed Fitter
P. Vagdargi
Shital S. Chiddarwar
59
0
0
06 Jan 2024
SecureReg: Combining NLP and MLP for Enhanced Detection of Malicious
  Domain Name Registrations
SecureReg: Combining NLP and MLP for Enhanced Detection of Malicious Domain Name Registrations
Furkan cColhak
Mert İlhan Ecevit
Hasan Daug
Reiner Creutzburg
41
0
0
06 Jan 2024
Lotto: Secure Participant Selection against Adversarial Servers in
  Federated Learning
Lotto: Secure Participant Selection against Adversarial Servers in Federated Learning
Zhifeng Jiang
Peng Ye
Shiqi He
Wei Wang
Ruichuan Chen
Bo Li
91
2
0
05 Jan 2024
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts
  for Instruction Tuning on General Tasks
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
Haoyuan Wu
Haisheng Zheng
Zhuolun He
Bei Yu
MoEALM
98
16
0
05 Jan 2024
Understanding LLMs: A Comprehensive Overview from Training to Inference
Understanding LLMs: A Comprehensive Overview from Training to Inference
Yi-Hsueh Liu
Haoyang He
Tianle Han
Xu-Yao Zhang
Mengyuan Liu
...
Xintao Hu
Tuo Zhang
Ning Qiang
Tianming Liu
Bao Ge
SyDa
151
77
0
04 Jan 2024
Towards a Foundation Purchasing Model: Pretrained Generative
  Autoregression on Transaction Sequences
Towards a Foundation Purchasing Model: Pretrained Generative Autoregression on Transaction Sequences
Piotr Skalski
David Sutton
Stuart Burrell
Iker Perez
Jason Wong
AI4TS
71
2
0
03 Jan 2024
Evaluating Fairness in Self-supervised and Supervised Models for
  Sequential Data
Evaluating Fairness in Self-supervised and Supervised Models for Sequential Data
Sofia Yfantidou
Dimitris Spathis
Marios Constantinides
Athena Vakali
Daniele Quercia
F. Kawsar
107
2
0
03 Jan 2024
A Comprehensive Survey of Hallucination Mitigation Techniques in Large
  Language Models
A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models
S.M. Towhidul Islam Tonmoy
S. M. M. Zaman
Vinija Jain
Anku Rani
Vipula Rawte
Aman Chadha
Amitava Das
HILM
120
208
0
02 Jan 2024
Unifying Structured Data as Graph for Data-to-Text Pre-Training
Unifying Structured Data as Graph for Data-to-Text Pre-Training
Shujie Li
Liang Li
Ruiying Geng
Min Yang
Binhua Li
...
Wanwei He
Shao Yuan
Can Ma
Fei Huang
Yongbin Li
LMTD
88
14
0
02 Jan 2024
Masked Modeling for Self-supervised Representation Learning on Vision
  and Beyond
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li
Luyuan Zhang
Zedong Wang
Di Wu
Lirong Wu
...
Jun Xia
Cheng Tan
Yang Liu
Baigui Sun
Stan Z. Li
SSL
107
15
0
31 Dec 2023
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via
  Expressive Masked Audio Gesture Modeling
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling
Haiyang Liu
Zihao Zhu
Giorgio Becherini
Yichen Peng
Mingyang Su
You Zhou
Xuefei Zhe
Naoya Iwamoto
Bo Zheng
Michael J. Black
SLR
172
36
0
31 Dec 2023
Research on the Laws of Multimodal Perception and Cognition from a
  Cross-cultural Perspective -- Taking Overseas Chinese Gardens as an Example
Research on the Laws of Multimodal Perception and Cognition from a Cross-cultural Perspective -- Taking Overseas Chinese Gardens as an Example
Ran Chen
Xueqi Yao
Jing Zhao
Shuhan Xu
Sirui Zhang
Yijun Mao
40
0
0
29 Dec 2023
Multi-Task Multi-Agent Shared Layers are Universal Cognition of
  Multi-Agent Coordination
Multi-Task Multi-Agent Shared Layers are Universal Cognition of Multi-Agent Coordination
Jiawei Wang
Jian Zhao
Zhengtao Cao
Ruili Feng
Rongjun Qin
Yang Yu
58
1
0
25 Dec 2023
Multi-level biomedical NER through multi-granularity embeddings and
  enhanced labeling
Multi-level biomedical NER through multi-granularity embeddings and enhanced labeling
Fahime Shahrokh
Nasser Ghadiri
Rasoul Samani
M. Moradi
133
0
0
24 Dec 2023
Understanding the Potential of FPGA-Based Spatial Acceleration for Large
  Language Model Inference
Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference
Hongzheng Chen
Jiahao Zhang
Yixiao Du
Shaojie Xiang
Zichao Yue
Niansong Zhang
Yaohui Cai
Zhiru Zhang
117
40
0
23 Dec 2023
Characterizing and Classifying Developer Forum Posts with their
  Intentions
Characterizing and Classifying Developer Forum Posts with their Intentions
Xingfang Wu
Eric Thibodeau-Laufer
Heng Li
Foutse Khomh
Santhosh Srinivasan
Jayden Luo
30
0
0
21 Dec 2023
DSFormer: Effective Compression of Text-Transformers by Dense-Sparse
  Weight Factorization
DSFormer: Effective Compression of Text-Transformers by Dense-Sparse Weight Factorization
Rahul Chand
Yashoteja Prabhu
Pratyush Kumar
54
3
0
20 Dec 2023
Parameter-Efficient Fine-Tuning Methods for Pretrained Language Models:
  A Critical Review and Assessment
Parameter-Efficient Fine-Tuning Methods for Pretrained Language Models: A Critical Review and Assessment
Lingling Xu
Haoran Xie
S. J. Qin
Xiaohui Tao
F. Wang
118
163
0
19 Dec 2023
Assessing Logical Reasoning Capabilities of Encoder-Only Transformer
  Models
Assessing Logical Reasoning Capabilities of Encoder-Only Transformer Models
Paulo Pirozelli
M. M. José
Paulo de Tarso P. Filho
A. Brandão
Fabio Gagliardi Cozman
LRMELM
103
2
0
18 Dec 2023
A mathematical perspective on Transformers
A mathematical perspective on Transformers
Borjan Geshkovski
Cyril Letrouit
Yury Polyanskiy
Philippe Rigollet
EDLAI4CE
138
47
0
17 Dec 2023
RDR: the Recap, Deliberate, and Respond Method for Enhanced Language
  Understanding
RDR: the Recap, Deliberate, and Respond Method for Enhanced Language Understanding
Yuxin Zi
Hariram Veeramani
Kaushik Roy
Amit P. Sheth
AI4TS
64
2
0
15 Dec 2023
BinGo: Identifying Security Patches in Binary Code with Graph
  Representation Learning
BinGo: Identifying Security Patches in Binary Code with Graph Representation Learning
Xu He
Shu Wang
Pengbin Feng
Xinda Wang
Shiyu Sun
Qi Li
Kun Sun
27
1
0
13 Dec 2023
One-Step Diffusion Distillation via Deep Equilibrium Models
One-Step Diffusion Distillation via Deep Equilibrium Models
Zhengyang Geng
Ashwini Pokle
Trevor Killeen
75
33
0
12 Dec 2023
Evaluating ChatGPT as a Question Answering System: A Comprehensive
  Analysis and Comparison with Existing Models
Evaluating ChatGPT as a Question Answering System: A Comprehensive Analysis and Comparison with Existing Models
Hossein Bahak
Farzaneh Taheri
Zahra Zojaji
Arefeh Kazemi
ELMAI4MH
73
21
0
11 Dec 2023
Why "classic" Transformers are shallow and how to make them go deep
Why "classic" Transformers are shallow and how to make them go deep
Yueyao Yu
Yin Zhang
ViT
97
0
0
11 Dec 2023
Transformer as Linear Expansion of Learngene
Transformer as Linear Expansion of Learngene
Shiyu Xia
Miaosen Zhang
Xu Yang
Ruiming Chen
Haokun Chen
Xin Geng
71
7
0
09 Dec 2023
Sim-GPT: Text Similarity via GPT Annotated Data
Sim-GPT: Text Similarity via GPT Annotated Data
Shuhe Wang
Beiming Cao
Shengyu Zhang
Xiaoya Li
Jiwei Li
Leilei Gan
Guoyin Wang
Eduard Hovy
79
2
0
09 Dec 2023
Enhanced E-Commerce Attribute Extraction: Innovating with Decorative
  Relation Correction and LLAMA 2.0-Based Annotation
Enhanced E-Commerce Attribute Extraction: Innovating with Decorative Relation Correction and LLAMA 2.0-Based Annotation
Jianghong Zhou
Weizhi Du
Md Omar Faruk Rokon
Zhaodong Wang
Jiaxuan Xu
Isha Shah
Kuang-chih Lee
Musen Wen
26
1
0
09 Dec 2023
Graph Convolutions Enrich the Self-Attention in Transformers!
Graph Convolutions Enrich the Self-Attention in Transformers!
Jeongwhan Choi
Hyowon Wi
Jayoung Kim
Yehjin Shin
Kookjin Lee
Nathaniel Trask
Noseong Park
110
5
0
07 Dec 2023
RoAST: Robustifying Language Models via Adversarial Perturbation with
  Selective Training
RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training
Jaehyung Kim
Yuning Mao
Rui Hou
Hanchao Yu
Davis Liang
Pascale Fung
Qifan Wang
Fuli Feng
Lifu Huang
Madian Khabsa
AAML
58
4
0
07 Dec 2023
Series2Vec: Similarity-based Self-supervised Representation Learning for
  Time Series Classification
Series2Vec: Similarity-based Self-supervised Representation Learning for Time Series Classification
Navid Mohammadi Foumani
Chang Wei Tan
Geoffrey I. Webb
Hamid Rezatofighi
Mahsa Salehi
SSLAI4TS
103
5
0
07 Dec 2023
Detecting Rumor Veracity with Only Textual Information by Double-Channel
  Structure
Detecting Rumor Veracity with Only Textual Information by Double-Channel Structure
Alex G. Kim
Sangwon Yoon
43
4
0
06 Dec 2023
Large Language Models on Graphs: A Comprehensive Survey
Large Language Models on Graphs: A Comprehensive Survey
Bowen Jin
Gang Liu
Chi Han
Meng Jiang
Heng Ji
Jiawei Han
AI4CE
112
161
0
05 Dec 2023
Expand BERT Representation with Visual Information via Grounded Language
  Learning with Multimodal Partial Alignment
Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment
Cong-Duy Nguyen
The-Anh Vu-Le
Thong Nguyen
Tho Quan
Anh Tuan Luu
100
6
0
04 Dec 2023
Unsupervised Approach to Evaluate Sentence-Level Fluency: Do We Really
  Need Reference?
Unsupervised Approach to Evaluate Sentence-Level Fluency: Do We Really Need Reference?
Gopichand Kanumolu
Lokesh Madasu
Pavan Baswani
Ananya Mukherjee
Manish Shrivastava
33
2
0
03 Dec 2023
Learning to Compose SuperWeights for Neural Parameter Allocation Search
Learning to Compose SuperWeights for Neural Parameter Allocation Search
Piotr Teterwak
Soren Nelson
Nikoli Dryden
D. Bashkirova
Kate Saenko
Bryan A. Plummer
103
2
0
03 Dec 2023
Adaptive Resource Allocation for Semantic Communication Networks
Adaptive Resource Allocation for Semantic Communication Networks
Lingyi Wang
Wei Wu
Fuhui Zhou
Zhaohui Yang
Zhijing Qin
114
23
0
02 Dec 2023
The Cost of Compression: Investigating the Impact of Compression on
  Parametric Knowledge in Language Models
The Cost of Compression: Investigating the Impact of Compression on Parametric Knowledge in Language Models
Srinath Namburi
Makesh Narsimhan Sreedhar
Srinath Srinivasan
Frederic Sala
MQ
63
11
0
01 Dec 2023
The Efficiency Spectrum of Large Language Models: An Algorithmic Survey
The Efficiency Spectrum of Large Language Models: An Algorithmic Survey
Tianyu Ding
Tianyi Chen
Haidong Zhu
Jiachen Jiang
Yiqi Zhong
Jinxin Zhou
Guangzhi Wang
Zhihui Zhu
Ilya Zharkov
Luming Liang
121
24
0
01 Dec 2023
Spatial-Temporal-Decoupled Masked Pre-training for Spatiotemporal
  Forecasting
Spatial-Temporal-Decoupled Masked Pre-training for Spatiotemporal Forecasting
Haotian Gao
Renhe Jiang
Zheng Dong
Jinliang Deng
Yuxin Ma
Xuan Song
AI4TS
101
21
0
01 Dec 2023
SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection
SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection
Anku Rani
Dwip Dalal
Shreya Gautam
Pankaj Gupta
Vinija Jain
Aman Chadha
Amit P. Sheth
Amitava Das
70
0
0
01 Dec 2023
Mavericks at BLP-2023 Task 1: Ensemble-based Approach Using Language
  Models for Violence Inciting Text Detection
Mavericks at BLP-2023 Task 1: Ensemble-based Approach Using Language Models for Violence Inciting Text Detection
Saurabh Page
Sudeep Mangalvedhekar
Kshitij Deshpande
Tanmay Chavan
S. Sonawane
60
1
0
30 Nov 2023
DisCGen: A Framework for Discourse-Informed Counterspeech Generation
DisCGen: A Framework for Discourse-Informed Counterspeech Generation
Sabit Hassan
Malihe Alikhani
93
14
0
29 Nov 2023
RACE-IT: A Reconfigurable Analog CAM-Crossbar Engine for In-Memory
  Transformer Acceleration
RACE-IT: A Reconfigurable Analog CAM-Crossbar Engine for In-Memory Transformer Acceleration
Lei Zhao
Luca Buonanno
Ron M. Roth
Sergey Serebryakov
Archit Gajjar
John Moon
Jim Ignowski
Giacomo Pedretti
73
4
0
29 Nov 2023
TARGET: Template-Transferable Backdoor Attack Against Prompt-based NLP
  Models via GPT4
TARGET: Template-Transferable Backdoor Attack Against Prompt-based NLP Models via GPT4
Zihao Tan
Qingliang Chen
Yongjian Huang
Chen Liang
SILMAAML
70
5
0
29 Nov 2023
LayerCollapse: Adaptive compression of neural networks
LayerCollapse: Adaptive compression of neural networks
Soheil Zibakhsh Shabgahi
Mohammad Soheil Shariff
F. Koushanfar
AI4CE
61
1
0
29 Nov 2023
Previous
123...101112...575859
Next