ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
Finnish SQuAD: A Simple Approach to Machine Translation of Span Annotations
Finnish SQuAD: A Simple Approach to Machine Translation of Span Annotations
Emil Nuutinen
Iiro Rastas
Filip Ginter
61
2
0
10 Jan 2025
Merging Feed-Forward Sublayers for Compressed Transformers
Merging Feed-Forward Sublayers for Compressed Transformers
Neha Verma
Kenton W. Murray
Kevin Duh
AI4CE
152
0
0
10 Jan 2025
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Nikita Neveditsin
Pawan Lingras
V. Mago
LM&MA
113
5
0
08 Jan 2025
Trust Modeling in Counseling Conversations: A Benchmark Study
Aseem Srivastava
Zuhair Hasan Shaik
Tanmoy Chakraborty
Md. Shad Akhtar
82
0
0
06 Jan 2025
Decoding News Bias: Multi Bias Detection in News Articles
Decoding News Bias: Multi Bias Detection in News Articles
Bhushan Santosh Shah
Deven Santosh Shah
Vahida Attar
146
1
0
05 Jan 2025
Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language Understanding
Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language Understanding
Binh-Nguyen Nguyen
Yang He
116
1
0
05 Jan 2025
Efficient support ticket resolution using Knowledge Graphs
Sherwin Varghese
James Tian
57
0
0
03 Jan 2025
Toward Corpus Size Requirements for Training and Evaluating Depression Risk Models Using Spoken Language
Tomek Rutowski
Amir Harati
Elizabeth Shriberg
Yang Lu
Piotr Chlebek
Ricardo Oliveira
129
7
0
03 Jan 2025
Text Classification: Neural Networks VS Machine Learning Models VS Pre-trained Models
Text Classification: Neural Networks VS Machine Learning Models VS Pre-trained Models
Christos Petridis
VLM
134
3
0
31 Dec 2024
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
Hanguang Xiao
Feizhong Zhou
Xianglong Liu
Tianqi Liu
Zhipeng Li
Xin Liu
Xiaoxuan Huang
AILawLM&MALRM
143
30
0
31 Dec 2024
SAFE-MEME: Structured Reasoning Framework for Robust Hate Speech Detection in Memes
SAFE-MEME: Structured Reasoning Framework for Robust Hate Speech Detection in Memes
Palash Nandi
Shivam Sharma
Tanmoy Chakraborty
69
1
0
31 Dec 2024
AIGT: AI Generative Table Based on Prompt
AIGT: AI Generative Table Based on Prompt
Mingming Zhang
Zhiqing Xiao
Guoshan Lu
Sai Wu
Weiqiang Wang
Xing Fu
Can Yi
Junbo Zhao
LMTDVLM
73
2
0
24 Dec 2024
Adversarial Robustness through Dynamic Ensemble Learning
Adversarial Robustness through Dynamic Ensemble Learning
Hetvi Waghela
Jaydip Sen
Sneha Rakshit
AAML
129
0
0
20 Dec 2024
COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated
  Sample Weighting Mechanism
COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism
Jianing He
Qi Zhang
Hongyun Zhang
Xuanjing Huang
Usman Naseem
Duoqian Miao
136
1
0
17 Dec 2024
Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental
  Health
Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental Health
Vivek Kumar
Eirini Ntoutsi
Pushpraj Singh Rajawat
Giacomo Medda
Diego Reforgiato Recupero
AI4MH
112
1
0
17 Dec 2024
Bias Vector: Mitigating Biases in Language Models with Task Arithmetic
  Approach
Bias Vector: Mitigating Biases in Language Models with Task Arithmetic Approach
Daiki Shirafuji
Makoto Takenaka
Shinya Taguchi
LLMAG
110
1
0
16 Dec 2024
One Pixel is All I Need
One Pixel is All I Need
Deng Siqin
Zhou Xiaoyi
ViT
453
0
0
14 Dec 2024
BinarySelect to Improve Accessibility of Black-Box Attack Research
BinarySelect to Improve Accessibility of Black-Box Attack Research
Shatarupa Ghosh
Jonathan Rusert
AAML
148
0
0
13 Dec 2024
SMMF: Square-Matricized Momentum Factorization for Memory-Efficient
  Optimization
SMMF: Square-Matricized Momentum Factorization for Memory-Efficient Optimization
Kwangryeol Park
Seulki Lee
88
0
0
12 Dec 2024
Coverage-based Fairness in Multi-document Summarization
Coverage-based Fairness in Multi-document Summarization
Haoyuan Li
Yusen Zhang
Rui Zhang
Snigdha Chaturvedi
181
0
0
11 Dec 2024
KITE-DDI: A Knowledge graph Integrated Transformer Model for accurately
  predicting Drug-Drug Interaction Events from Drug SMILES and Biomedical
  Knowledge Graph
KITE-DDI: A Knowledge graph Integrated Transformer Model for accurately predicting Drug-Drug Interaction Events from Drug SMILES and Biomedical Knowledge Graph
Azwad Tamir
Jiann-Shiun Yuan
93
0
0
08 Dec 2024
AntLM: Bridging Causal and Masked Language Models
AntLM: Bridging Causal and Masked Language Models
Xinru Yu
Bin Guo
Shiwei Luo
Jiadong Wang
Tao Ji
Yuanbin Wu
CLL
132
1
0
04 Dec 2024
Generative Language Models Potential for Requirement Engineering
  Applications: Insights into Current Strengths and Limitations
Generative Language Models Potential for Requirement Engineering Applications: Insights into Current Strengths and Limitations
Summra Saleem
Muhammad Nabeel Asim
L. V. Elst
Andreas Dengel
109
0
0
01 Dec 2024
DynRank: Improving Passage Retrieval with Dynamic Zero-Shot Prompting
  Based on Question Classification
DynRank: Improving Passage Retrieval with Dynamic Zero-Shot Prompting Based on Question Classification
Abdelrahman Abdallah
Jamshid Mozafari
Bhawna Piryani
Mohammed M. Abdelgwad
Adam Jatowt
145
1
0
30 Nov 2024
Turing Representational Similarity Analysis (RSA): A Flexible Method for Measuring Alignment Between Human and Artificial Intelligence
Turing Representational Similarity Analysis (RSA): A Flexible Method for Measuring Alignment Between Human and Artificial Intelligence
Mattson Ogg
Ritwik Bose
Jamie Scharf
Christopher Ratto
Michael Wolmetz
151
2
0
30 Nov 2024
Can bidirectional encoder become the ultimate winner for downstream
  applications of foundation models?
Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Lewen Yang
Xuanyu Zhou
Juao Fan
Xinyi Xie
Shengxin Zhu
AI4CE
123
0
0
27 Nov 2024
What Differentiates Educational Literature? A Multimodal Fusion Approach
  of Transformers and Computational Linguistics
What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics
Jordan J. Bird
121
0
0
26 Nov 2024
Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning
  Small Language Models
Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models
Y. Fu
Yin Yu
Xiaotian Han
Runchao Li
Xianxuan Long
Haotian Yu
Pan Li
SyDa
174
0
0
25 Nov 2024
Profiling Bias in LLMs: Stereotype Dimensions in Contextual Word Embeddings
Profiling Bias in LLMs: Stereotype Dimensions in Contextual Word Embeddings
Carolin M. Schuster
Maria-Alexandra Dinisor
Shashwat Ghatiwala
Georg Groh
162
2
0
25 Nov 2024
A Comparative Analysis of Transformer and LSTM Models for Detecting
  Suicidal Ideation on Reddit
A Comparative Analysis of Transformer and LSTM Models for Detecting Suicidal Ideation on Reddit
Khalid Hasan
Jamil Saquer
AI4MH
105
1
0
23 Nov 2024
IRLab@iKAT24: Learned Sparse Retrieval with Multi-aspect LLM Query
  Generation for Conversational Search
IRLab@iKAT24: Learned Sparse Retrieval with Multi-aspect LLM Query Generation for Conversational Search
Simon Lupart
Zahra Abbasiantaeb
Mohammad Aliannejadi
RALM
78
1
0
22 Nov 2024
FLARE: FP-Less PTQ and Low-ENOB ADC Based AMS-PiM for Error-Resilient,
  Fast, and Efficient Transformer Acceleration
FLARE: FP-Less PTQ and Low-ENOB ADC Based AMS-PiM for Error-Resilient, Fast, and Efficient Transformer Acceleration
Donghyeon Yi
Seoyoung Lee
Jongho Kim
Junyoung Kim
Sohmyung Ha
Ik Joon Chang
Minkyu Je
103
0
0
22 Nov 2024
BERT-Based Approach for Automating Course Articulation Matrix
  Construction with Explainable AI
BERT-Based Approach for Automating Course Articulation Matrix Construction with Explainable AI
Natenaile Asmamaw Shiferaw
Simpenzwe Honore Leandre
Aman Sinha
Dillip Rout
72
0
0
21 Nov 2024
Forecasting Future International Events: A Reliable Dataset for
  Text-Based Event Modeling
Forecasting Future International Events: A Reliable Dataset for Text-Based Event Modeling
Daehoon Gwak
Junwoo Park
Minho Park
C. Park
Hyunchan Lee
E. Choi
Jaegul Choo
110
1
0
21 Nov 2024
Mitigating Gender Bias in Contextual Word Embeddings
Mitigating Gender Bias in Contextual Word Embeddings
Navya Yarrabelly
Vinay Damodaran
Feng-Guang Su
87
0
0
18 Nov 2024
New Emerged Security and Privacy of Pre-trained Model: a Survey and
  Outlook
New Emerged Security and Privacy of Pre-trained Model: a Survey and Outlook
Meng Yang
Tianqing Zhu
Chi Liu
Wanlei Zhou
Shui Yu
Philip S. Yu
AAMLELMPILM
112
1
0
12 Nov 2024
Clustering in Causal Attention Masking
Clustering in Causal Attention Masking
Nikita Karagodin
Yury Polyanskiy
Philippe Rigollet
134
7
0
07 Nov 2024
PSformer: Parameter-efficient Transformer with Segment Attention for Time Series Forecasting
PSformer: Parameter-efficient Transformer with Segment Attention for Time Series Forecasting
Yanlong Wang
Jinfeng Xu
Fei Ma
Shao-Lun Huang
Danny Dongning Sun
Xiao-Ping Zhang
AI4TS
124
1
0
03 Nov 2024
Human-inspired Perspectives: A Survey on AI Long-term Memory
Human-inspired Perspectives: A Survey on AI Long-term Memory
Zihong He
Weizhe Lin
Hao Zheng
Fan Zhang
Matt Jones
Laurence Aitchison
X. Xu
Miao Liu
Per Ola Kristensson
Junxiao Shen
244
3
0
01 Nov 2024
ProTransformer: Robustify Transformers via Plug-and-Play Paradigm
ProTransformer: Robustify Transformers via Plug-and-Play Paradigm
Zhichao Hou
Weizhi Gao
Yuchen Shen
Feiyi Wang
Xiaorui Liu
VLM
70
2
0
30 Oct 2024
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Sangmin Bae
Adam Fisch
Hrayr Harutyunyan
Ziwei Ji
Seungyeon Kim
Tal Schuster
KELM
137
7
0
28 Oct 2024
Ensembling Finetuned Language Models for Text Classification
Ensembling Finetuned Language Models for Text Classification
Sebastian Pineda Arango
Maciej Janowski
Lennart Purucker
Arber Zela
Frank Hutter
Josif Grabocka
73
0
0
25 Oct 2024
Deep Insights into Cognitive Decline: A Survey of Leveraging
  Non-Intrusive Modalities with Deep Learning Techniques
Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning Techniques
David Ortiz-Perez
Manuel Benavent-Lledo
José García Rodríguez
David Tomás
M. Flores Vizcaya-Moreno
69
1
0
24 Oct 2024
MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
Zebin Yang
Renze Chen
Taiqiang Wu
Ngai Wong
Yun Liang
Runsheng Wang
R. Huang
Meng Li
MQ
82
1
0
23 Oct 2024
Quantifying the Risks of Tool-assisted Rephrasing to Linguistic
  Diversity
Quantifying the Risks of Tool-assisted Rephrasing to Linguistic Diversity
Mengying Wang
Andreas Spitz
26
0
0
23 Oct 2024
Acoustic Model Optimization over Multiple Data Sources: Merging and
  Valuation
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation
Victor Junqiu Wei
Weicheng Wang
Di Jiang
Conghui Tan
Rongzhong Lian
MoMe
94
0
0
21 Oct 2024
Causality for Large Language Models
Causality for Large Language Models
Anpeng Wu
Kun Kuang
Minqin Zhu
Yingrong Wang
Yujia Zheng
Kairong Han
Yangqiu Song
Guangyi Chen
Leilei Gan
Kun Zhang
LRM
115
9
0
20 Oct 2024
Pseudo-label Refinement for Improving Self-Supervised Learning Systems
Pseudo-label Refinement for Improving Self-Supervised Learning Systems
Zia-ur-Rehman
Arif Mahmood
Wenxiong Kang
74
1
0
18 Oct 2024
Attuned to Change: Causal Fine-Tuning under Latent-Confounded Shifts
Attuned to Change: Causal Fine-Tuning under Latent-Confounded Shifts
Jialin Yu
Yuxiang Zhou
Yulan He
Nevin L. Zhang
Ricardo Silva
Philip Torr
Ricardo M. A. Silva
93
0
0
18 Oct 2024
From Babbling to Fluency: Evaluating the Evolution of Language Models in
  Terms of Human Language Acquisition
From Babbling to Fluency: Evaluating the Evolution of Language Models in Terms of Human Language Acquisition
Qiyuan Yang
Pengda Wang
Luke D. Plonsky
Frederick L. Oswald
Hanjie Chen
ELM
77
2
0
17 Oct 2024
Previous
123456...575859
Next