ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
Speaker Tagging Correction With Non-Autoregressive Language Models
Speaker Tagging Correction With Non-Autoregressive Language Models
Grigor Kirakosyan
Davit Karamyan
3DV
95
0
0
30 Aug 2024
Is Personality Prediction Possible Based on Reddit Comments?
Is Personality Prediction Possible Based on Reddit Comments?
Robert Deimann
Till Preidt
Shaptarshi Roy
Jan Stanicki
47
0
0
28 Aug 2024
A Survey of Large Language Models for European Languages
A Survey of Large Language Models for European Languages
Wazir Ali
S. Pyysalo
151
3
0
27 Aug 2024
Shifted Window Fourier Transform And Retention For Image Captioning
Shifted Window Fourier Transform And Retention For Image Captioning
J. Hu
Roberto Cavicchioli
Alessandro Capotondi
VLM
106
1
0
25 Aug 2024
Genetic Approach to Mitigate Hallucination in Generative IR
Genetic Approach to Mitigate Hallucination in Generative IR
Hrishikesh Kulkarni
Nazli Goharian
O. Frieder
Sean MacAvaney
HILM
62
2
0
25 Aug 2024
Domain-specific long text classification from sparse relevant
  information
Domain-specific long text classification from sparse relevant information
Célia DĆruz
J. Bereder
Frédéric Precioso
Michel Riveill
105
0
0
23 Aug 2024
Instruct-DeBERTa: A Hybrid Approach for Aspect-based Sentiment Analysis
  on Textual Reviews
Instruct-DeBERTa: A Hybrid Approach for Aspect-based Sentiment Analysis on Textual Reviews
Dineth Jayakody
A. V. A. Malkith
Koshila Isuranda
Vishal Thenuwara
Nisansa de Silva
Sachintha Rajith Ponnamperuma
G. Sandamali
K. L. Sudheera
27
2
0
23 Aug 2024
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation
  Models
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models
Wentao Wu
Fanghua Hong
Xiao Wang
Chenglong Li
Jin Tang
VLM
91
1
0
23 Aug 2024
MedDec: A Dataset for Extracting Medical Decisions from Discharge
  Summaries
MedDec: A Dataset for Extracting Medical Decisions from Discharge Summaries
Mohamed Elgaar
Jiali Cheng
Nidhi Vakil
Hadi Amiri
Leo Anthony Celi
58
2
0
23 Aug 2024
Internal and External Knowledge Interactive Refinement Framework for
  Knowledge-Intensive Question Answering
Internal and External Knowledge Interactive Refinement Framework for Knowledge-Intensive Question Answering
Haowei Du
Dongyan Zhao
KELM
39
0
0
23 Aug 2024
Large Language Models are Good Attackers: Efficient and Stealthy Textual
  Backdoor Attacks
Large Language Models are Good Attackers: Efficient and Stealthy Textual Backdoor Attacks
Ziqiang Li
Yueqi Zeng
Pengfei Xia
Lei Liu
Zhangjie Fu
Bin Li
SILMAAML
90
3
0
21 Aug 2024
BURExtract-Llama: An LLM for Clinical Concept Extraction in Breast
  Ultrasound Reports
BURExtract-Llama: An LLM for Clinical Concept Extraction in Breast Ultrasound Reports
Yuxuan Chen
Haoyan Yang
Hengkai Pan
Fardeen Siddiqui
Antonio Verdone
Qingyang Zhang
S. Chopra
Chen Zhao
Yiqiu Shen
33
2
0
21 Aug 2024
Inside the Black Box: Detecting Data Leakage in Pre-trained Language
  Encoders
Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders
Yuan Xin
Zehan Li
Ning Yu
Dingfan Chen
Mario Fritz
Michael Backes
Yang Zhang
PILMMIACV
106
2
0
20 Aug 2024
Uniting contrastive and generative learning for event sequences models
Uniting contrastive and generative learning for event sequences models
Aleksandr Yugay
Alexey Zaytsev
AI4TS
97
1
0
19 Aug 2024
MegaFake: A Theory-Driven Dataset of Fake News Generated by Large
  Language Models
MegaFake: A Theory-Driven Dataset of Fake News Generated by Large Language Models
Lionel Z. Wang
Yiming Ma
Renfei Gao
Beichen Guo
Han Zhu
Wenqi Fan
Zexin Lu
Ka Chung Ng
SyDa
75
4
0
19 Aug 2024
A Psychology-based Unified Dynamic Framework for Curriculum Learning
A Psychology-based Unified Dynamic Framework for Curriculum Learning
Guangyu Meng
Qingkai Zeng
John P. Lalor
Hong-ye Yu
76
0
0
09 Aug 2024
Investigating a Benchmark for Training-set free Evaluation of Linguistic
  Capabilities in Machine Reading Comprehension
Investigating a Benchmark for Training-set free Evaluation of Linguistic Capabilities in Machine Reading Comprehension
Viktor Schlegel
Goran Nenadic
Riza Batista-Navarro
ELM
56
0
0
09 Aug 2024
Survey: Transformer-based Models in Data Modality Conversion
Survey: Transformer-based Models in Data Modality Conversion
Elyas Rashno
Amir Eskandari
Aman Anand
F. Zulkernine
MedIm
91
0
0
08 Aug 2024
MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture
  Generation
MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation
Xiaofeng Mao
Zhengkai Jiang
Qilin Wang
Chencan Fu
Jiangning Zhang
Jiafu Wu
Yabiao Wang
Chengjie Wang
Wei Li
Mingmin Chi
136
4
0
06 Aug 2024
Dopamin: Transformer-based Comment Classifiers through Domain
  Post-Training and Multi-level Layer Aggregation
Dopamin: Transformer-based Comment Classifiers through Domain Post-Training and Multi-level Layer Aggregation
Nam Le Hai
Nghi D. Q. Bui
97
5
0
06 Aug 2024
Modelling Visual Semantics via Image Captioning to extract Enhanced
  Multi-Level Cross-Modal Semantic Incongruity Representation with Attention
  for Multimodal Sarcasm Detection
Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with Attention for Multimodal Sarcasm Detection
Sajal Aggarwal
Ananya Pandey
Dinesh Kumar Vishwakarma
73
2
0
05 Aug 2024
Large Language Model Aided QoS Prediction for Service Recommendation
Large Language Model Aided QoS Prediction for Service Recommendation
Huiying Liu
Zekun Zhang
Honghao Li
Qilin Wu
Yiwen Zhang
43
2
0
05 Aug 2024
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey
  on Methods and Datasets
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets
Shima Foolad
Kourosh Kiani
R. Rastgoo
FaML
89
0
0
04 Aug 2024
Effective Demonstration Annotation for In-Context Learning via Language
  Model-Based Determinantal Point Process
Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process
Peng Wang
Xiaobin Wang
Chao Lou
Shengyu Mao
Pengjun Xie
Yong Jiang
78
4
0
04 Aug 2024
Cross-layer Attention Sharing for Large Language Models
Cross-layer Attention Sharing for Large Language Models
Yongyu Mu
Yuzhang Wu
Yuchun Fan
Chenglong Wang
Hengyu Li
Qiaozhi He
Murun Yang
Tong Xiao
Jingbo Zhu
85
5
0
04 Aug 2024
Deep Learning based Visually Rich Document Content Understanding: A Survey
Deep Learning based Visually Rich Document Content Understanding: A Survey
Muhammad Ali
Jean Lee
Salman Khan
Eduard Hovy
109
6
0
02 Aug 2024
Pathway to Secure and Trustworthy ZSM for LLMs: Attacks, Defense, and Opportunities
Pathway to Secure and Trustworthy ZSM for LLMs: Attacks, Defense, and Opportunities
Yangzhen Wu
P. Khuwaja
Kapal Dev
H. A. Hamadi
Yiming Yang
80
1
0
01 Aug 2024
Big Cooperative Learning
Big Cooperative Learning
Yulai Cong
AI4CE
68
0
0
31 Jul 2024
A Generic Review of Integrating Artificial Intelligence in Cognitive
  Behavioral Therapy
A Generic Review of Integrating Artificial Intelligence in Cognitive Behavioral Therapy
Meng Jiang
Qing Zhao
Jianqiang Li
Fan Wang
Tianyu He
Xinyan Cheng
Bing Xiang Yang
Grace W.K. Ho
Guanghui Fu
76
6
0
28 Jul 2024
Tracking linguistic information in transformer-based sentence embeddings
  through targeted sparsification
Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification
Vivi Nastase
Paola Merlo
58
3
0
25 Jul 2024
Fine-Tuning Large Language Models for Stock Return Prediction Using
  Newsflow
Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow
Tian Guo
E. Hauptmann
AIFin
87
5
0
25 Jul 2024
Large Language Models for Anomaly Detection in Computational Workflows:
  from Supervised Fine-Tuning to In-Context Learning
Large Language Models for Anomaly Detection in Computational Workflows: from Supervised Fine-Tuning to In-Context Learning
Hongwei Jin
George Papadimitriou
Krishnan Raghavan
Pawel Zuk
Prasanna Balaprakash
Cong Wang
A. Mandal
Ewa Deelman
70
2
0
24 Jul 2024
Pre-Training and Prompting for Few-Shot Node Classification on
  Text-Attributed Graphs
Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs
Huan-jing Zhao
Beining Yang
Yukuo Cen
Junyu Ren
Chenhui Zhang
Yuxiao Dong
Evgeny Kharlamov
Shu Zhao
Jie Tang
VLM
94
8
0
22 Jul 2024
Token-Picker: Accelerating Attention in Text Generation with Minimized
  Memory Transfer via Probability Estimation
Token-Picker: Accelerating Attention in Text Generation with Minimized Memory Transfer via Probability Estimation
Junyoung Park
Myeonggu Kang
Yunki Han
Yang-Gon Kim
Jaekang Shin
Lee-Sup Kim
52
0
0
21 Jul 2024
Sharpness-diversity tradeoff: improving flat ensembles with SharpBalance
Sharpness-diversity tradeoff: improving flat ensembles with SharpBalance
Haiquan Lu
Xiaotian Liu
Yefan Zhou
Qunli Li
Kurt Keutzer
Michael W. Mahoney
Yujun Yan
Huanrui Yang
Yaoqing Yang
61
1
0
17 Jul 2024
ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer
  Neural Networks
ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks
Salma Afifi
Ishan G. Thakkar
S. Pasricha
GNN
62
0
0
17 Jul 2024
Evaluating Linguistic Capabilities of Multimodal LLMs in the Lens of
  Few-Shot Learning
Evaluating Linguistic Capabilities of Multimodal LLMs in the Lens of Few-Shot Learning
Mustafa Dogan
.Ilker Kesen
Iacer Calixto
Aykut Erdem
Erkut Erdem
LRM
87
1
0
17 Jul 2024
Sharif-STR at SemEval-2024 Task 1: Transformer as a Regression Model for
  Fine-Grained Scoring of Textual Semantic Relations
Sharif-STR at SemEval-2024 Task 1: Transformer as a Regression Model for Fine-Grained Scoring of Textual Semantic Relations
Seyedeh Fatemeh Ebrahimi
Karim Akhavan Azari
Amirmasoud Iravani
Hadi Alizadeh
Zeinab Taghavi
Hossein Sameti
60
4
0
17 Jul 2024
InstructAV: Instruction Fine-tuning Large Language Models for Authorship
  Verification
InstructAV: Instruction Fine-tuning Large Language Models for Authorship Verification
Yujia Hu
Zhiqiang Hu
C. Seah
Roy Ka-wei Lee
72
0
0
16 Jul 2024
TCM-FTP: Fine-Tuning Large Language Models for Herbal Prescription
  Prediction
TCM-FTP: Fine-Tuning Large Language Models for Herbal Prescription Prediction
Xingzhi Zhou
Xin Dong
Chunhao Li
Yuning Bai
Yulong Xu
...
Simon See
Xinpeng Song
Runshun Zhang
Xuezhong Zhou
Nevin L. Zhang
LM&MA
62
5
0
15 Jul 2024
Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of
  Modules
Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
Zhuocheng Gong
Ang Lv
Jian Guan
Junxi Yan
Wei Wu
Huishuai Zhang
Minlie Huang
Dongyan Zhao
Rui Yan
MoE
86
7
0
09 Jul 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
152
56
0
09 Jul 2024
Noise-Free Explanation for Driving Action Prediction
Noise-Free Explanation for Driving Action Prediction
Hongbo Zhu
Theodor Wulff
R. S. Maharjan
Jinpei Han
Angelo Cangelosi
AAMLFAtt
64
0
0
08 Jul 2024
AI Safety in Generative AI Large Language Models: A Survey
AI Safety in Generative AI Large Language Models: A Survey
Jaymari Chua
Yun Yvonna Li
Shiyi Yang
Chen Wang
Lina Yao
LM&MA
100
19
0
06 Jul 2024
Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM
  Compression
Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM Compression
Zhichao Xu
Ashim Gupta
Tao Li
Oliver Bentham
Vivek Srikumar
107
13
0
06 Jul 2024
Not (yet) the whole story: Evaluating Visual Storytelling Requires More
  than Measuring Coherence, Grounding, and Repetition
Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition
Aditya K Surikuchi
Raquel Fernández
Sandro Pezzelle
59
6
0
05 Jul 2024
Multi-modal Masked Siamese Network Improves Chest X-Ray Representation
  Learning
Multi-modal Masked Siamese Network Improves Chest X-Ray Representation Learning
Saeed Shurrab
Alejandro Guerra-Manzanares
Farah E. Shamout
89
1
0
05 Jul 2024
ESQA: Event Sequences Question Answering
ESQA: Event Sequences Question Answering
Irina Abdullaeva
Andrei Filatov
Mikhail Orlov
Ivan Karpukhin
Viacheslav Vasilev
Denis Dimitrov
Andrey Kuznetsov
Ivan A Kireev
Andrey Savchenko
86
0
0
03 Jul 2024
Aspect-Based Sentiment Analysis Techniques: A Comparative Study
Aspect-Based Sentiment Analysis Techniques: A Comparative Study
Dineth Jayakody
Koshila Isuranda
A. V. A. Malkith
Nisansa de Silva
Sachintha Rajith Ponnamperuma
G. Sandamali
K. L. Sudheera
65
1
0
03 Jul 2024
Increasing Model Capacity for Free: A Simple Strategy for Parameter
  Efficient Fine-tuning
Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning
Haobo Song
Hao Zhao
Soumajit Majumder
Tao Lin
58
4
0
01 Jul 2024
Previous
123456...575859
Next