ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
Description-Enhanced Label Embedding Contrastive Learning for Text
  Classification
Description-Enhanced Label Embedding Contrastive Learning for Text Classification
Kun Zhang
Le Wu
Guangyi Lv
Enhong Chen
Shulan Ruan
Jing Liu
Qing Cui
Jun Zhou
Meng Wang
VLM
52
10
0
15 Jun 2023
Understanding Privacy Over-collection in WeChat Sub-app Ecosystem
Understanding Privacy Over-collection in WeChat Sub-app Ecosystem
Xiaohan Zhang
Yang Wang
Xin Zhang
Ziqi Huang
Lei Zhang
Min Yang
22
5
0
14 Jun 2023
Unifying Large Language Models and Knowledge Graphs: A Roadmap
Unifying Large Language Models and Knowledge Graphs: A Roadmap
Shirui Pan
Linhao Luo
Yufei Wang
Chen Chen
Jiapu Wang
Xindong Wu
KELM
160
787
0
14 Jun 2023
SqueezeLLM: Dense-and-Sparse Quantization
SqueezeLLM: Dense-and-Sparse Quantization
Sehoon Kim
Coleman Hooper
A. Gholami
Zhen Dong
Xiuyu Li
Sheng Shen
Michael W. Mahoney
Kurt Keutzer
MQ
150
198
0
13 Jun 2023
Recurrent Attention Networks for Long-text Modeling
Recurrent Attention Networks for Long-text Modeling
Xianming Li
Zongxi Li
Xiaotian Luo
Haoran Xie
Xing Lee
Yingbin Zhao
Fu Lee Wang
Qing Li
RALM
92
15
0
12 Jun 2023
QUERT: Continual Pre-training of Language Model for Query Understanding
  in Travel Domain Search
QUERT: Continual Pre-training of Language Model for Query Understanding in Travel Domain Search
Jian Xie
Yidan Liang
Jingping Liu
Yanghua Xiao
Baohua Wu
Shenghua Ni
VLMLRM
90
9
0
11 Jun 2023
Are Intermediate Layers and Labels Really Necessary? A General Language
  Model Distillation Method
Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method
Shicheng Tan
Weng Lam Tam
Yuanchun Wang
Wenwen Gong
Shuo Zhao
Peng Zhang
Jie Tang
VLM
49
1
0
11 Jun 2023
RoBERTweet: A BERT Language Model for Romanian Tweets
RoBERTweet: A BERT Language Model for Romanian Tweets
Iulian-Marius Tuaiatu
Andrei-Marius Avram
Dumitru-Clementin Cercel
Florin-Catalin Pop
40
1
0
11 Jun 2023
Enhancing Low Resource NER Using Assisting Language And Transfer
  Learning
Enhancing Low Resource NER Using Assisting Language And Transfer Learning
Maithili Sabane
Aparna Ranade
Onkar Litake
Parth Patil
Raviraj Joshi
Dipali M. Kadam
64
5
0
10 Jun 2023
Leveraging Language Identification to Enhance Code-Mixed Text
  Classification
Leveraging Language Identification to Enhance Code-Mixed Text Classification
Gauri Takawane
Abhishek Phaltankar
Varad Patwardhan
Aryan Patil
Raviraj Joshi
Mukta S. Takalikar
72
4
0
08 Jun 2023
Prefer to Classify: Improving Text Classifiers via Auxiliary Preference
  Learning
Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning
Jaehyung Kim
Jinwoo Shin
Dongyeop Kang
64
2
0
08 Jun 2023
MobileNMT: Enabling Translation in 15MB and 30ms
MobileNMT: Enabling Translation in 15MB and 30ms
Ye Lin
Xiaohui Wang
Zhexi Zhang
Mingxuan Wang
Tong Xiao
Jingbo Zhu
MQ
63
2
0
07 Jun 2023
Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data
  Augmentation
Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation
Xiusi Chen
Yu Zhang
Jinliang Deng
Jyun-Yu Jiang
Wei Wang
62
12
0
07 Jun 2023
Randomized Schur Complement Views for Graph Contrastive Learning
Randomized Schur Complement Views for Graph Contrastive Learning
Vignesh Kothapalli
117
2
0
06 Jun 2023
Causal interventions expose implicit situation models for commonsense
  language understanding
Causal interventions expose implicit situation models for commonsense language understanding
Takateru Yamakoshi
James L. McClelland
A. Goldberg
Robert D. Hawkins
104
6
0
06 Jun 2023
On the Difference of BERT-style and CLIP-style Text Encoders
On the Difference of BERT-style and CLIP-style Text Encoders
Zhihong Chen
Guiming Hardy Chen
Shizhe Diao
Xiang Wan
Benyou Wang
VLM
67
19
0
06 Jun 2023
CUE: An Uncertainty Interpretation Framework for Text Classifiers Built
  on Pre-Trained Language Models
CUE: An Uncertainty Interpretation Framework for Text Classifiers Built on Pre-Trained Language Models
Jiazheng Li
ZHAOYUE SUN
Bin Liang
Lin Gui
Yulan He
79
2
0
06 Jun 2023
Subgraph Networks Based Contrastive Learning
Subgraph Networks Based Contrastive Learning
Jinhuan Wang
Jiafei Shao
Zeyu Wang
Shanqing Yu
Qi Xuan
Xiaoniu Yang
78
1
0
06 Jun 2023
Using Sequences of Life-events to Predict Human Lives
Using Sequences of Life-events to Predict Human Lives
Germans Savcisens
Tina Eliassi-Rad
L. K. Hansen
L. Mortensen
Lau Lilleholt
Anna Rogers
Ingo Zettler
Sune Lehmann
AI4TS
94
46
0
05 Jun 2023
Probing Physical Reasoning with Counter-Commonsense Context
Probing Physical Reasoning with Counter-Commonsense Context
Kazushi Kondo
Saku Sugawara
Akiko Aizawa
LRM
74
4
0
04 Jun 2023
Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions
Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions
Hui Yang
Sifu Yue
Yunzhong He
RALM
72
172
0
04 Jun 2023
Improving Generalization in Task-oriented Dialogues with Workflows and
  Action Plans
Improving Generalization in Task-oriented Dialogues with Workflows and Action Plans
Stefania Raimondo
C. Pal
Xiaotian Liu
David Vazquez
Héctor Palacios
48
2
0
02 Jun 2023
Centered Self-Attention Layers
Centered Self-Attention Layers
Ameen Ali
Tomer Galanti
Lior Wolf
140
8
0
02 Jun 2023
Data-Efficient French Language Modeling with CamemBERTa
Data-Efficient French Language Modeling with CamemBERTa
Wissam Antoun
Benoît Sagot
Djamé Seddah
50
7
0
02 Jun 2023
Unsupervised Paraphrasing of Multiword Expressions
Unsupervised Paraphrasing of Multiword Expressions
Takashi Wada
Yuji Matsumoto
Timothy Baldwin
Jey Han Lau
66
0
0
02 Jun 2023
Towards Learning Discrete Representations via Self-Supervision for
  Wearables-Based Human Activity Recognition
Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition
H. Haresamudram
Irfan Essa
Thomas Ploetz
98
8
0
01 Jun 2023
Topic-Guided Sampling For Data-Efficient Multi-Domain Stance Detection
Topic-Guided Sampling For Data-Efficient Multi-Domain Stance Detection
Erik Arakelyan
Arnav Arora
Isabelle Augenstein
56
10
0
01 Jun 2023
CL-MRI: Self-Supervised Contrastive Learning to Improve the Accuracy of
  Undersampled MRI Reconstruction
CL-MRI: Self-Supervised Contrastive Learning to Improve the Accuracy of Undersampled MRI Reconstruction
Mevan Ekanayake
Zhiwen Chen
Mehrtash Harandi
Gary Egan
Zhaolin Chen
81
3
0
01 Jun 2023
Attention-Based Methods For Audio Question Answering
Attention-Based Methods For Audio Question Answering
Parthasaarathy Sudarsanam
Tuomas Virtanen
68
3
0
31 May 2023
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations
  for Text-to-Speech
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech
L. T. Nguyen
Thinh-Le-Gia Pham
Dat Quoc Nguyen
98
14
0
31 May 2023
Assessing Word Importance Using Models Trained for Semantic Tasks
Assessing Word Importance Using Models Trained for Semantic Tasks
Dávid Javorský
Ondrej Bojar
François Yvon
37
2
0
31 May 2023
What does the Failure to Reason with "Respectively" in Zero/Few-Shot
  Settings Tell Us about Language Models?
What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models?
Ruixiang Cui
Seolhwa Lee
Daniel Hershcovich
Anders Søgaard
50
2
0
31 May 2023
Stable Anisotropic Regularization
Stable Anisotropic Regularization
William Rudman
Carsten Eickhoff
83
6
0
30 May 2023
PreQuant: A Task-agnostic Quantization Approach for Pre-trained Language
  Models
PreQuant: A Task-agnostic Quantization Approach for Pre-trained Language Models
Zhuocheng Gong
Jiahao Liu
Qifan Wang
Yang Yang
Jingang Wang
Wei Wu
Yunsen Xian
Dongyan Zhao
Rui Yan
MQ
70
5
0
30 May 2023
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark
  Datasets
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets
Md Tahmid Rahman Laskar
M Saiful Bari
Mizanur Rahman
Md Amran Hossen Bhuiyan
Shafiq Joty
J. Huang
LM&MAELMALM
125
193
0
29 May 2023
Whitening-based Contrastive Learning of Sentence Embeddings
Whitening-based Contrastive Learning of Sentence Embeddings
Wenjie Zhuo
Yifan Sun
Xiaohan Wang
Linchao Zhu
Yezhou Yang
59
21
0
28 May 2023
Tri-level Joint Natural Language Understanding for Multi-turn
  Conversational Datasets
Tri-level Joint Natural Language Understanding for Multi-turn Conversational Datasets
H. Weld
Sijia Hu
Siqu Long
Josiah Poon
S. Han
51
1
0
28 May 2023
One Network, Many Masks: Towards More Parameter-Efficient Transfer
  Learning
One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning
Guangtao Zeng
Peiyuan Zhang
Wei Lu
95
22
0
28 May 2023
AI Coach Assist: An Automated Approach for Call Recommendation in
  Contact Centers for Agent Coaching
AI Coach Assist: An Automated Approach for Call Recommendation in Contact Centers for Agent Coaching
Md Tahmid Rahman Laskar
Cheng Chen
Xue-Yong Fu
M. Azizi
Shashi Bhushan
Simon Corston-Oliver
54
2
0
28 May 2023
A Match Made in Heaven: A Multi-task Framework for Hyperbole and
  Metaphor Detection
A Match Made in Heaven: A Multi-task Framework for Hyperbole and Metaphor Detection
Naveen Badathala
Abisek Rajakumar Kalarani
Tejpalsingh Siledar
P. Bhattacharyya
49
12
0
27 May 2023
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating
  Vision-Language Transformers
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
Dachuan Shi
Chaofan Tao
Anyi Rao
Zhendong Yang
Chun Yuan
Jiaqi Wang
VLM
130
23
0
27 May 2023
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained
  Transformer for Vision, Language, and Multimodal Tasks
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks
Kai Zhang
Jun Yu
Eashan Adhikarla
Rong Zhou
Zhilin Yan
...
Xun Chen
Yong Chen
Quanzheng Li
Hongfang Liu
Lichao Sun
LM&MAMedIm
103
185
0
26 May 2023
NormBank: A Knowledge Bank of Situational Social Norms
NormBank: A Knowledge Bank of Situational Social Norms
Caleb Ziems
Jane Dwivedi-Yu
Yi-Chia Wang
A. Halevy
Diyi Yang
109
45
0
26 May 2023
Efficient Detection of LLM-generated Texts with a Bayesian Surrogate
  Model
Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model
Yibo Miao
Hongcheng Gao
Hao Zhang
Zhijie Deng
DeLMO
78
20
0
26 May 2023
Don't Retrain, Just Rewrite: Countering Adversarial Perturbations by
  Rewriting Text
Don't Retrain, Just Rewrite: Countering Adversarial Perturbations by Rewriting Text
Ashim Gupta
Carter Blum
Temma Choji
Yingjie Fei
Shalin S Shah
Alakananda Vempala
Vivek Srikumar
AAML
62
9
0
25 May 2023
Comparative Study of Pre-Trained BERT Models for Code-Mixed
  Hindi-English Data
Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data
Aryan Patil
Varad Patwardhan
Abhishek Phaltankar
Gauri Takawane
Raviraj Joshi
76
12
0
25 May 2023
Exploring Automatically Perturbed Natural Language Explanations in
  Relation Extraction
Exploring Automatically Perturbed Natural Language Explanations in Relation Extraction
Wanyun Cui
Xingran Chen
LRMAAML
63
0
0
24 May 2023
Context-Aware Transformer Pre-Training for Answer Sentence Selection
Context-Aware Transformer Pre-Training for Answer Sentence Selection
Luca Di Liello
Siddhant Garg
Alessandro Moschitti
71
4
0
24 May 2023
Dynamic Masking Rate Schedules for MLM Pretraining
Dynamic Masking Rate Schedules for MLM Pretraining
Zachary Ankner
Naomi Saphra
Davis W. Blalock
Jonathan Frankle
Matthew L. Leavitt
101
8
0
24 May 2023
SETI: Systematicity Evaluation of Textual Inference
SETI: Systematicity Evaluation of Textual Inference
Xiyan Fu
Anette Frank
LRM
46
5
0
24 May 2023
Previous
123...161718...575859
Next