ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
Estimating class separability of text embeddings with persistent
  homology
Estimating class separability of text embeddings with persistent homology
Kostis Gourgoulias
Najah F. Ghalyan
Maxime Labonne
Yash Satsangi
Sean J. Moran
Joseph Sabelja
92
1
0
24 May 2023
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and
  Compositional Experts
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts
Yunshui Li
Binyuan Hui
Zhichao Yin
Min Yang
Fei Huang
Yongbin Li
MoE
87
21
0
24 May 2023
#REVAL: a semantic evaluation framework for hashtag recommendation
#REVAL: a semantic evaluation framework for hashtag recommendation
Areej Alsini
D. Huynh
A. Datta
41
0
0
24 May 2023
Machine Reading Comprehension using Case-based Reasoning
Machine Reading Comprehension using Case-based Reasoning
Dung Ngoc Thai
Dhruv Agarwal
Mudit Chaudhary
Wenlong Zhao
Rajarshi Das
Manzil Zaheer
J. Lee
Hannaneh Hajishirzi
Andrew McCallum
98
1
0
24 May 2023
DialogVCS: Robust Natural Language Understanding in Dialogue System
  Upgrade
DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade
Zefan Cai
Xin Zheng
Tianyu Liu
Xu Wang
H. Meng
Jiaqi Han
Gang Yuan
Binghuai Lin
Baobao Chang
Yunbo Cao
70
4
0
24 May 2023
TACR: A Table-alignment-based Cell-selection and Reasoning Model for
  Hybrid Question-Answering
TACR: A Table-alignment-based Cell-selection and Reasoning Model for Hybrid Question-Answering
Jian Wu
Yicheng Xu
Yan Gao
Jian-Guang Lou
Börje F. Karlsson
Manabu Okumura
LMTD
54
3
0
24 May 2023
Few-shot Unified Question Answering: Tuning Models or Prompts?
Few-shot Unified Question Answering: Tuning Models or Prompts?
Srijan Bansal
Semih Yavuz
Bo Pang
Meghana Moorthy Bhat
Yingbo Zhou
102
2
0
23 May 2023
Training Transitive and Commutative Multimodal Transformers with LoReTTa
Training Transitive and Commutative Multimodal Transformers with LoReTTa
Manuel Tran
Yashin Dicente Cid
Amal Lahiani
Fabian J. Theis
Tingying Peng
Eldad Klaiman
54
2
0
23 May 2023
WYWEB: A NLP Evaluation Benchmark For Classical Chinese
WYWEB: A NLP Evaluation Benchmark For Classical Chinese
Bo Zhou
Qianglong Chen
Tianyu Wang
Xiaoshi Zhong
Yin Zhang
ELM
119
10
0
23 May 2023
Weakly Supervised 3D Open-vocabulary Segmentation
Weakly Supervised 3D Open-vocabulary Segmentation
Kunhao Liu
Fangneng Zhan
Jiahui Zhang
Muyu Xu
Yingchen Yu
Abdulmotaleb El Saddik
Christian Theobalt
Eric P. Xing
Shijian Lu
121
70
0
23 May 2023
Assessing Linguistic Generalisation in Language Models: A Dataset for
  Brazilian Portuguese
Assessing Linguistic Generalisation in Language Models: A Dataset for Brazilian Portuguese
Rodrigo Wilkens
Leonardo Zilio
Aline Villavicencio
56
1
0
23 May 2023
VisorGPT: Learning Visual Prior via Generative Pre-Training
VisorGPT: Learning Visual Prior via Generative Pre-Training
Jinheng Xie
Kai Ye
Yudong Li
Yuexiang Li
Kevin Qinghong Lin
Yefeng Zheng
Linlin Shen
Mike Zheng Shou
ViT
321
8
0
23 May 2023
AxomiyaBERTa: A Phonologically-aware Transformer Model for Assamese
AxomiyaBERTa: A Phonologically-aware Transformer Model for Assamese
Abhijnan Nath
Sheikh Mannan
Nikhil Krishnaswamy
68
6
0
23 May 2023
Can LLMs facilitate interpretation of pre-trained language models?
Can LLMs facilitate interpretation of pre-trained language models?
Basel Mousi
Nadir Durrani
Fahim Dalvi
93
13
0
22 May 2023
To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis
To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis
Fuzhao Xue
Yao Fu
Wangchunshu Zhou
Zangwei Zheng
Yang You
149
86
0
22 May 2023
A Pretrainer's Guide to Training Data: Measuring the Effects of Data
  Age, Domain Coverage, Quality, & Toxicity
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Shayne Longpre
Gregory Yauney
Emily Reif
Katherine Lee
Adam Roberts
...
Denny Zhou
Jason W. Wei
Kevin Robinson
David M. Mimno
Daphne Ippolito
117
168
0
22 May 2023
Bidirectional Transformer Reranker for Grammatical Error Correction
Bidirectional Transformer Reranker for Grammatical Error Correction
Ying Zhang
Hidetaka Kamigaito
Manabu Okumura
44
2
0
22 May 2023
On Bias and Fairness in NLP: Investigating the Impact of Bias and
  Debiasing in Language Models on the Fairness of Toxicity Detection
On Bias and Fairness in NLP: Investigating the Impact of Bias and Debiasing in Language Models on the Fairness of Toxicity Detection
Fatma Elsafoury
Stamos Katsigiannis
67
1
0
22 May 2023
Kanbun-LM: Reading and Translating Classical Chinese in Japanese Methods
  by Language Models
Kanbun-LM: Reading and Translating Classical Chinese in Japanese Methods by Language Models
Hao Wang
Hirofumi Shimizu
Daisuke Kawahara
73
1
0
22 May 2023
Keeping Up with the Language Models: Robustness-Bias Interplay in NLI
  Data and Models
Keeping Up with the Language Models: Robustness-Bias Interplay in NLI Data and Models
Ioana Baldini
Chhavi Yadav
Payel Das
Kush R. Varshney
MLAU
84
3
0
22 May 2023
Infor-Coef: Information Bottleneck-based Dynamic Token Downsampling for
  Compact and Efficient language model
Infor-Coef: Information Bottleneck-based Dynamic Token Downsampling for Compact and Efficient language model
Wenxin Tan
50
1
0
21 May 2023
F-PABEE: Flexible-patience-based Early Exiting for Single-label and
  Multi-label text Classification Tasks
F-PABEE: Flexible-patience-based Early Exiting for Single-label and Multi-label text Classification Tasks
Xiangxiang Gao
Wei-wei Zhu
Jiasheng Gao
Congrui Yin
VLM
92
12
0
21 May 2023
Pruning Pre-trained Language Models with Principled Importance and
  Self-regularization
Pruning Pre-trained Language Models with Principled Importance and Self-regularization
Siyu Ren
Kenny Q. Zhu
79
2
0
21 May 2023
Machine Translation by Projecting Text into the Same
  Phonetic-Orthographic Space Using a Common Encoding
Machine Translation by Projecting Text into the Same Phonetic-Orthographic Space Using a Common Encoding
Amit Kumar
Shantipriya Parida
A. Pratap
Anil Kumar Singh
78
2
0
21 May 2023
Dynamic Transformers Provide a False Sense of Efficiency
Dynamic Transformers Provide a False Sense of Efficiency
Yiming Chen
Simin Chen
Zexin Li
Wei Yang
Cong Liu
R. Tan
Haizhou Li
AAML
90
12
0
20 May 2023
"What do others think?": Task-Oriented Conversational Modeling with
  Subjective Knowledge
"What do others think?": Task-Oriented Conversational Modeling with Subjective Knowledge
Chao Zhao
Spandana Gella
Seokhwan Kim
Di Jin
Devamanyu Hazarika
Alexandros Papangelis
Behnam Hedayatnia
Mahdi Namazifar
Yang Liu
Dilek Z. Hakkani-Tür
91
7
0
20 May 2023
LLM-Pruner: On the Structural Pruning of Large Language Models
LLM-Pruner: On the Structural Pruning of Large Language Models
Xinyin Ma
Gongfan Fang
Xinchao Wang
173
445
0
19 May 2023
Zero-Shot Text Classification via Self-Supervised Tuning
Zero-Shot Text Classification via Self-Supervised Tuning
Chaoqun Liu
Wenxuan Zhang
Guizhen Chen
Xiaobao Wu
Anh Tuan Luu
Chip Hong Chang
Lidong Bing
VLM
85
11
0
19 May 2023
A Survey of Safety and Trustworthiness of Large Language Models through
  the Lens of Verification and Validation
A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation
Xiaowei Huang
Wenjie Ruan
Wei Huang
Gao Jin
Yizhen Dong
...
Sihao Wu
Peipei Xu
Dengyu Wu
André Freitas
Mustafa A. Mustafa
ALM
132
96
0
19 May 2023
How does the task complexity of masked pretraining objectives affect
  downstream performance?
How does the task complexity of masked pretraining objectives affect downstream performance?
Atsuki Yamaguchi
Hiroaki Ozaki
Terufumi Morishita
Gaku Morio
Yasuhiro Sogawa
86
2
0
18 May 2023
Diffusion Language Models Generation Can Be Halted Early
Diffusion Language Models Generation Can Be Halted Early
Sofia Maria Lo Cicero Vaina
Nikita Balagansky
Daniil Gavrilov
DiffM
84
0
0
18 May 2023
Vision-Language Pre-training with Object Contrastive Learning for 3D
  Scene Understanding
Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Understanding
Zhang Tao
Su He
D. Tao
Bin Chen
Zhi Wang
Shutao Xia
VLM
82
27
0
18 May 2023
NoisywikiHow: A Benchmark for Learning with Real-world Noisy Labels in
  Natural Language Processing
NoisywikiHow: A Benchmark for Learning with Real-world Noisy Labels in Natural Language Processing
Tingting Wu
Xiao Ding
Minji Tang
Haotian Zhang
Bing Qin
Ting Liu
NoLa
94
11
0
18 May 2023
Large-Scale Text Analysis Using Generative Language Models: A Case Study
  in Discovering Public Value Expressions in AI Patents
Large-Scale Text Analysis Using Generative Language Models: A Case Study in Discovering Public Value Expressions in AI Patents
Sergio Pelaez
Gaurav Verma
Barbara Ribeiro
P. Shapira
96
15
0
17 May 2023
UniEX: An Effective and Efficient Framework for Unified Information
  Extraction via a Span-extractive Perspective
UniEX: An Effective and Efficient Framework for Unified Information Extraction via a Span-extractive Perspective
Ping Yang
Junyu Lu
Ruyi Gan
Junjie Wang
Yuxiang Zhang
Jiaxing Zhang
Pingjian Zhang
71
11
0
17 May 2023
The Interpreter Understands Your Meaning: End-to-end Spoken Language
  Understanding Aided by Speech Translation
The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation
Mutian He
Philip N. Garner
100
4
0
16 May 2023
Weight-Inherited Distillation for Task-Agnostic BERT Compression
Weight-Inherited Distillation for Task-Agnostic BERT Compression
Taiqiang Wu
Cheng-An Hou
Shanshan Lao
Jiayi Li
Ngai Wong
Zhe Zhao
Yujiu Yang
136
10
0
16 May 2023
Exploring In-Context Learning Capabilities of Foundation Models for
  Generating Knowledge Graphs from Text
Exploring In-Context Learning Capabilities of Foundation Models for Generating Knowledge Graphs from Text
H. Khorashadizadeh
Nandana Mihindukulasooriya
Sanju Tiwari
Jinghua Groppe
Sven Groppe
73
23
0
15 May 2023
MeeQA: Natural Questions in Meeting Transcripts
MeeQA: Natural Questions in Meeting Transcripts
Reut Apel
Tom Braude
Amir Kantor
Eyal Kolman
RALM
59
2
0
15 May 2023
Coreference-aware Double-channel Attention Network for Multi-party
  Dialogue Reading Comprehension
Coreference-aware Double-channel Attention Network for Multi-party Dialogue Reading Comprehension
Yanling Li
Bowei Zou
Yifan Fan
Mengxing Dong
Yu Hong
70
4
0
15 May 2023
From Pretraining Data to Language Models to Downstream Tasks: Tracking
  the Trails of Political Biases Leading to Unfair NLP Models
From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models
Shangbin Feng
Chan Young Park
Yuhan Liu
Yulia Tsvetkov
104
248
0
15 May 2023
Distinguish Before Answer: Generating Contrastive Explanation as
  Knowledge for Commonsense Question Answering
Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering
Qianglong Chen
Guohai Xu
Mingshi Yan
Ji Zhang
Fei Huang
Luo Si
Yin Zhang
79
10
0
14 May 2023
Make Prompt-based Black-Box Tuning Colorful: Boosting Model
  Generalization from Three Orthogonal Perspectives
Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives
Qiushi Sun
Chengcheng Han
Nuo Chen
Renyu Zhu
Jing Gong
Xiang Li
Ming Gao
VLM
47
9
0
14 May 2023
Constructing Holistic Measures for Social Biases in Masked Language Models
Yang Liu
Yuexian Hou
25
0
0
12 May 2023
Towards Versatile and Efficient Visual Knowledge Integration into
  Pre-trained Language Models with Cross-Modal Adapters
Towards Versatile and Efficient Visual Knowledge Integration into Pre-trained Language Models with Cross-Modal Adapters
Xinyun Zhang
Haochen Tan
Han Wu
Bei Yu
KELM
36
1
0
12 May 2023
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health
  Management: A Survey and Roadmaps
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health Management: A Survey and Roadmaps
Yanfang Li
Huan Wang
Muxia Sun
LM&MAAI4TSAI4CE
103
59
0
10 May 2023
SPSQL: Step-by-step Parsing Based Framework for Text-to-SQL Generation
SPSQL: Step-by-step Parsing Based Framework for Text-to-SQL Generation
Ran Shen
Gang Sun
Hao Shen
Yiling Li
Liangfeng Jin
Han Jiang
53
5
0
10 May 2023
Investigating Forgetting in Pre-Trained Representations Through
  Continual Learning
Investigating Forgetting in Pre-Trained Representations Through Continual Learning
Yun Luo
Zhen Yang
Xuefeng Bai
Fandong Meng
Jie Zhou
Yue Zhang
CLLKELM
103
17
0
10 May 2023
Similarity of Neural Network Models: A Survey of Functional and Representational Measures
Similarity of Neural Network Models: A Survey of Functional and Representational Measures
Max Klabunde
Tobias Schumacher
M. Strohmaier
Florian Lemmerich
183
75
0
10 May 2023
Vision-Language Models in Remote Sensing: Current Progress and Future
  Trends
Vision-Language Models in Remote Sensing: Current Progress and Future Trends
Xiang Li
Congcong Wen
Yuan Hu
Zhenghang Yuan
Xiao Xiang Zhu
VLM
82
82
0
09 May 2023
Previous
123...171819...575859
Next