ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
Chemical transformer compression for accelerating both training and
  inference of molecular modeling
Chemical transformer compression for accelerating both training and inference of molecular modeling
Yi Yu
K. Börjesson
67
0
0
16 May 2022
Transkimmer: Transformer Learns to Layer-wise Skim
Transkimmer: Transformer Learns to Layer-wise Skim
Yue Guan
Zhengyi Li
Jingwen Leng
Zhouhan Lin
Minyi Guo
125
40
0
15 May 2022
TiBERT: Tibetan Pre-trained Language Model
TiBERT: Tibetan Pre-trained Language Model
Yuan Sun
Sisi Liu
Junjie Deng
Xiaobing Zhao
94
10
0
15 May 2022
Adaptive Prompt Learning-based Few-Shot Sentiment Analysis
Adaptive Prompt Learning-based Few-Shot Sentiment Analysis
Pengfei Zhang
Tingting Chai
Yongdong Xu
VLM
80
13
0
15 May 2022
Evaluating Generalizability of Fine-Tuned Models for Fake News Detection
Evaluating Generalizability of Fine-Tuned Models for Fake News Detection
Abhijit Suprem
C. Pu
72
4
0
15 May 2022
A Property Induction Framework for Neural Language Models
A Property Induction Framework for Neural Language Models
Kanishka Misra
Julia Taylor Rayz
Allyson Ettinger
76
12
0
13 May 2022
Improving Contextual Representation with Gloss Regularized Pre-training
Improving Contextual Representation with Gloss Regularized Pre-training
Yu Lin
Zhecheng An
Peihao Wu
Zejun Ma
83
5
0
13 May 2022
AEON: A Method for Automatic Evaluation of NLP Test Cases
AEON: A Method for Automatic Evaluation of NLP Test Cases
Jen-tse Huang
Jianping Zhang
Wenxuan Wang
Pinjia He
Yuxin Su
Michael R. Lyu
83
23
0
13 May 2022
Predicting Human Psychometric Properties Using Computational Language
  Models
Predicting Human Psychometric Properties Using Computational Language Models
Antonio Laverghetta
Animesh Nighojkar
Jamshidbek Mirzakhalov
John Licato
62
9
0
12 May 2022
DTW at Qurán QA 2022: Utilising Transfer Learning with Transformers for
  Question Answering in a Low-resource Domain
DTW at Qurán QA 2022: Utilising Transfer Learning with Transformers for Question Answering in a Low-resource Domain
Damith Premasiri
Tharindu Ranasinghe
Wajdi Zaghouani
R. Mitkov
59
12
0
12 May 2022
e-CARE: a New Dataset for Exploring Explainable Causal Reasoning
e-CARE: a New Dataset for Exploring Explainable Causal Reasoning
Li Du
Xiao Ding
Kai Xiong
Ting Liu
Bing Qin
CML
82
67
0
12 May 2022
Towards Unified Prompt Tuning for Few-shot Text Classification
Towards Unified Prompt Tuning for Few-shot Text Classification
Jiadong Wang
Chengyu Wang
Fuli Luo
Chuanqi Tan
Minghui Qiu
Fei Yang
Qiuhui Shi
Songfang Huang
Ming Gao
VLM
70
28
0
11 May 2022
ALLSH: Active Learning Guided by Local Sensitivity and Hardness
ALLSH: Active Learning Guided by Local Sensitivity and Hardness
Shujian Zhang
Chengyue Gong
Xingchao Liu
Pengcheng He
Weizhu Chen
Mingyuan Zhou
100
26
0
10 May 2022
BLINK with Elasticsearch for Efficient Entity Linking in Business
  Conversations
BLINK with Elasticsearch for Efficient Entity Linking in Business Conversations
Md Tahmid Rahman Laskar
Cheng Chen
Aliaksandr Martsinovich
Jonathan Johnston
Xue-Yong Fu
TN ShashiBhushan
Simon Corston-Oliver
79
17
0
09 May 2022
Empathetic Conversational Systems: A Review of Current Advances, Gaps,
  and Opportunities
Empathetic Conversational Systems: A Review of Current Advances, Gaps, and Opportunities
Aravind Sesagiri Raamkumar
Yinping Yang
112
33
0
09 May 2022
Beyond Distributional Hypothesis: Let Language Models Learn Meaning-Text
  Correspondence
Beyond Distributional Hypothesis: Let Language Models Learn Meaning-Text Correspondence
Myeongjun Jang
Frank Mtumbuka
Thomas Lukasiewicz
87
10
0
08 May 2022
RoViST:Learning Robust Metrics for Visual Storytelling
RoViST:Learning Robust Metrics for Visual Storytelling
Eileen Wang
S. Han
Josiah Poon
49
10
0
08 May 2022
Vector Representations of Idioms in Conversational Systems
Vector Representations of Idioms in Conversational Systems
Tosin Adewumi
F. Liwicki
Marcus Liwicki
73
9
0
07 May 2022
Fine-grained Intent Classification in the Legal Domain
Fine-grained Intent Classification in the Legal Domain
Ankan Mullick
Abhilash Nandy
M. Kapadnis
Sohan Patnaik
R. Raghav
AILaw
28
9
0
06 May 2022
Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude
  Detection in Social Media
Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media
Lixing Zhu
Zheng Fang
Gabriele Pergola
Rob Procter
Yulan He
62
8
0
06 May 2022
Collective Relevance Labeling for Passage Retrieval
Collective Relevance Labeling for Passage Retrieval
Jihyuk Kim
Minsoo Kim
Seung-won Hwang
VLM
34
8
0
06 May 2022
IMU Based Deep Stride Length Estimation With Self-Supervised Learning
IMU Based Deep Stride Length Estimation With Self-Supervised Learning
Jien-De Sui
Tian-Sheuan Chang
SSL
24
14
0
06 May 2022
RaFoLa: A Rationale-Annotated Corpus for Detecting Indicators of Forced
  Labour
RaFoLa: A Rationale-Annotated Corpus for Detecting Indicators of Forced Labour
Erick Mendez Guzman
Viktor Schlegel
Riza Batista-Navarro
38
8
0
05 May 2022
METGEN: A Module-Based Entailment Tree Generation Framework for Answer
  Explanation
METGEN: A Module-Based Entailment Tree Generation Framework for Answer Explanation
Ruixin Hong
Hongming Zhang
Xintong Yu
Changshui Zhang
ReLMLRM
95
33
0
05 May 2022
Knowledge Distillation of Russian Language Models with Reduction of
  Vocabulary
Knowledge Distillation of Russian Language Models with Reduction of Vocabulary
A. Kolesnikova
Yuri Kuratov
Vasily Konovalov
Andrey Kravchenko
VLM
44
10
0
04 May 2022
MAD: Self-Supervised Masked Anomaly Detection Task for Multivariate Time
  Series
MAD: Self-Supervised Masked Anomaly Detection Task for Multivariate Time Series
Yiwei Fu
Feng Xue
AI4TS
36
15
0
04 May 2022
Crystal Twins: Self-supervised Learning for Crystalline Material
  Property Prediction
Crystal Twins: Self-supervised Learning for Crystalline Material Property Prediction
Rishikesh Magar
Yuyang Wang
Amir Barati Farimani
81
63
0
04 May 2022
Visual Commonsense in Pretrained Unimodal and Multimodal Models
Visual Commonsense in Pretrained Unimodal and Multimodal Models
Chenyu Zhang
Benjamin Van Durme
Zhuowan Li
Elias Stengel-Eskin
VLMSSL
79
41
0
04 May 2022
Unified Semantic Typing with Meaningful Label Inference
Unified Semantic Typing with Meaningful Label Inference
James Y. Huang
Bangzheng Li
Lyne Tchapmi
Muhao Chen
88
32
0
04 May 2022
ElitePLM: An Empirical Study on General Language Ability Evaluation of
  Pretrained Language Models
ElitePLM: An Empirical Study on General Language Ability Evaluation of Pretrained Language Models
Junyi Li
Tianyi Tang
Zheng Gong
Lixin Yang
Zhuohao Yu
Zhongfu Chen
Jingyuan Wang
Wayne Xin Zhao
Ji-Rong Wen
LM&MAELM
49
8
0
03 May 2022
Textual Entailment for Event Argument Extraction: Zero- and Few-Shot
  with Multi-Source Learning
Textual Entailment for Event Argument Extraction: Zero- and Few-Shot with Multi-Source Learning
Oscar Sainz
Itziar Gonzalez-Dios
Oier López de Lacalle
Bonan Min
Eneko Agirre
79
50
0
03 May 2022
SemAttack: Natural Textual Attacks via Different Semantic Spaces
SemAttack: Natural Textual Attacks via Different Semantic Spaces
Wei Ping
Chejian Xu
Xiangyu Liu
Yuk-Kit Cheng
Yue Liu
SILMAAML
119
53
0
03 May 2022
Paragraph-based Transformer Pre-training for Multi-Sentence Inference
Paragraph-based Transformer Pre-training for Multi-Sentence Inference
Luca Di Liello
Siddhant Garg
Luca Soldaini
Alessandro Moschitti
53
8
0
02 May 2022
Gender Bias in Masked Language Models for Multiple Languages
Gender Bias in Masked Language Models for Multiple Languages
Masahiro Kaneko
Aizhan Imankulova
Danushka Bollegala
Naoaki Okazaki
108
64
0
01 May 2022
Visualizing and Explaining Language Models
Visualizing and Explaining Language Models
Adrian M. P. Braşoveanu
Razvan Andonie
MILMVLM
109
5
0
30 Apr 2022
Clues Before Answers: Generation-Enhanced Multiple-Choice QA
Clues Before Answers: Generation-Enhanced Multiple-Choice QA
Zixian Huang
Ao Wu
Jiaying Zhou
Yu Gu
Yue Zhao
Gong Cheng
41
27
0
30 Apr 2022
Heterogeneous Graph Neural Networks using Self-supervised Reciprocally
  Contrastive Learning
Heterogeneous Graph Neural Networks using Self-supervised Reciprocally Contrastive Learning
Cuiying Huo
Dongxiao He
Peican Zhu
Di Jin
Jianwu Dang
Weixiong Zhang
Witold Pedrycz
Lingfei Wu
SSL
65
4
0
30 Apr 2022
Solution of DeBERTaV3 on CommonsenseQA
Solution of DeBERTaV3 on CommonsenseQA
Letian Peng
Zuchao Li
Hai Zhao
32
0
0
30 Apr 2022
Approximating Permutations with Neural Network Components for Travelling
  Photographer Problem
Approximating Permutations with Neural Network Components for Travelling Photographer Problem
S. Chong
69
0
0
30 Apr 2022
End-to-end Spoken Conversational Question Answering: Task, Dataset and
  Model
End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Chenyu You
Nuo Chen
Fenglin Liu
Shen Ge
Xian Wu
Yuexian Zou
AuLLM
63
44
0
29 Apr 2022
OPERA:Operation-Pivoted Discrete Reasoning over Text
OPERA:Operation-Pivoted Discrete Reasoning over Text
Yongwei Zhou
Junwei Bao
Chaoqun Duan
Haipeng Sun
Jiahui Liang
Yifan Wang
Jing Zhao
Youzheng Wu
Xiaodong He
Tiejun Zhao
LRM
81
11
0
29 Apr 2022
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Derya Soydaner
3DV
127
182
0
27 Apr 2022
NLU++: A Multi-Label, Slot-Rich, Generalisable Dataset for Natural
  Language Understanding in Task-Oriented Dialogue
NLU++: A Multi-Label, Slot-Rich, Generalisable Dataset for Natural Language Understanding in Task-Oriented Dialogue
I. Casanueva
Ivan Vulić
Georgios P. Spithourakis
Paweł Budzianowski
88
11
0
27 Apr 2022
PLOD: An Abbreviation Detection Dataset for Scientific Documents
PLOD: An Abbreviation Detection Dataset for Scientific Documents
Leonardo Zilio
Hadeel Saadany
Prashant Sharma
Diptesh Kanojia
Constantin Orasan
19
4
0
26 Apr 2022
Reprint: a randomized extrapolation based on principal components for
  data augmentation
Reprint: a randomized extrapolation based on principal components for data augmentation
Jiale Wei
Qiyuan Chen
Pai Peng
Benjamin Guedj
Le Li
43
2
0
26 Apr 2022
A Survey on Word Meta-Embedding Learning
A Survey on Word Meta-Embedding Learning
Danushka Bollegala
James OÑeill
58
12
0
25 Apr 2022
Transformation Invariant Cancerous Tissue Classification Using Spatially
  Transformed DenseNet
Transformation Invariant Cancerous Tissue Classification Using Spatially Transformed DenseNet
Omar Mahdi
Ali Bou Nassif
MedIm
26
2
0
23 Apr 2022
Investigating Neural Architectures by Synthetic Dataset Design
Investigating Neural Architectures by Synthetic Dataset Design
Adrien Courtois
Jean-Michel Morel
Pablo Arias
72
4
0
23 Apr 2022
Rethinking Offensive Text Detection as a Multi-Hop Reasoning Problem
Rethinking Offensive Text Detection as a Multi-Hop Reasoning Problem
Qiang Zhang
Jason Naradowsky
Yusuke Miyao
LRM
52
5
0
22 Apr 2022
Progressive Training of A Two-Stage Framework for Video Restoration
Progressive Training of A Two-Stage Framework for Video Restoration
Mei Zheng
Qunliang Xing
Minglang Qiao
Mai Xu
Lai Jiang
Huaida Liu
Ying-Cong Chen
91
11
0
21 Apr 2022
Previous
123...293031...575859
Next