ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural
  Architecture Search
NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture Search
Jin Xu
Xu Tan
Renqian Luo
Kaitao Song
Jian Li
Tao Qin
Tie-Yan Liu
MQ
62
79
0
30 May 2021
Sentiment analysis in tweets: an assessment study from classical to
  modern text representation models
Sentiment analysis in tweets: an assessment study from classical to modern text representation models
Sérgio Barreto
Ricardo Moura
Jonnathan Carvalho
A. Paes
A. Plastino
78
14
0
29 May 2021
NeuralLog: Natural Language Inference with Joint Neural and Logical
  Reasoning
NeuralLog: Natural Language Inference with Joint Neural and Logical Reasoning
Zeming Chen
Qiyue Gao
Lawrence S. Moss
FedMLNAI
86
42
0
29 May 2021
Cisco at SemEval-2021 Task 5: What's Toxic?: Leveraging Transformers for
  Multiple Toxic Span Extraction from Online Comments
Cisco at SemEval-2021 Task 5: What's Toxic?: Leveraging Transformers for Multiple Toxic Span Extraction from Online Comments
Sreyan Ghosh
Sonal Kumar
51
8
0
28 May 2021
Knowledge Inheritance for Pre-trained Language Models
Knowledge Inheritance for Pre-trained Language Models
Yujia Qin
Yankai Lin
Jing Yi
Jiajie Zhang
Xu Han
...
Yusheng Su
Zhiyuan Liu
Peng Li
Maosong Sun
Jie Zhou
VLM
78
50
0
28 May 2021
Accelerating BERT Inference for Sequence Labeling via Early-Exit
Accelerating BERT Inference for Sequence Labeling via Early-Exit
Xiaonan Li
Yunfan Shao
Tianxiang Sun
Hang Yan
Xipeng Qiu
Xuanjing Huang
84
41
0
28 May 2021
Lightweight Cross-Lingual Sentence Representation Learning
Lightweight Cross-Lingual Sentence Representation Learning
Zhuoyuan Mao
Prakhar Gupta
Pei Wang
Chenhui Chu
Martin Jaggi
Sadao Kurohashi
VLM
120
9
0
28 May 2021
Early Exiting with Ensemble Internal Classifiers
Early Exiting with Ensemble Internal Classifiers
Tianxiang Sun
Yunhua Zhou
Xiangyang Liu
Xinyu Zhang
Hao Jiang
Bo Zhao
Xuanjing Huang
Xipeng Qiu
70
31
0
28 May 2021
An Explanatory Query-Based Framework for Exploring Academic Expertise
An Explanatory Query-Based Framework for Exploring Academic Expertise
O. Cocarascu
A. McLean
Paul French
Francesca Toni
35
0
0
28 May 2021
Hierarchical Transformer Encoders for Vietnamese Spelling Correction
Hierarchical Transformer Encoders for Vietnamese Spelling Correction
H. Tran
C. Dinh
Long Phan
S. T. Nguyen
62
12
0
28 May 2021
Inspecting the concept knowledge graph encoded by modern language models
Inspecting the concept knowledge graph encoded by modern language models
Carlos Aspillaga
Marcelo Mendoza
Alvaro Soto
72
13
0
27 May 2021
Verb Sense Clustering using Contextualized Word Representations for
  Semantic Frame Induction
Verb Sense Clustering using Contextualized Word Representations for Semantic Frame Induction
Kosuke Yamada
Ryohei Sasano
Koichi Takeda
37
7
0
27 May 2021
A Full-Stack Search Technique for Domain Optimized Deep Learning
  Accelerators
A Full-Stack Search Technique for Domain Optimized Deep Learning Accelerators
Dan Zhang
Safeen Huda
Ebrahim M. Songhori
Kartik Prabhu
Quoc V. Le
Anna Goldie
Azalia Mirhoseini
94
53
0
26 May 2021
TreeBERT: A Tree-Based Pre-Trained Model for Programming Language
TreeBERT: A Tree-Based Pre-Trained Model for Programming Language
Xue Jiang
Zhuoran Zheng
Chen Lyu
Liang Li
Lei Lyu
85
91
0
26 May 2021
LMMS Reloaded: Transformer-based Sense Embeddings for Disambiguation and
  Beyond
LMMS Reloaded: Transformer-based Sense Embeddings for Disambiguation and Beyond
Daniel Loureiro
A. Jorge
Jose Camacho-Collados
92
26
0
26 May 2021
NEUer at SemEval-2021 Task 4: Complete Summary Representation by Filling
  Answers into Question for Matching Reading Comprehension
NEUer at SemEval-2021 Task 4: Complete Summary Representation by Filling Answers into Question for Matching Reading Comprehension
Zhixiang Chen
Yikun Lei
Pai Liu
G. Guo
133
0
0
25 May 2021
Dynamic Semantic Graph Construction and Reasoning for Explainable
  Multi-hop Science Question Answering
Dynamic Semantic Graph Construction and Reasoning for Explainable Multi-hop Science Question Answering
Weiwen Xu
Huihui Zhang
Deng Cai
Wai Lam
81
35
0
25 May 2021
True Few-Shot Learning with Language Models
True Few-Shot Learning with Language Models
Ethan Perez
Douwe Kiela
Kyunghyun Cho
142
440
0
24 May 2021
Using Adversarial Attacks to Reveal the Statistical Bias in Machine
  Reading Comprehension Models
Using Adversarial Attacks to Reveal the Statistical Bias in Machine Reading Comprehension Models
Jieyu Lin
Jiajie Zou
Nai Ding
AAML
81
45
0
24 May 2021
One4all User Representation for Recommender Systems in E-commerce
One4all User Representation for Recommender Systems in E-commerce
Kyuyong Shin
Hanock Kwak
KyungHyun Kim
Minkyu Kim
Young-Jin Park
Jisu Jeong
Seungjae Jung
62
28
0
24 May 2021
Structural Pre-training for Dialogue Comprehension
Structural Pre-training for Dialogue Comprehension
Zhuosheng Zhang
Hai Zhao
94
31
0
23 May 2021
Pretrained Language Models for Text Generation: A Survey
Pretrained Language Models for Text Generation: A Survey
Junyi Li
Tianyi Tang
Wayne Xin Zhao
Ji-Rong Wen
LM&MAVLMSyDa
106
191
0
21 May 2021
Dynaboard: An Evaluation-As-A-Service Platform for Holistic
  Next-Generation Benchmarking
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
Zhiyi Ma
Kawin Ethayarajh
Tristan Thrush
Somya Jain
Ledell Yu Wu
Robin Jia
Christopher Potts
Adina Williams
Douwe Kiela
ELM
115
59
0
21 May 2021
Boosting Span-based Joint Entity and Relation Extraction via Squence
  Tagging Mechanism
Boosting Span-based Joint Entity and Relation Extraction via Squence Tagging Mechanism
Shezheng Song
Shasha Li
Jie Yu
Jun Ma
Bin Ji
44
4
0
21 May 2021
KLUE: Korean Language Understanding Evaluation
KLUE: Korean Language Understanding Evaluation
Sungjoon Park
Jihyung Moon
Sungdong Kim
Won Ik Cho
Jiyoon Han
...
Seonghyun Kim
Lucy Park
Alice Oh
Jung-Woo Ha
Kyunghyun Cho
ELMVLM
123
198
0
20 May 2021
Towards Detecting Need for Empathetic Response in Motivational
  Interviewing
Towards Detecting Need for Empathetic Response in Motivational Interviewing
Zixiu "Alex" Wu
Rim Helaoui
Vivek Kumar (Ph.D)
Diego Reforgiato Recupero
Daniele Riboni
31
14
0
20 May 2021
Self-supervised Heterogeneous Graph Neural Network with Co-contrastive
  Learning
Self-supervised Heterogeneous Graph Neural Network with Co-contrastive Learning
Xiao Wang
Nian Liu
Hui-jun Han
C. Shi
SSL
84
396
0
19 May 2021
Investigating Math Word Problems using Pretrained Multilingual Language
  Models
Investigating Math Word Problems using Pretrained Multilingual Language Models
Minghuan Tan
Lei Wang
Lingxiao Jiang
Jing Jiang
LRM
94
33
0
19 May 2021
Relative Positional Encoding for Transformers with Linear Complexity
Relative Positional Encoding for Transformers with Linear Complexity
Antoine Liutkus
Ondřej Cífka
Shih-Lun Wu
Umut Simsekli
Yi-Hsuan Yang
Gaël Richard
84
48
0
18 May 2021
SHARE: a System for Hierarchical Assistive Recipe Editing
SHARE: a System for Hierarchical Assistive Recipe Editing
Shuyang Li
Yufei Li
Jianmo Ni
Julian McAuley
52
20
0
17 May 2021
SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising
SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising
K. Xuan
Yongbo Wang
Yongliang Wang
Zujie Wen
Yang Dong
VLM
76
54
0
17 May 2021
Self-supervised Learning on Graphs: Contrastive, Generative,or
  Predictive
Self-supervised Learning on Graphs: Contrastive, Generative,or Predictive
Lirong Wu
Haitao Lin
Zhangyang Gao
Cheng Tan
Stan.Z.Li
SSL
95
262
0
16 May 2021
A Deep Metric Learning Approach to Account Linking
A Deep Metric Learning Approach to Account Linking
Aleem Khan
Elizabeth Fleming
N. Schofield
M. Bishop
Nicholas Andrews
59
23
0
15 May 2021
On the Distributional Properties of Adaptive Gradients
On the Distributional Properties of Adaptive Gradients
Z. Zhiyi
Liu Ziyin
53
4
0
15 May 2021
Distilling BERT for low complexity network training
Distilling BERT for low complexity network training
Bansidhar Mangalwedhekar
26
1
0
13 May 2021
Addressing "Documentation Debt" in Machine Learning Research: A
  Retrospective Datasheet for BookCorpus
Addressing "Documentation Debt" in Machine Learning Research: A Retrospective Datasheet for BookCorpus
Jack Bandy
Nicholas Vincent
69
57
0
11 May 2021
Benchmarking down-scaled (not so large) pre-trained language models
Benchmarking down-scaled (not so large) pre-trained language models
Yi Men
P. Schulze
C. Heumann
31
1
0
11 May 2021
Improving Factual Consistency of Abstractive Summarization via Question
  Answering
Improving Factual Consistency of Abstractive Summarization via Question Answering
Feng Nan
Cicero Nogueira dos Santos
Henghui Zhu
Patrick Ng
Kathleen McKeown
Ramesh Nallapati
Dejiao Zhang
Zhiguo Wang
Andrew O. Arnold
Bing Xiang
HILM
75
88
0
10 May 2021
Dispatcher: A Message-Passing Approach To Language Modelling
Dispatcher: A Message-Passing Approach To Language Modelling
A. Cetoli
84
0
0
09 May 2021
Which transformer architecture fits my data? A vocabulary bottleneck in
  self-attention
Which transformer architecture fits my data? A vocabulary bottleneck in self-attention
Noam Wies
Yoav Levine
Daniel Jannai
Amnon Shashua
92
20
0
09 May 2021
Self-Supervised Adversarial Example Detection by Disentangled
  Representation
Self-Supervised Adversarial Example Detection by Disentangled Representation
Zhaoxi Zhang
L. Zhang
Xufei Zheng
Jinyu Tian
Jiantao Zhou
AAMLDRL
65
9
0
08 May 2021
Logic-Driven Context Extension and Data Augmentation for Logical
  Reasoning of Text
Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text
Siyuan Wang
Wanjun Zhong
Duyu Tang
Zhongyu Wei
Zhihao Fan
Daxin Jiang
Ming Zhou
Nan Duan
NAI
134
73
0
08 May 2021
Empirical Evaluation of Pre-trained Transformers for Human-Level NLP:
  The Role of Sample Size and Dimensionality
Empirical Evaluation of Pre-trained Transformers for Human-Level NLP: The Role of Sample Size and Dimensionality
Adithya Ganesan
Matthew Matero
Aravind Reddy Ravula
Huy-Hien Vu
H. Andrew Schwartz
90
35
0
07 May 2021
Adapting by Pruning: A Case Study on BERT
Adapting by Pruning: A Case Study on BERT
Yang Gao
Nicolo Colombo
Wen Wang
49
17
0
07 May 2021
Are Pre-trained Convolutions Better than Pre-trained Transformers?
Are Pre-trained Convolutions Better than Pre-trained Transformers?
Yi Tay
Mostafa Dehghani
J. Gupta
Dara Bahri
V. Aribandi
Zhen Qin
Donald Metzler
AI4CE
77
49
0
07 May 2021
VAULT: VAriable Unified Long Text Representation for Machine Reading
  Comprehension
VAULT: VAriable Unified Long Text Representation for Machine Reading Comprehension
Haoyang Wen
Anthony Ferritto
Heng Ji
Radu Florian
Avirup Sil
33
3
0
07 May 2021
Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing
  Regressions In NLP Model Updates
Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing Regressions In NLP Model Updates
Yuqing Xie
Yi-An Lai
Yuanjun Xiong
Yi Zhang
Stefano Soatto
UQCV
59
16
0
07 May 2021
Do language models learn typicality judgments from text?
Do language models learn typicality judgments from text?
Kanishka Misra
Allyson Ettinger
Julia Taylor Rayz
56
34
0
06 May 2021
HerBERT: Efficiently Pretrained Transformer-based Language Model for
  Polish
HerBERT: Efficiently Pretrained Transformer-based Language Model for Polish
Robert Mroczkowski
Piotr Rybak
Alina Wróblewska
Ireneusz Gawlik
86
85
0
04 May 2021
An Estimation of Online Video User Engagement from Features of
  Continuous Emotions
An Estimation of Online Video User Engagement from Features of Continuous Emotions
Lukas Stappen
Alice Baird
Michelle Lienhart
Annalena Batz
Björn Schuller
60
3
0
04 May 2021
Previous
123...424344...575859
Next