ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
LadRa-Net: Locally-Aware Dynamic Re-read Attention Net for Sentence
  Semantic Matching
LadRa-Net: Locally-Aware Dynamic Re-read Attention Net for Sentence Semantic Matching
Kun Zhang
Guangyi Lv
Le Wu
Enhong Chen
Qi Liu
Meng Wang
61
6
0
06 Aug 2021
Robust Transfer Learning with Pretrained Language Models through
  Adapters
Robust Transfer Learning with Pretrained Language Models through Adapters
Wenjuan Han
Bo Pang
Ying Nian Wu
69
56
0
05 Aug 2021
Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt
  Verbalizer for Text Classification
Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification
Shengding Hu
Ning Ding
Huadong Wang
Zhiyuan Liu
Jingang Wang
Juan-Zi Li
Wei Wu
Maosong Sun
VLM
106
373
0
04 Aug 2021
How to Query Language Models?
How to Query Language Models?
Leonard Adolphs
Shehzaad Dhuliawala
Thomas Hofmann
KELM
86
15
0
04 Aug 2021
Your fairness may vary: Pretrained language model fairness in toxic text
  classification
Your fairness may vary: Pretrained language model fairness in toxic text classification
Ioana Baldini
Dennis L. Wei
Karthikeyan N. Ramamurthy
Mikhail Yurochkin
Moninder Singh
93
54
0
03 Aug 2021
LICHEE: Improving Language Model Pre-training with Multi-grained
  Tokenization
LICHEE: Improving Language Model Pre-training with Multi-grained Tokenization
Weidong Guo
Mingjun Zhao
Lusheng Zhang
Di Niu
Jinwen Luo
Zhenhua Liu
Zhenyang Li
J. Tang
55
8
0
02 Aug 2021
From LSAT: The Progress and Challenges of Complex Reasoning
From LSAT: The Progress and Challenges of Complex Reasoning
Siyuan Wang
Zhongkun Liu
Wanjun Zhong
Ming Zhou
Zhongyu Wei
Zhumin Chen
Nan Duan
ELM
85
46
0
02 Aug 2021
Transformer-Encoder-GRU (T-E-GRU) for Chinese Sentiment Analysis on
  Chinese Comment Text
Transformer-Encoder-GRU (T-E-GRU) for Chinese Sentiment Analysis on Chinese Comment Text
Binlong Zhang
Wei Zhou
42
17
0
01 Aug 2021
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Xianrui Zheng
Chao Zhang
P. Woodland
34
49
0
29 Jul 2021
UIBert: Learning Generic Multimodal Representations for UI Understanding
UIBert: Learning Generic Multimodal Representations for UI Understanding
Chongyang Bai
Xiaoxue Zang
Ying Xu
Srinivas Sunkara
Abhinav Rastogi
Jindong Chen
Blaise Agüera y Arcas
90
95
0
29 Jul 2021
AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient
  Pre-trained Language Models
AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models
Yichun Yin
Cheng Chen
Lifeng Shang
Xin Jiang
Xiao Chen
Qun Liu
VLM
69
50
0
29 Jul 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods
  in Natural Language Processing
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLMSyDa
350
4,050
0
28 Jul 2021
Exceeding the Limits of Visual-Linguistic Multi-Task Learning
Exceeding the Limits of Visual-Linguistic Multi-Task Learning
Cameron R. Wolfe
Keld T. Lundgaard
VLM
76
2
0
27 Jul 2021
gaBERT -- an Irish Language Model
gaBERT -- an Irish Language Model
James Barry
Joachim Wagner
Lauren Cassidy
Alan Cowap
Teresa Lynn
Abigail Walsh
Mícheál J. Ó Meachair
Jennifer Foster
65
18
0
27 Jul 2021
Dual Slot Selector via Local Reliability Verification for Dialogue State
  Tracking
Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking
Jinyu Guo
Kai Shuang
Jijie Li
Zihan Wang
75
18
0
27 Jul 2021
Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction
Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction
Lu Xu
Yew Ken Chia
Lidong Bing
155
188
0
26 Jul 2021
Fine-Grained Emotion Prediction by Modeling Emotion Definitions
Fine-Grained Emotion Prediction by Modeling Emotion Definitions
Gargi Singh
Dhanajit Brahma
Piyush Rai
Ashutosh Modi
62
10
0
26 Jul 2021
ICDAR 2021 Competition on Scene Video Text Spotting
ICDAR 2021 Competition on Scene Video Text Spotting
Zhanzhan Cheng
Jing Lu
Baorui Zou
Shuigeng Zhou
Leilei Gan
22
4
0
26 Jul 2021
Graph-free Multi-hop Reading Comprehension: A Select-to-Guide Strategy
Graph-free Multi-hop Reading Comprehension: A Select-to-Guide Strategy
Bohong Wu
Zhuosheng Zhang
Hai Zhao
72
21
0
25 Jul 2021
Go Wider Instead of Deeper
Go Wider Instead of Deeper
Fuzhao Xue
Ziji Shi
Futao Wei
Yuxuan Lou
Yong Liu
Yang You
ViTMoE
95
84
0
25 Jul 2021
Query2Label: A Simple Transformer Way to Multi-Label Classification
Query2Label: A Simple Transformer Way to Multi-Label Classification
Shilong Liu
Lei Zhang
Xiao Yang
Hang Su
Jun Zhu
73
193
0
22 Jul 2021
Back-Translated Task Adaptive Pretraining: Improving Accuracy and
  Robustness on Text Classification
Back-Translated Task Adaptive Pretraining: Improving Accuracy and Robustness on Text Classification
Junghoon Lee
Jounghee Kim
Pilsung Kang
VLM
72
5
0
22 Jul 2021
Multi-stage Pre-training over Simplified Multimodal Pre-training Models
Multi-stage Pre-training over Simplified Multimodal Pre-training Models
Tongtong Liu
Fangxiang Feng
Xiaojie Wang
38
13
0
22 Jul 2021
CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with
  Minimal Supervision
CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with Minimal Supervision
Zhongyang Li
Xiao Ding
Kuo Liao
Bing Qin
Ting Liu
CML
107
19
0
21 Jul 2021
Improving Sentence-Level Relation Extraction through Curriculum Learning
Improving Sentence-Level Relation Extraction through Curriculum Learning
Seongsik Park
Harksoo Kim
60
14
0
20 Jul 2021
Token-Level Supervised Contrastive Learning for Punctuation Restoration
Token-Level Supervised Contrastive Learning for Punctuation Restoration
Qiushi Huang
Tom Ko
Lilian H. Y. Tang
Xubo Liu
Boyong Wu
65
23
0
19 Jul 2021
Clinical Relation Extraction Using Transformer-based Models
Clinical Relation Extraction Using Transformer-based Models
Xi Yang
Zehao Yu
Yi Guo
Jiang Bian
Yonghui Wu
LM&MAMedIm
65
20
0
19 Jul 2021
Pre-trained Language Models as Prior Knowledge for Playing Text-based
  Games
Pre-trained Language Models as Prior Knowledge for Playing Text-based Games
Ishika Singh
Gargi Singh
Ashutosh Modi
OffRLAI4CE
103
29
0
18 Jul 2021
Generative Pretraining for Paraphrase Evaluation
Generative Pretraining for Paraphrase Evaluation
J. Weston
R. Lenain
U. Meepegama
E. Fristed
AIMat
59
10
0
17 Jul 2021
AutoBERT-Zero: Evolving BERT Backbone from Scratch
AutoBERT-Zero: Evolving BERT Backbone from Scratch
Jiahui Gao
Hang Xu
Han Shi
Xiaozhe Ren
Philip L. H. Yu
Xiaodan Liang
Xin Jiang
Zhenguo Li
85
37
0
15 Jul 2021
Automatic Task Requirements Writing Evaluation via Machine Reading
  Comprehension
Automatic Task Requirements Writing Evaluation via Machine Reading Comprehension
Shiting Xu
Guowei Xu
Peilei Jia
Wenbiao Ding
Zhongqin Wu
Zitao Liu
28
1
0
15 Jul 2021
The Benchmark Lottery
The Benchmark Lottery
Mostafa Dehghani
Yi Tay
A. Gritsenko
Zhe Zhao
N. Houlsby
Fernando Diaz
Donald Metzler
Oriol Vinyals
117
92
0
14 Jul 2021
BERT Fine-Tuning for Sentiment Analysis on Indonesian Mobile Apps
  Reviews
BERT Fine-Tuning for Sentiment Analysis on Indonesian Mobile Apps Reviews
Kuncahyo Setyo Nugroho
Anantha Yullian Sukmadewa
DW HaftittahWuswilahaken
F. A. Bachtiar
N. Yudistira
33
45
0
14 Jul 2021
Importance-based Neuron Allocation for Multilingual Neural Machine
  Translation
Importance-based Neuron Allocation for Multilingual Neural Machine Translation
Wanying Xie
Yang Feng
Shuhao Gu
Dong Yu
111
34
0
14 Jul 2021
Conformer-based End-to-end Speech Recognition With Rotary Position
  Embedding
Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
82
9
0
13 Jul 2021
Human Attention during Goal-directed Reading Comprehension Relies on
  Task Optimization
Human Attention during Goal-directed Reading Comprehension Relies on Task Optimization
Jiajie Zou
Yuran Zhang
Jialu Li
Xing Tian
Nai Ding
AIMat
92
2
0
13 Jul 2021
Combiner: Full Attention Transformer with Sparse Computation Cost
Combiner: Full Attention Transformer with Sparse Computation Cost
Hongyu Ren
H. Dai
Zihang Dai
Mengjiao Yang
J. Leskovec
Dale Schuurmans
Bo Dai
166
80
0
12 Jul 2021
Trustworthy AI: A Computational Perspective
Trustworthy AI: A Computational Perspective
Haochen Liu
Yiqi Wang
Wenqi Fan
Xiaorui Liu
Yaxin Li
Shaili Jain
Yunhao Liu
Anil K. Jain
Jiliang Tang
FaML
194
213
0
12 Jul 2021
The Brownian motion in the transformer model
The Brownian motion in the transformer model
Yingshi Chen
116
1
0
12 Jul 2021
LexSubCon: Integrating Knowledge from Lexical Resources into Contextual
  Embeddings for Lexical Substitution
LexSubCon: Integrating Knowledge from Lexical Resources into Contextual Embeddings for Lexical Substitution
George Michalopoulos
I. McKillop
Alexander Wong
Helen H. Chen
120
19
0
11 Jul 2021
Noise Stability Regularization for Improving BERT Fine-tuning
Noise Stability Regularization for Improving BERT Fine-tuning
Hang Hua
Xingjian Li
Dejing Dou
Chengzhong Xu
Jiebo Luo
79
45
0
10 Jul 2021
Transformer-Based Behavioral Representation Learning Enables Transfer
  Learning for Mobile Sensing in Small Datasets
Transformer-Based Behavioral Representation Learning Enables Transfer Learning for Mobile Sensing in Small Datasets
Michael Merrill
Tim Althoff
AI4TSMUMedIm
50
5
0
09 Jul 2021
An Initial Investigation of Non-Native Spoken Question-Answering
An Initial Investigation of Non-Native Spoken Question-Answering
V. Raina
Mark Gales
66
1
0
09 Jul 2021
Can Deep Neural Networks Predict Data Correlations from Column Names?
Can Deep Neural Networks Predict Data Correlations from Column Names?
Immanuel Trummer
78
8
0
09 Jul 2021
Benchmarking for Biomedical Natural Language Processing Tasks with a
  Domain Specific ALBERT
Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERT
Usman Naseem
A. Dunn
Matloob Khushi
Jinman Kim
OODLM&MAAI4MH
86
43
0
09 Jul 2021
UniRE: A Unified Label Space for Entity Relation Extraction
UniRE: A Unified Label Space for Entity Relation Extraction
Yijun Wang
Changzhi Sun
Yuanbin Wu
Hao Zhou
Lei Li
Junchi Yan
82
116
0
09 Jul 2021
Joint Models for Answer Verification in Question Answering Systems
Joint Models for Answer Verification in Question Answering Systems
Zeyu Zhang
Thuy Vu
Alessandro Moschitti
53
24
0
09 Jul 2021
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge
  Transfer
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer
Zineng Tang
Jaemin Cho
Hao Tan
Joey Tianyi Zhou
VLM
59
29
0
06 Jul 2021
PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior
  for Joint Image-Text Modeling
PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Xiaoxue Zang
Lijuan Liu
Maria Wang
Yang Song
Hao Zhang
Jindong Chen
VLM
101
60
0
06 Jul 2021
Sarcasm Detection: A Comparative Study
Sarcasm Detection: A Comparative Study
Hamed Yaghoobian
H. Arabnia
Khaled Rasheed
57
23
0
05 Jul 2021
Previous
123...394041...575859
Next