ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSL
    AIMat
ArXivPDFHTML

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,913 papers shown
Title
Layer-wise Model Pruning based on Mutual Information
Layer-wise Model Pruning based on Mutual Information
Chun Fan
Jiwei Li
Xiang Ao
Fei Wu
Yuxian Meng
Xiaofei Sun
53
19
0
28 Aug 2021
Code-switched inspired losses for generic spoken dialog representations
Code-switched inspired losses for generic spoken dialog representations
E. Chapuis
Pierre Colombo
Matthieu Labeau
Chloe Clave
32
12
0
27 Aug 2021
A Partition Filter Network for Joint Entity and Relation Extraction
A Partition Filter Network for Joint Entity and Relation Extraction
Zhiheng Yan
Chong Zhang
Jinlan Fu
Qi Zhang
Zhongyu Wei
28
136
0
27 Aug 2021
Query-Focused Extractive Summarisation for Finding Ideal Answers to
  Biomedical and COVID-19 Questions
Query-Focused Extractive Summarisation for Finding Ideal Answers to Biomedical and COVID-19 Questions
Diego Mollá Aliod
Urvashi Khanna
Dima Galat
Vincent Nguyen
Maciej Rybiński
RALM
43
2
0
27 Aug 2021
Multilingual Multi-Aspect Explainability Analyses on Machine Reading
  Comprehension Models
Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models
Yiming Cui
Weinan Zhang
Wanxiang Che
Ting Liu
Zhigang Chen
Shijin Wang
LRM
25
9
0
26 Aug 2021
Exploring the Promises of Transformer-Based LMs for the Representation
  of Normative Claims in the Legal Domain
Exploring the Promises of Transformer-Based LMs for the Representation of Normative Claims in the Legal Domain
Reto Gubelmann
Peter Hongler
Siegfried Handschuh
AILaw
19
0
0
25 Aug 2021
YANMTT: Yet Another Neural Machine Translation Toolkit
YANMTT: Yet Another Neural Machine Translation Toolkit
Raj Dabre
Eiichiro Sumita
38
13
0
25 Aug 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer
  Models via Low-Rank Approximation
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation
Samuel Cahyawijaya
34
12
0
24 Aug 2021
Are the Multilingual Models Better? Improving Czech Sentiment with
  Transformers
Are the Multilingual Models Better? Improving Czech Sentiment with Transformers
Pavel Přibáň
J. Steinberger
41
11
0
24 Aug 2021
Recurrent multiple shared layers in Depth for Neural Machine Translation
Recurrent multiple shared layers in Depth for Neural Machine Translation
Guoliang Li
Yiyang Li
MoE
23
1
0
23 Aug 2021
APObind: A Dataset of Ligand Unbound Protein Conformations for Machine
  Learning Applications in De Novo Drug Design
APObind: A Dataset of Ligand Unbound Protein Conformations for Machine Learning Applications in De Novo Drug Design
Rishal Aggarwal
Akash Gupta
U. Priyakumar
6
10
0
23 Aug 2021
Fastformer: Additive Attention Can Be All You Need
Fastformer: Additive Attention Can Be All You Need
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
Xing Xie
51
119
0
20 Aug 2021
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with
  Structured Semantics for Medical Text Mining
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured Semantics for Medical Text Mining
Taolin Zhang
Zerui Cai
Chengyu Wang
Minghui Qiu
Bite Yang
Xiaofeng He
AI4MH
33
52
0
20 Aug 2021
Detection of Illicit Drug Trafficking Events on Instagram: A Deep
  Multimodal Multilabel Learning Approach
Detection of Illicit Drug Trafficking Events on Instagram: A Deep Multimodal Multilabel Learning Approach
Chuanbo Hu
Minglei Yin
Bin Liu
Xin Li
Yanfang Ye
24
15
0
19 Aug 2021
TSI: an Ad Text Strength Indicator using Text-to-CTR and
  Semantic-Ad-Similarity
TSI: an Ad Text Strength Indicator using Text-to-CTR and Semantic-Ad-Similarity
Shaunak Mishra
Changwei Hu
Manisha Verma
Kevin Yen
Yifan Hu
M. Sviridenko
19
8
0
18 Aug 2021
EviDR: Evidence-Emphasized Discrete Reasoning for Reasoning Machine
  Reading Comprehension
EviDR: Evidence-Emphasized Discrete Reasoning for Reasoning Machine Reading Comprehension
Yongwei Zhou
Junwei Bao
Haipeng Sun
Jiahui Liang
Youzheng Wu
Xiaodong He
Bowen Zhou
Tiejun Zhao
12
5
0
18 Aug 2021
RRLFSOR: An Efficient Self-Supervised Learning Strategy of Graph
  Convolutional Networks
RRLFSOR: An Efficient Self-Supervised Learning Strategy of Graph Convolutional Networks
Feng Sun
Ajith Kumar
Guanci Yang
Qikui Zhu
Yiyun Zhang
Ansi Zhang
Dhruv Makwana
SSL
GNN
47
0
0
17 Aug 2021
Exploring Generalization Ability of Pretrained Language Models on
  Arithmetic and Logical Reasoning
Exploring Generalization Ability of Pretrained Language Models on Arithmetic and Logical Reasoning
Cunxiang Wang
Boyuan Zheng
Y. Niu
Yue Zhang
LRM
41
22
0
15 Aug 2021
Contrastive Self-supervised Sequential Recommendation with Robust
  Augmentation
Contrastive Self-supervised Sequential Recommendation with Robust Augmentation
Zhiwei Liu
Yong-Guang Chen
Jia Li
Philip S. Yu
Julian McAuley
Caiming Xiong
20
164
0
14 Aug 2021
FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning
FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning
Jing Zhou
Yanan Zheng
Jie Tang
Jian Li
Zhilin Yang
VLM
30
76
0
13 Aug 2021
Low-Resource Adaptation of Open-Domain Generative Chatbots
Low-Resource Adaptation of Open-Domain Generative Chatbots
Greyson Gerhard-Young
R. Anantha
Srinivas Chappidi
Björn Hoffmeister
39
3
0
13 Aug 2021
AMMUS : A Survey of Transformer-based Pretrained Models in Natural
  Language Processing
AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing
Katikapalli Subramanyam Kalyan
A. Rajasekharan
S. Sangeetha
VLM
LM&MA
36
261
0
12 Aug 2021
Unsupervised Corpus Aware Language Model Pre-training for Dense Passage
  Retrieval
Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval
Luyu Gao
Jamie Callan
RALM
175
330
0
12 Aug 2021
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code
  Representation
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code Representation
Xin Wang
Yasheng Wang
Fei Mi
Pingyi Zhou
Yao Wan
Xiao Liu
Li Li
Hao Wu
Jin Liu
Xin Jiang
39
114
0
10 Aug 2021
Making Transformers Solve Compositional Tasks
Making Transformers Solve Compositional Tasks
Santiago Ontañón
Joshua Ainslie
Vaclav Cvicek
Zachary Kenneth Fisher
44
70
0
09 Aug 2021
Unifying Heterogeneous Electronic Health Records Systems via Text-Based
  Code Embedding
Unifying Heterogeneous Electronic Health Records Systems via Text-Based Code Embedding
Kyunghoon Hur
Jiyoung Lee
Jungwoo Oh
Wesley Price
Young-Hak Kim
Edward Choi
46
17
0
08 Aug 2021
LadRa-Net: Locally-Aware Dynamic Re-read Attention Net for Sentence
  Semantic Matching
LadRa-Net: Locally-Aware Dynamic Re-read Attention Net for Sentence Semantic Matching
Kun Zhang
Guangyi Lv
Le Wu
Enhong Chen
Qi Liu
Meng Wang
36
6
0
06 Aug 2021
Robust Transfer Learning with Pretrained Language Models through
  Adapters
Robust Transfer Learning with Pretrained Language Models through Adapters
Wenjuan Han
Bo Pang
Ying Nian Wu
21
54
0
05 Aug 2021
Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt
  Verbalizer for Text Classification
Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification
Shengding Hu
Ning Ding
Huadong Wang
Zhiyuan Liu
Jingang Wang
Juan-Zi Li
Wei Wu
Maosong Sun
VLM
41
363
0
04 Aug 2021
How to Query Language Models?
How to Query Language Models?
Leonard Adolphs
Shehzaad Dhuliawala
Thomas Hofmann
KELM
38
15
0
04 Aug 2021
Your fairness may vary: Pretrained language model fairness in toxic text
  classification
Your fairness may vary: Pretrained language model fairness in toxic text classification
Ioana Baldini
Dennis L. Wei
Karthikeyan N. Ramamurthy
Mikhail Yurochkin
Moninder Singh
26
53
0
03 Aug 2021
LICHEE: Improving Language Model Pre-training with Multi-grained
  Tokenization
LICHEE: Improving Language Model Pre-training with Multi-grained Tokenization
Weidong Guo
Mingjun Zhao
Lusheng Zhang
Di Niu
Jinwen Luo
Zhenhua Liu
Zhenyang Li
J. Tang
32
8
0
02 Aug 2021
From LSAT: The Progress and Challenges of Complex Reasoning
From LSAT: The Progress and Challenges of Complex Reasoning
Siyuan Wang
Zhongkun Liu
Wanjun Zhong
Ming Zhou
Zhongyu Wei
Zhumin Chen
Nan Duan
ELM
38
45
0
02 Aug 2021
Transformer-Encoder-GRU (T-E-GRU) for Chinese Sentiment Analysis on
  Chinese Comment Text
Transformer-Encoder-GRU (T-E-GRU) for Chinese Sentiment Analysis on Chinese Comment Text
Binlong Zhang
Wei Zhou
23
17
0
01 Aug 2021
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Xianrui Zheng
Chao Zhang
P. Woodland
14
46
0
29 Jul 2021
UIBert: Learning Generic Multimodal Representations for UI Understanding
UIBert: Learning Generic Multimodal Representations for UI Understanding
Chongyang Bai
Xiaoxue Zang
Ying Xu
Srinivas Sunkara
Abhinav Rastogi
Jindong Chen
Blaise Agüera y Arcas
27
88
0
29 Jul 2021
AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient
  Pre-trained Language Models
AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models
Yichun Yin
Cheng Chen
Lifeng Shang
Xin Jiang
Xiao Chen
Qun Liu
VLM
32
50
0
29 Jul 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods
  in Natural Language Processing
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLM
SyDa
129
3,875
0
28 Jul 2021
Exceeding the Limits of Visual-Linguistic Multi-Task Learning
Exceeding the Limits of Visual-Linguistic Multi-Task Learning
Cameron R. Wolfe
Keld T. Lundgaard
VLM
50
2
0
27 Jul 2021
gaBERT -- an Irish Language Model
gaBERT -- an Irish Language Model
James Barry
Joachim Wagner
Lauren Cassidy
Alan Cowap
Teresa Lynn
Abigail Walsh
Mícheál J. Ó Meachair
Jennifer Foster
21
18
0
27 Jul 2021
Dual Slot Selector via Local Reliability Verification for Dialogue State
  Tracking
Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking
Jinyu Guo
Kai Shuang
Jijie Li
Zihan Wang
28
18
0
27 Jul 2021
Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction
Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction
Lu Xu
Yew Ken Chia
Lidong Bing
41
180
0
26 Jul 2021
Fine-Grained Emotion Prediction by Modeling Emotion Definitions
Fine-Grained Emotion Prediction by Modeling Emotion Definitions
Gargi Singh
Dhanajit Brahma
Piyush Rai
Ashutosh Modi
19
10
0
26 Jul 2021
ICDAR 2021 Competition on Scene Video Text Spotting
ICDAR 2021 Competition on Scene Video Text Spotting
Zhanzhan Cheng
Jing Lu
Baorui Zou
Shuigeng Zhou
Fei Wu
20
4
0
26 Jul 2021
Graph-free Multi-hop Reading Comprehension: A Select-to-Guide Strategy
Graph-free Multi-hop Reading Comprehension: A Select-to-Guide Strategy
Bohong Wu
Zhuosheng Zhang
Hai Zhao
34
20
0
25 Jul 2021
Go Wider Instead of Deeper
Go Wider Instead of Deeper
Fuzhao Xue
Ziji Shi
Futao Wei
Yuxuan Lou
Yong Liu
Yang You
ViT
MoE
33
80
0
25 Jul 2021
Query2Label: A Simple Transformer Way to Multi-Label Classification
Query2Label: A Simple Transformer Way to Multi-Label Classification
Shilong Liu
Lei Zhang
Xiao Yang
Hang Su
Jun Zhu
31
188
0
22 Jul 2021
Back-Translated Task Adaptive Pretraining: Improving Accuracy and
  Robustness on Text Classification
Back-Translated Task Adaptive Pretraining: Improving Accuracy and Robustness on Text Classification
Junghoon Lee
Jounghee Kim
Pilsung Kang
VLM
19
5
0
22 Jul 2021
Multi-stage Pre-training over Simplified Multimodal Pre-training Models
Multi-stage Pre-training over Simplified Multimodal Pre-training Models
Tongtong Liu
Fangxiang Feng
Xiaojie Wang
21
14
0
22 Jul 2021
CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with
  Minimal Supervision
CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with Minimal Supervision
Zhongyang Li
Xiao Ding
Kuo Liao
Bing Qin
Ting Liu
CML
29
17
0
21 Jul 2021
Previous
123...383940...575859
Next