ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
NumGPT: Improving Numeracy Ability of Generative Pre-trained Models
NumGPT: Improving Numeracy Ability of Generative Pre-trained Models
Zhihua Jin
Xin Jiang
Xingbo Wang
Qun Liu
Yong Wang
Xiaozhe Ren
Huamin Qu
77
19
0
07 Sep 2021
IndicBART: A Pre-trained Model for Indic Natural Language Generation
IndicBART: A Pre-trained Model for Indic Natural Language Generation
Raj Dabre
Himani Shrotriya
Anoop Kunchukuttan
Ratish Puduppully
Mitesh M. Khapra
Pratyush Kumar
129
74
0
07 Sep 2021
Sent2Span: Span Detection for PICO Extraction in the Biomedical Text
  without Span Annotations
Sent2Span: Span Detection for PICO Extraction in the Biomedical Text without Span Annotations
Shifeng Liu
Yifang Sun
Bing Li
Wei Wang
Florence T. Bourgeois
A. Dunn
52
14
0
06 Sep 2021
STaCK: Sentence Ordering with Temporal Commonsense Knowledge
STaCK: Sentence Ordering with Temporal Commonsense Knowledge
Deepanway Ghosal
Navonil Majumder
Rada Mihalcea
Soujanya Poria
121
11
0
06 Sep 2021
Re-entry Prediction for Online Conversations via Self-Supervised
  Learning
Re-entry Prediction for Online Conversations via Self-Supervised Learning
Lingzhi Wang
Xingshan Zeng
Huang Hu
Kam-Fai Wong
Daxin Jiang
68
6
0
05 Sep 2021
FewshotQA: A simple framework for few-shot learning of question
  answering tasks using pre-trained text-to-text models
FewshotQA: A simple framework for few-shot learning of question answering tasks using pre-trained text-to-text models
Rakesh Chada
P. Natarajan
92
46
0
04 Sep 2021
Frustratingly Simple Pretraining Alternatives to Masked Language
  Modeling
Frustratingly Simple Pretraining Alternatives to Masked Language Modeling
Atsuki Yamaguchi
G. Chrysostomou
Katerina Margatina
Nikolaos Aletras
69
25
0
04 Sep 2021
Do Prompt-Based Models Really Understand the Meaning of their Prompts?
Do Prompt-Based Models Really Understand the Meaning of their Prompts?
Albert Webson
Ellie Pavlick
LRM
136
374
0
02 Sep 2021
So Cloze yet so Far: N400 Amplitude is Better Predicted by
  Distributional Information than Human Predictability Judgements
So Cloze yet so Far: N400 Amplitude is Better Predicted by Distributional Information than Human Predictability Judgements
J. Michaelov
S. Coulson
Benjamin Bergen
73
44
0
02 Sep 2021
Fight Fire with Fire: Fine-tuning Hate Detectors using Large Samples of
  Generated Hate Speech
Fight Fire with Fire: Fine-tuning Hate Detectors using Large Samples of Generated Hate Speech
Tomer Wullach
A. Adler
Einat Minkov
73
41
0
01 Sep 2021
Does Knowledge Help General NLU? An Empirical Study
Does Knowledge Help General NLU? An Empirical Study
Ruochen Xu
Yuwei Fang
Chenguang Zhu
Michael Zeng
ELM
70
9
0
01 Sep 2021
What Have Been Learned & What Should Be Learned? An Empirical Study of
  How to Selectively Augment Text for Classification
What Have Been Learned & What Should Be Learned? An Empirical Study of How to Selectively Augment Text for Classification
Biyang Guo
S. Han
Hailiang Huang
39
5
0
01 Sep 2021
It's not Rocket Science : Interpreting Figurative Language in Narratives
It's not Rocket Science : Interpreting Figurative Language in Narratives
Tuhin Chakrabarty
Yejin Choi
Vered Shwartz
97
58
0
31 Aug 2021
Effectiveness of Deep Networks in NLP using BiDAF as an example
  architecture
Effectiveness of Deep Networks in NLP using BiDAF as an example architecture
Soumyendu Sarkar
41
2
0
31 Aug 2021
Thermostat: A Large Collection of NLP Model Explanations and Analysis
  Tools
Thermostat: A Large Collection of NLP Model Explanations and Analysis Tools
Nils Feldhus
Robert Schwarzenberg
Sebastian Möller
123
14
0
31 Aug 2021
DNNFusion: Accelerating Deep Neural Networks Execution with Advanced
  Operator Fusion
DNNFusion: Accelerating Deep Neural Networks Execution with Advanced Operator Fusion
Wei Niu
Jiexiong Guan
Yanzhi Wang
G. Agrawal
Bin Ren
AI4CE
76
153
0
30 Aug 2021
ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language
  Understanding
ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language Understanding
Lingyun Feng
Jianwei Yu
Deng Cai
Songxiang Liu
Haitao Zheng
Yan Wang
ELM
179
14
0
30 Aug 2021
Shatter: An Efficient Transformer Encoder with Single-Headed
  Self-Attention and Relative Sequence Partitioning
Shatter: An Efficient Transformer Encoder with Single-Headed Self-Attention and Relative Sequence Partitioning
Ran Tian
Joshua Maynez
Ankur P. Parikh
ViT
56
2
0
30 Aug 2021
Generating Answer Candidates for Quizzes and Answer-Aware Question
  Generators
Generating Answer Candidates for Quizzes and Answer-Aware Question Generators
Kristiyan Vachev
Momchil Hardalov
Georgi Karadzhov
Georgi Georgiev
Ivan Koychev
Preslav Nakov
AI4Ed
46
5
0
29 Aug 2021
Span Fine-tuning for Pre-trained Language Models
Span Fine-tuning for Pre-trained Language Models
Rongzhou Bao
Zhuosheng Zhang
Hai Zhao
50
2
0
29 Aug 2021
Analyzing and Mitigating Interference in Neural Architecture Search
Analyzing and Mitigating Interference in Neural Architecture Search
Jin Xu
Xu Tan
Kaitao Song
Renqian Luo
Yichong Leng
Tao Qin
Tie-Yan Liu
Jian Li
MoMe
91
29
0
29 Aug 2021
Transfer Learning for Multi-lingual Tasks -- a Survey
Transfer Learning for Multi-lingual Tasks -- a Survey
A. Jafari
Behnam Heidary
R. Farahbakhsh
Mostafa Salehi
Mahdi Jalili
LRM
51
5
0
28 Aug 2021
DKM: Differentiable K-Means Clustering Layer for Neural Network
  Compression
DKM: Differentiable K-Means Clustering Layer for Neural Network Compression
Minsik Cho
Keivan Alizadeh Vahid
Saurabh N. Adya
Mohammad Rastegari
95
34
0
28 Aug 2021
Prototype-Guided Memory Replay for Continual Learning
Prototype-Guided Memory Replay for Continual Learning
Stella Ho
Ming Liu
Lan Du
Longxiang Gao
Yong Xiang
CLL
67
32
0
28 Aug 2021
Layer-wise Model Pruning based on Mutual Information
Layer-wise Model Pruning based on Mutual Information
Chun Fan
Jiwei Li
Xiang Ao
Leilei Gan
Yuxian Meng
Xiaofei Sun
84
19
0
28 Aug 2021
Code-switched inspired losses for generic spoken dialog representations
Code-switched inspired losses for generic spoken dialog representations
E. Chapuis
Pierre Colombo
Matthieu Labeau
Chloe Clave
177
12
0
27 Aug 2021
A Partition Filter Network for Joint Entity and Relation Extraction
A Partition Filter Network for Joint Entity and Relation Extraction
Zhiheng Yan
Chong Zhang
Jinlan Fu
Qi Zhang
Zhongyu Wei
120
140
0
27 Aug 2021
Query-Focused Extractive Summarisation for Finding Ideal Answers to
  Biomedical and COVID-19 Questions
Query-Focused Extractive Summarisation for Finding Ideal Answers to Biomedical and COVID-19 Questions
Diego Mollá Aliod
Urvashi Khanna
Dima Galat
Vincent Nguyen
Maciej Rybiński
RALM
71
2
0
27 Aug 2021
Multilingual Multi-Aspect Explainability Analyses on Machine Reading
  Comprehension Models
Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models
Yiming Cui
Weinan Zhang
Wanxiang Che
Ting Liu
Zhigang Chen
Shijin Wang
LRM
47
9
0
26 Aug 2021
Exploring the Promises of Transformer-Based LMs for the Representation
  of Normative Claims in the Legal Domain
Exploring the Promises of Transformer-Based LMs for the Representation of Normative Claims in the Legal Domain
Reto Gubelmann
Peter Hongler
Siegfried Handschuh
AILaw
21
0
0
25 Aug 2021
YANMTT: Yet Another Neural Machine Translation Toolkit
YANMTT: Yet Another Neural Machine Translation Toolkit
Raj Dabre
Eiichiro Sumita
72
13
0
25 Aug 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer
  Models via Low-Rank Approximation
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation
Samuel Cahyawijaya
103
12
0
24 Aug 2021
Are the Multilingual Models Better? Improving Czech Sentiment with
  Transformers
Are the Multilingual Models Better? Improving Czech Sentiment with Transformers
Pavel Přibáň
J. Steinberger
70
11
0
24 Aug 2021
Recurrent multiple shared layers in Depth for Neural Machine Translation
Recurrent multiple shared layers in Depth for Neural Machine Translation
Guoliang Li
Yiyang Li
MoE
48
1
0
23 Aug 2021
APObind: A Dataset of Ligand Unbound Protein Conformations for Machine
  Learning Applications in De Novo Drug Design
APObind: A Dataset of Ligand Unbound Protein Conformations for Machine Learning Applications in De Novo Drug Design
Rishal Aggarwal
Akash Gupta
U. Priyakumar
42
11
0
23 Aug 2021
Fastformer: Additive Attention Can Be All You Need
Fastformer: Additive Attention Can Be All You Need
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
Xing Xie
91
121
0
20 Aug 2021
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with
  Structured Semantics for Medical Text Mining
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured Semantics for Medical Text Mining
Taolin Zhang
Zerui Cai
Chengyu Wang
Minghui Qiu
Bite Yang
Xiaofeng He
AI4MH
73
54
0
20 Aug 2021
Detection of Illicit Drug Trafficking Events on Instagram: A Deep
  Multimodal Multilabel Learning Approach
Detection of Illicit Drug Trafficking Events on Instagram: A Deep Multimodal Multilabel Learning Approach
Chuanbo Hu
Minglei Yin
Bin Liu
Xin Li
Yanfang Ye
43
15
0
19 Aug 2021
TSI: an Ad Text Strength Indicator using Text-to-CTR and
  Semantic-Ad-Similarity
TSI: an Ad Text Strength Indicator using Text-to-CTR and Semantic-Ad-Similarity
Shaunak Mishra
Changwei Hu
Manisha Verma
Kevin Yen
Yifan Hu
M. Sviridenko
44
8
0
18 Aug 2021
EviDR: Evidence-Emphasized Discrete Reasoning for Reasoning Machine
  Reading Comprehension
EviDR: Evidence-Emphasized Discrete Reasoning for Reasoning Machine Reading Comprehension
Yongwei Zhou
Junwei Bao
Haipeng Sun
Jiahui Liang
Youzheng Wu
Xiaodong He
Bowen Zhou
Tiejun Zhao
29
5
0
18 Aug 2021
RRLFSOR: An Efficient Self-Supervised Learning Strategy of Graph
  Convolutional Networks
RRLFSOR: An Efficient Self-Supervised Learning Strategy of Graph Convolutional Networks
Feng Sun
Ajith Kumar
Guanci Yang
Qikui Zhu
Yiyun Zhang
Ansi Zhang
Dhruv Makwana
SSLGNN
114
0
0
17 Aug 2021
Exploring Generalization Ability of Pretrained Language Models on
  Arithmetic and Logical Reasoning
Exploring Generalization Ability of Pretrained Language Models on Arithmetic and Logical Reasoning
Cunxiang Wang
Boyuan Zheng
Y. Niu
Yue Zhang
LRM
78
23
0
15 Aug 2021
Contrastive Self-supervised Sequential Recommendation with Robust
  Augmentation
Contrastive Self-supervised Sequential Recommendation with Robust Augmentation
Zhiwei Liu
Yong-Guang Chen
Jia Li
Philip S. Yu
Julian McAuley
Caiming Xiong
72
171
0
14 Aug 2021
FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning
FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning
Jing Zhou
Yanan Zheng
Jie Tang
Jian Li
Zhilin Yang
VLM
89
80
0
13 Aug 2021
Low-Resource Adaptation of Open-Domain Generative Chatbots
Low-Resource Adaptation of Open-Domain Generative Chatbots
Greyson Gerhard-Young
R. Anantha
Srinivas Chappidi
Björn Hoffmeister
80
3
0
13 Aug 2021
AMMUS : A Survey of Transformer-based Pretrained Models in Natural
  Language Processing
AMMUS : A Survey of Transformer-based Pretrained Models in Natural Language Processing
Katikapalli Subramanyam Kalyan
A. Rajasekharan
S. Sangeetha
VLMLM&MA
111
270
0
12 Aug 2021
Unsupervised Corpus Aware Language Model Pre-training for Dense Passage
  Retrieval
Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval
Luyu Gao
Jamie Callan
RALM
298
342
0
12 Aug 2021
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code
  Representation
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code Representation
Xin Wang
Yasheng Wang
Fei Mi
Pingyi Zhou
Yao Wan
Xiao Liu
Li Li
Hao Wu
Jin Liu
Xin Jiang
142
118
0
10 Aug 2021
Making Transformers Solve Compositional Tasks
Making Transformers Solve Compositional Tasks
Santiago Ontañón
Joshua Ainslie
Vaclav Cvicek
Zachary Kenneth Fisher
109
74
0
09 Aug 2021
Unifying Heterogeneous Electronic Health Records Systems via Text-Based
  Code Embedding
Unifying Heterogeneous Electronic Health Records Systems via Text-Based Code Embedding
Kyunghoon Hur
Jiyoung Lee
Jungwoo Oh
Wesley Price
Young-Hak Kim
Edward Choi
103
19
0
08 Aug 2021
Previous
123...383940...575859
Next