ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
Simple is Better! Lightweight Data Augmentation for Low Resource Slot
  Filling and Intent Classification
Simple is Better! Lightweight Data Augmentation for Low Resource Slot Filling and Intent Classification
Samuel Louvan
Bernardo Magnini
61
26
0
08 Sep 2020
kk2018 at SemEval-2020 Task 9: Adversarial Training for Code-Mixing
  Sentiment Classification
kk2018 at SemEval-2020 Task 9: Adversarial Training for Code-Mixing Sentiment Classification
Jiaxiang Liu
Xuyi Chen
Shikun Feng
Shuohuan Wang
Ouyang Xuan
Yu Sun
Zhengjie Huang
Weiyue Su
74
20
0
08 Sep 2020
Measuring Massive Multitask Language Understanding
Measuring Massive Multitask Language Understanding
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
ELMRALM
209
4,582
0
07 Sep 2020
UIT-HSE at WNUT-2020 Task 2: Exploiting CT-BERT for Identifying COVID-19
  Information on the Twitter Social Network
UIT-HSE at WNUT-2020 Task 2: Exploiting CT-BERT for Identifying COVID-19 Information on the Twitter Social Network
Khiem Vinh Tran
Hao Phu Phan
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
59
8
0
07 Sep 2020
E-BERT: A Phrase and Product Knowledge Enhanced Language Model for
  E-commerce
E-BERT: A Phrase and Product Knowledge Enhanced Language Model for E-commerce
Denghui Zhang
Zixuan Yuan
Yanchi Liu
Fuzhen Zhuang
Haifeng Chen
Hui Xiong
84
34
0
07 Sep 2020
UPB at SemEval-2020 Task 8: Joint Textual and Visual Modeling in a
  Multi-Task Learning Architecture for Memotion Analysis
UPB at SemEval-2020 Task 8: Joint Textual and Visual Modeling in a Multi-Task Learning Architecture for Memotion Analysis
G. Vlad
George-Eduard Zaharia
Dumitru-Clementin Cercel
Costin-Gabriel Chiru
Stefan Trausan-Matu
76
31
0
06 Sep 2020
A Survey on Machine Learning from Few Samples
A Survey on Machine Learning from Few Samples
Jiang Lu
Pinghua Gong
Jieping Ye
Jianwei Zhang
Changshu Zhang
98
52
0
06 Sep 2020
QiaoNing at SemEval-2020 Task 4: Commonsense Validation and Explanation
  system based on ensemble of language model
QiaoNing at SemEval-2020 Task 4: Commonsense Validation and Explanation system based on ensemble of language model
Pai Liu
LRM
66
6
0
06 Sep 2020
AutoTrans: Automating Transformer Design via Reinforced Architecture Search
Wei-wei Zhu
Xiaoling Wang
Xipeng Qiu
Yuan Ni
Guotong Xie
81
18
0
04 Sep 2020
LiftFormer: 3D Human Pose Estimation using attention models
LiftFormer: 3D Human Pose Estimation using attention models
Adrian Llopart
47
9
0
01 Sep 2020
Rethinking the Objectives of Extractive Question Answering
Rethinking the Objectives of Extractive Question Answering
Martin Fajcik
Josef Jon
Pavel Smrz
97
12
0
28 Aug 2020
Intimate Partner Violence and Injury Prediction From Radiology Reports
Intimate Partner Violence and Injury Prediction From Radiology Reports
Irene Y. Chen
Emily Alsentzer
Hyesun Park
Richard Thomas
B. Gosangi
Rahul Gujrathi
B. Khurana
50
22
0
28 Aug 2020
Short-term Traffic Prediction with Deep Neural Networks: A Survey
Short-term Traffic Prediction with Deep Neural Networks: A Survey
Kyungeun Lee
Moonjung Eo
Euna Jung
Yoonjin Yoon
Wonjong Rhee
GNNAI4TS
70
52
0
28 Aug 2020
GREEK-BERT: The Greeks visiting Sesame Street
GREEK-BERT: The Greeks visiting Sesame Street
John Koutsikakis
Ilias Chalkidis
Prodromos Malakasiotis
Ion Androutsopoulos
70
92
0
27 Aug 2020
Improvement of a dedicated model for open domain persona-aware dialogue
  generation
Improvement of a dedicated model for open domain persona-aware dialogue generation
Qiang Han
49
0
0
27 Aug 2020
AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization
AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization
Xinsong Zhang
Pengshuai Li
Hang Li
95
52
0
27 Aug 2020
Analysis and Evaluation of Language Models for Word Sense Disambiguation
Analysis and Evaluation of Language Models for Word Sense Disambiguation
Daniel Loureiro
Kiamehr Rezaee
Mohammad Taher Pilehvar
Jose Camacho-Collados
93
14
0
26 Aug 2020
Inno at SemEval-2020 Task 11: Leveraging Pure Transformer for
  Multi-Class Propaganda Detection
Inno at SemEval-2020 Task 11: Leveraging Pure Transformer for Multi-Class Propaganda Detection
D. Grigorev
V. Ivanov
21
2
0
26 Aug 2020
JokeMeter at SemEval-2020 Task 7: Convolutional humor
JokeMeter at SemEval-2020 Task 7: Convolutional humor
Martin Docekal
Martin Fajcik
Josef Jon
Pavel Smrz
58
2
0
25 Aug 2020
Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life
  Anecdotes
Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes
Nicholas Lourie
Ronan Le Bras
Yejin Choi
82
125
0
20 Aug 2020
BUT-FIT at SemEval-2020 Task 4: Multilingual commonsense
BUT-FIT at SemEval-2020 Task 4: Multilingual commonsense
Josef Jon
Martin Fajcik
Martin Docekal
Pavel Smrz
74
5
0
17 Aug 2020
Finding Fast Transformers: One-Shot Neural Architecture Search by
  Component Composition
Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Henry Tsai
Jayden Ooi
Chun-Sung Ferng
Hyung Won Chung
Jason Riesa
ViT
80
21
0
15 Aug 2020
On the Importance of Local Information in Transformer Based Models
On the Importance of Local Information in Transformer Based Models
Madhura Pande
Aakriti Budhraja
Preksha Nema
Pratyush Kumar
Mitesh M. Khapra
42
2
0
13 Aug 2020
Prosody Learning Mechanism for Speech Synthesis System Without Text
  Length Limit
Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit
Zhen Zeng
Jianzong Wang
Ning Cheng
Jing Xiao
61
8
0
13 Aug 2020
Compression of Deep Learning Models for Text: A Survey
Compression of Deep Learning Models for Text: A Survey
Manish Gupta
Puneet Agrawal
VLMMedImAI4CE
79
119
0
12 Aug 2020
SemEval-2020 Task 8: Memotion Analysis -- The Visuo-Lingual Metaphor!
SemEval-2020 Task 8: Memotion Analysis -- The Visuo-Lingual Metaphor!
Chhavi Sharma
Deepesh Bhageria
W. Scott
Srinivas Pykl
A. Das
Tanmoy Chakraborty
Viswanath Pulabaigari
Björn Gambäck
92
180
0
09 Aug 2020
SemEval-2020 Task 10: Emphasis Selection for Written Text in Visual
  Media
SemEval-2020 Task 10: Emphasis Selection for Written Text in Visual Media
Amirreza Shirani
Franck Dernoncourt
Nedim Lipka
P. Asente
J. Echevarria
Thamar Solorio
51
21
0
07 Aug 2020
ConvBERT: Improving BERT with Span-based Dynamic Convolution
ConvBERT: Improving BERT with Span-based Dynamic Convolution
Zihang Jiang
Weihao Yu
Daquan Zhou
Yunpeng Chen
Jiashi Feng
Shuicheng Yan
135
162
0
06 Aug 2020
Aligning AI With Shared Human Values
Aligning AI With Shared Human Values
Dan Hendrycks
Collin Burns
Steven Basart
Andrew Critch
Jingkai Li
Basel Alomair
Jacob Steinhardt
153
574
0
05 Aug 2020
DeLighT: Deep and Light-weight Transformer
DeLighT: Deep and Light-weight Transformer
Sachin Mehta
Marjan Ghazvininejad
Srini Iyer
Luke Zettlemoyer
Hannaneh Hajishirzi
VLM
90
32
0
03 Aug 2020
SemEval-2020 Task 5: Counterfactual Recognition
SemEval-2020 Task 5: Counterfactual Recognition
Xiaoyu Yang
Stephen Obadinma
Huasha Zhao
Qiong Zhang
Stan Matwin
Xiao-Dan Zhu
67
42
0
02 Aug 2020
A Survey on Text Classification: From Shallow to Deep Learning
A Survey on Text Classification: From Shallow to Deep Learning
Qian Li
Hao Peng
Jianxin Li
Congyin Xia
Renyu Yang
Lichao Sun
Philip S. Yu
Lifang He
VLM
174
358
0
02 Aug 2020
On Learning Universal Representations Across Languages
On Learning Universal Representations Across Languages
Xiangpeng Wei
Rongxiang Weng
Yue Hu
Luxi Xing
Heng Yu
Weihua Luo
SSLVLM
99
87
0
31 Jul 2020
Language Modelling for Source Code with Transformer-XL
Language Modelling for Source Code with Transformer-XL
Thomas D. Dowdell
Hongyu Zhang
57
8
0
31 Jul 2020
Deep Learning Brasil -- NLP at SemEval-2020 Task 9: Overview of
  Sentiment Analysis of Code-Mixed Tweets
Deep Learning Brasil -- NLP at SemEval-2020 Task 9: Overview of Sentiment Analysis of Code-Mixed Tweets
Manoel Veríssimo dos Santos Neto
Ayrton Amaral
Nádia Félix F. da Silva
A. S. Soares
23
4
0
28 Jul 2020
TensorCoder: Dimension-Wise Attention via Tensor Representation for
  Natural Language Modeling
TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling
Shuai Zhang
Peng Zhang
Xindian Ma
Junqiu Wei
Ning Wang
Qun Liu
27
5
0
28 Jul 2020
ECNU-SenseMaker at SemEval-2020 Task 4: Leveraging Heterogeneous
  Knowledge Resources for Commonsense Validation and Explanation
ECNU-SenseMaker at SemEval-2020 Task 4: Leveraging Heterogeneous Knowledge Resources for Commonsense Validation and Explanation
Qiang Zhao
Siyu Tao
Jie Zhou
Linlin Wang
Xin Lin
Liang He
86
8
0
28 Jul 2020
BUT-FIT at SemEval-2020 Task 5: Automatic detection of counterfactual
  statements with deep pre-trained language representation models
BUT-FIT at SemEval-2020 Task 5: Automatic detection of counterfactual statements with deep pre-trained language representation models
Martin Fajcik
Josef Jon
Martin Docekal
Pavel Smrz
42
11
0
28 Jul 2020
Variants of BERT, Random Forests and SVM approach for Multimodal
  Emotion-Target Sub-challenge
Variants of BERT, Random Forests and SVM approach for Multimodal Emotion-Target Sub-challenge
Hoang Manh Hung
Hyung-Jeong Yang
Soohyung Kim
Gueesang Lee
28
0
0
28 Jul 2020
Public Sentiment Toward Solar Energy: Opinion Mining of Twitter Using a
  Transformer-Based Language Model
Public Sentiment Toward Solar Energy: Opinion Mining of Twitter Using a Transformer-Based Language Model
Serena Y Kim
K. Ganesan
P. Dickens
S. Panda
66
60
0
27 Jul 2020
Self-supervised Learning for Large-scale Item Recommendations
Self-supervised Learning for Large-scale Item Recommendations
Tiansheng Yao
Xinyang Yi
D. Cheng
Felix X. Yu
Ting-Li Chen
...
Lichan Hong
Ed H. Chi
S. Tjoa
Jieqi Kang
Evan Ettinger
SSL
98
49
0
25 Jul 2020
Multi-task learning for natural language processing in the 2020s: where
  are we going?
Multi-task learning for natural language processing in the 2020s: where are we going?
Joseph Worsham
Jugal Kalita
AIMat
73
81
0
22 Jul 2020
XD at SemEval-2020 Task 12: Ensemble Approach to Offensive Language
  Identification in Social Media Using Transformer Encoders
XD at SemEval-2020 Task 12: Ensemble Approach to Offensive Language Identification in Social Media Using Transformer Encoders
Xiangjue Dong
Jinho Choi
52
1
0
21 Jul 2020
CS-NET at SemEval-2020 Task 4: Siamese BERT for ComVE
CS-NET at SemEval-2020 Task 4: Siamese BERT for ComVE
S. Dash
Sandeep K. Routray
P. Varshney
Ashutosh Modi
58
3
0
21 Jul 2020
PanRep: Graph neural networks for extracting universal node embeddings
  in heterogeneous graphs
PanRep: Graph neural networks for extracting universal node embeddings in heterogeneous graphs
V. Ioannidis
Da Zheng
George Karypis
SSL
32
4
0
20 Jul 2020
Mono vs Multilingual Transformer-based Models: a Comparison across
  Several Language Tasks
Mono vs Multilingual Transformer-based Models: a Comparison across Several Language Tasks
Diego de Vargas Feijó
V. Moreira
MILM
29
7
0
19 Jul 2020
Fighting the COVID-19 Infodemic in Social Media: A Holistic Perspective
  and a Call to Arms
Fighting the COVID-19 Infodemic in Social Media: A Holistic Perspective and a Call to Arms
Firoj Alam
Fahim Dalvi
Shaden Shaar
Nadir Durrani
Hamdy Mubarak
...
Giovanni Da San Martino
Ahmed Abdelali
Hassan Sajjad
Kareem Darwish
Preslav Nakov
102
102
0
15 Jul 2020
COBE: Contextualized Object Embeddings from Narrated Instructional Video
COBE: Contextualized Object Embeddings from Narrated Instructional Video
Gedas Bertasius
Lorenzo Torresani
70
24
0
14 Jul 2020
ProtTrans: Towards Cracking the Language of Life's Code Through
  Self-Supervised Deep Learning and High Performance Computing
ProtTrans: Towards Cracking the Language of Life's Code Through Self-Supervised Deep Learning and High Performance Computing
Ahmed Elnaggar
M. Heinzinger
Christian Dallago
Ghalia Rehawi
Yu Wang
...
Tamas B. Fehér
Christoph Angerer
Martin Steinegger
D. Bhowmik
B. Rost
DRL
80
967
0
13 Jul 2020
TERA: Self-Supervised Learning of Transformer Encoder Representation for
  Speech
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech
Andy T. Liu
Shang-Wen Li
Hung-yi Lee
SSL
173
361
0
12 Jul 2020
Previous
123...525354...575859
Next