ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSL
    AIMat
ArXivPDFHTML

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,913 papers shown
Title
Hierarchical Pre-training for Sequence Labelling in Spoken Dialog
Hierarchical Pre-training for Sequence Labelling in Spoken Dialog
E. Chapuis
Pierre Colombo
Matteo Manica
Matthieu Labeau
Chloé Clavel
22
58
0
23 Sep 2020
Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text
  Recognition
Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text Recognition
B. Li
Xin Tang
Xianbiao Qi
Yihao Chen
Rong Xiao
29
8
0
23 Sep 2020
Global-to-Local Neural Networks for Document-Level Relation Extraction
Global-to-Local Neural Networks for Document-Level Relation Extraction
D. Wang
Wei Hu
E. Cao
Weijian Sun
NAI
18
118
0
22 Sep 2020
Preserving Integrity in Online Social Networks
Preserving Integrity in Online Social Networks
A. Halevy
Cristian Canton Ferrer
Hao Ma
Umut Ozertem
Patrick Pantel
Marzieh Saeidi
Fabrizio Silvestri
Ves Stoyanov
22
57
0
22 Sep 2020
BioALBERT: A Simple and Effective Pre-trained Language Model for
  Biomedical Named Entity Recognition
BioALBERT: A Simple and Effective Pre-trained Language Model for Biomedical Named Entity Recognition
Usman Naseem
Matloob Khushi
V. Reddy
S. Rajendran
Imran Razzak
Jinman Kim
24
63
0
19 Sep 2020
Learning to Attack: Towards Textual Adversarial Attacking in Real-world
  Situations
Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations
Yuan Zang
Bairu Hou
Fanchao Qi
Zhiyuan Liu
Xiaojun Meng
Maosong Sun
22
11
0
19 Sep 2020
The birth of Romanian BERT
The birth of Romanian BERT
Stefan Daniel Dumitrescu
Andrei-Marius Avram
S. Pyysalo
VLM
8
76
0
18 Sep 2020
A Multimodal Memes Classification: A Survey and Open Research Issues
A Multimodal Memes Classification: A Survey and Open Research Issues
Tariq Habib Afridi
A. Alam
Muhammad Numan Khan
Jawad Khan
Young-Koo Lee
29
35
0
17 Sep 2020
ISCAS at SemEval-2020 Task 5: Pre-trained Transformers for
  Counterfactual Statement Modeling
ISCAS at SemEval-2020 Task 5: Pre-trained Transformers for Counterfactual Statement Modeling
Yaojie Lu
Annan Li
Hongyu Lin
Xianpei Han
Le Sun
14
5
0
17 Sep 2020
Efficient Transformer-based Large Scale Language Representations using
  Hardware-friendly Block Structured Pruning
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning
Bingbing Li
Zhenglun Kong
Tianyun Zhang
Ji Li
Zechao Li
Hang Liu
Caiwen Ding
VLM
32
64
0
17 Sep 2020
Answering Any-hop Open-domain Questions with Iterative Document
  Reranking
Answering Any-hop Open-domain Questions with Iterative Document Reranking
Ping Nie
Yuyu Zhang
Arun Ramamurthy
Le Song
33
20
0
16 Sep 2020
Question Directed Graph Attention Network for Numerical Reasoning over
  Text
Question Directed Graph Attention Network for Numerical Reasoning over Text
Kunlong Chen
Weidi Xu
Xingyi Cheng
Zou Xiaochuan
Yuyu Zhang
Le Song
Taifeng Wang
Yuan Qi
Wei Chu
AIMat
OOD
30
66
0
16 Sep 2020
Multi-span Style Extraction for Generative Reading Comprehension
Multi-span Style Extraction for Generative Reading Comprehension
Junjie Yang
ZhuoSheng Zhang
Hai Zhao
SyDa
19
14
0
15 Sep 2020
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
Timo Schick
Hinrich Schütze
51
956
0
15 Sep 2020
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
Louis Clouâtre
P. Trempe
Amal Zouaq
Sarath Chandar
25
43
0
15 Sep 2020
Efficient Transformers: A Survey
Efficient Transformers: A Survey
Yi Tay
Mostafa Dehghani
Dara Bahri
Donald Metzler
VLM
116
1,104
0
14 Sep 2020
Cluster-Former: Clustering-based Sparse Transformer for Long-Range
  Dependency Encoding
Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding
Shuohang Wang
Luowei Zhou
Zhe Gan
Yen-Chun Chen
Yuwei Fang
S. Sun
Yu Cheng
Jingjing Liu
43
28
0
13 Sep 2020
DualDE: Dually Distilling Knowledge Graph Embedding for Faster and
  Cheaper Reasoning
DualDE: Dually Distilling Knowledge Graph Embedding for Faster and Cheaper Reasoning
Yushan Zhu
Wen Zhang
Mingyang Chen
Hui Chen
Xu-Xin Cheng
Wei Zhang
Huajun Chen Zhejiang University
22
39
0
13 Sep 2020
Syntax Role for Neural Semantic Role Labeling
Syntax Role for Neural Semantic Role Labeling
Z. Li
Hai Zhao
Shexia He
Jiaxun Cai
NAI
25
19
0
12 Sep 2020
Compressed Deep Networks: Goodbye SVD, Hello Robust Low-Rank
  Approximation
Compressed Deep Networks: Goodbye SVD, Hello Robust Low-Rank Approximation
M. Tukan
Alaa Maalouf
Matan Weksler
Dan Feldman
23
9
0
11 Sep 2020
UPB at SemEval-2020 Task 6: Pretrained Language Models for Definition
  Extraction
UPB at SemEval-2020 Task 6: Pretrained Language Models for Definition Extraction
Andrei-Marius Avram
Dumitru-Clementin Cercel
Costin-Gabriel Chiru
11
7
0
11 Sep 2020
Semantic Relations and Deep Learning
Semantic Relations and Deep Learning
Vivi Nastase
Stan Szpakowicz
GNN
14
0
0
11 Sep 2020
Rank over Class: The Untapped Potential of Ranking in Natural Language
  Processing
Rank over Class: The Untapped Potential of Ranking in Natural Language Processing
Amir Atapour-Abarghouei
Stephen Bonner
A. Mcgough
13
4
0
10 Sep 2020
Dialogue-adaptive Language Model Pre-training From Quality Estimation
Dialogue-adaptive Language Model Pre-training From Quality Estimation
Junlong Li
ZhuoSheng Zhang
Hai Zhao
OffRL
27
12
0
10 Sep 2020
Modern Methods for Text Generation
Modern Methods for Text Generation
Dimas Muñoz-Montesinos
14
5
0
10 Sep 2020
Learning Universal Representations from Word to Sentence
Learning Universal Representations from Word to Sentence
Yian Li
Hai Zhao
SSL
12
2
0
10 Sep 2020
Comparative Study of Language Models on Cross-Domain Data with Model
  Agnostic Explainability
Comparative Study of Language Models on Cross-Domain Data with Model Agnostic Explainability
Mayank Chhipa
Hrushikesh Mahesh Vazurkar
Abhijeet Kumar
Mridul Mishra
23
0
0
09 Sep 2020
ERNIE at SemEval-2020 Task 10: Learning Word Emphasis Selection by
  Pre-trained Language Model
ERNIE at SemEval-2020 Task 10: Learning Word Emphasis Selection by Pre-trained Language Model
Zhengjie Huang
Shikun Feng
Weiyue Su
Xuyi Chen
Shuohuan Wang
Jiaxiang Liu
Ouyang Xuan
Yu Sun
27
8
0
08 Sep 2020
Simple is Better! Lightweight Data Augmentation for Low Resource Slot
  Filling and Intent Classification
Simple is Better! Lightweight Data Augmentation for Low Resource Slot Filling and Intent Classification
Samuel Louvan
Bernardo Magnini
12
26
0
08 Sep 2020
kk2018 at SemEval-2020 Task 9: Adversarial Training for Code-Mixing
  Sentiment Classification
kk2018 at SemEval-2020 Task 9: Adversarial Training for Code-Mixing Sentiment Classification
Jiaxiang Liu
Xuyi Chen
Shikun Feng
Shuohuan Wang
Ouyang Xuan
Yu Sun
Zhengjie Huang
Weiyue Su
35
19
0
08 Sep 2020
Measuring Massive Multitask Language Understanding
Measuring Massive Multitask Language Understanding
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
D. Song
Jacob Steinhardt
ELM
RALM
84
3,969
0
07 Sep 2020
UIT-HSE at WNUT-2020 Task 2: Exploiting CT-BERT for Identifying COVID-19
  Information on the Twitter Social Network
UIT-HSE at WNUT-2020 Task 2: Exploiting CT-BERT for Identifying COVID-19 Information on the Twitter Social Network
Khiem Vinh Tran
Hao Phu Phan
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
24
8
0
07 Sep 2020
E-BERT: A Phrase and Product Knowledge Enhanced Language Model for
  E-commerce
E-BERT: A Phrase and Product Knowledge Enhanced Language Model for E-commerce
Denghui Zhang
Zixuan Yuan
Yanchi Liu
Fuzhen Zhuang
Haifeng Chen
Hui Xiong
25
33
0
07 Sep 2020
UPB at SemEval-2020 Task 8: Joint Textual and Visual Modeling in a
  Multi-Task Learning Architecture for Memotion Analysis
UPB at SemEval-2020 Task 8: Joint Textual and Visual Modeling in a Multi-Task Learning Architecture for Memotion Analysis
G. Vlad
George-Eduard Zaharia
Dumitru-Clementin Cercel
Costin-Gabriel Chiru
Stefan Trausan-Matu
33
29
0
06 Sep 2020
A Survey on Machine Learning from Few Samples
A Survey on Machine Learning from Few Samples
Jiang Lu
Pinghua Gong
Jieping Ye
Jianwei Zhang
Changshu Zhang
22
47
0
06 Sep 2020
QiaoNing at SemEval-2020 Task 4: Commonsense Validation and Explanation
  system based on ensemble of language model
QiaoNing at SemEval-2020 Task 4: Commonsense Validation and Explanation system based on ensemble of language model
Pai Liu
LRM
30
6
0
06 Sep 2020
AutoTrans: Automating Transformer Design via Reinforced Architecture Search
Wei-wei Zhu
Xiaoling Wang
Xipeng Qiu
Yuan Ni
Guotong Xie
35
18
0
04 Sep 2020
LiftFormer: 3D Human Pose Estimation using attention models
LiftFormer: 3D Human Pose Estimation using attention models
Adrian Llopart
22
9
0
01 Sep 2020
Rethinking the Objectives of Extractive Question Answering
Rethinking the Objectives of Extractive Question Answering
Martin Fajcik
Josef Jon
Pavel Smrz
15
12
0
28 Aug 2020
Intimate Partner Violence and Injury Prediction From Radiology Reports
Intimate Partner Violence and Injury Prediction From Radiology Reports
Irene Y. Chen
Emily Alsentzer
Hyesun Park
Richard Thomas
B. Gosangi
Rahul Gujrathi
B. Khurana
11
21
0
28 Aug 2020
Short-term Traffic Prediction with Deep Neural Networks: A Survey
Short-term Traffic Prediction with Deep Neural Networks: A Survey
Kyungeun Lee
Moonjung Eo
Euna Jung
Yoonjin Yoon
Wonjong Rhee
GNN
AI4TS
24
52
0
28 Aug 2020
GREEK-BERT: The Greeks visiting Sesame Street
GREEK-BERT: The Greeks visiting Sesame Street
John Koutsikakis
Ilias Chalkidis
Prodromos Malakasiotis
Ion Androutsopoulos
24
89
0
27 Aug 2020
Improvement of a dedicated model for open domain persona-aware dialogue
  generation
Improvement of a dedicated model for open domain persona-aware dialogue generation
Qiang Han
6
0
0
27 Aug 2020
AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization
AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization
Xinsong Zhang
Pengshuai Li
Hang Li
18
51
0
27 Aug 2020
Analysis and Evaluation of Language Models for Word Sense Disambiguation
Analysis and Evaluation of Language Models for Word Sense Disambiguation
Daniel Loureiro
Kiamehr Rezaee
Mohammad Taher Pilehvar
Jose Camacho-Collados
33
13
0
26 Aug 2020
Inno at SemEval-2020 Task 11: Leveraging Pure Transformer for
  Multi-Class Propaganda Detection
Inno at SemEval-2020 Task 11: Leveraging Pure Transformer for Multi-Class Propaganda Detection
D. Grigorev
V. Ivanov
19
2
0
26 Aug 2020
JokeMeter at SemEval-2020 Task 7: Convolutional humor
JokeMeter at SemEval-2020 Task 7: Convolutional humor
Martin Docekal
Martin Fajcik
Josef Jon
Pavel Smrz
29
2
0
25 Aug 2020
Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life
  Anecdotes
Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes
Nicholas Lourie
Ronan Le Bras
Yejin Choi
6
118
0
20 Aug 2020
BUT-FIT at SemEval-2020 Task 4: Multilingual commonsense
BUT-FIT at SemEval-2020 Task 4: Multilingual commonsense
Josef Jon
Martin Fajcik
Martin Docekal
Pavel Smrz
35
5
0
17 Aug 2020
Finding Fast Transformers: One-Shot Neural Architecture Search by
  Component Composition
Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Henry Tsai
Jayden Ooi
Chun-Sung Ferng
Hyung Won Chung
Jason Riesa
ViT
25
21
0
15 Aug 2020
Previous
123...515253...575859
Next