ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
DATE: Detecting Anomalies in Text via Self-Supervision of Transformers
DATE: Detecting Anomalies in Text via Self-Supervision of Transformers
Andrei Manolache
Florin Brad
Elena Burceanu
UQCV
68
34
0
12 Apr 2021
Factual Probing Is [MASK]: Learning vs. Learning to Recall
Factual Probing Is [MASK]: Learning vs. Learning to Recall
Zexuan Zhong
Dan Friedman
Danqi Chen
81
413
0
12 Apr 2021
Not All Attention Is All You Need
Not All Attention Is All You Need
Hongqiu Wu
Hai Zhao
Min Zhang
71
9
0
10 Apr 2021
WLV-RIT at SemEval-2021 Task 5: A Neural Transformer Framework for
  Detecting Toxic Spans
WLV-RIT at SemEval-2021 Task 5: A Neural Transformer Framework for Detecting Toxic Spans
Tharindu Ranasinghe
Diptanu Sarkar
Marcos Zampieri
Alexander Ororbia
MedIm
60
13
0
09 Apr 2021
AdCOFE: Advanced Contextual Feature Extraction in Conversations for
  emotion classification
AdCOFE: Advanced Contextual Feature Extraction in Conversations for emotion classification
Vaibhav Bhat
Anita Yadav
Sonal Yadav
Dhivya Chandrasekaran
Vijay K. Mago
40
5
0
09 Apr 2021
Did they answer? Subjective acts and intents in conversational discourse
Did they answer? Subjective acts and intents in conversational discourse
Elisa Ferracane
Greg Durrett
Junjie Li
K. Erk
77
20
0
09 Apr 2021
Larger-Context Tagging: When and Why Does It Work?
Larger-Context Tagging: When and Why Does It Work?
Jinlan Fu
Liangjing Feng
Qi Zhang
Xuanjing Huang
Pengfei Liu
57
5
0
09 Apr 2021
Transformers: "The End of History" for NLP?
Transformers: "The End of History" for NLP?
Anton Chernyavskiy
Dmitry Ilvovsky
Preslav Nakov
117
30
0
09 Apr 2021
Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via
  Layer Consistency
Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency
Jinchuan Tian
Rongzhi Gu
Helin Wang
Yuexian Zou
46
0
0
08 Apr 2021
A Question-answering Based Framework for Relation Extraction Validation
A Question-answering Based Framework for Relation Extraction Validation
Cheng Jiayang
Haiyun Jiang
Deqing Yang
Yanghua Xiao
38
11
0
07 Apr 2021
Creativity and Machine Learning: A Survey
Creativity and Machine Learning: A Survey
Giorgio Franceschelli
Mirco Musolesi
VLMAI4CE
129
43
0
06 Apr 2021
Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with
  Common Sense and World Knowledge
Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge
Canwen Xu
Wangchunshu Zhou
Tao Ge
Ke Xu
Julian McAuley
Furu Wei
48
16
0
06 Apr 2021
Extremely Low Footprint End-to-End ASR System for Smart Device
Extremely Low Footprint End-to-End ASR System for Smart Device
Zhifu Gao
Yiwu Yao
Shiliang Zhang
Jun Yang
Ming Lei
Ian Mcloughlin
43
13
0
06 Apr 2021
CodeTrans: Towards Cracking the Language of Silicon's Code Through
  Self-Supervised Deep Learning and High Performance Computing
CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing
Ahmed Elnaggar
Wei Ding
Llion Jones
Tom Gibbs
Tamas B. Fehér
Christoph Angerer
Silvia Severini
Florian Matthes
B. Rost
72
72
0
06 Apr 2021
Automating Transfer Credit Assessment in Student Mobility -- A Natural
  Language Processing-based Approach
Automating Transfer Credit Assessment in Student Mobility -- A Natural Language Processing-based Approach
Dhivya Chandrasekaran
Vijay K. Mago
134
2
0
05 Apr 2021
Explainability-aided Domain Generalization for Image Classification
Explainability-aided Domain Generalization for Image Classification
Robin M. Schmidt
FAttOOD
61
1
0
05 Apr 2021
MCL@IITK at SemEval-2021 Task 2: Multilingual and Cross-lingual
  Word-in-Context Disambiguation using Augmented Data, Signals, and
  Transformers
MCL@IITK at SemEval-2021 Task 2: Multilingual and Cross-lingual Word-in-Context Disambiguation using Augmented Data, Signals, and Transformers
Rohan Gupta
Jay Mundra
Deepak Mahajan
Ashutosh Modi
43
3
0
04 Apr 2021
ReCAM@IITK at SemEval-2021 Task 4: BERT and ALBERT based Ensemble for
  Abstract Word Prediction
ReCAM@IITK at SemEval-2021 Task 4: BERT and ALBERT based Ensemble for Abstract Word Prediction
Abhishek Mittal
Ashutosh Modi
30
2
0
04 Apr 2021
Exploring the Role of BERT Token Representations to Explain Sentence
  Probing Results
Exploring the Role of BERT Token Representations to Explain Sentence Probing Results
Hosein Mohebbi
Ali Modarressi
Mohammad Taher Pilehvar
MILM
67
26
0
03 Apr 2021
Humor@IITK at SemEval-2021 Task 7: Large Language Models for Quantifying
  Humor and Offensiveness
Humor@IITK at SemEval-2021 Task 7: Large Language Models for Quantifying Humor and Offensiveness
Aishwarya Gupta
Avik Pal
Bholeshwar Khurana
Lakshay Tyagi
Ashutosh Modi
53
6
0
02 Apr 2021
Action-Based Conversations Dataset: A Corpus for Building More In-Depth
  Task-Oriented Dialogue Systems
Action-Based Conversations Dataset: A Corpus for Building More In-Depth Task-Oriented Dialogue Systems
Derek Chen
Howard Chen
Yi Yang
A. Lin
Zhou Yu
90
70
0
01 Apr 2021
Self-Supervised Euphemism Detection and Identification for Content
  Moderation
Self-Supervised Euphemism Detection and Identification for Content Moderation
Wanzheng Zhu
Hongyu Gong
Rohan Bansal
Zachary Weinberg
Nicolas Christin
Giulia Fanti
S. Bhat
77
40
0
31 Mar 2021
Pre-training for low resource speech-to-intent applications
Pre-training for low resource speech-to-intent applications
Pu Wang
Hugo Van hamme
45
4
0
30 Mar 2021
XRJL-HKUST at SemEval-2021 Task 4: WordNet-Enhanced Dual Multi-head
  Co-Attention for Reading Comprehension of Abstract Meaning
XRJL-HKUST at SemEval-2021 Task 4: WordNet-Enhanced Dual Multi-head Co-Attention for Reading Comprehension of Abstract Meaning
Yuxin Jiang
Ziyi Shou
Qijun Wang
Hao Wu
Fangzhen Lin
RALM
86
2
0
30 Mar 2021
Contextual Text Embeddings for Twi
Contextual Text Embeddings for Twi
P. Azunre
Salomey Osei
S. Addo
Lawrence Asamoah Adu-Gyamfi
Stephen E. Moore
...
Standylove Birago Mensah
Lucien Mensah
Mark Amoako Marcel
A. Amponsah
J. B. Hayfron-Acquah
50
6
0
29 Mar 2021
Retraining DistilBERT for a Voice Shopping Assistant by Using Universal
  Dependencies
Retraining DistilBERT for a Voice Shopping Assistant by Using Universal Dependencies
P. Jayarao
Arpit Sharma
62
2
0
29 Mar 2021
ViViT: A Video Vision Transformer
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
242
2,175
0
29 Mar 2021
Machine Learning Meets Natural Language Processing -- The story so far
Machine Learning Meets Natural Language Processing -- The story so far
N. Galanis
P. Vafiadis
K.-G. Mirzaev
G. Papakostas
82
7
0
27 Mar 2021
A Practical Survey on Faster and Lighter Transformers
A Practical Survey on Faster and Lighter Transformers
Quentin Fournier
G. Caron
Daniel Aloise
137
104
0
26 Mar 2021
Unsupervised Document Embedding via Contrastive Augmentation
Unsupervised Document Embedding via Contrastive Augmentation
Dongsheng Luo
Wei Cheng
Jingchao Ni
Wenchao Yu
Xuchao Zhang
...
Yanchi Liu
Zhengzhang Chen
Dongjin Song
Haifeng Chen
Xiang Zhang
SSL
67
12
0
26 Mar 2021
AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent
  Forecasting
AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting
Ye Yuan
Xinshuo Weng
Yanglan Ou
Kris Kitani
AI4TS
111
459
0
25 Mar 2021
Visual Grounding Strategies for Text-Only Natural Language Processing
Visual Grounding Strategies for Text-Only Natural Language Processing
Damien Sileo
45
8
0
25 Mar 2021
Bertinho: Galician BERT Representations
Bertinho: Galician BERT Representations
David Vilares
Marcos Garcia
Carlos Gómez-Rodríguez
90
22
0
25 Mar 2021
Pruning-then-Expanding Model for Domain Adaptation of Neural Machine
  Translation
Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation
Shuhao Gu
Yang Feng
Wanying Xie
CLLAI4CE
58
28
0
25 Mar 2021
Predicting Directionality in Causal Relations in Text
Predicting Directionality in Causal Relations in Text
Pedram Hosseini
David A. Broniatowski
Mona T. Diab
CML
45
11
0
25 Mar 2021
Finetuning Pretrained Transformers into RNNs
Finetuning Pretrained Transformers into RNNs
Jungo Kasai
Hao Peng
Yizhe Zhang
Dani Yogatama
Gabriel Ilharco
Nikolaos Pappas
Yi Mao
Weizhu Chen
Noah A. Smith
112
67
0
24 Mar 2021
Czert -- Czech BERT-like Model for Language Representation
Czert -- Czech BERT-like Model for Language Representation
Jakub Sido
O. Pražák
P. Pribán
Jan Pasek
Michal Seják
Miloslav Konopík
76
44
0
24 Mar 2021
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New
  Multitask Benchmark
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark
Nicholas Lourie
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
LRM
105
140
0
24 Mar 2021
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning
  Architectures
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning Architectures
Sushant Singh
A. Mahmood
AI4TS
111
95
0
23 Mar 2021
Tiny Transformers for Environmental Sound Classification at the Edge
Tiny Transformers for Environmental Sound Classification at the Edge
David Elliott
Carlos E. Otero
Steven Wyatt
Evan Martino
81
16
0
22 Mar 2021
End-to-End Trainable Multi-Instance Pose Estimation with Transformers
End-to-End Trainable Multi-Instance Pose Estimation with Transformers
Lucas Stoffl
Maxime Vidal
Alexander Mathis
ViT
74
52
0
22 Mar 2021
Improving and Simplifying Pattern Exploiting Training
Improving and Simplifying Pattern Exploiting Training
Derek Tam
Rakesh R Menon
Joey Tianyi Zhou
Shashank Srivastava
Colin Raffel
78
151
0
22 Mar 2021
Identifying Machine-Paraphrased Plagiarism
Identifying Machine-Paraphrased Plagiarism
Jan Philip Wahle
Terry Ruas
Tomávs Foltýnek
Norman Meuschke
Bela Gipp
83
32
0
22 Mar 2021
ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques
ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques
Yuanxin Liu
Zheng Lin
Fengcheng Yuan
VLMMQ
65
20
0
21 Mar 2021
AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive
  Summarization
AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization
Tiezheng Yu
Zihan Liu
Pascale Fung
CLL
107
81
0
21 Mar 2021
Pretraining the Noisy Channel Model for Task-Oriented Dialogue
Pretraining the Noisy Channel Model for Task-Oriented Dialogue
Qi Liu
Lei Yu
Laura Rimell
Phil Blunsom
97
26
0
18 Mar 2021
Refining Language Models with Compositional Explanations
Refining Language Models with Compositional Explanations
Huihan Yao
Ying Chen
Qinyuan Ye
Xisen Jin
Xiang Ren
89
36
0
18 Mar 2021
GLM: General Language Model Pretraining with Autoregressive Blank
  Infilling
GLM: General Language Model Pretraining with Autoregressive Blank Infilling
Zhengxiao Du
Yujie Qian
Xiao Liu
Ming Ding
J. Qiu
Zhilin Yang
Jie Tang
BDLAI4CE
162
1,565
0
18 Mar 2021
Structure Inducing Pre-Training
Structure Inducing Pre-Training
Matthew B. A. McDermott
Brendan Yap
Peter Szolovits
Marinka Zitnik
108
21
0
18 Mar 2021
Large-Scale Zero-Shot Image Classification from Rich and Diverse Textual
  Descriptions
Large-Scale Zero-Shot Image Classification from Rich and Diverse Textual Descriptions
Sebastian Bujwid
Josephine Sullivan
VLM
136
29
0
17 Mar 2021
Previous
123...444546...575859
Next