ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
A Closer Look at Linguistic Knowledge in Masked Language Models: The
  Case of Relative Clauses in American English
A Closer Look at Linguistic Knowledge in Masked Language Models: The Case of Relative Clauses in American English
Marius Mosbach
Stefania Degaetano-Ortlieb
Marie-Pauline Krielke
Badr M. Abdullah
Dietrich Klakow
27
7
0
02 Nov 2020
COSMO: Conditional SEQ2SEQ-based Mixture Model for Zero-Shot Commonsense
  Question Answering
COSMO: Conditional SEQ2SEQ-based Mixture Model for Zero-Shot Commonsense Question Answering
Farhad Moghimifar
Zhuang Li
Terry Yue Zhuo
Mahsa Baktashmotlagh
Gholamreza Haffari
LRM
43
8
0
02 Nov 2020
I Know What You Asked: Graph Path Learning using AMR for Commonsense
  Reasoning
I Know What You Asked: Graph Path Learning using AMR for Commonsense Reasoning
J. Lim
Dongsuk Oh
Yoonna Jang
Kisu Yang
Heuiseok Lim
ReLMGNNLRM
62
35
0
02 Nov 2020
Transformer-based Multi-Aspect Modeling for Multi-Aspect Multi-Sentiment
  Analysis
Transformer-based Multi-Aspect Modeling for Multi-Aspect Multi-Sentiment Analysis
Zhanghua Wu
Chengcan Ying
Xinyu Dai
Shujian Huang
Jiajun Chen
18
9
0
01 Nov 2020
Understanding Pre-trained BERT for Aspect-based Sentiment Analysis
Understanding Pre-trained BERT for Aspect-based Sentiment Analysis
Hu Xu
Lei Shu
Philip S. Yu
Bing-Quan Liu
SSL
119
46
0
31 Oct 2020
SLM: Learning a Discourse Language Representation with Sentence
  Unshuffling
SLM: Learning a Discourse Language Representation with Sentence Unshuffling
Haejun Lee
Drew A. Hudson
Kangwook Lee
Christopher D. Manning
SSL
119
52
0
30 Oct 2020
A Comprehensive Survey on Word Representation Models: From Classical to
  State-Of-The-Art Word Representation Language Models
A Comprehensive Survey on Word Representation Models: From Classical to State-Of-The-Art Word Representation Language Models
Usman Naseem
Imran Razzak
S. Khan
M. Prasad
86
161
0
28 Oct 2020
Unmasking Contextual Stereotypes: Measuring and Mitigating BERT's Gender
  Bias
Unmasking Contextual Stereotypes: Measuring and Mitigating BERT's Gender Bias
Marion Bartl
Malvina Nissim
Albert Gatt
91
125
0
27 Oct 2020
Probing Task-Oriented Dialogue Representation from Language Models
Probing Task-Oriented Dialogue Representation from Language Models
Chien-Sheng Wu
Caiming Xiong
79
20
0
26 Oct 2020
UPB at SemEval-2020 Task 12: Multilingual Offensive Language Detection
  on Social Media by Fine-tuning a Variety of BERT-based Models
UPB at SemEval-2020 Task 12: Multilingual Offensive Language Detection on Social Media by Fine-tuning a Variety of BERT-based Models
Mircea-Adrian Tanase
Dumitru-Clementin Cercel
Costin-Gabriel Chiru
42
15
0
26 Oct 2020
Accelerating Training of Transformer-Based Language Models with
  Progressive Layer Dropping
Accelerating Training of Transformer-Based Language Models with Progressive Layer Dropping
Minjia Zhang
Yuxiong He
AI4CE
48
104
0
26 Oct 2020
Learning Contextualized Knowledge Structures for Commonsense Reasoning
Learning Contextualized Knowledge Structures for Commonsense Reasoning
Jun Yan
Mrigank Raman
Aaron Chan
Tianyu Zhang
Ryan Rossi
Handong Zhao
Sungchul Kim
Nedim Lipka
Xiang Ren
306
37
0
24 Oct 2020
Rethinking embedding coupling in pre-trained language models
Rethinking embedding coupling in pre-trained language models
Hyung Won Chung
Thibault Févry
Henry Tsai
Melvin Johnson
Sebastian Ruder
177
143
0
24 Oct 2020
A Frustratingly Easy Approach for Entity and Relation Extraction
A Frustratingly Easy Approach for Entity and Relation Extraction
Zexuan Zhong
Danqi Chen
229
108
0
24 Oct 2020
Pre-training Text-to-Text Transformers for Concept-centric Common Sense
Pre-training Text-to-Text Transformers for Concept-centric Common Sense
Wangchunshu Zhou
Dong-Ho Lee
Ravi Kiran Selvam
Seyeon Lee
Bill Yuchen Lin
Xiang Ren
LRMVLM
55
72
0
24 Oct 2020
Improving Multilingual Models with Language-Clustered Vocabularies
Improving Multilingual Models with Language-Clustered Vocabularies
Hyung Won Chung
Dan Garrette
Kiat Chuan Tan
Jason Riesa
VLM
129
65
0
24 Oct 2020
ANLIzing the Adversarial Natural Language Inference Dataset
ANLIzing the Adversarial Natural Language Inference Dataset
Adina Williams
Tristan Thrush
Douwe Kiela
AAML
255
47
0
24 Oct 2020
Comparative analysis of word embeddings in assessing semantic similarity
  of complex sentences
Comparative analysis of word embeddings in assessing semantic similarity of complex sentences
Dhivya Chandrasekaran
Vijay K. Mago
124
8
0
23 Oct 2020
DICT-MLM: Improved Multilingual Pre-Training using Bilingual
  Dictionaries
DICT-MLM: Improved Multilingual Pre-Training using Bilingual Dictionaries
Aditi Chaudhary
K. Raman
Krishna Srinivasan
Jiecao Chen
81
25
0
23 Oct 2020
GiBERT: Introducing Linguistic Knowledge into BERT through a Lightweight
  Gated Injection Method
GiBERT: Introducing Linguistic Knowledge into BERT through a Lightweight Gated Injection Method
Nicole Peinelt
Marek Rei
Maria Liakata
55
2
0
23 Oct 2020
Generating Plausible Counterfactual Explanations for Deep Transformers
  in Financial Text Classification
Generating Plausible Counterfactual Explanations for Deep Transformers in Financial Text Classification
Linyi Yang
Eoin M. Kenny
T. L. J. Ng
Yi Yang
Barry Smyth
Ruihai Dong
96
73
0
23 Oct 2020
Event-Driven Learning of Systematic Behaviours in Stock Markets
Event-Driven Learning of Systematic Behaviours in Stock Markets
Xianchao Wu
AIFin
49
7
0
23 Oct 2020
Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian
  Tweets
Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian Tweets
Gaurish Thakkar
Marcis Pinnis
95
9
0
23 Oct 2020
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling
  for Natural Language Understanding
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
Dongling Xiao
Yukun Li
Han Zhang
Yu Sun
Hao Tian
Hua Wu
Haifeng Wang
29
39
0
23 Oct 2020
Language Models are Open Knowledge Graphs
Language Models are Open Knowledge Graphs
Chenguang Wang
Xiao Liu
Basel Alomair
SSLKELM
79
137
0
22 Oct 2020
AdapterDrop: On the Efficiency of Adapters in Transformers
AdapterDrop: On the Efficiency of Adapters in Transformers
Andreas Rucklé
Gregor Geigle
Max Glockner
Tilman Beck
Jonas Pfeiffer
Nils Reimers
Iryna Gurevych
125
267
0
22 Oct 2020
Contrastive Self-Supervised Learning for Wireless Power Control
Contrastive Self-Supervised Learning for Wireless Power Control
Navid Naderializadeh
SSL
95
7
0
22 Oct 2020
Towards Fully Bilingual Deep Language Modeling
Towards Fully Bilingual Deep Language Modeling
Li-Hsin Chang
S. Pyysalo
Jenna Kanerva
Filip Ginter
67
3
0
22 Oct 2020
Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution
  Data
Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
Lingkai Kong
Haoming Jiang
Yuchen Zhuang
Jie Lyu
T. Zhao
Chao Zhang
OODD
91
26
0
22 Oct 2020
Detection of COVID-19 informative tweets using RoBERTa
Detection of COVID-19 informative tweets using RoBERTa
Sirigireddy Dhanalaxmi
Rohit Agarwal
Aman Sinha
42
7
0
21 Oct 2020
Knowledge Distillation for Improved Accuracy in Spoken Question
  Answering
Knowledge Distillation for Improved Accuracy in Spoken Question Answering
Chenyu You
Nuo Chen
Yuexian Zou
112
52
0
21 Oct 2020
Contextualized Attention-based Knowledge Transfer for Spoken
  Conversational Question Answering
Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering
Chenyu You
Nuo Chen
Yuexian Zou
111
38
0
21 Oct 2020
Complaint Identification in Social Media with Transformer Networks
Complaint Identification in Social Media with Transformer Networks
Mali Jin
Nikolaos Aletras
44
16
0
21 Oct 2020
German's Next Language Model
German's Next Language Model
Branden Chan
Stefan Schweter
Timo Möller
104
276
0
21 Oct 2020
Learning to Embed Categorical Features without Embedding Tables for
  Recommendation
Learning to Embed Categorical Features without Embedding Tables for Recommendation
Wang-Cheng Kang
D. Cheng
Tiansheng Yao
Xinyang Yi
Ting-Li Chen
Lichan Hong
Ed H. Chi
LMTDCMLDML
110
72
0
21 Oct 2020
Self-supervised Graph Learning for Recommendation
Self-supervised Graph Learning for Recommendation
Jiancan Wu
Xiang Wang
Fuli Feng
Xiangnan He
Liang Chen
Jianxun Lian
Xing Xie
SSL
202
1,182
0
21 Oct 2020
An Empirical Investigation of Contextualized Number Prediction
An Empirical Investigation of Contextualized Number Prediction
Daniel M. Spokoyny
Taylor Berg-Kirkpatrick
AI4TS
80
38
0
20 Oct 2020
Bayesian Attention Modules
Bayesian Attention Modules
Xinjie Fan
Shujian Zhang
Bo Chen
Mingyuan Zhou
183
62
0
20 Oct 2020
Optimal Subarchitecture Extraction For BERT
Optimal Subarchitecture Extraction For BERT
Adrian de Wynter
Daniel J. Perry
MQ
98
18
0
20 Oct 2020
BERT2DNN: BERT Distillation with Massive Unlabeled Data for Online
  E-Commerce Search
BERT2DNN: BERT Distillation with Massive Unlabeled Data for Online E-Commerce Search
Yunjiang Jiang
Yue Shang
Ziyang Liu
Hongwei Shen
Yun Xiao
Wei Xiong
Sulong Xu
Weipeng P. Yan
Di Jin
62
17
0
20 Oct 2020
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary
  Representations From Characters
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters
Hicham El Boukkouri
Olivier Ferret
Thomas Lavergne
Hiroshi Noji
Pierre Zweigenbaum
Junichi Tsujii
164
161
0
20 Oct 2020
A Benchmark for Lease Contract Review
A Benchmark for Lease Contract Review
Spyretta Leivaditi
Julien Rossi
Evangelos Kanoulas
AILaw
175
37
0
20 Oct 2020
Bi-directional Cognitive Thinking Network for Machine Reading
  Comprehension
Bi-directional Cognitive Thinking Network for Machine Reading Comprehension
Wei Peng
Yue Hu
Luxi Xing
Yuqiang Xie
Jing Yu
Yajing Sun
Xiangpeng Wei
59
7
0
20 Oct 2020
CoRT: Complementary Rankings from Transformers
CoRT: Complementary Rankings from Transformers
Marco Wrzalik
D. Krechel
44
10
0
20 Oct 2020
ChemBERTa: Large-Scale Self-Supervised Pretraining for Molecular
  Property Prediction
ChemBERTa: Large-Scale Self-Supervised Pretraining for Molecular Property Prediction
Seyone Chithrananda
Gabriel Grand
Bharath Ramsundar
AI4CE
128
415
0
19 Oct 2020
Technical Question Answering across Tasks and Domains
Technical Question Answering across Tasks and Domains
Wenhao Yu
Lingfei Wu
Yu Deng
Qingkai Zeng
R. Mahindru
S. Guven
Meng Jiang
60
8
0
19 Oct 2020
An Empirical Study for Vietnamese Constituency Parsing with Pre-training
An Empirical Study for Vietnamese Constituency Parsing with Pre-training
Tuan-Vi Tran
Xuan-Thien Pham
Duc-Vu Nguyen
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
46
4
0
19 Oct 2020
Better Distractions: Transformer-based Distractor Generation and
  Multiple Choice Question Filtering
Better Distractions: Transformer-based Distractor Generation and Multiple Choice Question Filtering
J. Offerijns
Suzan Verberne
Tessa Verhoef
59
26
0
19 Oct 2020
Knowledge-guided Open Attribute Value Extraction with Reinforcement
  Learning
Knowledge-guided Open Attribute Value Extraction with Reinforcement Learning
Ye Liu
Sheng Zhang
Rui Song
Suo Feng
Yanghua Xiao
62
8
0
19 Oct 2020
Towards Interpreting BERT for Reading Comprehension Based QA
Towards Interpreting BERT for Reading Comprehension Based QA
Sahana Ramnath
Preksha Nema
Deep Sahni
Mitesh M. Khapra
94
30
0
18 Oct 2020
Previous
123...495051...575859
Next