ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
BERT-based Ensembles for Modeling Disclosure and Support in
  Conversational Social Media Text
BERT-based Ensembles for Modeling Disclosure and Support in Conversational Social Media Text
Tanvi Dadu
Kartikey Pant
R. Mamidi
32
9
0
01 Jun 2020
Emergence of Separable Manifolds in Deep Language Representations
Emergence of Separable Manifolds in Deep Language Representations
Jonathan Mamou
Hang Le
Miguel Angel del Rio
Cory Stephenson
Hanlin Tang
Yoon Kim
SueYeon Chung
AAMLAI4CE
107
40
0
01 Jun 2020
Conversational Machine Comprehension: a Literature Review
Conversational Machine Comprehension: a Literature Review
Somil Gupta
Bhanu Pratap Singh Rawat
Hong Yu
74
22
0
01 Jun 2020
A Survey on Transfer Learning in Natural Language Processing
A Survey on Transfer Learning in Natural Language Processing
Zaid Alyafeai
Maged S. Alshaibani
Irfan Ahmad
91
75
0
31 May 2020
LRG at SemEval-2020 Task 7: Assessing the Ability of BERT and Derivative
  Models to Perform Short-Edits based Humor Grading
LRG at SemEval-2020 Task 7: Assessing the Ability of BERT and Derivative Models to Perform Short-Edits based Humor Grading
Siddhant Mahurkar
Rajaswa Patil
46
7
0
31 May 2020
Beyond Leaderboards: A survey of methods for revealing weaknesses in
  Natural Language Inference data and models
Beyond Leaderboards: A survey of methods for revealing weaknesses in Natural Language Inference data and models
Viktor Schlegel
Goran Nenadic
Riza Batista-Navarro
ELM
84
18
0
29 May 2020
ValueNet: A Natural Language-to-SQL System that Learns from Database
  Information
ValueNet: A Natural Language-to-SQL System that Learns from Database Information
Ursin Brunner
Kurt Stockinger
44
10
0
29 May 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
1.1K
42,651
0
28 May 2020
Language Representation Models for Fine-Grained Sentiment Classification
Language Representation Models for Fine-Grained Sentiment Classification
Brian Cheang
Bailey Wei
David Kogan
H. Qiu
Masud Ahmed
AI4MH
33
8
0
27 May 2020
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
A. Kuncoro
Lingpeng Kong
Daniel Fried
Dani Yogatama
Laura Rimell
Chris Dyer
Phil Blunsom
93
34
0
27 May 2020
GECToR -- Grammatical Error Correction: Tag, Not Rewrite
GECToR -- Grammatical Error Correction: Tag, Not Rewrite
Kostiantyn Omelianchuk
Vitaliy Atrasevych
Artem Chernodub
Oleksandr Skurzhanskyi
99
317
0
26 May 2020
ParsBERT: Transformer-based Model for Persian Language Understanding
ParsBERT: Transformer-based Model for Persian Language Understanding
Mehrdad Farahani
Mohammad Gharachorloo
Marzieh Farahani
Mohammad Manthouri
91
210
0
26 May 2020
An Audio-enriched BERT-based Framework for Spoken Multiple-choice
  Question Answering
An Audio-enriched BERT-based Framework for Spoken Multiple-choice Question Answering
Chia-Chih Kuo
Shang-Bao Luo
Kuan-Yu Chen
65
17
0
25 May 2020
NILE : Natural Language Inference with Faithful Natural Language
  Explanations
NILE : Natural Language Inference with Faithful Natural Language Explanations
Sawan Kumar
Partha P. Talukdar
XAILRM
119
163
0
25 May 2020
KaLM at SemEval-2020 Task 4: Knowledge-aware Language Models for
  Comprehension And Generation
KaLM at SemEval-2020 Task 4: Knowledge-aware Language Models for Comprehension And Generation
Jiajing Wan
Xinting Huang
LRM
64
5
0
24 May 2020
Transformer-based Context-aware Sarcasm Detection in Conversation
  Threads from Social Media
Transformer-based Context-aware Sarcasm Detection in Conversation Threads from Social Media
Xiangjue Dong
Changmao Li
Jinho Choi
52
26
0
22 May 2020
Open-Retrieval Conversational Question Answering
Open-Retrieval Conversational Question Answering
Chen Qu
Liu Yang
Cen Chen
Minghui Qiu
W. Bruce Croft
Mohit Iyyer
RALM
85
175
0
22 May 2020
Comparative Study of Machine Learning Models and BERT on SQuAD
Comparative Study of Machine Learning Models and BERT on SQuAD
Devshree Patel
Param Raval
Ratnam Parikh
Yesha Shastri
15
7
0
22 May 2020
PruneNet: Channel Pruning via Global Importance
PruneNet: Channel Pruning via Global Importance
A. Khetan
Zohar Karnin
40
11
0
22 May 2020
Med-BERT: pre-trained contextualized embeddings on large-scale
  structured electronic health records for disease prediction
Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction
L. Rasmy
Yang Xiang
Z. Xie
Cui Tao
Degui Zhi
AI4MHLM&MA
116
704
0
22 May 2020
Pretraining with Contrastive Sentence Objectives Improves Discourse
  Performance of Language Models
Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models
Dan Iter
Kelvin Guu
L. Lansing
Dan Jurafsky
61
78
0
20 May 2020
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based
  Quantized DNNs
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based Quantized DNNs
Yongkweon Jeon
Baeseong Park
S. Kwon
Byeongwook Kim
Jeongin Yun
Dongsoo Lee
MQ
63
31
0
20 May 2020
FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal
  Retrieval
FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval
D. Gao
Linbo Jin
Ben Chen
Minghui Qiu
Peng Li
Yi Wei
Yitao Hu
Haozhe Jasper Wang
OOD
84
134
0
20 May 2020
Normalized Attention Without Probability Cage
Normalized Attention Without Probability Cage
Oliver Richter
Roger Wattenhofer
91
21
0
19 May 2020
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from
  Transformers by Self-supervised Learning of Sketch Gestalt
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt
Hangyu Lin
Yanwei Fu
Yu-Gang Jiang
Xiangyang Xue
SSL
85
66
0
19 May 2020
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio
  Representation
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation
Po-Han Chi
Pei-Hung Chung
Tsung-Han Wu
Chun-Cheng Hsieh
Yen-Hao Chen
Shang-Wen Li
Hung-yi Lee
SSL
95
148
0
18 May 2020
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory
  Prediction
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction
Cunjun Yu
Xiao Ma
Jiawei Ren
Haiyu Zhao
Shuai Yi
90
475
0
18 May 2020
T-VSE: Transformer-Based Visual Semantic Embedding
T-VSE: Transformer-Based Visual Semantic Embedding
M. Bastan
Arnau Ramisa
Mehmet Tek
ViT
28
7
0
17 May 2020
CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP
  Deep Learning Architectures on Commonsense Reasoning Task
CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP Deep Learning Architectures on Commonsense Reasoning Task
Sirwe Saeedi
Ali (Aliakbar) Panahi
Seyran Saeedi
A. Fong
ReLMELMLRM
69
12
0
17 May 2020
Speech Recognition and Multi-Speaker Diarization of Long Conversations
Speech Recognition and Multi-Speaker Diarization of Long Conversations
H. H. Mao
Shuyang Li
Julian McAuley
G. Cottrell
VLM
85
40
0
16 May 2020
CERT: Contrastive Self-supervised Learning for Language Understanding
CERT: Contrastive Self-supervised Learning for Language Understanding
Hongchao Fang
Sicheng Wang
Meng Zhou
Jiayuan Ding
P. Xie
ELMSSL
76
345
0
16 May 2020
COVID-Twitter-BERT: A Natural Language Processing Model to Analyse
  COVID-19 Content on Twitter
COVID-Twitter-BERT: A Natural Language Processing Model to Analyse COVID-19 Content on Twitter
Martin Müller
M. Salathé
P. Kummervold
VLMMedImAI4MH
94
361
0
15 May 2020
Spelling Error Correction with Soft-Masked BERT
Spelling Error Correction with Soft-Masked BERT
Shaohua Zhang
Haoran Huang
Jicong Liu
Hang Li
60
214
0
15 May 2020
WG-WaveNet: Real-Time High-Fidelity Speech Synthesis without GPU
WG-WaveNet: Real-Time High-Fidelity Speech Synthesis without GPU
Po-Chun Hsu
Hung-yi Lee
44
16
0
15 May 2020
Machine Reading Comprehension: The Role of Contextualized Language
  Models and Beyond
Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond
Zhuosheng Zhang
Hai Zhao
Rui Wang
115
63
0
13 May 2020
Automated Extraction of Socio-political Events from News (AESPEN):
  Workshop and Shared Task Report
Automated Extraction of Socio-political Events from News (AESPEN): Workshop and Shared Task Report
Ali Hürriyetoǧlu
Vanni Zavarella
Hristo Tanev
E. Yoruk
Ali Safaya
Osman Mutlu
45
31
0
12 May 2020
A Report on the 2020 Sarcasm Detection Shared Task
A Report on the 2020 Sarcasm Detection Shared Task
Debanjan Ghosh
Avijit Vajpayee
Smaranda Muresan
64
61
0
12 May 2020
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
Hao Tian
Can Gao
Xinyan Xiao
Hao Liu
Bolei He
Hua Wu
Haifeng Wang
Feng Wu
73
237
0
12 May 2020
How Context Affects Language Models' Factual Predictions
How Context Affects Language Models' Factual Predictions
Fabio Petroni
Patrick Lewis
Aleksandra Piktus
Tim Rocktaschel
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
82
239
0
10 May 2020
schuBERT: Optimizing Elements of BERT
schuBERT: Optimizing Elements of BERT
A. Khetan
Zohar Karnin
86
30
0
09 May 2020
Modeling Document Interactions for Learning to Rank with Regularized
  Self-Attention
Modeling Document Interactions for Learning to Rank with Regularized Self-Attention
Shuo Sun
Kevin Duh
27
4
0
08 May 2020
Detecting East Asian Prejudice on Social Media
Detecting East Asian Prejudice on Social Media
Bertie Vidgen
Austin Botelho
David A. Broniatowski
E. Guest
Matthew Hall
Helen Z. Margetts
Rebekah Tromble
Zeerak Talat
Scott A. Hale
49
101
0
08 May 2020
Distilling Knowledge from Pre-trained Language Models via Text Smoothing
Distilling Knowledge from Pre-trained Language Models via Text Smoothing
Xing Wu
Yebin Liu
Xiangyang Zhou
Dianhai Yu
42
6
0
08 May 2020
SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for
  Multi-Document Summarization
SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization
Yang Gao
Wei Zhao
Steffen Eger
ELM
112
126
0
07 May 2020
The Cascade Transformer: an Application for Efficient Answer Sentence
  Selection
The Cascade Transformer: an Application for Efficient Answer Sentence Selection
Luca Soldaini
Alessandro Moschitti
90
44
0
05 May 2020
ESG2Risk: A Deep Learning Framework from ESG News to Stock Volatility
  Prediction
ESG2Risk: A Deep Learning Framework from ESG News to Stock Volatility Prediction
Tian Guo
N. Jamet
Valentin Betrix
Louis-Alexandre Piquet
E. Hauptmann
AIFin
42
31
0
05 May 2020
Establishing Baselines for Text Classification in Low-Resource Languages
Establishing Baselines for Text Classification in Low-Resource Languages
Jan Christian Blaise Cruz
C. Cheng
99
38
0
05 May 2020
ImpactCite: An XLNet-based method for Citation Impact Analysis
ImpactCite: An XLNet-based method for Citation Impact Analysis
Dominique Mercier
Syed Tahseen Raza Rizvi
Vikas Rajashekar
Andreas Dengel
Sheraz Ahmed
52
16
0
05 May 2020
CAiRE-COVID: A Question Answering and Query-focused Multi-Document
  Summarization System for COVID-19 Scholarly Information Management
CAiRE-COVID: A Question Answering and Query-focused Multi-Document Summarization System for COVID-19 Scholarly Information Management
Dan Su
Yan Xu
Tiezheng Yu
Farhad Bin Siddique
Elham J. Barezi
Pascale Fung
RALM
48
31
0
04 May 2020
Physical reservoir computing -- An introductory perspective
Physical reservoir computing -- An introductory perspective
Kohei Nakajima
86
313
0
03 May 2020
Previous
123...545556575859
Next