Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,935 papers shown
Title
BERT-based Ensembles for Modeling Disclosure and Support in Conversational Social Media Text
Tanvi Dadu
Kartikey Pant
R. Mamidi
32
9
0
01 Jun 2020
Emergence of Separable Manifolds in Deep Language Representations
Jonathan Mamou
Hang Le
Miguel Angel del Rio
Cory Stephenson
Hanlin Tang
Yoon Kim
SueYeon Chung
AAML
AI4CE
107
40
0
01 Jun 2020
Conversational Machine Comprehension: a Literature Review
Somil Gupta
Bhanu Pratap Singh Rawat
Hong Yu
74
22
0
01 Jun 2020
A Survey on Transfer Learning in Natural Language Processing
Zaid Alyafeai
Maged S. Alshaibani
Irfan Ahmad
91
75
0
31 May 2020
LRG at SemEval-2020 Task 7: Assessing the Ability of BERT and Derivative Models to Perform Short-Edits based Humor Grading
Siddhant Mahurkar
Rajaswa Patil
46
7
0
31 May 2020
Beyond Leaderboards: A survey of methods for revealing weaknesses in Natural Language Inference data and models
Viktor Schlegel
Goran Nenadic
Riza Batista-Navarro
ELM
84
18
0
29 May 2020
ValueNet: A Natural Language-to-SQL System that Learns from Database Information
Ursin Brunner
Kurt Stockinger
44
10
0
29 May 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
1.1K
42,651
0
28 May 2020
Language Representation Models for Fine-Grained Sentiment Classification
Brian Cheang
Bailey Wei
David Kogan
H. Qiu
Masud Ahmed
AI4MH
33
8
0
27 May 2020
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
A. Kuncoro
Lingpeng Kong
Daniel Fried
Dani Yogatama
Laura Rimell
Chris Dyer
Phil Blunsom
93
34
0
27 May 2020
GECToR -- Grammatical Error Correction: Tag, Not Rewrite
Kostiantyn Omelianchuk
Vitaliy Atrasevych
Artem Chernodub
Oleksandr Skurzhanskyi
99
317
0
26 May 2020
ParsBERT: Transformer-based Model for Persian Language Understanding
Mehrdad Farahani
Mohammad Gharachorloo
Marzieh Farahani
Mohammad Manthouri
91
210
0
26 May 2020
An Audio-enriched BERT-based Framework for Spoken Multiple-choice Question Answering
Chia-Chih Kuo
Shang-Bao Luo
Kuan-Yu Chen
65
17
0
25 May 2020
NILE : Natural Language Inference with Faithful Natural Language Explanations
Sawan Kumar
Partha P. Talukdar
XAI
LRM
119
163
0
25 May 2020
KaLM at SemEval-2020 Task 4: Knowledge-aware Language Models for Comprehension And Generation
Jiajing Wan
Xinting Huang
LRM
64
5
0
24 May 2020
Transformer-based Context-aware Sarcasm Detection in Conversation Threads from Social Media
Xiangjue Dong
Changmao Li
Jinho Choi
52
26
0
22 May 2020
Open-Retrieval Conversational Question Answering
Chen Qu
Liu Yang
Cen Chen
Minghui Qiu
W. Bruce Croft
Mohit Iyyer
RALM
85
175
0
22 May 2020
Comparative Study of Machine Learning Models and BERT on SQuAD
Devshree Patel
Param Raval
Ratnam Parikh
Yesha Shastri
15
7
0
22 May 2020
PruneNet: Channel Pruning via Global Importance
A. Khetan
Zohar Karnin
40
11
0
22 May 2020
Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction
L. Rasmy
Yang Xiang
Z. Xie
Cui Tao
Degui Zhi
AI4MH
LM&MA
116
704
0
22 May 2020
Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models
Dan Iter
Kelvin Guu
L. Lansing
Dan Jurafsky
61
78
0
20 May 2020
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based Quantized DNNs
Yongkweon Jeon
Baeseong Park
S. Kwon
Byeongwook Kim
Jeongin Yun
Dongsoo Lee
MQ
63
31
0
20 May 2020
FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval
D. Gao
Linbo Jin
Ben Chen
Minghui Qiu
Peng Li
Yi Wei
Yitao Hu
Haozhe Jasper Wang
OOD
84
134
0
20 May 2020
Normalized Attention Without Probability Cage
Oliver Richter
Roger Wattenhofer
91
21
0
19 May 2020
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt
Hangyu Lin
Yanwei Fu
Yu-Gang Jiang
Xiangyang Xue
SSL
85
66
0
19 May 2020
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation
Po-Han Chi
Pei-Hung Chung
Tsung-Han Wu
Chun-Cheng Hsieh
Yen-Hao Chen
Shang-Wen Li
Hung-yi Lee
SSL
95
148
0
18 May 2020
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction
Cunjun Yu
Xiao Ma
Jiawei Ren
Haiyu Zhao
Shuai Yi
90
475
0
18 May 2020
T-VSE: Transformer-Based Visual Semantic Embedding
M. Bastan
Arnau Ramisa
Mehmet Tek
ViT
28
7
0
17 May 2020
CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP Deep Learning Architectures on Commonsense Reasoning Task
Sirwe Saeedi
Ali (Aliakbar) Panahi
Seyran Saeedi
A. Fong
ReLM
ELM
LRM
69
12
0
17 May 2020
Speech Recognition and Multi-Speaker Diarization of Long Conversations
H. H. Mao
Shuyang Li
Julian McAuley
G. Cottrell
VLM
85
40
0
16 May 2020
CERT: Contrastive Self-supervised Learning for Language Understanding
Hongchao Fang
Sicheng Wang
Meng Zhou
Jiayuan Ding
P. Xie
ELM
SSL
76
345
0
16 May 2020
COVID-Twitter-BERT: A Natural Language Processing Model to Analyse COVID-19 Content on Twitter
Martin Müller
M. Salathé
P. Kummervold
VLM
MedIm
AI4MH
94
361
0
15 May 2020
Spelling Error Correction with Soft-Masked BERT
Shaohua Zhang
Haoran Huang
Jicong Liu
Hang Li
60
214
0
15 May 2020
WG-WaveNet: Real-Time High-Fidelity Speech Synthesis without GPU
Po-Chun Hsu
Hung-yi Lee
44
16
0
15 May 2020
Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond
Zhuosheng Zhang
Hai Zhao
Rui Wang
115
63
0
13 May 2020
Automated Extraction of Socio-political Events from News (AESPEN): Workshop and Shared Task Report
Ali Hürriyetoǧlu
Vanni Zavarella
Hristo Tanev
E. Yoruk
Ali Safaya
Osman Mutlu
45
31
0
12 May 2020
A Report on the 2020 Sarcasm Detection Shared Task
Debanjan Ghosh
Avijit Vajpayee
Smaranda Muresan
64
61
0
12 May 2020
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
Hao Tian
Can Gao
Xinyan Xiao
Hao Liu
Bolei He
Hua Wu
Haifeng Wang
Feng Wu
73
237
0
12 May 2020
How Context Affects Language Models' Factual Predictions
Fabio Petroni
Patrick Lewis
Aleksandra Piktus
Tim Rocktaschel
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
82
239
0
10 May 2020
schuBERT: Optimizing Elements of BERT
A. Khetan
Zohar Karnin
86
30
0
09 May 2020
Modeling Document Interactions for Learning to Rank with Regularized Self-Attention
Shuo Sun
Kevin Duh
27
4
0
08 May 2020
Detecting East Asian Prejudice on Social Media
Bertie Vidgen
Austin Botelho
David A. Broniatowski
E. Guest
Matthew Hall
Helen Z. Margetts
Rebekah Tromble
Zeerak Talat
Scott A. Hale
49
101
0
08 May 2020
Distilling Knowledge from Pre-trained Language Models via Text Smoothing
Xing Wu
Yebin Liu
Xiangyang Zhou
Dianhai Yu
42
6
0
08 May 2020
SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization
Yang Gao
Wei Zhao
Steffen Eger
ELM
112
126
0
07 May 2020
The Cascade Transformer: an Application for Efficient Answer Sentence Selection
Luca Soldaini
Alessandro Moschitti
90
44
0
05 May 2020
ESG2Risk: A Deep Learning Framework from ESG News to Stock Volatility Prediction
Tian Guo
N. Jamet
Valentin Betrix
Louis-Alexandre Piquet
E. Hauptmann
AIFin
42
31
0
05 May 2020
Establishing Baselines for Text Classification in Low-Resource Languages
Jan Christian Blaise Cruz
C. Cheng
99
38
0
05 May 2020
ImpactCite: An XLNet-based method for Citation Impact Analysis
Dominique Mercier
Syed Tahseen Raza Rizvi
Vikas Rajashekar
Andreas Dengel
Sheraz Ahmed
52
16
0
05 May 2020
CAiRE-COVID: A Question Answering and Query-focused Multi-Document Summarization System for COVID-19 Scholarly Information Management
Dan Su
Yan Xu
Tiezheng Yu
Farhad Bin Siddique
Elham J. Barezi
Pascale Fung
RALM
48
31
0
04 May 2020
Physical reservoir computing -- An introductory perspective
Kohei Nakajima
86
313
0
03 May 2020
Previous
1
2
3
...
54
55
56
57
58
59
Next