Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,913 papers shown
Title
PALRACE: Reading Comprehension Dataset with Human Data and Labeled Rationales
Jiajie Zou
Yuran Zhang
Peiqing Jin
Cheng Luo
Xunyi Pan
Nai Ding
FaML
29
5
0
23 Jun 2021
LegoFormer: Transformers for Block-by-Block Multi-view 3D Reconstruction
Farid Yagubbayli
Yida Wang
A. Tonioni
Federico Tombari
ViT
10
34
0
23 Jun 2021
LV-BERT: Exploiting Layer Variety for BERT
Weihao Yu
Zihang Jiang
Fei Chen
Qibin Hou
Jiashi Feng
MQ
29
0
0
22 Jun 2021
Towards Long-Form Video Understanding
Chaoxia Wu
Philipp Krahenbuhl
VLM
ViT
59
166
0
21 Jun 2021
Secure Distributed Training at Scale
Eduard A. Gorbunov
Alexander Borzunov
Michael Diskin
Max Ryabinin
FedML
31
15
0
21 Jun 2021
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning
Hao Tan
Jie Lei
Thomas Wolf
Joey Tianyi Zhou
39
66
0
21 Jun 2021
Iterative Network Pruning with Uncertainty Regularization for Lifelong Sentiment Classification
Binzong Geng
Min Yang
Fajie Yuan
Shupeng Wang
Xiang Ao
Ruifeng Xu
CLL
21
19
0
21 Jun 2021
Distributed Deep Learning in Open Collaborations
Michael Diskin
Alexey Bukhtiyarov
Max Ryabinin
Lucile Saulnier
Quentin Lhoest
...
Denis Mazur
Ilia Kobelev
Yacine Jernite
Thomas Wolf
Gennady Pekhimenko
FedML
41
54
0
18 Jun 2021
Anomaly Detection in Dynamic Graphs via Transformer
Yixin Liu
Shirui Pan
Yu Guang Wang
Fei Xiong
Liang Wang
Qingfeng Chen
V. C. Lee
34
92
0
18 Jun 2021
Application-driven Design Exploration for Dense Ferroelectric Embedded Non-volatile Memories
Mohammad Mehdi Sharifi
†∞ LillianPentecost
R. Rajaei
Arman Kazemi
Qiuwen Lou
...
David Brooks
Kai Ni
Sharon Hu
Michael Niemier
M. Donato
13
5
0
18 Jun 2021
Learning Knowledge Graph-based World Models of Textual Environments
Prithviraj Ammanabrolu
Mark O. Riedl
3DV
28
31
0
17 Jun 2021
Classifying vaccine sentiment tweets by modelling domain-specific representation and commonsense knowledge into context-aware attentive GRU
Usman Naseem
Matloob Khushi
Jinman Kim
A. Dunn
29
12
0
17 Jun 2021
Modeling Worlds in Text
Prithviraj Ammanabrolu
Mark O. Riedl
VGen
LM&Ro
19
14
0
17 Jun 2021
DravidianCodeMix: Sentiment Analysis and Offensive Language Identification Dataset for Dravidian Languages in Code-Mixed Text
Bharathi Raja Chakravarthi
R. Priyadharshini
Vigneshwaran Muralidaran
Navya Jose
Shardul Suryawanshi
E. Sherly
John P. Mccrae
27
105
0
17 Jun 2021
Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data
Haoming Jiang
Danqing Zhang
Tianyu Cao
Bing Yin
T. Zhao
NoLa
30
44
0
16 Jun 2021
Direction is what you need: Improving Word Embedding Compression in Large Language Models
Klaudia Bałazy
Mohammadreza Banaei
R. Lebret
Jacek Tabor
Karl Aberer
32
6
0
15 Jun 2021
CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Ningyu Zhang
Mosha Chen
Zhen Bi
Xiaozhuan Liang
Lei Li
...
Jun Yan
Hongying Zan
Kunli Zhang
Buzhou Tang
Qingcai Chen
LM&MA
ELM
34
180
0
15 Jun 2021
Unsupervised Abstractive Opinion Summarization by Generating Sentences with Tree-Structured Topic Guidance
Masaru Isonuma
Junichiro Mori
Danushka Bollegala
Ichiro Sakata
21
27
0
15 Jun 2021
Incorporating Word Sense Disambiguation in Neural Language Models
Jan Philip Wahle
Terry Ruas
Norman Meuschke
Bela Gipp
9
10
0
15 Jun 2021
Bilateral Personalized Dialogue Generation with Contrastive Learning
Bin Li
Hanjun Deng
31
6
0
15 Jun 2021
Pre-Trained Models: Past, Present and Future
Xu Han
Zhengyan Zhang
Ning Ding
Yuxian Gu
Xiao Liu
...
Jie Tang
Ji-Rong Wen
Jinhui Yuan
Wayne Xin Zhao
Jun Zhu
AIFin
MQ
AI4MH
63
819
0
14 Jun 2021
InfoBehavior: Self-supervised Representation Learning for Ultra-long Behavior Sequence via Hierarchical Grouping
Runshi Liu
Pengda Qin
Yuhong Li
Weigao Wen
Dong Li
Kefeng Deng
Qiang Wu
AI4TS
15
0
0
13 Jun 2021
Can Transformer Language Models Predict Psychometric Properties?
Antonio Laverghetta
Animesh Nighojkar
Jamshidbek Mirzakhalov
John Licato
LM&MA
40
14
0
12 Jun 2021
Incorporating External POS Tagger for Punctuation Restoration
Ning Shi
Wei Wang
Wei Ping
Jinfeng Li
Xiangyu Liu
Zhouhan Lin
KELM
14
10
0
12 Jun 2021
A Discussion on Building Practical NLP Leaderboards: The Case of Machine Translation
Sebastin Santy
Prasanta Bhattacharya
LLMAG
33
2
0
11 Jun 2021
Hybrid Generative-Contrastive Representation Learning
Saehoon Kim
Sungwoong Kim
Juho Lee
SSL
22
11
0
11 Jun 2021
RefBERT: Compressing BERT by Referencing to Pre-computed Representations
Xinyi Wang
Haiqing Yang
Liang Zhao
Yang Mo
Jianping Shen
MQ
30
3
0
11 Jun 2021
Deciphering Implicit Hate: Evaluating Automated Detection Algorithms for Multimodal Hate
Austin Botelho
Bertie Vidgen
Scott A. Hale
24
8
0
10 Jun 2021
A Semi-supervised Multi-task Learning Approach to Classify Customer Contact Intents
Li Dong
Matthew C. Spencer
Amir Biagi
27
3
0
10 Jun 2021
GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Ivan Chelombiev
Daniel Justus
Douglas Orr
A. Dietrich
Frithjof Gressmann
A. Koliousis
Carlo Luschi
27
5
0
10 Jun 2021
CAT: Cross Attention in Vision Transformer
Hezheng Lin
Xingyi Cheng
Xiangyu Wu
Fan Yang
Dong Shen
Zhongyuan Wang
Qing Song
Wei Yuan
ViT
35
149
0
10 Jun 2021
Linguistically Informed Masking for Representation Learning in the Patent Domain
Sophia Althammer
Mark Buckley
Sebastian Hofstatter
Allan Hanbury
45
11
0
10 Jun 2021
Low-Dimensional Structure in the Space of Language Representations is Reflected in Brain Responses
Richard Antonello
Javier S. Turek
Vy A. Vo
Alexander G. Huth
37
38
0
09 Jun 2021
Eye of the Beholder: Improved Relation Generalization for Text-based Reinforcement Learning Agents
K. Murugesan
Subhajit Chaudhury
Kartik Talamadupula
41
5
0
09 Jun 2021
Cocktail: Leveraging Ensemble Learning for Optimized Model Serving in Public Cloud
Jashwant Raj Gunasekaran
Cyan Subhra Mishra
P. Thinakaran
M. Kandemir
Chita R. Das
13
3
0
09 Jun 2021
Bayesian Attention Belief Networks
Shujian Zhang
Xinjie Fan
Bo Chen
Mingyuan Zhou
BDL
32
30
0
09 Jun 2021
Key Information Extraction From Documents: Evaluation And Generator
Oliver Bensch
Mirela C. Popa
Constantin Spille
14
13
0
09 Jun 2021
Automatic Sexism Detection with Multilingual Transformer Models
Mina Schütz
Jaqueline Boeck
Daria Liakhovets
D. Slijepcevic
Armin Kirchknopf
Manuel Hecht
Johannes Bogensperger
S. Schlarb
Alexander Schindler
Matthias Zeppelzauer
14
27
0
09 Jun 2021
TIMEDIAL: Temporal Commonsense Reasoning in Dialog
Lianhui Qin
Aditya Gupta
Shyam Upadhyay
Luheng He
Yejin Choi
Manaal Faruqui
LRM
31
65
0
08 Jun 2021
Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks
Avi Schwarzschild
Eitan Borgnia
Arjun Gupta
Furong Huang
U. Vishkin
Micah Goldblum
Tom Goldstein
24
75
0
08 Jun 2021
CLTR: An End-to-End, Transformer-Based System for Cell Level Table Retrieval and Table Question Answering
FeiFei Pan
Mustafa Canim
Michael R. Glass
A. Gliozzo
Peter Fox
VLM
LMTD
26
26
0
08 Jun 2021
Staircase Attention for Recurrent Processing of Sequences
Da Ju
Stephen Roller
Sainbayar Sukhbaatar
Jason Weston
32
11
0
08 Jun 2021
Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages Study
Yash Khemchandani
Sarvesh Mehtani
Vaidehi Patil
Abhijeet Awasthi
Partha P. Talukdar
Sunita Sarawagi
38
32
0
07 Jun 2021
Measuring and Improving BERT's Mathematical Abilities by Predicting the Order of Reasoning
Piotr Pikekos
Henryk Michalewski
Mateusz Malinowski
35
28
0
07 Jun 2021
Refiner: Refining Self-attention for Vision Transformers
Daquan Zhou
Yujun Shi
Bingyi Kang
Weihao Yu
Zihang Jiang
Yuan Li
Xiaojie Jin
Qibin Hou
Jiashi Feng
ViT
29
59
0
07 Jun 2021
PROST: Physical Reasoning of Objects through Space and Time
Stéphane Aroca-Ouellette
Cory Paik
Alessandro Roncone
Katharina Kann
LRM
19
47
0
07 Jun 2021
RoSearch: Search for Robust Student Architectures When Distilling Pre-trained Language Models
Xin Guo
Jianlei Yang
Haoyi Zhou
Xucheng Ye
Jianxin Li
46
1
0
07 Jun 2021
Relative Importance in Sentence Processing
Nora Hollenstein
Lisa Beinborn
FAtt
33
30
0
07 Jun 2021
ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias
Yufei Xu
Qiming Zhang
Jing Zhang
Dacheng Tao
ViT
65
330
0
07 Jun 2021
Understand and Improve Contrastive Learning Methods for Visual Representation: A Review
Ran Liu
SSL
29
12
0
06 Jun 2021
Previous
1
2
3
...
40
41
42
...
57
58
59
Next