Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,935 papers shown
Title
Language Models are Realistic Tabular Data Generators
V. Borisov
Kathrin Seßler
Tobias Leemann
Martin Pawelczyk
Gjergji Kasneci
LMTD
117
255
0
12 Oct 2022
Task Compass: Scaling Multi-task Pre-training with Task Prefix
Zhuosheng Zhang
Shuohang Wang
Yichong Xu
Yuwei Fang
Wenhao Yu
Yang Liu
Han Zhao
Chenguang Zhu
Michael Zeng
SSL
LRM
70
16
0
12 Oct 2022
Pruning Pre-trained Language Models Without Fine-Tuning
Ting Jiang
Deqing Wang
Fuzhen Zhuang
Ruobing Xie
Feng Xia
100
10
0
12 Oct 2022
Text-Derived Knowledge Helps Vision: A Simple Cross-modal Distillation for Video-based Action Anticipation
Sayontan Ghosh
Tanvi Aggarwal
Minh Hoai
Niranjan Balasubramanian
VLM
88
4
0
12 Oct 2022
SEAL : Interactive Tool for Systematic Error Analysis and Labeling
Nazneen Rajani
Weixin Liang
Lingjiao Chen
Margaret Mitchell
James Zou
96
16
0
11 Oct 2022
Voteñ'Rank: Revision of Benchmarking with Social Choice Theory
Mark Rofin
Vladislav Mikhailov
Mikhail Florinskiy
A. Kravchenko
E. Tutubalina
Tatiana Shavrina
Daniel Karabekyan
Ekaterina Artemova
87
11
0
11 Oct 2022
Enriching Biomedical Knowledge for Low-resource Language Through Large-Scale Translation
Long Phan
Tai Dang
H. Tran
Trieu H. Trinh
Vy Phan
Lam D. Chau
Minh-Thang Luong
56
8
0
11 Oct 2022
Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models
Qihuang Zhong
Liang Ding
Li Shen
Peng Mi
Juhua Liu
Bo Du
Dacheng Tao
AAML
90
51
0
11 Oct 2022
Instance Regularization for Discriminative Language Model Pre-training
Zhuosheng Zhang
Hai Zhao
M. Zhou
95
1
0
11 Oct 2022
A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models
Yuanxin Liu
Fandong Meng
Zheng Lin
JiangNan Li
Peng Fu
Yanan Cao
Weiping Wang
Jie Zhou
87
6
0
11 Oct 2022
Extracting or Guessing? Improving Faithfulness of Event Temporal Relation Extraction
Haoyu Wang
Hongming Zhang
Yuqian Deng
Jacob R. Gardner
Dan Roth
Muhao Chen
68
21
0
10 Oct 2022
Better Pre-Training by Reducing Representation Confusion
Haojie Zhang
Mingfei Liang
Ruobing Xie
Zhen Sun
Bo Zhang
Leyu Lin
47
2
0
09 Oct 2022
Understanding HTML with Large Language Models
Izzeddin Gur
Ofir Nachum
Yingjie Miao
Mustafa Safdari
Austin Huang
Aakanksha Chowdhery
Sharan Narang
Noah Fiedel
Aleksandra Faust
AI4CE
225
71
0
08 Oct 2022
Hate Speech and Offensive Language Detection in Bengali
Mithun Das
Somnath Banerjee
Punyajoy Saha
Animesh Mukherjee
119
30
0
07 Oct 2022
DABERT: Dual Attention Enhanced BERT for Semantic Matching
Sirui Wang
Di Liang
Jian Song
Yun Li
Wei Wu
88
18
0
07 Oct 2022
Improving Large-scale Paraphrase Acquisition and Generation
Yao Dou
Chao Jiang
Wei Xu
99
9
0
06 Oct 2022
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs
Yujia Zhai
Chengquan Jiang
Leyuan Wang
Xiaoying Jia
Shang Zhang
Zizhong Chen
Xin Liu
Yibo Zhu
144
52
0
06 Oct 2022
Detecting Narrative Elements in Informational Text
Effi Levi
Guy Mor
Tamir Sheafer
Shaul R. Shenhav
439
7
0
06 Oct 2022
Debiasing isn't enough! -- On the Effectiveness of Debiasing MLMs and their Social Biases in Downstream Tasks
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
102
46
0
06 Oct 2022
WakeUpNet: A Mobile-Transformer based Framework for End-to-End Streaming Voice Trigger
Zixing Zhang
Thorin Farnsworth
Senling Lin
S. Karout
127
2
0
06 Oct 2022
XDoc: Unified Pre-training for Cross-Format Document Understanding
Jingye Chen
Tengchao Lv
Lei Cui
Changrong Zhang
Furu Wei
95
14
0
06 Oct 2022
Towards Better Semantic Understanding of Mobile Interfaces
Srinivas Sunkara
Maria Wang
Lijuan Liu
Gilles Baechler
Yu-Chung Hsiao
Jindong Chen
Chen
Abhanshu Sharma
James Stout
84
24
0
06 Oct 2022
COMPS: Conceptual Minimal Pair Sentences for testing Robust Property Knowledge and its Inheritance in Pre-trained Language Models
Kanishka Misra
Julia Taylor Rayz
Allyson Ettinger
132
10
0
05 Oct 2022
When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment
Zhijing Jin
Sydney Levine
Fernando Gonzalez
Ojasv Kamal
Maarten Sap
Mrinmaya Sachan
Rada Mihalcea
J. Tenenbaum
Bernhard Schölkopf
ELM
LRM
103
103
0
04 Oct 2022
Understanding Prior Bias and Choice Paralysis in Transformer-based Language Representation Models through Four Experimental Probes
Ke Shen
Mayank Kejriwal
81
4
0
03 Oct 2022
Improving Molecular Pretraining with Complementary Featurizations
Yanqiao Zhu
Dingshuo Chen
Yuanqi Du
Yingze Wang
Qiang Liu
Shu Wu
AI4CE
70
7
0
29 Sep 2022
Domain-aware Self-supervised Pre-training for Label-Efficient Meme Analysis
Shivam Sharma
Mohd Khizir Siddiqui
Md. Shad Akhtar
Tanmoy Chakraborty
SSL
38
5
0
29 Sep 2022
Bidirectional Language Models Are Also Few-shot Learners
Ajay Patel
Bryan Li
Mohammad Sadegh Rasooli
Noah Constant
Colin Raffel
Chris Callison-Burch
LRM
140
47
0
29 Sep 2022
TVLT: Textless Vision-Language Transformer
Zineng Tang
Jaemin Cho
Yixin Nie
Joey Tianyi Zhou
VLM
137
31
0
28 Sep 2022
Using contradictions improves question answering systems
Étienne Fortier-Dubois
Domenic Rosati
95
0
0
28 Sep 2022
Neural Network Panning: Screening the Optimal Sparse Network Before Training
Xiatao Kang
P. Li
Jiayi Yao
Chengxi Li
VLM
45
1
0
27 Sep 2022
Fast-FNet: Accelerating Transformer Encoder Models via Efficient Fourier Layers
Nurullah Sevim
Ege Ozan Özyedek
Furkan Şahinuç
Aykut Koç
85
12
0
26 Sep 2022
Do ever larger octopi still amplify reporting biases? Evidence from judgments of typical colour
Fangyu Liu
Julian Martin Eisenschlos
Jeremy R. Cole
Nigel Collier
90
4
0
26 Sep 2022
Dordis: Efficient Federated Learning with Dropout-Resilient Differential Privacy
Zhifeng Jiang
Wei Wang
Ruichuan Chen
108
8
0
26 Sep 2022
Application of Deep Learning in Generating Structured Radiology Reports: A Transformer-Based Technique
Seyed Ali Reza Moezzi
Abdolrahman Ghaedi
M. Rahmanian
Seyedeh Zahra Mousavi
A. Sami
MedIm
33
9
0
25 Sep 2022
Multiple-Choice Question Generation: Towards an Automated Assessment Framework
Vatsal Raina
Mark Gales
AI4Ed
ELM
75
34
0
23 Sep 2022
Adaptation of domain-specific transformer models with text oversampling for sentiment analysis of social media posts on Covid-19 vaccines
Anmol Bansal
Arjun Choudhry
Anubhav Sharma
Seba Susan
MedIm
66
4
0
22 Sep 2022
EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models
Perry Lam
Huayun Zhang
Nancy F. Chen
Berrak Sisman
29
2
0
22 Sep 2022
FV2ES: A Fully End2End Multimodal System for Fast Yet Effective Video Emotion Recognition Inference
Qinglan Wei
Xu-Juan Huang
Yuan Zhang
78
15
0
21 Sep 2022
An Efficient End-to-End Transformer with Progressive Tri-modal Attention for Multi-modal Emotion Recognition
Yang Wu
Pai Peng
Zhenyu Zhang
Yanyan Zhao
Bing Qin
43
1
0
20 Sep 2022
Joint Language Semantic and Structure Embedding for Knowledge Graph Completion
Jianhao Shen
Chenguang Wang
Linyuan Gong
Dawn Song
119
32
0
19 Sep 2022
Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Ye Bai
Jie Li
W. Han
Hao Ni
Kaituo Xu
Zhuo Zhang
Cheng Yi
Xiaorui Wang
MoE
58
2
0
17 Sep 2022
Negation, Coordination, and Quantifiers in Contextualized Language Models
A. Kalouli
Rita Sevastjanova
C. Beck
Maribel Romero
88
12
0
16 Sep 2022
CNN-Trans-Enc: A CNN-Enhanced Transformer-Encoder On Top Of Static BERT representations for Document Classification
Charaf Eddine Benarab
Shenglin Gui
41
6
0
13 Sep 2022
Computational Sarcasm Analysis on Social Media: A Systematic Review
Faria Binte Kader
Nafisa Hossain Nujat
Tasmia Binte Sogir
Mohsinul Kabir
H. Mahmud
Md. Kamrul Hasan
50
5
0
13 Sep 2022
Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching
Kunbo Ding
Weijie Liu
Yuejian Fang
Zhe Zhao
Qi Ju
Xuefeng Yang
41
1
0
13 Sep 2022
DECK: Behavioral Tests to Improve Interpretability and Generalizability of BERT Models Detecting Depression from Text
Jekaterina Novikova
Ksenia Shkaruta
AI4MH
71
4
0
12 Sep 2022
Evaluation of Question Answering Systems: Complexity of judging a natural language
Amer Farea
Zhen Yang
Kien Duong
Nadeesha Perera
F. Emmert-Streib
ELM
57
3
0
10 Sep 2022
Adversarial Learning-based Stance Classifier for COVID-19-related Health Policies
Feng Xie
Zhong Zhang
Xuechen Zhao
Haiyang Wang
Jiaying Zou
Lei Tian
Bin Zhou
Yusong Tan
78
8
0
10 Sep 2022
Activity report analysis with automatic single or multispan answer extraction
R. Choudhary
A. Sridhar
Erik M. Visser
31
1
0
09 Sep 2022
Previous
1
2
3
...
25
26
27
...
57
58
59
Next