Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,935 papers shown
Title
Description-Enhanced Label Embedding Contrastive Learning for Text Classification
Kun Zhang
Le Wu
Guangyi Lv
Enhong Chen
Shulan Ruan
Jing Liu
Qing Cui
Jun Zhou
Meng Wang
VLM
52
10
0
15 Jun 2023
Understanding Privacy Over-collection in WeChat Sub-app Ecosystem
Xiaohan Zhang
Yang Wang
Xin Zhang
Ziqi Huang
Lei Zhang
Min Yang
22
5
0
14 Jun 2023
Unifying Large Language Models and Knowledge Graphs: A Roadmap
Shirui Pan
Linhao Luo
Yufei Wang
Chen Chen
Jiapu Wang
Xindong Wu
KELM
160
787
0
14 Jun 2023
SqueezeLLM: Dense-and-Sparse Quantization
Sehoon Kim
Coleman Hooper
A. Gholami
Zhen Dong
Xiuyu Li
Sheng Shen
Michael W. Mahoney
Kurt Keutzer
MQ
150
198
0
13 Jun 2023
Recurrent Attention Networks for Long-text Modeling
Xianming Li
Zongxi Li
Xiaotian Luo
Haoran Xie
Xing Lee
Yingbin Zhao
Fu Lee Wang
Qing Li
RALM
92
15
0
12 Jun 2023
QUERT: Continual Pre-training of Language Model for Query Understanding in Travel Domain Search
Jian Xie
Yidan Liang
Jingping Liu
Yanghua Xiao
Baohua Wu
Shenghua Ni
VLM
LRM
90
9
0
11 Jun 2023
Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method
Shicheng Tan
Weng Lam Tam
Yuanchun Wang
Wenwen Gong
Shuo Zhao
Peng Zhang
Jie Tang
VLM
49
1
0
11 Jun 2023
RoBERTweet: A BERT Language Model for Romanian Tweets
Iulian-Marius Tuaiatu
Andrei-Marius Avram
Dumitru-Clementin Cercel
Florin-Catalin Pop
40
1
0
11 Jun 2023
Enhancing Low Resource NER Using Assisting Language And Transfer Learning
Maithili Sabane
Aparna Ranade
Onkar Litake
Parth Patil
Raviraj Joshi
Dipali M. Kadam
64
5
0
10 Jun 2023
Leveraging Language Identification to Enhance Code-Mixed Text Classification
Gauri Takawane
Abhishek Phaltankar
Varad Patwardhan
Aryan Patil
Raviraj Joshi
Mukta S. Takalikar
72
4
0
08 Jun 2023
Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning
Jaehyung Kim
Jinwoo Shin
Dongyeop Kang
64
2
0
08 Jun 2023
MobileNMT: Enabling Translation in 15MB and 30ms
Ye Lin
Xiaohui Wang
Zhexi Zhang
Mingxuan Wang
Tong Xiao
Jingbo Zhu
MQ
63
2
0
07 Jun 2023
Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation
Xiusi Chen
Yu Zhang
Jinliang Deng
Jyun-Yu Jiang
Wei Wang
62
12
0
07 Jun 2023
Randomized Schur Complement Views for Graph Contrastive Learning
Vignesh Kothapalli
117
2
0
06 Jun 2023
Causal interventions expose implicit situation models for commonsense language understanding
Takateru Yamakoshi
James L. McClelland
A. Goldberg
Robert D. Hawkins
104
6
0
06 Jun 2023
On the Difference of BERT-style and CLIP-style Text Encoders
Zhihong Chen
Guiming Hardy Chen
Shizhe Diao
Xiang Wan
Benyou Wang
VLM
67
19
0
06 Jun 2023
CUE: An Uncertainty Interpretation Framework for Text Classifiers Built on Pre-Trained Language Models
Jiazheng Li
ZHAOYUE SUN
Bin Liang
Lin Gui
Yulan He
79
2
0
06 Jun 2023
Subgraph Networks Based Contrastive Learning
Jinhuan Wang
Jiafei Shao
Zeyu Wang
Shanqing Yu
Qi Xuan
Xiaoniu Yang
78
1
0
06 Jun 2023
Using Sequences of Life-events to Predict Human Lives
Germans Savcisens
Tina Eliassi-Rad
L. K. Hansen
L. Mortensen
Lau Lilleholt
Anna Rogers
Ingo Zettler
Sune Lehmann
AI4TS
94
46
0
05 Jun 2023
Probing Physical Reasoning with Counter-Commonsense Context
Kazushi Kondo
Saku Sugawara
Akiko Aizawa
LRM
74
4
0
04 Jun 2023
Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions
Hui Yang
Sifu Yue
Yunzhong He
RALM
72
172
0
04 Jun 2023
Improving Generalization in Task-oriented Dialogues with Workflows and Action Plans
Stefania Raimondo
C. Pal
Xiaotian Liu
David Vazquez
Héctor Palacios
48
2
0
02 Jun 2023
Centered Self-Attention Layers
Ameen Ali
Tomer Galanti
Lior Wolf
140
8
0
02 Jun 2023
Data-Efficient French Language Modeling with CamemBERTa
Wissam Antoun
Benoît Sagot
Djamé Seddah
50
7
0
02 Jun 2023
Unsupervised Paraphrasing of Multiword Expressions
Takashi Wada
Yuji Matsumoto
Timothy Baldwin
Jey Han Lau
66
0
0
02 Jun 2023
Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition
H. Haresamudram
Irfan Essa
Thomas Ploetz
98
8
0
01 Jun 2023
Topic-Guided Sampling For Data-Efficient Multi-Domain Stance Detection
Erik Arakelyan
Arnav Arora
Isabelle Augenstein
56
10
0
01 Jun 2023
CL-MRI: Self-Supervised Contrastive Learning to Improve the Accuracy of Undersampled MRI Reconstruction
Mevan Ekanayake
Zhiwen Chen
Mehrtash Harandi
Gary Egan
Zhaolin Chen
81
3
0
01 Jun 2023
Attention-Based Methods For Audio Question Answering
Parthasaarathy Sudarsanam
Tuomas Virtanen
68
3
0
31 May 2023
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech
L. T. Nguyen
Thinh-Le-Gia Pham
Dat Quoc Nguyen
98
14
0
31 May 2023
Assessing Word Importance Using Models Trained for Semantic Tasks
Dávid Javorský
Ondrej Bojar
François Yvon
37
2
0
31 May 2023
What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models?
Ruixiang Cui
Seolhwa Lee
Daniel Hershcovich
Anders Søgaard
50
2
0
31 May 2023
Stable Anisotropic Regularization
William Rudman
Carsten Eickhoff
83
6
0
30 May 2023
PreQuant: A Task-agnostic Quantization Approach for Pre-trained Language Models
Zhuocheng Gong
Jiahao Liu
Qifan Wang
Yang Yang
Jingang Wang
Wei Wu
Yunsen Xian
Dongyan Zhao
Rui Yan
MQ
70
5
0
30 May 2023
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets
Md Tahmid Rahman Laskar
M Saiful Bari
Mizanur Rahman
Md Amran Hossen Bhuiyan
Shafiq Joty
J. Huang
LM&MA
ELM
ALM
125
193
0
29 May 2023
Whitening-based Contrastive Learning of Sentence Embeddings
Wenjie Zhuo
Yifan Sun
Xiaohan Wang
Linchao Zhu
Yezhou Yang
59
21
0
28 May 2023
Tri-level Joint Natural Language Understanding for Multi-turn Conversational Datasets
H. Weld
Sijia Hu
Siqu Long
Josiah Poon
S. Han
51
1
0
28 May 2023
One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning
Guangtao Zeng
Peiyuan Zhang
Wei Lu
95
22
0
28 May 2023
AI Coach Assist: An Automated Approach for Call Recommendation in Contact Centers for Agent Coaching
Md Tahmid Rahman Laskar
Cheng Chen
Xue-Yong Fu
M. Azizi
Shashi Bhushan
Simon Corston-Oliver
54
2
0
28 May 2023
A Match Made in Heaven: A Multi-task Framework for Hyperbole and Metaphor Detection
Naveen Badathala
Abisek Rajakumar Kalarani
Tejpalsingh Siledar
P. Bhattacharyya
49
12
0
27 May 2023
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
Dachuan Shi
Chaofan Tao
Anyi Rao
Zhendong Yang
Chun Yuan
Jiaqi Wang
VLM
130
23
0
27 May 2023
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks
Kai Zhang
Jun Yu
Eashan Adhikarla
Rong Zhou
Zhilin Yan
...
Xun Chen
Yong Chen
Quanzheng Li
Hongfang Liu
Lichao Sun
LM&MA
MedIm
103
185
0
26 May 2023
NormBank: A Knowledge Bank of Situational Social Norms
Caleb Ziems
Jane Dwivedi-Yu
Yi-Chia Wang
A. Halevy
Diyi Yang
109
45
0
26 May 2023
Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model
Yibo Miao
Hongcheng Gao
Hao Zhang
Zhijie Deng
DeLMO
78
20
0
26 May 2023
Don't Retrain, Just Rewrite: Countering Adversarial Perturbations by Rewriting Text
Ashim Gupta
Carter Blum
Temma Choji
Yingjie Fei
Shalin S Shah
Alakananda Vempala
Vivek Srikumar
AAML
62
9
0
25 May 2023
Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data
Aryan Patil
Varad Patwardhan
Abhishek Phaltankar
Gauri Takawane
Raviraj Joshi
76
12
0
25 May 2023
Exploring Automatically Perturbed Natural Language Explanations in Relation Extraction
Wanyun Cui
Xingran Chen
LRM
AAML
63
0
0
24 May 2023
Context-Aware Transformer Pre-Training for Answer Sentence Selection
Luca Di Liello
Siddhant Garg
Alessandro Moschitti
71
4
0
24 May 2023
Dynamic Masking Rate Schedules for MLM Pretraining
Zachary Ankner
Naomi Saphra
Davis W. Blalock
Jonathan Frankle
Matthew L. Leavitt
101
8
0
24 May 2023
SETI: Systematicity Evaluation of Textual Inference
Xiyan Fu
Anette Frank
LRM
46
5
0
24 May 2023
Previous
1
2
3
...
16
17
18
...
57
58
59
Next