Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,935 papers shown
Title
Learning Long-form Video Prior via Generative Pre-Training
Jinheng Xie
Jiajun Feng
Zhaoxu Tian
Kevin Qinghong Lin
Yawen Huang
...
Nanxu Gong
Xu Zuo
Jiaqi Yang
Yefeng Zheng
Mike Zheng Shou
69
6
0
24 Apr 2024
A Comprehensive Survey on Evaluating Large Language Model Applications in the Medical Industry
Yining Huang
Keke Tang
Meilian Chen
Boyuan Wang
ELM
LM&MA
72
15
0
24 Apr 2024
A Survey of Decomposition-Based Evolutionary Multi-Objective Optimization: Part II -- A Data Science Perspective
Mingyu Huang
Ke Li
57
1
0
22 Apr 2024
Embarrassingly Simple Unsupervised Aspect Based Sentiment Tuple Extraction
Kevin Scaria
Abyn Scaria
Ben Scaria
CoGe
51
0
0
21 Apr 2024
PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical Structure
Feiqi Cao
S. Han
Hyunsuk Chung
53
0
0
21 Apr 2024
Explanation based Bias Decoupling Regularization for Natural Language Inference
Jianxiang Zang
Hui Liu
59
1
0
20 Apr 2024
Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge
Khuyagbaatar Batsuren
Ekaterina Vylomova
Verna Dankers
Tsetsuukhei Delgerbaatar
Omri Uzan
Yuval Pinter
Gábor Bella
96
10
0
20 Apr 2024
Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment
Danqing Ma
Meng Wang
Ao Xiang
Zongqing Qi
Qin Yang
72
19
0
19 Apr 2024
EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction
Urchade Zaratiana
Nadi Tomeh
Yann Dauxais
Pierre Holat
Thierry Charnois
74
0
0
18 Apr 2024
GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction
Urchade Zaratiana
Nadi Tomeh
Niama El Khbir
Pierre Holat
Thierry Charnois
112
1
0
18 Apr 2024
Enhance Robustness of Language Models Against Variation Attack through Graph Integration
Ziteng Xiong
Lizhi Qing
Yangyang Kang
Jiawei Liu
Hongsong Li
Changlong Sun
Xiaozhong Liu
Wei Lu
54
2
0
18 Apr 2024
Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Representation Learning
Zhengyang Liang
Meiyu Liang
Wei Huang
Yawen Li
Zhe Xue
74
1
0
16 Apr 2024
Referring Flexible Image Restoration
Runwei Guan
Rongsheng Hu
Zhuhao Zhou
Tianlang Xue
Ka Lok Man
Jeremy S. Smith
Eng Gee Lim
Weiping Ding
Yutao Yue
76
0
0
16 Apr 2024
On the Effects of Fine-tuning Language Models for Text-Based Reinforcement Learning
Mauricio G. Gruppi
Soham Dan
K. Murugesan
Subhajit Chaudhury
LLMAG
32
0
0
15 Apr 2024
Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies
Benjue Weng
LM&MA
117
10
0
13 Apr 2024
VertAttack: Taking advantage of Text Classifiers' horizontal vision
Jonathan Rusert
AAML
105
1
0
12 Apr 2024
Emerging Property of Masked Token for Effective Pre-training
Hyesong Choi
Hunsang Lee
Seyoung Joung
Hyejin Park
Jiyeong Kim
Dongbo Min
87
10
0
12 Apr 2024
Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training
Hyesong Choi
Hyejin Park
Kwang Moo Yi
Sungmin Cha
Dongbo Min
105
9
0
12 Apr 2024
On Unified Prompt Tuning for Request Quality Assurance in Public Code Review
Xinyu Chen
Lin Li
Rui Zhang
Peng Liang
84
1
0
11 Apr 2024
CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers
Longwei Zou
Qingyang Wang
Han Zhao
Jiangang Kong
Yi Yang
Yangdong Deng
98
0
0
10 Apr 2024
Dimensionality Reduction in Sentence Transformer Vector Databases with Fast Fourier Transform
Vitaly Bulgakov
Alec Segal
59
2
0
09 Apr 2024
AnchorAL: Computationally Efficient Active Learning for Large and Imbalanced Datasets
Pietro Lesci
Andreas Vlachos
143
4
0
08 Apr 2024
Chinese Sequence Labeling with Semi-Supervised Boundary-Aware Language Model Pre-training
Longhui Zhang
Dingkun Long
Meishan Zhang
Yanzhao Zhang
Pengjun Xie
Min Zhang
81
3
0
08 Apr 2024
OPSD: an Offensive Persian Social media Dataset and its baseline evaluations
M. Safayani
Amir Sartipi
Amir Hossein Ahmadi
Parniyan Jalali
Amir Hossein Mansouri
Mohammad Bisheh-Niasar
Zahra Pourbahman
23
0
0
08 Apr 2024
Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts
Weilin Cai
Juyong Jiang
Le Qin
Junwei Cui
Sunghun Kim
Jiayi Huang
180
10
0
07 Apr 2024
What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models
Busayo Awobade
Mardiyyah Oduwole
Steven Kolawole
75
1
0
06 Apr 2024
Order-Based Pre-training Strategies for Procedural Text Understanding
Abhilash Nandy
Yash Kulkarni
Pawan Goyal
Niloy Ganguly
126
4
0
06 Apr 2024
A Morphology-Based Investigation of Positional Encodings
Poulami Ghosh
Shikhar Vashishth
Raj Dabre
Pushpak Bhattacharyya
68
2
0
06 Apr 2024
Multi-modal Learning for WebAssembly Reverse Engineering
Hanxian Huang
Jishen Zhao
58
3
0
04 Apr 2024
Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased?
Vagrant Gautam
Eileen Bingert
D. Zhu
Anne Lauscher
Dietrich Klakow
75
8
0
04 Apr 2024
Revisiting subword tokenization: A case study on affixal negation in large language models
Thinh Hung Truong
Yulia Otmakhova
Karin Verspoor
Trevor Cohn
Timothy Baldwin
75
2
0
03 Apr 2024
Linear Attention Sequence Parallelism
Weigao Sun
Zhen Qin
Dong Li
Xuyang Shen
Yu Qiao
Yiran Zhong
146
2
0
03 Apr 2024
Semantic Augmentation in Images using Language
Sahiti Yerramilli
Jayant Sravan Tamarapalli
Tanmay Girish Kulkarni
Jonathan M Francis
Eric Nyberg
VLM
DiffM
52
0
0
02 Apr 2024
Deconstructing In-Context Learning: Understanding Prompts via Corruption
Namrata Shivagunde
Vladislav Lialin
Sherin Muckatira
Anna Rumshisky
83
3
0
02 Apr 2024
Green AI: Exploring Carbon Footprints, Mitigation Strategies, and Trade Offs in Large Language Model Training
Vivian Liu
Yiqiao Yin
110
23
0
01 Apr 2024
Efficient Prompting Methods for Large Language Models: A Survey
Kaiyan Chang
Songcheng Xu
Chenglong Wang
Yingfeng Luo
Tong Xiao
Jingbo Zhu
LRM
117
36
0
01 Apr 2024
Efficiently Distilling LLMs for Edge Applications
Achintya Kundu
Fabian Lim
Aaron Chew
L. Wynter
Penny Chong
Rhui Dih Lee
126
6
0
01 Apr 2024
CoUDA: Coherence Evaluation via Unified Data Augmentation
Dawei Zhu
Wenhao Wu
Yifan Song
Fangwei Zhu
Ziqiang Cao
Sujian Li
51
0
0
31 Mar 2024
Addressing Both Statistical and Causal Gender Fairness in NLP Models
Hannah Chen
Yangfeng Ji
David Evans
75
4
0
30 Mar 2024
A Comprehensive Study on NLP Data Augmentation for Hate Speech Detection: Legacy Methods, BERT, and LLMs
Md Saroar Jahan
Mourad Oussalah
D. Beddiar
Jhuma Kabir Mim
Nabil Arhab
104
6
0
30 Mar 2024
Classifying Conspiratorial Narratives At Scale: False Alarms and Erroneous Connections
Ahmad Diab
Rr. Nefriana
Yu-Ru Lin
65
7
0
29 Mar 2024
The Future of Combating Rumors? Retrieval, Discrimination, and Generation
Junhao Xu
Longdi Xian
Zening Liu
Mingliang Chen
Qiuyang Yin
Fenghua Song
63
2
0
29 Mar 2024
New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark
Nadege Alavoine
G. Laperriere
Christophe Servan
Sahar Ghannay
Sophie Rosset
VLM
108
1
0
28 Mar 2024
A Benchmark Evaluation of Clinical Named Entity Recognition in French
N. Bannour
Christophe Servan
Aurélie Névéol
Xavier Tannier
46
0
0
28 Mar 2024
mALBERT: Is a Compact Multilingual BERT Model Still Worth It?
Christophe Servan
Sahar Ghannay
Sophie Rosset
46
1
0
27 Mar 2024
GPTs and Language Barrier: A Cross-Lingual Legal QA Examination
Ha-Thanh Nguyen
Hiroaki Yamada
Ken Satoh
ELM
AILaw
52
0
0
26 Mar 2024
REFeREE: A REference-FREE Model-Based Metric for Text Simplification
Yichen Huang
Ekaterina Kochmar
76
2
0
26 Mar 2024
Opportunities and challenges in the application of large artificial intelligence models in radiology
Liangrui Pan
Zhenyu Zhao
Ying Lu
Kewei Tang
Liyong Fu
Qingchun Liang
Shaoliang Peng
LM&MA
MedIm
AI4CE
81
6
0
24 Mar 2024
VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding
Phong Nguyen-Thuan Do
Son Quoc Tran
Phu Gia Hoang
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
ELM
69
5
0
23 Mar 2024
Enhancing Traffic Incident Management with Large Language Models: A Hybrid Machine Learning Approach for Severity Classification
Artur Grigorev
Khaled Saleh
Yuming Ou
Adriana-Simona Mihaita
62
6
0
20 Mar 2024
Previous
1
2
3
...
7
8
9
...
57
58
59
Next