Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,935 papers shown
Title
Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining
Qi Zhang
Tianqi Du
Haotian Huang
Yifei Wang
Yisen Wang
71
5
0
01 Jul 2024
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Xin Wang
Zirui Chen
Haofen Wang
Leong Hou U
Zhao Li
Wenbin Guo
KELM
213
3
0
01 Jul 2024
FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis
Yinlin Guo
Yening Lv
Jinqiao Dou
Yan Zhang
Yuehai Wang
61
0
0
30 Jun 2024
LegalTurk Optimized BERT for Multi-Label Text Classification and NER
Farnaz Zeidi
Mehmet Fatih Amasyali
Çiğdem Erol
VLM
58
2
0
30 Jun 2024
"I understand why I got this grade": Automatic Short Answer Grading with Feedback
Dishank Aggarwal
Pushpak Bhattacharyya
Bhaskaran Raman
Pushpak Bhattacharyya
43
4
0
30 Jun 2024
BioMNER: A Dataset for Biomedical Method Entity Recognition
Chen Tang
Bohao Yang
Kun Zhao
Bo Lv
Chenghao Xiao
Frank Guerin
Chenghua Lin
63
0
0
28 Jun 2024
Protein Representation Learning with Sequence Information Embedding: Does it Always Lead to a Better Performance?
Y. Tan
Lirong Zheng
Bozitao Zhong
Liang Hong
Bingxin Zhou
62
5
0
28 Jun 2024
When Search Engine Services meet Large Language Models: Visions and Challenges
Haoyi Xiong
Jiang Bian
Yuchen Li
Xuhong Li
Jundong Li
Shuaiqiang Wang
Dawei Yin
Sumi Helal
135
36
0
28 Jun 2024
Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Ali Khaleghi Rahimian
Manish Kumar Govind
Subhajit Maity
Dominick Reilly
Christian Kummerle
Srijan Das
A. Dutta
78
1
0
27 Jun 2024
The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning
Shaobo Cui
Zhijing Jin
Bernhard Schölkopf
Boi Faltings
CML
LRM
85
4
0
27 Jun 2024
Clustering in pure-attention hardmax transformers and its role in sentiment analysis
Albert Alcalde
Giovanni Fantuzzi
Enrique Zuazua
91
4
0
26 Jun 2024
Unveiling and Controlling Anomalous Attention Distribution in Transformers
Ruiqing Yan
Xingbo Du
Haoyu Deng
Linghan Zheng
Qiuzhuang Sun
Jifang Hu
Yuhang Shao
Penghao Jiang
Jinrong Jiang
Lian Zhao
62
1
0
26 Jun 2024
ViANLI: Adversarial Natural Language Inference for Vietnamese
Tin Van Huynh
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
64
0
0
25 Jun 2024
Are there identifiable structural parts in the sentence embedding whole?
Vivi Nastase
Paola Merlo
63
3
0
24 Jun 2024
Large Vocabulary Size Improves Large Language Models
Sho Takase
Ryokan Ri
Shun Kiyono
Takuya Kato
133
4
0
24 Jun 2024
Evaluating the Effectiveness of the Foundational Models for Q&A Classification in Mental Health care
Hassan Alhuzali
Ashwag Alasmari
AI4MH
79
2
0
23 Jun 2024
Intrinsic Dimension Correlation: uncovering nonlinear connections in multimodal representations
Lorenzo Basile
Santiago Acevedo
Luca Bortolussi
Fabio Anselmi
Alex Rodriguez
86
4
0
22 Jun 2024
Brain-Like Language Processing via a Shallow Untrained Multihead Attention Network
Badr AlKhamissi
Greta Tuckute
Antoine Bosselut
Martin Schrimpf
126
6
0
21 Jun 2024
Text Serialization and Their Relationship with the Conventional Paradigms of Tabular Machine Learning
Kyoka Ono
Simon A. Lee
LMTD
52
8
0
19 Jun 2024
Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation
Branislav Pecher
Ján Cegin
Róbert Belanec
Jakub Simko
Ivan Srba
Maria Bielikova
83
1
0
18 Jun 2024
QueerBench: Quantifying Discrimination in Language Models Toward Queer Identities
Mae Sosto
Alberto Barrón-Cedeño
63
4
0
18 Jun 2024
TroL: Traversal of Layers for Large Language and Vision Models
Byung-Kwan Lee
Sangyun Chung
Chae Won Kim
Beomchan Park
Yong Man Ro
111
7
0
18 Jun 2024
CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis
Saranya Venkatraman
Nafis Irtiza Tripto
Dongwon Lee
156
14
0
18 Jun 2024
A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models
Haopeng Zhang
Philip S. Yu
Jiawei Zhang
128
27
0
17 Jun 2024
Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars
Damien Sileo
LRM
ReLM
100
4
0
16 Jun 2024
Improving Large Models with Small models: Lower Costs and Better Performance
Dong Chen
Shuo Zhang
Yueting Zhuang
Siliang Tang
Qidong Liu
Hua Wang
Mingliang Xu
96
6
0
15 Jun 2024
Adversarial Evasion Attack Efficiency against Large Language Models
João Vitorino
Eva Maia
Isabel Praça
AAML
72
2
0
12 Jun 2024
Defining and Detecting Vulnerability in Human Evaluation Guidelines: A Preliminary Study Towards Reliable NLG Evaluation
Jie Ruan
Wenqing Wang
Xiaojun Wan
AAML
ELM
80
6
0
12 Jun 2024
HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question Answering using LLMs
Pranoy Panda
Ankush Agarwal
Chaitanya Devaguptapu
Manohar Kaul
Prathosh A P
RALM
104
13
0
10 Jun 2024
Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge
Rui Liu
Zening Ma
SSL
111
1
0
10 Jun 2024
Gentle-CLIP: Exploring Aligned Semantic In Low-Quality Multimodal Data With Soft Alignment
Zijia Song
Z. Zang
Yelin Wang
Guozheng Yang
Jiangbin Zheng
Kaicheng Yu
Wanyu Chen
Stan Z. Li
75
1
0
09 Jun 2024
Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions
Cheng Tan
Dongxin Lyu
Siyuan Li
Zhangyang Gao
Jingxuan Wei
Siqi Ma
Zicheng Liu
Stan Z. Li
LLMAG
81
13
0
09 Jun 2024
Automata Extraction from Transformers
Yihao Zhang
Zeming Wei
Meng Sun
AI4CE
78
1
0
08 Jun 2024
Integrating Text and Image Pre-training for Multi-modal Algorithmic Reasoning
Zijian Zhang
Wei Liu
100
0
0
08 Jun 2024
BERTs are Generative In-Context Learners
David Samuel
85
8
0
07 Jun 2024
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs
Lingchen Meng
Jianwei Yang
Rui Tian
Xiyang Dai
Zuxuan Wu
Jianfeng Gao
Yu-Gang Jiang
VLM
88
9
0
06 Jun 2024
Pre-trained Transformer Uncovers Meaningful Patterns in Human Mobility Data
Alameen Najjar
69
0
0
06 Jun 2024
A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions
Lei Liu
Xiaoyan Yang
Junchi Lei
Xiaoyang Liu
Yue Shen
...
Peng Wei
Jinjie Gu
Zhixuan Chu
Zhan Qin
Kui Ren
LM&MA
AILaw
100
18
0
06 Jun 2024
RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization
Jinge Wu
Abul Hasan
Honghan Wu
33
1
0
05 Jun 2024
Language Model Can Do Knowledge Tracing: Simple but Effective Method to Integrate Language Model and Knowledge Tracing Task
Unggi Lee
Jiyeong Bae
Dohee Kim
Sookbun Lee
Jaekwon Park
Taekyung Ahn
Gunho Lee
Damji Stratton
Hyeoncheol Kim
AI4Ed
KELM
63
12
0
05 Jun 2024
Using Self-supervised Learning Can Improve Model Fairness
Sofia Yfantidou
Dimitris Spathis
Marios Constantinides
Athena Vakali
Daniele Quercia
F. Kawsar
126
4
0
04 Jun 2024
Robust Interaction-based Relevance Modeling for Online E-Commerce and LLM-based Retrieval
Ben Chen
Huangyu Dai
Xiang Ma
Wen Jiang
Wei Ning
32
0
0
04 Jun 2024
Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models
Jiexin Wang
Adam Jatowt
Yi Cai
AI4CE
96
1
0
04 Jun 2024
It's a Feature, Not a Bug: Measuring Creative Fluidity in Image Generators
Aditi Ramaswamy
Melane Navaratnarajah
Hana Chockler
EGVM
52
0
0
03 Jun 2024
Reward-based Input Construction for Cross-document Relation Extraction
Byeonghu Na
Suhyeon Jo
Yeongmin Kim
Il-Chul Moon
49
2
0
31 May 2024
GAMedX: Generative AI-based Medical Entity Data Extractor Using Large Language Models
Mohammed-Khalil Ghali
Abdelrahman Farrag
Hajar Sakai
Hicham El Baz
Yu Jin
Sarah Lam
LM&MA
MedIm
81
9
0
31 May 2024
Entangled Relations: Leveraging NLI and Meta-analysis to Enhance Biomedical Relation Extraction
William Hogan
Jingbo Shang
93
0
0
31 May 2024
Unlocking the Potential of Large Language Models for Clinical Text Anonymization: A Comparative Study
David Pissarra
Isabel Curioso
João Alveira
Duarte Pereira
Bruno Ribeiro
Tomas Souper
Vasco Gomes
A. Carreiro
Vitor Rolla
46
4
0
29 May 2024
On the Role of Attention Masks and LayerNorm in Transformers
Xinyi Wu
A. Ajorlou
Yifei Wang
Stefanie Jegelka
Ali Jadbabaie
98
12
0
29 May 2024
Transformers Can Do Arithmetic with the Right Embeddings
Sean McLeish
Arpit Bansal
Alex Stein
Neel Jain
John Kirchenbauer
...
B. Kailkhura
A. Bhatele
Jonas Geiping
Avi Schwarzschild
Tom Goldstein
78
37
0
27 May 2024
Previous
1
2
3
...
5
6
7
...
57
58
59
Next