Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,935 papers shown
Title
A Survey on Prompting Techniques in LLMs
Prabin Bhandari
48
7
0
28 Nov 2023
Entity-Aspect-Opinion-Sentiment Quadruple Extraction for Fine-grained Sentiment Analysis
Dan Ma
Jun Xu
Zongyu Wang
Xuezhi Cao
Yunsen Xian
34
0
0
28 Nov 2023
Recognizing Conditional Causal Relationships about Emotions and Their Corresponding Conditions
Xinhong Chen
Zongxi Li
Yaowei Wang
Haoran Xie
Jianping Wang
Qing Li
40
0
0
28 Nov 2023
Leveraging deep active learning to identify low-resource mobility functioning information in public clinical notes
Tuan-Dung Le
Zhuqi Miao
Samuel Alvarado
Brittany Smith
William Paiva
Thanh Thieu
28
1
0
27 Nov 2023
C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing
Avigyan Bhattacharya
Mainak Singha
Ankit Jha
Biplab Banerjee
SSL
VLM
78
6
0
27 Nov 2023
A Comparative and Experimental Study on Automatic Question Answering Systems and its Robustness against Word Jumbling
Shashidhar Reddy Javaji
Haoran Hu
Sai Sameer Vennam
Vijaya Gajanan Buddhavarapu
18
0
0
27 Nov 2023
Probabilistic Transformer: A Probabilistic Dependency Model for Contextual Word Representation
Haoyi Wu
Kewei Tu
414
4
0
26 Nov 2023
General Phrase Debiaser: Debiasing Masked Language Models at a Multi-Token Level
Bingkang Shi
Xiaodan Zhang
Dehan Kong
Yulei Wu
Zongzhen Liu
Honglei Lyu
Longtao Huang
AI4CE
81
2
0
23 Nov 2023
A Multi-solution Study on GDPR AI-enabled Completeness Checking of DPAs
Muhammad Ilyas Azeem
Sallam Abualhaija
75
7
0
23 Nov 2023
Transformer-based Named Entity Recognition in Construction Supply Chain Risk Management in Australia
Milad Baghalzadeh Shishehgarkhaneh
R. Moehler
Yihai Fang
Amer A. Hijazi
Hamed Aboutorab
97
10
0
23 Nov 2023
Efficient Transformer Knowledge Distillation: A Performance Review
Nathan Brown
Ashton Williamson
Tahj Anderson
Logan Lawrence
VLM
50
5
0
22 Nov 2023
Looped Transformers are Better at Learning Learning Algorithms
Liu Yang
Kangwook Lee
Robert D. Nowak
Dimitris Papailiopoulos
97
26
0
21 Nov 2023
Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey
Yunpeng Huang
Jingwei Xu
Junyu Lai
Zixu Jiang
Taolue Chen
...
Xiaoxing Ma
Lijuan Yang
Zhou Xin
Shupeng Li
Penghao Zhao
LLMAG
KELM
98
66
0
21 Nov 2023
Long-MIL: Scaling Long Contextual Multiple Instance Learning for Histopathology Whole Slide Image Analysis
Honglin Li
Yunlong Zhang
Chenglu Zhu
Jiatong Cai
Sunyi Zheng
Lin Yang
VLM
89
4
0
21 Nov 2023
Tensor-Aware Energy Accounting
Timur Babakol
Yu David Liu
38
4
0
19 Nov 2023
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections
Lihan Zha
Yuchen Cui
Li-Heng Lin
Minae Kwon
Montse Gonzalez Arenas
Andy Zeng
Fei Xia
Dorsa Sadigh
104
37
0
17 Nov 2023
Generative AI for Hate Speech Detection: Evaluation and Findings
Sagi Pendzel
Tomer Wullach
Amir Adler
Einat Minkov
60
11
0
16 Nov 2023
Long-form Question Answering: An Iterative Planning-Retrieval-Generation Approach
Pritom Saha Akash
Kashob Kumar Roy
Lucian Popa
Kevin Chen-Chuan Chang
71
3
0
15 Nov 2023
Temporal Knowledge Question Answering via Abstract Reasoning Induction
Ziyang Chen
Dongfang Li
Xiang Zhao
Baotian Hu
Min Zhang
LRM
88
17
0
15 Nov 2023
OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining
Yihong Liu
Peiqin Lin
Mingyang Wang
Hinrich Schütze
71
29
0
15 Nov 2023
It Takes Two to Negotiate: Modeling Social Exchange in Online Multiplayer Games
Kokil Jaidka
Hansin Ahuja
Lynnette Ng
137
7
0
15 Nov 2023
GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer
Urchade Zaratiana
Nadi Tomeh
Pierre Holat
Thierry Charnois
72
41
0
14 Nov 2023
AI-generated text boundary detection with RoFT
Laida Kushnareva
T. Gaintseva
German Magai
S. Barannikov
Dmitry Abulkhanov
Kristian Kuznetsov
Eduard Tulchinskii
Irina Piontkovskaya
Sergey I. Nikolenko
DeLMO
65
7
0
14 Nov 2023
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models
.Ilker Kesen
Andrea Pedrotti
Mustafa Dogan
Michele Cafagna
Emre Can Acikgoz
...
Iacer Calixto
Anette Frank
Albert Gatt
Aykut Erdem
Erkut Erdem
94
19
0
13 Nov 2023
Training A Multi-stage Deep Classifier with Feedback Signals
Chao Xu
Yu Yang
Rong Wang
Guan Wang
Bojia Lin
35
0
0
12 Nov 2023
Tunable Soft Prompts are Messengers in Federated Learning
Chenhe Dong
Yuexiang Xie
Bolin Ding
Ying Shen
Yaliang Li
FedML
84
8
0
12 Nov 2023
Early-Exit Neural Networks with Nested Prediction Sets
Metod Jazbec
Patrick Forré
Stephan Mandt
Dan Zhang
Eric T. Nalisnick
UQCV
56
1
0
10 Nov 2023
The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models
Anton Razzhigaev
Matvey Mikhalchuk
Elizaveta Goncharova
Ivan Oseledets
Denis Dimitrov
Andrey Kuznetsov
77
10
0
10 Nov 2023
Hallucination-minimized Data-to-answer Framework for Financial Decision-makers
Sohini Roychowdhury
Andres Alvarez
Brian Moore
Marko Krema
Maria Paz Gelpi
...
Angel Rodriguez
Jose Ramon Cabrejas
Pablo Martinez Serrano
Punit Agrawal
Arijit Mukherjee
68
9
0
09 Nov 2023
A Survey of Large Language Models in Medicine: Progress, Application, and Challenge
Hongjian Zhou
Fenglin Liu
Boyang Gu
Xinyu Zou
Jinfa Huang
...
Yefeng Zheng
Lei A. Clifton
Zheng Li
Fenglin Liu
David Clifton
LM&MA
163
127
0
09 Nov 2023
Legal-HNet: Mixing Legal Long-Context Tokens with Hartley Transform
Daniele Giofré
Sneha Ghantasala
AILaw
70
0
0
09 Nov 2023
DACBERT: Leveraging Dependency Agreement for Cost-Efficient Bert Pretraining
Martin Kuo
Jianyi Zhang
Yiran Chen
52
2
0
08 Nov 2023
Pragmatic Reasoning Unlocks Quantifier Semantics for Foundation Models
Yiyuan Li
Rakesh R Menon
Sayan Ghosh
Shashank Srivastava
LRM
62
2
0
08 Nov 2023
DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding
Kehinde E. Ajayi
Xin Wei
Martin Gryder
Winston Shields
Jian Wu
Shawn M. Jones
Michal Kucer
Diane Oyen
3DV
36
4
0
07 Nov 2023
mahaNLP: A Marathi Natural Language Processing Library
Vidula Magdum
Omkar Dhekane
Sharayu Hiwarkhedkar
Saloni Mittal
Raviraj Joshi
76
5
0
05 Nov 2023
Sentiment Analysis through LLM Negotiations
Xiaofei Sun
Xiaoya Li
Shengyu Zhang
Shuhe Wang
Leilei Gan
Jiwei Li
Tianwei Zhang
Guoyin Wang
86
21
0
03 Nov 2023
TCM-GPT: Efficient Pre-training of Large Language Models for Domain Adaptation in Traditional Chinese Medicine
Guoxing Yang
Jianyu Shi
Zan Wang
Xiaohong Liu
Guangyu Wang
29
21
0
03 Nov 2023
A New Korean Text Classification Benchmark for Recognizing the Political Intents in Online Newspapers
Beomjune Kim
Eunsun Lee
Dongbin Na
45
1
0
03 Nov 2023
Successor Features for Efficient Multisubject Controlled Text Generation
Mengyao Cao
Mehdi Fatemi
Jackie Chi Kit Cheung
Samira Shabanian
BDL
82
0
0
03 Nov 2023
Adapting Fake News Detection to the Era of Large Language Models
Jinyan Su
Claire Cardie
Preslav Nakov
DeLMO
103
19
0
02 Nov 2023
Investigating Self-Supervised Deep Representations for EEG-based Auditory Attention Decoding
Karan Thakkar
Jiarui Hai
Mounya Elhilali
45
1
0
01 Nov 2023
Latent Space Translation via Semantic Alignment
Valentino Maiorca
Luca Moschella
Antonio Norelli
Marco Fumero
Francesco Locatello
Emanuele Rodolà
117
23
0
01 Nov 2023
LLMs may Dominate Information Access: Neural Retrievers are Biased Towards LLM-Generated Texts
Sunhao Dai
Yuqi Zhou
Liang Pang
Weihao Liu
Xiaolin Hu
Yong Liu
Xiao Zhang
Gang Wang
Jun Xu
121
34
0
31 Oct 2023
Do large language models solve verbal analogies like children do?
Claire E. Stevenson
Mathilde ter Veen
Rochelle Choenni
Han L. J. van der Maas
Ekaterina Shutova
LRM
28
8
0
31 Oct 2023
Learning to Play Chess from Textbooks (LEAP): a Corpus for Evaluating Chess Moves based on Sentiment Analysis
Haifa Alrdahi
Riza Batista-Navarro
62
2
0
31 Oct 2023
EELBERT: Tiny Models through Dynamic Embeddings
Gabrielle Cohn
Rishika Agarwal
Deepanshu Gupta
Siddharth Patwardhan
27
2
0
31 Oct 2023
Efficient Classification of Student Help Requests in Programming Courses Using Large Language Models
Jaromír Šavelka
Paul Denny
Mark H. Liffiton
Brad Sheese
AI4Ed
75
7
0
31 Oct 2023
MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Allen Nie
Yuhui Zhang
Atharva Amdekar
Chris Piech
Tatsunori Hashimoto
Tobias Gerstenberg
80
40
0
30 Oct 2023
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval
Youbo Lei
Feifei He
Chen Chen
Yingbin Mo
Sijia Li
Defeng Xie
H. Lu
VLM
87
0
0
30 Oct 2023
A Lightweight Method to Generate Unanswerable Questions in English
Vagrant Gautam
Miaoran Zhang
Dietrich Klakow
73
1
0
30 Oct 2023
Previous
1
2
3
...
11
12
13
...
57
58
59
Next