ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
A Survey on Prompting Techniques in LLMs
A Survey on Prompting Techniques in LLMs
Prabin Bhandari
48
7
0
28 Nov 2023
Entity-Aspect-Opinion-Sentiment Quadruple Extraction for Fine-grained
  Sentiment Analysis
Entity-Aspect-Opinion-Sentiment Quadruple Extraction for Fine-grained Sentiment Analysis
Dan Ma
Jun Xu
Zongyu Wang
Xuezhi Cao
Yunsen Xian
34
0
0
28 Nov 2023
Recognizing Conditional Causal Relationships about Emotions and Their
  Corresponding Conditions
Recognizing Conditional Causal Relationships about Emotions and Their Corresponding Conditions
Xinhong Chen
Zongxi Li
Yaowei Wang
Haoran Xie
Jianping Wang
Qing Li
40
0
0
28 Nov 2023
Leveraging deep active learning to identify low-resource mobility
  functioning information in public clinical notes
Leveraging deep active learning to identify low-resource mobility functioning information in public clinical notes
Tuan-Dung Le
Zhuqi Miao
Samuel Alvarado
Brittany Smith
William Paiva
Thanh Thieu
28
1
0
27 Nov 2023
C-SAW: Self-Supervised Prompt Learning for Image Generalization in
  Remote Sensing
C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing
Avigyan Bhattacharya
Mainak Singha
Ankit Jha
Biplab Banerjee
SSLVLM
78
6
0
27 Nov 2023
A Comparative and Experimental Study on Automatic Question Answering
  Systems and its Robustness against Word Jumbling
A Comparative and Experimental Study on Automatic Question Answering Systems and its Robustness against Word Jumbling
Shashidhar Reddy Javaji
Haoran Hu
Sai Sameer Vennam
Vijaya Gajanan Buddhavarapu
18
0
0
27 Nov 2023
Probabilistic Transformer: A Probabilistic Dependency Model for
  Contextual Word Representation
Probabilistic Transformer: A Probabilistic Dependency Model for Contextual Word Representation
Haoyi Wu
Kewei Tu
414
4
0
26 Nov 2023
General Phrase Debiaser: Debiasing Masked Language Models at a
  Multi-Token Level
General Phrase Debiaser: Debiasing Masked Language Models at a Multi-Token Level
Bingkang Shi
Xiaodan Zhang
Dehan Kong
Yulei Wu
Zongzhen Liu
Honglei Lyu
Longtao Huang
AI4CE
81
2
0
23 Nov 2023
A Multi-solution Study on GDPR AI-enabled Completeness Checking of DPAs
A Multi-solution Study on GDPR AI-enabled Completeness Checking of DPAs
Muhammad Ilyas Azeem
Sallam Abualhaija
75
7
0
23 Nov 2023
Transformer-based Named Entity Recognition in Construction Supply Chain
  Risk Management in Australia
Transformer-based Named Entity Recognition in Construction Supply Chain Risk Management in Australia
Milad Baghalzadeh Shishehgarkhaneh
R. Moehler
Yihai Fang
Amer A. Hijazi
Hamed Aboutorab
97
10
0
23 Nov 2023
Efficient Transformer Knowledge Distillation: A Performance Review
Efficient Transformer Knowledge Distillation: A Performance Review
Nathan Brown
Ashton Williamson
Tahj Anderson
Logan Lawrence
VLM
50
5
0
22 Nov 2023
Looped Transformers are Better at Learning Learning Algorithms
Looped Transformers are Better at Learning Learning Algorithms
Liu Yang
Kangwook Lee
Robert D. Nowak
Dimitris Papailiopoulos
97
26
0
21 Nov 2023
Advancing Transformer Architecture in Long-Context Large Language
  Models: A Comprehensive Survey
Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey
Yunpeng Huang
Jingwei Xu
Junyu Lai
Zixu Jiang
Taolue Chen
...
Xiaoxing Ma
Lijuan Yang
Zhou Xin
Shupeng Li
Penghao Zhao
LLMAGKELM
98
66
0
21 Nov 2023
Long-MIL: Scaling Long Contextual Multiple Instance Learning for
  Histopathology Whole Slide Image Analysis
Long-MIL: Scaling Long Contextual Multiple Instance Learning for Histopathology Whole Slide Image Analysis
Honglin Li
Yunlong Zhang
Chenglu Zhu
Jiatong Cai
Sunyi Zheng
Lin Yang
VLM
89
4
0
21 Nov 2023
Tensor-Aware Energy Accounting
Tensor-Aware Energy Accounting
Timur Babakol
Yu David Liu
38
4
0
19 Nov 2023
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation
  via Language Corrections
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections
Lihan Zha
Yuchen Cui
Li-Heng Lin
Minae Kwon
Montse Gonzalez Arenas
Andy Zeng
Fei Xia
Dorsa Sadigh
104
37
0
17 Nov 2023
Generative AI for Hate Speech Detection: Evaluation and Findings
Generative AI for Hate Speech Detection: Evaluation and Findings
Sagi Pendzel
Tomer Wullach
Amir Adler
Einat Minkov
60
11
0
16 Nov 2023
Long-form Question Answering: An Iterative Planning-Retrieval-Generation
  Approach
Long-form Question Answering: An Iterative Planning-Retrieval-Generation Approach
Pritom Saha Akash
Kashob Kumar Roy
Lucian Popa
Kevin Chen-Chuan Chang
71
3
0
15 Nov 2023
Temporal Knowledge Question Answering via Abstract Reasoning Induction
Temporal Knowledge Question Answering via Abstract Reasoning Induction
Ziyang Chen
Dongfang Li
Xiang Zhao
Baotian Hu
Min Zhang
LRM
88
17
0
15 Nov 2023
OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient
  Large-scale Multilingual Continued Pretraining
OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining
Yihong Liu
Peiqin Lin
Mingyang Wang
Hinrich Schütze
71
29
0
15 Nov 2023
It Takes Two to Negotiate: Modeling Social Exchange in Online
  Multiplayer Games
It Takes Two to Negotiate: Modeling Social Exchange in Online Multiplayer Games
Kokil Jaidka
Hansin Ahuja
Lynnette Ng
137
7
0
15 Nov 2023
GLiNER: Generalist Model for Named Entity Recognition using
  Bidirectional Transformer
GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer
Urchade Zaratiana
Nadi Tomeh
Pierre Holat
Thierry Charnois
72
41
0
14 Nov 2023
AI-generated text boundary detection with RoFT
AI-generated text boundary detection with RoFT
Laida Kushnareva
T. Gaintseva
German Magai
S. Barannikov
Dmitry Abulkhanov
Kristian Kuznetsov
Eduard Tulchinskii
Irina Piontkovskaya
Sergey I. Nikolenko
DeLMO
65
7
0
14 Nov 2023
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in
  Video-Language Models
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models
.Ilker Kesen
Andrea Pedrotti
Mustafa Dogan
Michele Cafagna
Emre Can Acikgoz
...
Iacer Calixto
Anette Frank
Albert Gatt
Aykut Erdem
Erkut Erdem
94
19
0
13 Nov 2023
Training A Multi-stage Deep Classifier with Feedback Signals
Training A Multi-stage Deep Classifier with Feedback Signals
Chao Xu
Yu Yang
Rong Wang
Guan Wang
Bojia Lin
35
0
0
12 Nov 2023
Tunable Soft Prompts are Messengers in Federated Learning
Tunable Soft Prompts are Messengers in Federated Learning
Chenhe Dong
Yuexiang Xie
Bolin Ding
Ying Shen
Yaliang Li
FedML
84
8
0
12 Nov 2023
Early-Exit Neural Networks with Nested Prediction Sets
Early-Exit Neural Networks with Nested Prediction Sets
Metod Jazbec
Patrick Forré
Stephan Mandt
Dan Zhang
Eric T. Nalisnick
UQCV
56
1
0
10 Nov 2023
The Shape of Learning: Anisotropy and Intrinsic Dimensions in
  Transformer-Based Models
The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models
Anton Razzhigaev
Matvey Mikhalchuk
Elizaveta Goncharova
Ivan Oseledets
Denis Dimitrov
Andrey Kuznetsov
77
10
0
10 Nov 2023
Hallucination-minimized Data-to-answer Framework for Financial
  Decision-makers
Hallucination-minimized Data-to-answer Framework for Financial Decision-makers
Sohini Roychowdhury
Andres Alvarez
Brian Moore
Marko Krema
Maria Paz Gelpi
...
Angel Rodriguez
Jose Ramon Cabrejas
Pablo Martinez Serrano
Punit Agrawal
Arijit Mukherjee
68
9
0
09 Nov 2023
A Survey of Large Language Models in Medicine: Progress, Application,
  and Challenge
A Survey of Large Language Models in Medicine: Progress, Application, and Challenge
Hongjian Zhou
Fenglin Liu
Boyang Gu
Xinyu Zou
Jinfa Huang
...
Yefeng Zheng
Lei A. Clifton
Zheng Li
Fenglin Liu
David Clifton
LM&MA
163
127
0
09 Nov 2023
Legal-HNet: Mixing Legal Long-Context Tokens with Hartley Transform
Legal-HNet: Mixing Legal Long-Context Tokens with Hartley Transform
Daniele Giofré
Sneha Ghantasala
AILaw
70
0
0
09 Nov 2023
DACBERT: Leveraging Dependency Agreement for Cost-Efficient Bert
  Pretraining
DACBERT: Leveraging Dependency Agreement for Cost-Efficient Bert Pretraining
Martin Kuo
Jianyi Zhang
Yiran Chen
52
2
0
08 Nov 2023
Pragmatic Reasoning Unlocks Quantifier Semantics for Foundation Models
Pragmatic Reasoning Unlocks Quantifier Semantics for Foundation Models
Yiyuan Li
Rakesh R Menon
Sayan Ghosh
Shashank Srivastava
LRM
62
2
0
08 Nov 2023
DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing
  Understanding
DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding
Kehinde E. Ajayi
Xin Wei
Martin Gryder
Winston Shields
Jian Wu
Shawn M. Jones
Michal Kucer
Diane Oyen
3DV
36
4
0
07 Nov 2023
mahaNLP: A Marathi Natural Language Processing Library
mahaNLP: A Marathi Natural Language Processing Library
Vidula Magdum
Omkar Dhekane
Sharayu Hiwarkhedkar
Saloni Mittal
Raviraj Joshi
76
5
0
05 Nov 2023
Sentiment Analysis through LLM Negotiations
Sentiment Analysis through LLM Negotiations
Xiaofei Sun
Xiaoya Li
Shengyu Zhang
Shuhe Wang
Leilei Gan
Jiwei Li
Tianwei Zhang
Guoyin Wang
86
21
0
03 Nov 2023
TCM-GPT: Efficient Pre-training of Large Language Models for Domain
  Adaptation in Traditional Chinese Medicine
TCM-GPT: Efficient Pre-training of Large Language Models for Domain Adaptation in Traditional Chinese Medicine
Guoxing Yang
Jianyu Shi
Zan Wang
Xiaohong Liu
Guangyu Wang
29
21
0
03 Nov 2023
A New Korean Text Classification Benchmark for Recognizing the Political
  Intents in Online Newspapers
A New Korean Text Classification Benchmark for Recognizing the Political Intents in Online Newspapers
Beomjune Kim
Eunsun Lee
Dongbin Na
45
1
0
03 Nov 2023
Successor Features for Efficient Multisubject Controlled Text Generation
Successor Features for Efficient Multisubject Controlled Text Generation
Mengyao Cao
Mehdi Fatemi
Jackie Chi Kit Cheung
Samira Shabanian
BDL
82
0
0
03 Nov 2023
Adapting Fake News Detection to the Era of Large Language Models
Adapting Fake News Detection to the Era of Large Language Models
Jinyan Su
Claire Cardie
Preslav Nakov
DeLMO
103
19
0
02 Nov 2023
Investigating Self-Supervised Deep Representations for EEG-based
  Auditory Attention Decoding
Investigating Self-Supervised Deep Representations for EEG-based Auditory Attention Decoding
Karan Thakkar
Jiarui Hai
Mounya Elhilali
45
1
0
01 Nov 2023
Latent Space Translation via Semantic Alignment
Latent Space Translation via Semantic Alignment
Valentino Maiorca
Luca Moschella
Antonio Norelli
Marco Fumero
Francesco Locatello
Emanuele Rodolà
117
23
0
01 Nov 2023
LLMs may Dominate Information Access: Neural Retrievers are Biased
  Towards LLM-Generated Texts
LLMs may Dominate Information Access: Neural Retrievers are Biased Towards LLM-Generated Texts
Sunhao Dai
Yuqi Zhou
Liang Pang
Weihao Liu
Xiaolin Hu
Yong Liu
Xiao Zhang
Gang Wang
Jun Xu
121
34
0
31 Oct 2023
Do large language models solve verbal analogies like children do?
Do large language models solve verbal analogies like children do?
Claire E. Stevenson
Mathilde ter Veen
Rochelle Choenni
Han L. J. van der Maas
Ekaterina Shutova
LRM
28
8
0
31 Oct 2023
Learning to Play Chess from Textbooks (LEAP): a Corpus for Evaluating
  Chess Moves based on Sentiment Analysis
Learning to Play Chess from Textbooks (LEAP): a Corpus for Evaluating Chess Moves based on Sentiment Analysis
Haifa Alrdahi
Riza Batista-Navarro
62
2
0
31 Oct 2023
EELBERT: Tiny Models through Dynamic Embeddings
EELBERT: Tiny Models through Dynamic Embeddings
Gabrielle Cohn
Rishika Agarwal
Deepanshu Gupta
Siddharth Patwardhan
27
2
0
31 Oct 2023
Efficient Classification of Student Help Requests in Programming Courses
  Using Large Language Models
Efficient Classification of Student Help Requests in Programming Courses Using Large Language Models
Jaromír Šavelka
Paul Denny
Mark H. Liffiton
Brad Sheese
AI4Ed
75
7
0
31 Oct 2023
MoCa: Measuring Human-Language Model Alignment on Causal and Moral
  Judgment Tasks
MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Allen Nie
Yuhui Zhang
Atharva Amdekar
Chris Piech
Tatsunori Hashimoto
Tobias Gerstenberg
80
40
0
30 Oct 2023
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient
  image-text retrieval
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval
Youbo Lei
Feifei He
Chen Chen
Yingbin Mo
Sijia Li
Defeng Xie
H. Lu
VLM
87
0
0
30 Oct 2023
A Lightweight Method to Generate Unanswerable Questions in English
A Lightweight Method to Generate Unanswerable Questions in English
Vagrant Gautam
Miaoran Zhang
Dietrich Klakow
73
1
0
30 Oct 2023
Previous
123...111213...575859
Next