ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 2,935 papers shown
Title
FinTree: Financial Dataset Pretrain Transformer Encoder for Relation
  Extraction
FinTree: Financial Dataset Pretrain Transformer Encoder for Relation Extraction
Hyunjong Ok
47
2
0
26 Jul 2023
Re-mine, Learn and Reason: Exploring the Cross-modal Semantic
  Correlations for Language-guided HOI detection
Re-mine, Learn and Reason: Exploring the Cross-modal Semantic Correlations for Language-guided HOI detection
Yichao Cao
Qingfei Tang
Fengyuan Yang
Xiu Su
Shan You
Xiaobo Lu
Chang Xu
91
17
0
25 Jul 2023
Gradient-Based Word Substitution for Obstinate Adversarial Examples
  Generation in Language Models
Gradient-Based Word Substitution for Obstinate Adversarial Examples Generation in Language Models
Yimu Wang
Peng Shi
Hongyang Zhang
SILM
33
3
0
24 Jul 2023
MythQA: Query-Based Large-Scale Check-Worthy Claim Detection through
  Multi-Answer Open-Domain Question Answering
MythQA: Query-Based Large-Scale Check-Worthy Claim Detection through Multi-Answer Open-Domain Question Answering
Yang Bai
Anthony Colas
D. Wang
HILM
46
2
0
21 Jul 2023
Teach model to answer questions after comprehending the document
Teach model to answer questions after comprehending the document
Ruiqing Sun
Ping Jian
FaML
67
0
0
18 Jul 2023
Revisiting Implicit Models: Sparsity Trade-offs Capability in
  Weight-tied Model for Vision Tasks
Revisiting Implicit Models: Sparsity Trade-offs Capability in Weight-tied Model for Vision Tasks
Haobo Song
Soumajit Majumder
Tao R. Lin
VLM
86
0
0
16 Jul 2023
Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for
  Parameter-Efficient BERT
Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient BERT
Souvik Kundu
S. Nittur
Maciej Szankin
Sairam Sundaresan
MQ
62
2
0
14 Jul 2023
Vacaspati: A Diverse Corpus of Bangla Literature
Vacaspati: A Diverse Corpus of Bangla Literature
Pramit Bhattacharyya
Joydeep Mondal
S. Maji
Arnab Bhattacharya
69
7
0
11 Jul 2023
Synthetic Dataset for Evaluating Complex Compositional Knowledge for
  Natural Language Inference
Synthetic Dataset for Evaluating Complex Compositional Knowledge for Natural Language Inference
Sushma A. Akoju
Robert Vacareanu
Haris Riaz
Eduardo Blanco
Mihai Surdeanu
NAICoGe
18
1
0
11 Jul 2023
Event Extraction as Question Generation and Answering
Event Extraction as Question Generation and Answering
Di Lu
Shihao Ran
Joel R. Tetreault
A. Jaimes
73
42
0
10 Jul 2023
Reasoning over the Behaviour of Objects in Video-Clips for Adverb-Type
  Recognition
Reasoning over the Behaviour of Objects in Video-Clips for Adverb-Type Recognition
Amrithaa Seshadri
Alessandra Russo
87
0
0
09 Jul 2023
Unveiling the Potential of Knowledge-Prompted ChatGPT for Enhancing Drug
  Trafficking Detection on Social Media
Unveiling the Potential of Knowledge-Prompted ChatGPT for Enhancing Drug Trafficking Detection on Social Media
Chuanbo Hu
Bin Liu
Xin Li
Yanfang Ye
30
4
0
07 Jul 2023
A Side-by-side Comparison of Transformers for English Implicit Discourse
  Relation Classification
A Side-by-side Comparison of Transformers for English Implicit Discourse Relation Classification
Bruce W. Lee
Bongseok Yang
J. Lee
76
0
0
07 Jul 2023
Evaluating Biased Attitude Associations of Language Models in an
  Intersectional Context
Evaluating Biased Attitude Associations of Language Models in an Intersectional Context
Shiva Omrani Sabbaghi
Robert Wolfe
Aylin Caliskan
73
25
0
07 Jul 2023
Vision Language Transformers: A Survey
Vision Language Transformers: A Survey
Clayton Fields
C. Kennington
VLM
53
5
0
06 Jul 2023
BLEURT Has Universal Translations: An Analysis of Automatic Metrics by
  Minimum Risk Training
BLEURT Has Universal Translations: An Analysis of Automatic Metrics by Minimum Risk Training
Yiming Yan
Tao Wang
Chengqi Zhao
Shujian Huang
Jiajun Chen
Mingxuan Wang
89
24
0
06 Jul 2023
DeepOnto: A Python Package for Ontology Engineering with Deep Learning
DeepOnto: A Python Package for Ontology Engineering with Deep Learning
Yuan He
Jiaoyan Chen
Hang Dong
Ian Horrocks
Carlo Allocca
Taehun Kim
B. Sapkota
138
26
0
06 Jul 2023
SpaceNLI: Evaluating the Consistency of Predicting Inferences in Space
SpaceNLI: Evaluating the Consistency of Predicting Inferences in Space
Lasha Abzianidze
J. Zwarts
Yoad Winter
34
2
0
05 Jul 2023
Chain of Thought Prompting Elicits Knowledge Augmentation
Chain of Thought Prompting Elicits Knowledge Augmentation
Di Wu
Jing Zhang
Xinmei Huang
LRM
89
35
0
04 Jul 2023
SCAT: Robust Self-supervised Contrastive Learning via Adversarial
  Training for Text Classification
SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification
J. Wu
Dit-Yan Yeung
SILM
74
0
0
04 Jul 2023
Multi-Task Learning Improves Performance In Deep Argument Mining Models
Multi-Task Learning Improves Performance In Deep Argument Mining Models
Amirhossein Farzam
Shashank Shekhar
Isaac Mehlhaff
Marco Morucci
48
1
0
03 Jul 2023
A Critical Re-evaluation of Benchmark Datasets for (Deep) Learning-Based
  Matching Algorithms
A Critical Re-evaluation of Benchmark Datasets for (Deep) Learning-Based Matching Algorithms
G. Papadakis
Nishadi Kirielle
Peter Christen
Themis Palpanas
89
8
0
03 Jul 2023
Make Text Unlearnable: Exploiting Effective Patterns to Protect Personal
  Data
Make Text Unlearnable: Exploiting Effective Patterns to Protect Personal Data
Xinzhe Li
Ming Liu
Shang Gao
MU
99
8
0
02 Jul 2023
Meta-training with Demonstration Retrieval for Efficient Few-shot
  Learning
Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Aaron Mueller
Kanika Narang
Lambert Mathias
Qifan Wang
Hamed Firooz
RALM
77
3
0
30 Jun 2023
Transformers in Healthcare: A Survey
Transformers in Healthcare: A Survey
Subhash Nerella
S. Bandyopadhyay
Jiaqing Zhang
Miguel Contreras
Scott Siegel
...
Jessica Sena
B. Shickel
A. Bihorac
Kia Khezeli
Parisa Rashidi
MedImAI4CE
96
36
0
30 Jun 2023
GPT-FinRE: In-context Learning for Financial Relation Extraction using
  Large Language Models
GPT-FinRE: In-context Learning for Financial Relation Extraction using Large Language Models
P. Rajpoot
Ankur P. Parikh
86
14
0
30 Jun 2023
Pollen: High-throughput Federated Learning Simulation via Resource-Aware
  Client Placement
Pollen: High-throughput Federated Learning Simulation via Resource-Aware Client Placement
Lorenzo Sani
Pedro Gusmão
Alexandru Iacob
Wanru Zhao
Xinchi Qiu
Yan Gao
Javier Fernandez-Marques
Nicholas D. Lane
62
0
0
30 Jun 2023
CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI
  Collaboration for Large Language Models
CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI Collaboration for Large Language Models
Yufei Huang
Deyi Xiong
ALM
134
19
0
28 Jun 2023
A generic self-supervised learning (SSL) framework for representation
  learning from spectra-spatial feature of unlabeled remote sensing imagery
A generic self-supervised learning (SSL) framework for representation learning from spectra-spatial feature of unlabeled remote sensing imagery
Xin Zhang
Liangxiu Han
SSL
92
3
0
27 Jun 2023
SparseOptimizer: Sparsify Language Models through Moreau-Yosida
  Regularization and Accelerate via Compiler Co-design
SparseOptimizer: Sparsify Language Models through Moreau-Yosida Regularization and Accelerate via Compiler Co-design
Fu-Ming Guo
MoE
104
0
0
27 Jun 2023
Length Generalization in Arithmetic Transformers
Length Generalization in Arithmetic Transformers
Samy Jelassi
Stéphane dÁscoli
Carles Domingo-Enrich
Yuhuai Wu
Yuan-Fang Li
Franccois Charton
100
43
0
27 Jun 2023
Gender Bias in BERT -- Measuring and Analysing Biases through Sentiment
  Rating in a Realistic Downstream Classification Task
Gender Bias in BERT -- Measuring and Analysing Biases through Sentiment Rating in a Realistic Downstream Classification Task
Sophie F. Jentzsch
Cigdem Turan
74
34
0
27 Jun 2023
IDOL: Indicator-oriented Logic Pre-training for Logical Reasoning
IDOL: Indicator-oriented Logic Pre-training for Logical Reasoning
Zihang Xu
Ziqing Yang
Yiming Cui
Shijin Wang
LRMReLMMLLM
87
6
0
27 Jun 2023
Approximated Prompt Tuning for Vision-Language Pre-trained Models
Approximated Prompt Tuning for Vision-Language Pre-trained Models
Qiong Wu
Shubin Huang
Yiyi Zhou
Pingyang Dai
Annan Shu
Guannan Jiang
Rongrong Ji
VLMVPVLM
42
2
0
27 Jun 2023
WinoQueer: A Community-in-the-Loop Benchmark for Anti-LGBTQ+ Bias in
  Large Language Models
WinoQueer: A Community-in-the-Loop Benchmark for Anti-LGBTQ+ Bias in Large Language Models
Virginia K. Felkner
Ho-Chun Herbert Chang
Eugene Jang
Jonathan May
OSLM
79
37
0
26 Jun 2023
Constraint-aware and Ranking-distilled Token Pruning for Efficient
  Transformer Inference
Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference
Junyan Li
Li Zhang
Jiahang Xu
Yujing Wang
Shaoguang Yan
...
Ting Cao
Hao Sun
Weiwei Deng
Qi Zhang
Mao Yang
64
10
0
26 Jun 2023
Switch-BERT: Learning to Model Multimodal Interactions by Switching
  Attention and Input
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input
Qingpei Guo
Kaisheng Yao
Wei Chu
MLLM
45
5
0
25 Jun 2023
My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models
  and Evaluation Benchmarks
My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation Benchmarks
Tanmay Chavan
Omkar Gokhale
Aditya Kane
Shantanu Patankar
Raviraj Joshi
66
3
0
24 Jun 2023
L3Cube-MahaSent-MD: A Multi-domain Marathi Sentiment Analysis Dataset
  and Transformer Models
L3Cube-MahaSent-MD: A Multi-domain Marathi Sentiment Analysis Dataset and Transformer Models
Aabha Pingle
Aditya Vyawahare
Isha Joshi
Rahul Tangsali
Raviraj Joshi
57
9
0
24 Jun 2023
Deep Metric Learning with Soft Orthogonal Proxies
Deep Metric Learning with Soft Orthogonal Proxies
F. Saberi-Movahed
M. K. Ebrahimpour
Farid Saberi-Movahed
Monireh Moshavash
Dorsa Rahmatian
Mahvash Mohazzebi
Mahdi Shariatzadeh
M. Eftekhari
53
3
0
22 Jun 2023
Solving Dialogue Grounding Embodied Task in a Simulated Environment
  using Further Masked Language Modeling
Solving Dialogue Grounding Embodied Task in a Simulated Environment using Further Masked Language Modeling
Weijie Zhang
61
0
0
21 Jun 2023
Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs
  for Fact-aware Language Modeling
Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language Modeling
Lin F. Yang
Hongyang Chen
Zhao Li
Xiao Ding
Xindong Wu
KELM
110
93
0
20 Jun 2023
Towards Theory-based Moral AI: Moral AI with Aggregating Models Based on
  Normative Ethical Theory
Towards Theory-based Moral AI: Moral AI with Aggregating Models Based on Normative Ethical Theory
Masashi Takeshita
Rafal Rzepka
K. Araki
54
9
0
20 Jun 2023
Graph Self-Supervised Learning for Endoscopic Image Matching
Graph Self-Supervised Learning for Endoscopic Image Matching
Manel Farhat
A. Ben-Hamadou
SSL
70
1
0
19 Jun 2023
Adversarial Robustness of Prompt-based Few-Shot Learning for Natural
  Language Understanding
Adversarial Robustness of Prompt-based Few-Shot Learning for Natural Language Understanding
Venkata Prabhakara Sarath Nookala
Gaurav Verma
Subhabrata Mukherjee
Srijan Kumar
ELM
129
6
0
19 Jun 2023
No Strong Feelings One Way or Another: Re-operationalizing Neutrality in
  Natural Language Inference
No Strong Feelings One Way or Another: Re-operationalizing Neutrality in Natural Language Inference
Animesh Nighojkar
Antonio Laverghetta
John Licato
59
4
0
16 Jun 2023
M3PT: A Multi-Modal Model for POI Tagging
M3PT: A Multi-Modal Model for POI Tagging
Jingsong Yang
Guanzhou Han
Deqing Yang
Jingping Liu
Yanghua Xiao
Xiang Xu
Baohua Wu
Shenghua Ni
93
3
0
16 Jun 2023
ChatGPT for Suicide Risk Assessment on Social Media: Quantitative
  Evaluation of Model Performance, Potentials and Limitations
ChatGPT for Suicide Risk Assessment on Social Media: Quantitative Evaluation of Model Performance, Potentials and Limitations
Hamideh Ghanadian
I. Nejadgholi
Hussein Al Osman
LM&MAAI4MH
42
17
0
15 Jun 2023
Understanding Parameter Sharing in Transformers
Understanding Parameter Sharing in Transformers
Ye Lin
Mingxuan Wang
Zhexi Zhang
Xiaohui Wang
Tong Xiao
Jingbo Zhu
MoE
77
2
0
15 Jun 2023
DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning
DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning
Hengli Li
Songchun Zhu
Zilong Zheng
54
9
0
15 Jun 2023
Previous
123...151617...575859
Next