Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,935 papers shown
Title
Fast Neural Kernel Embeddings for General Activations
Insu Han
A. Zandieh
Jaehoon Lee
Roman Novak
Lechao Xiao
Amin Karbasi
120
19
0
09 Sep 2022
An Analysis of Deep Reinforcement Learning Agents for Text-based Games
Chen Chen
Yue Dai
Josiah Poon
Caren Han
LLMAG
45
2
0
09 Sep 2022
5q032e@SMM4H'22: Transformer-based classification of premise in tweets related to COVID-19
Vadim Porvatov
Natalia Semenova
73
2
0
08 Sep 2022
Pre-Training a Graph Recurrent Network for Language Representation
Yile Wang
Linyi Yang
Zhiyang Teng
M. Zhou
Yue Zhang
GNN
81
1
0
08 Sep 2022
Towards explainable evaluation of language models on the semantic similarity of visual concepts
Maria Lymperaiou
George Manoliadis
Orfeas Menis Mastromichalakis
Edmund Dervakos
Giorgos Stamou
AAML
73
5
0
08 Sep 2022
SSL-WM: A Black-Box Watermarking Approach for Encoders Pre-trained by Self-supervised Learning
Peizhuo Lv
Pan Li
Shenchen Zhu
Shengzhi Zhang
Kai Chen
...
Fan Xiang
Yuling Cai
Hualong Ma
Yingjun Zhang
Guozhu Meng
AAML
86
7
0
08 Sep 2022
Blessing of Class Diversity in Pre-training
Yulai Zhao
Jianshu Chen
S. Du
AI4CE
77
3
0
07 Sep 2022
Evaluating the Susceptibility of Pre-Trained Language Models via Handcrafted Adversarial Examples
Hezekiah J. Branch
Jonathan Rodriguez Cefalu
Jeremy McHugh
Leyla Hujer
Aditya Bahl
Daniel del Castillo Iglesias
Ron Heichman
Ramesh Darwishi
ELM
SILM
AAML
70
56
0
05 Sep 2022
A Study on Representation Transfer for Few-Shot Learning
Chun-Nam Yu
Yi Xie
SSL
49
1
0
05 Sep 2022
Trust in Language Grounding: a new AI challenge for human-robot teams
David M. Bossens
C. Evers
90
1
0
05 Sep 2022
Selective Annotation Makes Language Models Better Few-Shot Learners
Hongjin Su
Jungo Kasai
Chen Henry Wu
Weijia Shi
Tianlu Wang
...
Rui Zhang
Mari Ostendorf
Luke Zettlemoyer
Noah A. Smith
Tao Yu
118
262
0
05 Sep 2022
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine Reading Comprehension
Xanh Ho
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
OffRL
92
4
0
05 Sep 2022
Query-focused Extractive Summarisation for Biomedical and COVID-19 Complex Question Answering
Diego Mollá Aliod
103
6
0
05 Sep 2022
Generalization in Neural Networks: A Broad Survey
Chris Rohlfs
OOD
AI4CE
67
7
0
04 Sep 2022
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling
Tsu-Jui Fu
Linjie Li
Zhe Gan
Kevin Qinghong Lin
William Yang Wang
Lijuan Wang
Zicheng Liu
VLM
130
65
0
04 Sep 2022
Sparse Attention Acceleration with Synergistic In-Memory Pruning and On-Chip Recomputation
Amir Yazdanbakhsh
Ashkan Moradifirouzabadi
Zheng Li
Mingu Kang
84
33
0
01 Sep 2022
Why Do Neural Language Models Still Need Commonsense Knowledge to Handle Semantic Variations in Question Answering?
Sunjae Kwon
Cheongwoong Kang
Jiyeon Han
Jaesik Choi
59
0
0
01 Sep 2022
Attack Tactic Identification by Transfer Learning of Language Model
Lily Lin
Shun-Wen Hsiao
67
2
0
01 Sep 2022
Efficient Methods for Natural Language Processing: A Survey
Marcos Vinícius Treviso
Ji-Ung Lee
Tianchu Ji
Betty van Aken
Qingqing Cao
...
Emma Strubell
Niranjan Balasubramanian
Leon Derczynski
Iryna Gurevych
Roy Schwartz
156
114
0
31 Aug 2022
Unified Knowledge Prompt Pre-training for Customer Service Dialogues
Keqing He
Jingang Wang
Chaobo Sun
Wei Wu
72
4
0
31 Aug 2022
Transformers with Learnable Activation Functions
Haishuo Fang
Ji-Ung Lee
N. Moosavi
Iryna Gurevych
AI4CE
46
8
0
30 Aug 2022
Survey: Exploiting Data Redundancy for Optimization of Deep Learning
Jou-An Chen
Wei Niu
Bin Ren
Yanzhi Wang
Xipeng Shen
60
24
0
29 Aug 2022
Addressing Token Uniformity in Transformers via Singular Value Transformation
Hanqi Yan
Lin Gui
Wenjie Li
Yulan He
75
15
0
24 Aug 2022
Not All GPUs Are Created Equal: Characterizing Variability in Large-Scale, Accelerator-Rich Systems
Prasoon Sinha
Akhil Guliani
Rutwik Jain
Brandon Tran
Matthew D. Sinclair
Shivaram Venkataraman
79
18
0
23 Aug 2022
CLOWER: A Pre-trained Language Model with Contrastive Learning over Word and Character Representations
Borun Chen
Hongyin Tang
Jiahao Bu
Kai Zhang
Jingang Wang
Qifan Wang
Haitao Zheng
Wei Wu
Liqian Yu
VLM
54
1
0
23 Aug 2022
Learning Better Masking for Better Language Model Pre-training
Dongjie Yang
Zhuosheng Zhang
Hai Zhao
80
15
0
23 Aug 2022
Learning Dynamic Contextualised Word Embeddings via Template-based Temporal Adaptation
Xiaohang Tang
Yi Zhou
Danushka Bollegala
95
6
0
23 Aug 2022
Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering
Siyuan Wang
Zhongyu Wei
Zhihao Fan
Qi Zhang
Xuanjing Huang
LRM
80
9
0
22 Aug 2022
Lost in Context? On the Sense-wise Variance of Contextualized Word Embeddings
Yile Wang
Yue Zhang
59
5
0
20 Aug 2022
SPOT: Knowledge-Enhanced Language Representations for Information Extraction
Jiacheng Li
Yannis Katsis
Tyler Baldwin
Ho-Cheol Kim
Andrew Bartko
Julian McAuley
Chun-Nan Hsu
80
17
0
20 Aug 2022
Adapting Task-Oriented Dialogue Models for Email Conversations
Soham Deshmukh
Charles Lee
57
1
0
19 Aug 2022
A Kind Introduction to Lexical and Grammatical Aspect, with a Survey of Computational Approaches
Annemarie Friedrich
Nianwen Xue
Alexis Palmer
89
3
0
18 Aug 2022
Exploring and Exploiting Multi-Granularity Representations for Machine Reading Comprehension
Nuo Chen
Chenyu You
82
0
0
18 Aug 2022
A Two-Phase Paradigm for Joint Entity-Relation Extraction
Shezheng Song
Hao Xu
Jie Yu
Shasha Li
Jun Ma
Yuke Ji
Bin Ji
113
2
0
18 Aug 2022
Transformer Encoder for Social Science
Haosen Ge
In Young Park
Xuancheng Qian
Grace Zeng
48
0
0
17 Aug 2022
Efficient Multimodal Transformer with Dual-Level Feature Restoration for Robust Multimodal Sentiment Analysis
Guoying Zhao
Zheng Lian
B. Liu
Jianhua Tao
89
54
0
16 Aug 2022
Zeus: Understanding and Optimizing GPU Energy Consumption of DNN Training
Jie You
Jaehoon Chung
Mosharaf Chowdhury
80
82
0
12 Aug 2022
On the Pros and Cons of Momentum Encoder in Self-Supervised Visual Representation Learning
T. Pham
Chaoning Zhang
Axi Niu
Kang Zhang
Chang D. Yoo
78
11
0
11 Aug 2022
Construction of English Resume Corpus and Test with Pre-trained Language Models
Chengguang Gan
Tatsunori Mori
24
3
0
05 Aug 2022
Global Pointer: Novel Efficient Span-based Approach for Named Entity Recognition
Jianlin Su
Ahmed Murtadha
Shengfeng Pan
Jing Hou
Jun Sun
Wanwei Huang
Bo Wen
Yunfeng Liu
74
80
0
05 Aug 2022
Towards No.1 in CLUE Semantic Matching Challenge: Pre-trained Language Model Erlangshen with Propensity-Corrected Loss
Junjie Wang
Yuxiang Zhang
Ping Yang
Ruyi Gan
47
2
0
05 Aug 2022
SpanDrop: Simple and Effective Counterfactual Learning for Long Sequences
Peng Qi
Guangtao Wang
Jing Huang
48
0
0
03 Aug 2022
A Comparative Study on COVID-19 Fake News Detection Using Different Transformer Based Models
Sajib Kumar Saha Joy
Dibyo Fabian Dofadar
Riyo Hayat Khan
M. Ahmed
Rafeed Rahman
MedIm
78
5
0
02 Aug 2022
To Answer or Not to Answer? Improving Machine Reading Comprehension Model with Span-based Contrastive Learning
Yunjie Ji
Liangyu Chen
Chenxiao Dou
Baochang Ma
Xiangang Li
82
5
0
02 Aug 2022
giMLPs: Gate with Inhibition Mechanism in MLPs
Cheng Kang
Jindich Prokop
Lei Tong
Huiyu Zhou
Yong Hu
Daneil Novak
33
0
0
01 Aug 2022
DictBERT: Dictionary Description Knowledge Enhanced Language Model Pre-training via Contrastive Learning
Qianglong Chen
Feng-Lin Li
Guohai Xu
Ming Yan
Ji Zhang
Yin Zhang
74
23
0
01 Aug 2022
Face-to-Face Contrastive Learning for Social Intelligence Question-Answering
Alex Wilf
Qianli Ma
Paul Pu Liang
Amir Zadeh
Louis-Philippe Morency
82
11
0
29 Jul 2022
Efficient NLP Model Finetuning via Multistage Data Filtering
Ouyang Xu
S. Ansari
F. Lin
Yangfeng Ji
74
4
0
28 Jul 2022
Measuring Causal Effects of Data Statistics on Language Model's `Factual' Predictions
Yanai Elazar
Nora Kassner
Shauli Ravfogel
Amir Feder
Abhilasha Ravichander
Marius Mosbach
Yonatan Belinkov
Hinrich Schütze
Yoav Goldberg
CML
SyDa
MILM
110
55
0
28 Jul 2022
Rethinking Efficacy of Softmax for Lightweight Non-Local Neural Networks
Yooshin Cho
Youngsoo Kim
Hanbyel Cho
Jaesung Ahn
H. Hong
Junmo Kim
26
3
0
27 Jul 2022
Previous
1
2
3
...
26
27
28
...
57
58
59
Next