Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.12804
Cited By
UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training
28 February 2020
Hangbo Bao
Li Dong
Furu Wei
Wenhui Wang
Nan Yang
Xiaodong Liu
Yu-Chiang Frank Wang
Songhao Piao
Jianfeng Gao
Ming Zhou
H. Hon
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training"
50 / 219 papers shown
Title
Information Extraction from Visually Rich Documents using LLM-based Organization of Documents into Independent Textual Segments
Aniket Bhattacharyya
Anurag Tripathi
Ujjal Das
Archan Karmakar
Amit Pathak
Maneesh Gupta
2
0
0
18 May 2025
Generative Sign-description Prompts with Multi-positive Contrastive Learning for Sign Language Recognition
Siyu Liang
Yunan Li
Wentian Xin
Huizhou Chen
Xujie Liu
Kang Liu
Qiguang Miao
27
0
0
05 May 2025
Robust Asymmetric Heterogeneous Federated Learning with Corrupted Clients
Xiuwen Fang
Mang Ye
Bo Du
FedML
74
1
0
12 Mar 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Min Zhang
LM&MA
AILaw
98
154
0
28 Jan 2025
Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Lewen Yang
Xuanyu Zhou
Juao Fan
Xinyi Xie
Shengxin Zhu
AI4CE
64
0
0
27 Nov 2024
SIRA: Scalable Inter-frame Relation and Association for Radar Perception
Ryoma Yataka
Peng Wang
P. Boufounos
R. Takahashi
43
4
0
04 Nov 2024
Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction
Nicholas Walker
29
0
0
23 Oct 2024
ReLayout: Towards Real-World Document Understanding via Layout-enhanced Pre-training
Zhouqiang Jiang
Bowen Wang
Junhao Chen
Yuta Nakashima
30
2
0
14 Oct 2024
A Novel LLM-based Two-stage Summarization Approach for Long Dialogues
Yuan-Jhe Yin
Bo-Yu Chen
Berlin Chen
28
3
0
09 Oct 2024
Towards Robust Vision Transformer via Masked Adaptive Ensemble
Fudong Lin
Jiadong Lou
Xu Yuan
Nianfeng Tzeng
ViT
AAML
36
1
0
22 Jul 2024
KEHRL: Learning Knowledge-Enhanced Language Representations with Hierarchical Reinforcement Learning
Dongyang Li
Taolin Zhang
Longtao Huang
Chengyu Wang
Xiaofeng He
Hui Xue
KELM
OffRL
33
0
0
24 Jun 2024
Capturing Temporal Components for Time Series Classification
Venkata Ragavendra Vavilthota
Ranjith Ramanathan
Sathyanarayanan N. Aakur
28
0
0
20 Jun 2024
Large Language Models for Education: A Survey
Hanyi Xu
Wensheng Gan
Zhenlian Qi
Jiayang Wu
Philip S. Yu
AI4Ed
ELM
62
14
0
12 May 2024
Multi-Head Mixture-of-Experts
Xun Wu
Shaohan Huang
Wenhui Wang
Furu Wei
MoE
39
12
0
23 Apr 2024
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
Jin Gao
Shubo Lin
Shaoru Wang
Yutong Kou
Zeming Li
Liang Li
Congxuan Zhang
Xiaoqin Zhang
Yizheng Wang
Weiming Hu
47
1
0
18 Apr 2024
Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies
Benjue Weng
LM&MA
46
8
0
13 Apr 2024
Emerging Property of Masked Token for Effective Pre-training
Hyesong Choi
Hunsang Lee
Seyoung Joung
Hyejin Park
Jiyeong Kim
Dongbo Min
36
9
0
12 Apr 2024
Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training
Hyesong Choi
Hyejin Park
Kwang Moo Yi
Sungmin Cha
Dongbo Min
39
9
0
12 Apr 2024
Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction
Inhwan Bae
Junoh Lee
Hae-Gon Jeon
36
15
0
27 Mar 2024
Transformers and Language Models in Form Understanding: A Comprehensive Review of Scanned Document Analysis
Abdelrahman Abdallah
Daniel Eberharter
Zoe Pfister
Adam Jatowt
40
12
0
06 Mar 2024
Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
Changyu Chen
Xiting Wang
Ting-En Lin
Ang Lv
Yuchuan Wu
Xin Gao
Ji-Rong Wen
Rui Yan
Yongbin Li
ReLM
LRM
31
9
0
04 Mar 2024
Leveraging Large Language Models for Learning Complex Legal Concepts through Storytelling
Hang Jiang
Xiajie Zhang
Robert Mahari
Daniel Kessler
Eric Ma
...
Irene Li
Alex Pentland
Yoon Kim
Deb Roy
Jad Kabbara
AILaw
30
22
0
26 Feb 2024
On Leveraging Encoder-only Pre-trained Language Models for Effective Keyphrase Generation
Di Wu
Wasi Uddin Ahmad
Kai-Wei Chang
35
3
0
21 Feb 2024
Language Model Adaptation to Specialized Domains through Selective Masking based on Genre and Topical Characteristics
Anas Belfathi
Ygor Gallina
Nicolas Hernandez
Richard Dufour
Laura Monceaux
44
1
0
19 Feb 2024
Text-to-Code Generation with Modality-relative Pre-training
Fenia Christopoulou
Guchun Zhang
Gerasimos Lampouras
AI4TS
26
1
0
08 Feb 2024
ELLA-V: Stable Neural Codec Language Modeling with Alignment-guided Sequence Reordering
Ya-Zhen Song
Zhuo Chen
Xiaofei Wang
Ziyang Ma
Xie Chen
AuLLM
21
36
0
14 Jan 2024
xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein
Bo Chen
Xingyi Cheng
Pan Li
Yangli-ao Geng
Jing Gong
...
Chiming Liu
Aohan Zeng
Yuxiao Dong
Jie Tang
Leo T. Song
42
101
0
11 Jan 2024
BIM: Block-Wise Self-Supervised Learning with Masked Image Modeling
Yixuan Luo
Mengye Ren
Sai Qian Zhang
28
0
0
28 Nov 2023
Enhancing Document Information Analysis with Multi-Task Pre-training: A Robust Approach for Information Extraction in Visually-Rich Documents
Tofik Ali
Partha Pratim Roy
16
0
0
25 Oct 2023
GenKIE: Robust Generative Multimodal Document Key Information Extraction
Panfeng Cao
Ye Wang
Qiang Zhang
Zaiqiao Meng
SyDa
29
6
0
24 Oct 2023
Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning
Hao Wang
Xiahua Chen
Rui-cang Wang
Chenhui Chu
27
0
0
23 Oct 2023
InferDPT: Privacy-Preserving Inference for Black-box Large Language Model
Meng Tong
Kejiang Chen
Jie Zhang
Yuang Qi
Weiming Zhang
Neng H. Yu
Tianwei Zhang
Zhikun Zhang
SILM
38
2
0
18 Oct 2023
Multi-Stage Pre-training Enhanced by ChatGPT for Multi-Scenario Multi-Domain Dialogue Summarization
Weixiao Zhou
Gengyao Li
Xianfu Cheng
Xinnian Liang
Junnan Zhu
Feifei Zhai
Zhoujun Li
39
6
0
16 Oct 2023
Surveying the Landscape of Text Summarization with Deep Learning: A Comprehensive Review
Guanghua Wang
Weili Wu
AI4TS
AILaw
38
3
0
13 Oct 2023
PolyTask: Learning Unified Policies through Behavior Distillation
Siddhant Haldar
Lerrel Pinto
28
7
0
12 Oct 2023
Fast-ELECTRA for Efficient Pre-training
Chengyu Dong
Liyuan Liu
Hao Cheng
Jingbo Shang
Jianfeng Gao
Xiaodong Liu
44
2
0
11 Oct 2023
MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation
Xinda Wu
Zhijie Huang
Kejun Zhang
Jiaxing Yu
Xu Tan
Tieyao Zhang
Zihao Wang
Lingyun Sun
24
5
0
19 Sep 2023
MMST-ViT: Climate Change-aware Crop Yield Prediction via Multi-Modal Spatial-Temporal Vision Transformer
Fudong Lin
Summer Crawford
Kaleb Guillot
Yihe Zhang
Yan Chen
...
Tri Setiyono
B. Tubana
Lu Peng
Magdy A. Bayoumi
N. Tzeng
42
20
0
16 Sep 2023
Learning to Predict Concept Ordering for Common Sense Generation
Tianhui Zhang
Danushka Bollegala
Bei Peng
LRM
18
2
0
12 Sep 2023
Multi-party Goal Tracking with LLMs: Comparing Pre-training, Fine-tuning, and Prompt Engineering
Angus Addlesee
Weronika Sieiñska
Nancie Gunson
Daniel Hernández García
Christian Dondrup
Oliver Lemon
19
20
0
29 Aug 2023
Artificial-Spiking Hierarchical Networks for Vision-Language Representation Learning
Ye-Ting Chen
Siyu Zhang
Yaoru Sun
Weijian Liang
Haoran Wang
40
0
0
18 Aug 2023
Challenges and Opportunities of Using Transformer-Based Multi-Task Learning in NLP Through ML Lifecycle: A Survey
Lovre Torbarina
Tin Ferkovic
Lukasz Roguski
Velimir Mihelčić
Bruno Šarlija
Z. Kraljevic
24
5
0
16 Aug 2023
Large Language Models for Information Retrieval: A Survey
Yutao Zhu
Huaying Yuan
Shuting Wang
Jiongnan Liu
Wenhan Liu
Chenlong Deng
Haonan Chen
Zhicheng Dou
Ji-Rong Wen
KELM
57
288
0
14 Aug 2023
Empowering NLG: Offline Reinforcement Learning for Informal Summarization in Online Domains
Zhiwei Tai
Po-Chuan Chen
OffRL
21
0
0
17 Jun 2023
Recipes for Sequential Pre-training of Multilingual Encoder and Seq2Seq Models
Saleh Soltan
Andrew Rosenbaum
Tobias Falke
Qin Lu
Anna Rumshisky
Wael Hamza
22
0
0
14 Jun 2023
UniPoll: A Unified Social Media Poll Generation Framework via Multi-Objective Optimization
Yixia Li
Rong Xiang
Yanlin Song
Jing Li
29
1
0
12 Jun 2023
How Can Recommender Systems Benefit from Large Language Models: A Survey
Jianghao Lin
Xinyi Dai
Yunjia Xi
Weiwen Liu
Bo Chen
...
Chenxu Zhu
Huifeng Guo
Yong Yu
Ruiming Tang
Weinan Zhang
LRM
30
196
0
09 Jun 2023
DocFormerv2: Local Features for Document Understanding
Srikar Appalaraju
Peng Tang
Qi Dong
Nishant Sankaran
Yichu Zhou
R. Manmatha
36
39
0
02 Jun 2023
Layout and Task Aware Instruction Prompt for Zero-shot Document Image Question Answering
Wenjin Wang
Yunhao Li
Yixin Ou
Yin Zhang
VLM
29
24
0
01 Jun 2023
Deliberate then Generate: Enhanced Prompting Framework for Text Generation
Bei Li
Rui Wang
Junliang Guo
Kaitao Song
Xuejiao Tan
Hany Hassan
Arul Menezes
Tong Xiao
Jiang Bian
JingBo Zhu
24
14
0
31 May 2023
1
2
3
4
5
Next