Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.12804
Cited By
UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training
28 February 2020
Hangbo Bao
Li Dong
Furu Wei
Wenhui Wang
Nan Yang
Xiaodong Liu
Yu-Chiang Frank Wang
Songhao Piao
Jianfeng Gao
Ming Zhou
H. Hon
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training"
50 / 105 papers shown
Title
Information Extraction from Visually Rich Documents using LLM-based Organization of Documents into Independent Textual Segments
Aniket Bhattacharyya
Anurag Tripathi
Ujjal Das
Archan Karmakar
Amit Pathak
Maneesh Gupta
2
0
0
18 May 2025
Generative Sign-description Prompts with Multi-positive Contrastive Learning for Sign Language Recognition
Siyu Liang
Yunan Li
Wentian Xin
Huizhou Chen
Xujie Liu
Kang Liu
Qiguang Miao
27
0
0
05 May 2025
Robust Asymmetric Heterogeneous Federated Learning with Corrupted Clients
Xiuwen Fang
Mang Ye
Bo Du
FedML
74
1
0
12 Mar 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Min Zhang
LM&MA
AILaw
98
154
0
28 Jan 2025
Capturing Temporal Components for Time Series Classification
Venkata Ragavendra Vavilthota
Ranjith Ramanathan
Sathyanarayanan N. Aakur
28
0
0
20 Jun 2024
Multi-Head Mixture-of-Experts
Xun Wu
Shaohan Huang
Wenhui Wang
Furu Wei
MoE
39
12
0
23 Apr 2024
Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies
Benjue Weng
LM&MA
46
8
0
13 Apr 2024
Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training
Hyesong Choi
Hyejin Park
Kwang Moo Yi
Sungmin Cha
Dongbo Min
39
9
0
12 Apr 2024
Language Model Adaptation to Specialized Domains through Selective Masking based on Genre and Topical Characteristics
Anas Belfathi
Ygor Gallina
Nicolas Hernandez
Richard Dufour
Laura Monceaux
44
1
0
19 Feb 2024
Text-to-Code Generation with Modality-relative Pre-training
Fenia Christopoulou
Guchun Zhang
Gerasimos Lampouras
AI4TS
26
1
0
08 Feb 2024
Surveying the Landscape of Text Summarization with Deep Learning: A Comprehensive Review
Guanghua Wang
Weili Wu
AI4TS
AILaw
38
3
0
13 Oct 2023
Neural Summarization of Electronic Health Records
Koyena Pal
Seyed Ali Bahrainian
Laura Y. Mercurio
Carsten Eickhoff
25
3
0
24 May 2023
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
Jiaao Chen
Aston Zhang
Mu Li
Alexander J. Smola
Diyi Yang
DiffM
32
17
0
10 Apr 2023
CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Haoxuan You
Mandy Guo
Zhecan Wang
Kai-Wei Chang
Jason Baldridge
Jiahui Yu
DiffM
49
13
0
23 Mar 2023
Parameter-Efficient Fine-Tuning Design Spaces
Jiaao Chen
Aston Zhang
Xingjian Shi
Mu Li
Alexander J. Smola
Diyi Yang
42
59
0
04 Jan 2023
Local Learning on Transformers via Feature Reconstruction
P. Pathak
Jingwei Zhang
Dimitris Samaras
ViT
24
5
0
29 Dec 2022
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Jian Yang
Shuming Ma
Li Dong
Shaohan Huang
Haoyang Huang
Yuwei Yin
Dongdong Zhang
Liqun Yang
Furu Wei
Zhoujun Li
SyDa
AI4CE
32
25
0
20 Dec 2022
Multimodal Vision Transformers with Forced Attention for Behavior Analysis
Tanay Agrawal
Michal Balazia
Philippe Muller
Franccois Brémond
ViT
23
9
0
07 Dec 2022
Unified Multimodal Model with Unlikelihood Training for Visual Dialog
Zihao Wang
Junli Wang
Changjun Jiang
MLLM
29
10
0
23 Nov 2022
Proactive Detractor Detection Framework Based on Message-Wise Sentiment Analysis Over Customer Support Interactions
J. S. Gallo
Jesus Solano
Javier A. García
David Zarruk-Valencia
Alejandro Correa-Bahnsen
19
1
0
08 Nov 2022
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models
Hao Liu
Xinyang Geng
Lisa Lee
Igor Mordatch
Sergey Levine
Sharan Narang
Pieter Abbeel
KELM
CLL
35
2
0
24 Oct 2022
S2WAT: Image Style Transfer via Hierarchical Vision Transformer using Strips Window Attention
Chi Zhang
Lu Zhou
Lei Wang
Zaiyan Dai
Jun Yang
ViT
34
23
0
22 Oct 2022
Metric-guided Distillation: Distilling Knowledge from the Metric to Ranker and Retriever for Generative Commonsense Reasoning
Xingwei He
Yeyun Gong
Alex Jin
Weizhen Qi
Hang Zhang
Jian Jiao
Bartuer Zhou
Biao Cheng
Sm Yiu
Nan Duan
38
11
0
21 Oct 2022
A Unified View of Masked Image Modeling
Zhiliang Peng
Li Dong
Hangbo Bao
QiXiang Ye
Furu Wei
VLM
56
35
0
19 Oct 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Qiming Peng
Yinxu Pan
Wenjin Wang
Bin Luo
Zhenyu Zhang
...
Shi Feng
Yu Sun
Hao Tian
Hua Wu
Haifeng Wang
13
83
0
12 Oct 2022
Generative Language Models for Paragraph-Level Question Generation
Asahi Ushio
Fernando Alva-Manchego
Jose Camacho-Collados
ELM
13
45
0
08 Oct 2022
XDoc: Unified Pre-training for Cross-Format Document Understanding
Jingye Chen
Tengchao Lv
Lei Cui
Changrong Zhang
Furu Wei
50
13
0
06 Oct 2022
Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation
Xingdi Yuan
Tong Wang
Yen-Hsiang Wang
Emery Fine
Rania Abdelghani
Pauline Lucas
Hélene Sauzéon
Pierre-Yves Oudeyer
30
29
0
22 Sep 2022
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Wenjin Wang
Zhengjie Huang
Bin Luo
Qianglong Chen
Qiming Peng
...
Weichong Yin
Shi Feng
Yu Sun
Dianhai Yu
Yin Zhang
ViT
35
11
0
18 Sep 2022
Enhancing Pre-trained Models with Text Structure Knowledge for Question Generation
Zichen Wu
Xin Jia
Fanyi Qu
Yunfang Wu
21
4
0
09 Sep 2022
TransPolymer: a Transformer-based language model for polymer property predictions
Changwen Xu
Yuyang Wang
A. Farimani
27
86
0
03 Sep 2022
Learning Better Masking for Better Language Model Pre-training
Dongjie Yang
Zhuosheng Zhang
Hai Zhao
37
15
0
23 Aug 2022
Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Wenhui Wang
Hangbo Bao
Li Dong
Johan Bjorck
Zhiliang Peng
...
Kriti Aggarwal
O. Mohammed
Saksham Singhal
Subhojit Som
Furu Wei
MLLM
VLM
ViT
54
629
0
22 Aug 2022
BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
Zhiliang Peng
Li Dong
Hangbo Bao
QiXiang Ye
Furu Wei
29
306
0
12 Aug 2022
DeepGen: Diverse Search Ad Generation and Real-Time Customization
Konstantin Golobokov
Junyi Chai
Victor Ye Dong
Mandy Gu
Bingyu Chi
Jie Cao
Yulan Yan
Yi Liu
31
8
0
06 Aug 2022
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model
Saleh Soltan
Shankar Ananthakrishnan
Jack G. M. FitzGerald
Rahul Gupta
Wael Hamza
...
Mukund Sridhar
Fabian Triefenbach
Apurv Verma
Gokhan Tur
Premkumar Natarajan
56
82
0
02 Aug 2022
MAR: Masked Autoencoders for Efficient Action Recognition
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Xiang Wang
Yuehuang Wang
Yiliang Lv
Changxin Gao
Nong Sang
32
42
0
24 Jul 2022
PanGu-Coder: Program Synthesis with Function-Level Language Modeling
Fenia Christopoulou
Gerasimos Lampouras
Milan Gritta
Guchun Zhang
Yinpeng Guo
...
Guangtai Liang
Jia Wei
Xin Jiang
Qianxiang Wang
Qun Liu
ELM
SyDa
ALM
45
74
0
22 Jul 2022
Language Models are General-Purpose Interfaces
Y. Hao
Haoyu Song
Li Dong
Shaohan Huang
Zewen Chi
Wenhui Wang
Shuming Ma
Furu Wei
MLLM
30
96
0
13 Jun 2022
E2S2: Encoding-Enhanced Sequence-to-Sequence Pretraining for Language Understanding and Generation
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
49
27
0
30 May 2022
Prototypical Calibration for Few-shot Learning of Language Models
Zhixiong Han
Y. Hao
Li Dong
Yutao Sun
Furu Wei
178
54
0
20 May 2022
Trading Positional Complexity vs. Deepness in Coordinate Networks
Jianqiao Zheng
Sameera Ramasinghe
Xueqian Li
Simon Lucey
31
18
0
18 May 2022
Knowledgeable Salient Span Mask for Enhancing Language Models as Knowledge Base
Cunxiang Wang
Fuli Luo
Yanyang Li
Runxin Xu
Fei Huang
Yue Zhang
KELM
36
2
0
17 Apr 2022
METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals
Payal Bajaj
Chenyan Xiong
Guolin Ke
Xiaodong Liu
Di He
Saurabh Tiwary
Tie-Yan Liu
Paul N. Bennett
Xia Song
Jianfeng Gao
50
32
0
13 Apr 2022
BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model
Hongyi Yuan
Zheng Yuan
Ruyi Gan
Jiaxing Zhang
Yutao Xie
Sheng Yu
LM&MA
33
123
0
08 Apr 2022
SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
Kai-Wei Chang
Wei-Cheng Tseng
Shang-Wen Li
Hung-yi Lee
27
22
0
31 Mar 2022
Towards Interpretable Deep Reinforcement Learning Models via Inverse Reinforcement Learning
Yuansheng Xie
Soroush Vosoughi
Saeed Hassanpour
26
2
0
30 Mar 2022
Beyond Fixation: Dynamic Window Visual Transformer
Pengzhen Ren
Changlin Li
Guangrun Wang
Yun Xiao
Qing Du
Xiaodan Liang
Qing Du Xiaodan Liang Xiaojun Chang
ViT
28
32
0
24 Mar 2022
A Feasibility Study of Answer-Agnostic Question Generation for Education
Liam Dugan
E. Miltsakaki
Shriyash Upadhyay
Etan Ginsberg
Hannah Gonzalez
Dayheon Choi
Chuning Yuan
Chris Callison-Burch
29
12
0
16 Mar 2022
MetaFormer: A Unified Meta Framework for Fine-Grained Recognition
Qishuai Diao
Yi-Xin Jiang
Bin Wen
Jianxiang Sun
Zehuan Yuan
39
60
0
05 Mar 2022
1
2
3
Next