ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.12804
  4. Cited By
UniLMv2: Pseudo-Masked Language Models for Unified Language Model
  Pre-Training

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

28 February 2020
Hangbo Bao
Li Dong
Furu Wei
Wenhui Wang
Nan Yang
Xiaodong Liu
Yu-Chiang Frank Wang
Songhao Piao
Jianfeng Gao
Ming Zhou
H. Hon
    AI4CE
ArXivPDFHTML

Papers citing "UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training"

50 / 219 papers shown
Title
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich
  Document Understanding
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding
Chuwei Luo
Guozhi Tang
Qi Zheng
Cong Yao
Lianwen Jin
Chenliang Li
Yang Xue
Luo Si
27
16
0
27 Jun 2022
Language Models are General-Purpose Interfaces
Language Models are General-Purpose Interfaces
Y. Hao
Haoyu Song
Li Dong
Shaohan Huang
Zewen Chi
Wenhui Wang
Shuming Ma
Furu Wei
MLLM
30
96
0
13 Jun 2022
Learning to Ask Like a Physician
Learning to Ask Like a Physician
Eric P. Lehman
Vladislav Lialin
K. Y. Legaspi
Anne Janelle R. Sy
Patricia Therese S. Pile
...
Anna Rumshisky
Jenifer Liang
Preethi Raghavan
Leo Anthony Celi
Peter Szolovits
OOD
25
19
0
06 Jun 2022
Learning to Break the Loop: Analyzing and Mitigating Repetitions for
  Neural Text Generation
Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation
Jin Xu
Xiaojiang Liu
Jianhao Yan
Deng Cai
Huayang Li
Jian Li
30
72
0
06 Jun 2022
ZusammenQA: Data Augmentation with Specialized Models for Cross-lingual
  Open-retrieval Question Answering System
ZusammenQA: Data Augmentation with Specialized Models for Cross-lingual Open-retrieval Question Answering System
Chia-Chien Hung
Tommaso Green
Robert Litschko
Tornike Tsereteli
Sotaro Takeshita
Marco Bombieri
Goran Glavavs
Simone Paolo Ponzetto
RALM
LRM
11
2
0
30 May 2022
E2S2: Encoding-Enhanced Sequence-to-Sequence Pretraining for Language
  Understanding and Generation
E2S2: Encoding-Enhanced Sequence-to-Sequence Pretraining for Language Understanding and Generation
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
54
27
0
30 May 2022
Your Transformer May Not be as Powerful as You Expect
Your Transformer May Not be as Powerful as You Expect
Shengjie Luo
Shanda Li
Shuxin Zheng
Tie-Yan Liu
Liwei Wang
Di He
70
51
0
26 May 2022
Prototypical Calibration for Few-shot Learning of Language Models
Prototypical Calibration for Few-shot Learning of Language Models
Zhixiong Han
Y. Hao
Li Dong
Yutao Sun
Furu Wei
178
54
0
20 May 2022
Trading Positional Complexity vs. Deepness in Coordinate Networks
Trading Positional Complexity vs. Deepness in Coordinate Networks
Jianqiao Zheng
Sameera Ramasinghe
Xueqian Li
Simon Lucey
31
18
0
18 May 2022
StableMoE: Stable Routing Strategy for Mixture of Experts
StableMoE: Stable Routing Strategy for Mixture of Experts
Damai Dai
Li Dong
Shuming Ma
Bo Zheng
Zhifang Sui
Baobao Chang
Furu Wei
MoE
21
62
0
18 Apr 2022
Knowledgeable Salient Span Mask for Enhancing Language Models as
  Knowledge Base
Knowledgeable Salient Span Mask for Enhancing Language Models as Knowledge Base
Cunxiang Wang
Fuli Luo
Yanyang Li
Runxin Xu
Fei Huang
Yue Zhang
KELM
36
2
0
17 Apr 2022
METRO: Efficient Denoising Pretraining of Large Scale Autoencoding
  Language Models with Model Generated Signals
METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals
Payal Bajaj
Chenyan Xiong
Guolin Ke
Xiaodong Liu
Di He
Saurabh Tiwary
Tie-Yan Liu
Paul N. Bennett
Xia Song
Jianfeng Gao
50
32
0
13 Apr 2022
Contrastive Demonstration Tuning for Pre-trained Language Models
Contrastive Demonstration Tuning for Pre-trained Language Models
Xiaozhuan Liang
Ningyu Zhang
Shuyang Cheng
Zhenru Zhang
Chuanqi Tan
Huajun Chen
VLM
ALM
VPVLM
47
9
0
09 Apr 2022
BioBART: Pretraining and Evaluation of A Biomedical Generative Language
  Model
BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model
Hongyi Yuan
Zheng Yuan
Ruyi Gan
Jiaxing Zhang
Yutao Xie
Sheng Yu
LM&MA
33
123
0
08 Apr 2022
Pretraining Text Encoders with Adversarial Mixture of Training Signal
  Generators
Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators
Yu Meng
Chenyan Xiong
Payal Bajaj
Saurabh Tiwary
Paul N. Bennett
Jiawei Han
Xia Song
MoE
39
16
0
07 Apr 2022
Question Generation for Reading Comprehension Assessment by Modeling How
  and What to Ask
Question Generation for Reading Comprehension Assessment by Modeling How and What to Ask
Bilal Ghanem
Lauren Lutz Coleman
Julia Rivard Dexter
Spencer McIntosh von der Ohe
Alona Fyshe
AI4Ed
25
27
0
06 Apr 2022
SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken
  Language Model for Speech Processing Tasks
SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
Kai-Wei Chang
Wei-Cheng Tseng
Shang-Wen Li
Hung-yi Lee
27
22
0
31 Mar 2022
Towards Few-shot Entity Recognition in Document Images: A Label-aware
  Sequence-to-Sequence Framework
Towards Few-shot Entity Recognition in Document Images: A Label-aware Sequence-to-Sequence Framework
Zilong Wang
Jingbo Shang
36
10
0
30 Mar 2022
Towards Interpretable Deep Reinforcement Learning Models via Inverse
  Reinforcement Learning
Towards Interpretable Deep Reinforcement Learning Models via Inverse Reinforcement Learning
Yuansheng Xie
Soroush Vosoughi
Saeed Hassanpour
26
2
0
30 Mar 2022
Beyond Fixation: Dynamic Window Visual Transformer
Beyond Fixation: Dynamic Window Visual Transformer
Pengzhen Ren
Changlin Li
Guangrun Wang
Yun Xiao
Qing Du
Xiaodan Liang
Qing Du Xiaodan Liang Xiaojun Chang
ViT
28
32
0
24 Mar 2022
A Feasibility Study of Answer-Agnostic Question Generation for Education
A Feasibility Study of Answer-Agnostic Question Generation for Education
Liam Dugan
E. Miltsakaki
Shriyash Upadhyay
Etan Ginsberg
Hannah Gonzalez
Dayheon Choi
Chuning Yuan
Chris Callison-Burch
29
12
0
16 Mar 2022
FormNet: Structural Encoding beyond Sequential Modeling in Form Document
  Information Extraction
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Chen-Yu Lee
Chun-Liang Li
Timothy Dozat
Vincent Perot
Guolong Su
Nan Hua
Joshua Ainslie
Renshen Wang
Yasuhisa Fujii
Tomas Pfister
25
77
0
16 Mar 2022
MetaFormer: A Unified Meta Framework for Fine-Grained Recognition
MetaFormer: A Unified Meta Framework for Fine-Grained Recognition
Qishuai Diao
Yi-Xin Jiang
Bin Wen
Jianxiang Sun
Zehuan Yuan
39
60
0
05 Mar 2022
DeepNet: Scaling Transformers to 1,000 Layers
DeepNet: Scaling Transformers to 1,000 Layers
Hongyu Wang
Shuming Ma
Li Dong
Shaohan Huang
Dongdong Zhang
Furu Wei
MoE
AI4CE
26
156
0
01 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for
  Structured Document Understanding
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding
Jiapeng Wang
Lianwen Jin
Kai Ding
VLM
35
138
0
28 Feb 2022
NoisyTune: A Little Noise Can Help You Finetune Pretrained Language
  Models Better
NoisyTune: A Little Noise Can Help You Finetune Pretrained Language Models Better
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
Xing Xie
25
58
0
24 Feb 2022
Aggregating Global Features into Local Vision Transformer
Aggregating Global Features into Local Vision Transformer
Krushi Patel
A. Bur
Fengju Li
Guanghui Wang
ViT
33
34
0
30 Jan 2022
Unified Question Generation with Continual Lifelong Learning
Unified Question Generation with Continual Lifelong Learning
Wei Yuan
Hongzhi Yin
Tieke He
Tong Chen
Qiufeng Wang
Li-zhen Cui
38
11
0
24 Jan 2022
Leaf: Multiple-Choice Question Generation
Leaf: Multiple-Choice Question Generation
Kristiyan Vachev
Momchil Hardalov
Georgi Karadzhov
Georgi Georgiev
Ivan Koychev
Preslav Nakov
AI4Ed
30
21
0
22 Jan 2022
Pretrained Language Models for Text Generation: A Survey
Pretrained Language Models for Text Generation: A Survey
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
AI4CE
36
127
0
14 Jan 2022
SeMask: Semantically Masked Transformers for Semantic Segmentation
SeMask: Semantically Masked Transformers for Semantic Segmentation
Jitesh Jain
Anukriti Singh
Nikita Orlov
Zilong Huang
Jiachen Li
Steven Walton
Humphrey Shi
ViT
29
92
0
23 Dec 2021
Learned Queries for Efficient Local Attention
Learned Queries for Efficient Local Attention
Moab Arar
Ariel Shamir
Amit H. Bermano
ViT
44
29
0
21 Dec 2021
Spiral Language Modeling
Spiral Language Modeling
Yong Cao
Yukun Feng
Shaohui Kuang
Gu Xu
21
0
0
20 Dec 2021
Diaformer: Automatic Diagnosis via Symptoms Sequence Generation
Diaformer: Automatic Diagnosis via Symptoms Sequence Generation
Junying Chen
Dongfang Li
Qingcai Chen
Wenxiu Zhou
Xin Liu
MedIm
30
30
0
20 Dec 2021
Distilled Dual-Encoder Model for Vision-Language Understanding
Distilled Dual-Encoder Model for Vision-Language Understanding
Zekun Wang
Wenhui Wang
Haichao Zhu
Ming Liu
Bing Qin
Furu Wei
VLM
FedML
29
30
0
16 Dec 2021
KGR^4: Retrieval, Retrospect, Refine and Rethink for Commonsense
  Generation
KGR^4: Retrieval, Retrospect, Refine and Rethink for Commonsense Generation
Xin Liu
Dayiheng Liu
Baosong Yang
Haibo Zhang
Junwei Ding
Wenqing Yao
Weihua Luo
Haiying Zhang
Jinsong Su
LRM
32
8
0
15 Dec 2021
Unified Multimodal Pre-training and Prompt-based Tuning for
  Vision-Language Understanding and Generation
Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation
Tianyi Liu
Zuxuan Wu
Wenhan Xiong
Jingjing Chen
Yu-Gang Jiang
VLM
MLLM
32
10
0
10 Dec 2021
FLAVA: A Foundational Language And Vision Alignment Model
FLAVA: A Foundational Language And Vision Alignment Model
Amanpreet Singh
Ronghang Hu
Vedanuj Goswami
Guillaume Couairon
Wojciech Galuba
Marcus Rohrbach
Douwe Kiela
CLIP
VLM
40
690
0
08 Dec 2021
BEVT: BERT Pretraining of Video Transformers
BEVT: BERT Pretraining of Video Transformers
Rui Wang
Dongdong Chen
Zuxuan Wu
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Yu-Gang Jiang
Luowei Zhou
Lu Yuan
ViT
39
203
0
02 Dec 2021
Improving Controllability of Educational Question Generation by Keyword
  Provision
Improving Controllability of Educational Question Generation by Keyword Provision
Ying-Hong Chan
Ho-Lam Chung
Yao-Chung Fan
27
3
0
02 Dec 2021
Exploring Low-Cost Transformer Model Compression for Large-Scale
  Commercial Reply Suggestions
Exploring Low-Cost Transformer Model Compression for Large-Scale Commercial Reply Suggestions
Vaishnavi Shrivastava
Radhika Gaonkar
Shashank Gupta
Abhishek Jha
14
0
0
27 Nov 2021
Swin Transformer V2: Scaling Up Capacity and Resolution
Swin Transformer V2: Scaling Up Capacity and Resolution
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
...
Yue Cao
Zheng-Wei Zhang
Li Dong
Furu Wei
B. Guo
ViT
82
1,754
0
18 Nov 2021
VLMo: Unified Vision-Language Pre-Training with
  Mixture-of-Modality-Experts
VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts
Hangbo Bao
Wenhui Wang
Li Dong
Qiang Liu
Owais Khan Mohammed
Kriti Aggarwal
Subhojit Som
Furu Wei
VLM
MLLM
MoE
20
533
0
03 Nov 2021
EventNarrative: A large-scale Event-centric Dataset for Knowledge
  Graph-to-Text Generation
EventNarrative: A large-scale Event-centric Dataset for Knowledge Graph-to-Text Generation
Anthony Colas
A. Sadeghian
Yue Wang
D. Wang
20
21
0
30 Oct 2021
s2s-ft: Fine-Tuning Pretrained Transformer Encoders for
  Sequence-to-Sequence Learning
s2s-ft: Fine-Tuning Pretrained Transformer Encoders for Sequence-to-Sequence Learning
Hangbo Bao
Li Dong
Wenhui Wang
Nan Yang
Furu Wei
16
11
0
26 Oct 2021
Improving Non-autoregressive Generation with Mixup Training
Improving Non-autoregressive Generation with Mixup Training
Ting Jiang
Shaohan Huang
Zihan Zhang
Deqing Wang
Fuzhen Zhuang
Furu Wei
Haizhen Huang
Liangjie Zhang
Qi Zhang
24
8
0
21 Oct 2021
End-to-End Segmentation-based News Summarization
End-to-End Segmentation-based News Summarization
Yang Liu
Chenguang Zhu
Michael Zeng
VLM
46
26
0
15 Oct 2021
EventBERT: A Pre-Trained Model for Event Correlation Reasoning
EventBERT: A Pre-Trained Model for Event Correlation Reasoning
Yucheng Zhou
Xiubo Geng
Tao Shen
Guodong Long
Daxin Jiang
42
46
0
13 Oct 2021
Allocating Large Vocabulary Capacity for Cross-lingual Language Model
  Pre-training
Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training
Bo Zheng
Li Dong
Shaohan Huang
Saksham Singhal
Wanxiang Che
Ting Liu
Xia Song
Furu Wei
VLM
21
22
0
15 Sep 2021
KFCNet: Knowledge Filtering and Contrastive Learning Network for
  Generative Commonsense Reasoning
KFCNet: Knowledge Filtering and Contrastive Learning Network for Generative Commonsense Reasoning
Haonan Li
Yeyun Gong
Jian Jiao
Ruofei Zhang
Timothy Baldwin
Nan Duan
OffRL
60
6
0
14 Sep 2021
Previous
12345
Next