Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.08006
Cited By
v1
v2 (latest)
ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation
16 April 2021
Weizhen Qi
Yeyun Gong
Yu Yan
Can Xu
Bolun Yao
Bartuer Zhou
Biao Cheng
Daxin Jiang
Jiusheng Chen
Ruofei Zhang
Houqiang Li
Nan Duan
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation"
37 / 37 papers shown
Title
Unified Pre-training for Program Understanding and Generation
Wasi Uddin Ahmad
Saikat Chakraborty
Baishakhi Ray
Kai-Wei Chang
135
769
0
10 Mar 2021
GraphCodeBERT: Pre-training Code Representations with Data Flow
Daya Guo
Shuo Ren
Shuai Lu
Zhangyin Feng
Duyu Tang
...
Dawn Drain
Neel Sundaresan
Jian Yin
Daxin Jiang
M. Zhou
145
1,143
0
17 Sep 2020
A Large-Scale Chinese Short-Text Conversation Dataset
Yida Wang
Pei Ke
Yinhe Zheng
Kaili Huang
Yong Jiang
Xiaoyan Zhu
Minlie Huang
50
136
0
10 Aug 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
832
42,332
0
28 May 2020
MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization
Canwen Xu
Jiaxin Pei
Hongtao Wu
Yiyu Liu
Chenliang Li
MLLM
VLM
47
14
0
26 Apr 2020
CLUE: A Chinese Language Understanding Evaluation Benchmark
Liang Xu
Hai Hu
Xuanwei Zhang
Lu Li
Chenjie Cao
...
Cong Yue
Xinrui Zhang
Zhen-Yi Yang
Kyle Richardson
Zhenzhong Lan
ELM
87
386
0
13 Apr 2020
Learning a Simple and Effective Model for Multi-turn Response Generation with Auxiliary Tasks
Yufan Zhao
Can Xu
Wei Wu
Lei Yu
61
28
0
04 Apr 2020
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation
Yaobo Liang
Nan Duan
Yeyun Gong
Ning Wu
Fenfei Guo
...
Shuguang Liu
Fan Yang
Daniel Fernando Campos
Rangan Majumder
Ming Zhou
ELM
VLM
86
350
0
03 Apr 2020
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
Zhangyin Feng
Daya Guo
Duyu Tang
Nan Duan
Xiaocheng Feng
...
Linjun Shou
Bing Qin
Ting Liu
Daxin Jiang
Ming Zhou
165
2,637
0
19 Feb 2020
Multilingual Denoising Pre-training for Neural Machine Translation
Yinhan Liu
Jiatao Gu
Naman Goyal
Xian Li
Sergey Edunov
Marjan Ghazvininejad
M. Lewis
Luke Zettlemoyer
AI4CE
AIMat
124
1,811
0
22 Jan 2020
ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training
Weizhen Qi
Yu Yan
Yeyun Gong
Dayiheng Liu
Nan Duan
Jiusheng Chen
Ruofei Zhang
Ming Zhou
AI4TS
90
450
0
13 Jan 2020
Unsupervised Cross-lingual Representation Learning at Scale
Alexis Conneau
Kartikay Khandelwal
Naman Goyal
Vishrav Chaudhary
Guillaume Wenzek
Francisco Guzmán
Edouard Grave
Myle Ott
Luke Zettlemoyer
Veselin Stoyanov
226
6,565
0
05 Nov 2019
DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation
Yizhe Zhang
Siqi Sun
Michel Galley
Yen-Chun Chen
Chris Brockett
Xiang Gao
Jianfeng Gao
Jingjing Liu
W. Dolan
VLM
189
1,524
0
01 Nov 2019
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
M. Lewis
Yinhan Liu
Naman Goyal
Marjan Ghazvininejad
Abdel-rahman Mohamed
Omer Levy
Veselin Stoyanov
Luke Zettlemoyer
AIMat
VLM
260
10,848
0
29 Oct 2019
PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable
Siqi Bao
H. He
Fan Wang
Hua Wu
Haifeng Wang
60
270
0
17 Oct 2019
CodeSearchNet Challenge: Evaluating the State of Semantic Code Search
Hamel Husain
Hongqiu Wu
Tiferet Gazit
Miltiadis Allamanis
Marc Brockschmidt
ELM
130
1,085
0
20 Sep 2019
Implicit Deep Latent Variable Models for Text Generation
Le Fang
Chunyuan Li
Jianfeng Gao
Wen Dong
Changyou Chen
DRL
51
64
0
30 Aug 2019
Text Summarization with Pretrained Encoders
Yang Liu
Mirella Lapata
MILM
452
1,451
0
22 Aug 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
668
24,528
0
26 Jul 2019
MASS: Masked Sequence to Sequence Pre-training for Language Generation
Kaitao Song
Xu Tan
Tao Qin
Jianfeng Lu
Tie-Yan Liu
117
966
0
07 May 2019
Audio-Visual Scene-Aware Dialog
Huda AlAmri
Vincent Cartillier
Abhishek Das
Jue Wang
A. Cherian
...
Tim K. Marks
Chiori Hori
Peter Anderson
Stefan Lee
Devi Parikh
VGen
54
194
0
25 Jan 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
95,114
0
11 Oct 2018
Global Encoding for Abstractive Summarization
Junyang Lin
Xu Sun
Shuming Ma
Qi Su
47
146
0
10 May 2018
Query and Output: Generating Words by Querying Distributed Word Representations for Paraphrase Generation
Shuming Ma
Xu Sun
Wei Li
Sujian Li
Wenjie Li
Xuancheng Ren
50
62
0
05 Mar 2018
Personalizing Dialogue Agents: I have a dog, do you have pets too?
Saizheng Zhang
Emily Dinan
Jack Urbanek
Arthur Szlam
Douwe Kiela
Jason Weston
118
1,464
0
22 Jan 2018
DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset
Yanran Li
Hui Su
Xiaoyu Shen
Wenjie Li
Ziqiang Cao
Shuzi Niu
66
1,304
0
11 Oct 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
728
132,199
0
12 Jun 2017
SQuAD: 100,000+ Questions for Machine Comprehension of Text
Pranav Rajpurkar
Jian Zhang
Konstantin Lopyrev
Percy Liang
RALM
296
8,160
0
16 Jun 2016
A Neural Attention Model for Abstractive Sentence Summarization
Alexander M. Rush
S. Chopra
Jason Weston
CVBM
186
2,701
0
02 Sep 2015
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
400
7,968
0
17 Aug 2015
A Neural Conversational Model
Oriol Vinyals
Quoc V. Le
BDL
139
1,768
0
19 Jun 2015
LCSTS: A Large Scale Chinese Short Text Summarization Dataset
Baotian Hu
Qingcai Chen
Fangze Zhu
74
339
0
19 Jun 2015
Teaching Machines to Read and Comprehend
Karl Moritz Hermann
Tomás Kociský
Edward Grefenstette
L. Espeholt
W. Kay
Mustafa Suleyman
Phil Blunsom
347
3,551
0
10 Jun 2015
Neural Responding Machine for Short-Text Conversation
Lifeng Shang
Zhengdong Lu
Hang Li
117
1,146
0
09 Mar 2015
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
437
20,584
0
10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
575
27,325
0
01 Sep 2014
LexRank: Graph-based Lexical Centrality as Salience in Text Summarization
Günes Erkan
Dragomir R. Radev
197
3,097
0
09 Sep 2011
1