ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.02683
  4. Cited By
Unsupervised Pretraining for Sequence to Sequence Learning

Unsupervised Pretraining for Sequence to Sequence Learning

8 November 2016
Prajit Ramachandran
Peter J. Liu
Quoc V. Le
    SSL
    AIMat
ArXivPDFHTML

Papers citing "Unsupervised Pretraining for Sequence to Sequence Learning"

50 / 61 papers shown
Title
Improving Language Model Integration for Neural Machine Translation
Improving Language Model Integration for Neural Machine Translation
Christian Herold
Yingbo Gao
Mohammad Zeineldeen
Hermann Ney
29
2
0
08 Jun 2023
Zero-Shot Learning for Requirements Classification: An Exploratory Study
Zero-Shot Learning for Requirements Classification: An Exploratory Study
Waad Alhoshan
Alessio Ferrari
Liping Zhao
VLM
17
39
0
09 Feb 2023
Uncertainty-DTW for Time Series and Sequences
Uncertainty-DTW for Time Series and Sequences
Lei Wang
Piotr Koniusz
13
33
0
30 Oct 2022
Rethinking Clustering-Based Pseudo-Labeling for Unsupervised
  Meta-Learning
Rethinking Clustering-Based Pseudo-Labeling for Unsupervised Meta-Learning
Xingping Dong
Jianbing Shen
Ling Shao
32
7
0
27 Sep 2022
DBT-DMAE: An Effective Multivariate Time Series Pre-Train Model under
  Missing Data
DBT-DMAE: An Effective Multivariate Time Series Pre-Train Model under Missing Data
Kai Zhang
Qinmin Yang
Chong Li
AI4TS
19
0
0
16 Sep 2022
E2S2: Encoding-Enhanced Sequence-to-Sequence Pretraining for Language
  Understanding and Generation
E2S2: Encoding-Enhanced Sequence-to-Sequence Pretraining for Language Understanding and Generation
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
54
27
0
30 May 2022
Graph Enhanced BERT for Query Understanding
Graph Enhanced BERT for Query Understanding
Juanhui Li
Yao Ma
Weizhen Zeng
Suqi Cheng
Jiliang Tang
Shuaiqiang Wang
Dawei Yin
29
7
0
03 Apr 2022
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Andy Zeng
Maria Attarian
Brian Ichter
K. Choromanski
Adrian S. Wong
...
Michael S. Ryoo
Vikas Sindhwani
Johnny Lee
Vincent Vanhoucke
Peter R. Florence
ReLM
LRM
49
574
0
01 Apr 2022
Survey of Low-Resource Machine Translation
Survey of Low-Resource Machine Translation
Barry Haddow
Rachel Bawden
Antonio Valerio Miceli Barone
Jindvrich Helcl
Alexandra Birch
AIMat
39
150
0
01 Sep 2021
A Survey on Low-Resource Neural Machine Translation
A Survey on Low-Resource Neural Machine Translation
Rui Wang
Xu Tan
Renqian Luo
Tao Qin
Tie-Yan Liu
3DV
38
58
0
09 Jul 2021
Neural Machine Translation for Low-Resource Languages: A Survey
Neural Machine Translation for Low-Resource Languages: A Survey
Surangika Ranathunga
E. Lee
Marjana Prifti Skenduli
Ravi Shekhar
Mehreen Alam
Rishemjit Kaur
40
236
0
29 Jun 2021
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning
  Architectures
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning Architectures
Sushant Singh
A. Mahmood
AI4TS
60
94
0
23 Mar 2021
BERT: A Review of Applications in Natural Language Processing and
  Understanding
BERT: A Review of Applications in Natural Language Processing and Understanding
M. V. Koroteev
VLM
25
196
0
22 Mar 2021
Code Generation from Natural Language with Less Prior and More
  Monolingual Data
Code Generation from Natural Language with Less Prior and More Monolingual Data
Sajad Norouzi
Keyi Tang
Yanshuai Cao
20
19
0
01 Jan 2021
Improving Text Generation with Student-Forcing Optimal Transport
Improving Text Generation with Student-Forcing Optimal Transport
Guoyin Wang
Chunyuan Li
Jianqiao Li
Hao Fu
Yuh-Chen Lin
...
Ruiyi Zhang
Wenlin Wang
Dinghan Shen
Qian Yang
Lawrence Carin
OT
30
17
0
12 Oct 2020
A Mathematical Exploration of Why Language Models Help Solve Downstream
  Tasks
A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks
Nikunj Saunshi
Sadhika Malladi
Sanjeev Arora
25
87
0
07 Oct 2020
Improving the Accuracy of Global Forecasting Models using Time Series
  Data Augmentation
Improving the Accuracy of Global Forecasting Models using Time Series Data Augmentation
Kasun Bandara
Hansika Hewamalage
Yuan-Hao Liu
Yanfei Kang
Christoph Bergmeir
AI4TS
24
115
0
06 Aug 2020
Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for
  Improved Generalization
Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for Improved Generalization
Sang Michael Xie
Tengyu Ma
Percy Liang
35
13
0
29 Jun 2020
Recipes for Adapting Pre-trained Monolingual and Multilingual Models to
  Machine Translation
Recipes for Adapting Pre-trained Monolingual and Multilingual Models to Machine Translation
Asa Cooper Stickland
Xian Li
Marjan Ghazvininejad
36
44
0
30 Apr 2020
Pre-trained Models for Natural Language Processing: A Survey
Pre-trained Models for Natural Language Processing: A Survey
Xipeng Qiu
Tianxiang Sun
Yige Xu
Yunfan Shao
Ning Dai
Xuanjing Huang
LM&MA
VLM
243
1,452
0
18 Mar 2020
A Survey on Contextual Embeddings
A Survey on Contextual Embeddings
Qi Liu
Matt J. Kusner
Phil Blunsom
225
146
0
16 Mar 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLM
AI4TS
AI4CE
22
138
0
18 Feb 2020
Multilingual Denoising Pre-training for Neural Machine Translation
Multilingual Denoising Pre-training for Neural Machine Translation
Yinhan Liu
Jiatao Gu
Naman Goyal
Xian Li
Sergey Edunov
Marjan Ghazvininejad
M. Lewis
Luke Zettlemoyer
AI4CE
AIMat
52
1,773
0
22 Jan 2020
A Comprehensive Survey of Multilingual Neural Machine Translation
A Comprehensive Survey of Multilingual Neural Machine Translation
Raj Dabre
Chenhui Chu
Anoop Kunchukuttan
LRM
36
33
0
04 Jan 2020
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive
  Summarization
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
Jingqing Zhang
Yao-Min Zhao
Mohammad Saleh
Peter J. Liu
RALM
3DGS
48
2,018
0
18 Dec 2019
FlauBERT: Unsupervised Language Model Pre-training for French
FlauBERT: Unsupervised Language Model Pre-training for French
Hang Le
Loïc Vial
Jibril Frej
Vincent Segonne
Maximin Coavoux
Benjamin Lecouteux
A. Allauzen
Benoît Crabbé
Laurent Besacier
D. Schwab
AI4CE
49
395
0
11 Dec 2019
Neural Machine Translation: A Review and Survey
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
25
312
0
04 Dec 2019
Pretrained Language Models for Document-Level Neural Machine Translation
Pretrained Language Models for Document-Level Neural Machine Translation
Liangyou Li
Xin Jiang
Qun Liu
31
19
0
08 Nov 2019
Domain, Translationese and Noise in Synthetic Data for Neural Machine
  Translation
Domain, Translationese and Noise in Synthetic Data for Neural Machine Translation
Nikolay Bogoychev
Rico Sennrich
16
50
0
06 Nov 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
129
19,529
0
23 Oct 2019
Deep Learning Based Chatbot Models
Deep Learning Based Chatbot Models
Richard Csaky
29
46
0
23 Aug 2019
Denoising based Sequence-to-Sequence Pre-training for Text Generation
Denoising based Sequence-to-Sequence Pre-training for Text Generation
Liang Wang
Wei-Ye Zhao
Ruoyu Jia
Sujian Li
Jingming Liu
VLM
AI4CE
42
37
0
22 Aug 2019
Encoder-Agnostic Adaptation for Conditional Language Generation
Encoder-Agnostic Adaptation for Conditional Language Generation
Zachary M. Ziegler
Luke Melas-Kyriazi
Sebastian Gehrmann
Alexander M. Rush
AI4CE
11
57
0
19 Aug 2019
Towards Making the Most of BERT in Neural Machine Translation
Towards Making the Most of BERT in Neural Machine Translation
Jiacheng Yang
Mingxuan Wang
Hao Zhou
Chengqi Zhao
Yong Yu
Weinan Zhang
Lei Li
CLL
21
156
0
15 Aug 2019
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
S. Rothe
Shashi Narayan
Aliaksei Severyn
SILM
71
433
0
29 Jul 2019
From Caesar Cipher to Unsupervised Learning: A New Method for Classifier
  Parameter Estimation
From Caesar Cipher to Unsupervised Learning: A New Method for Classifier Parameter Estimation
Yu Liu
Li Deng
Jianshu Chen
C. Chen
SSL
26
0
0
06 Jun 2019
Sequence Tagging with Contextual and Non-Contextual Subword
  Representations: A Multilingual Evaluation
Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation
Benjamin Heinzerling
Michael Strube
13
35
0
04 Jun 2019
Domain Adaptation of Neural Machine Translation by Lexicon Induction
Domain Adaptation of Neural Machine Translation by Lexicon Induction
Junjie Hu
Mengzhou Xia
Graham Neubig
J. Carbonell
27
75
0
02 Jun 2019
A Generalized Framework of Sequence Generation with Application to
  Undirected Sequence Models
A Generalized Framework of Sequence Generation with Application to Undirected Sequence Models
Elman Mansimov
Alex Jinpeng Wang
Sean Welleck
Kyunghyun Cho
AIMat
28
46
0
29 May 2019
Effective Cross-lingual Transfer of Neural Machine Translation Models
  without Shared Vocabularies
Effective Cross-lingual Transfer of Neural Machine Translation Models without Shared Vocabularies
Yunsu Kim
Yingbo Gao
Hermann Ney
VLM
24
88
0
14 May 2019
Improving Grammatical Error Correction via Pre-Training a Copy-Augmented
  Architecture with Unlabeled Data
Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data
Wei-Ye Zhao
Liang Wang
Kewei Shen
Ruoyu Jia
Jingming Liu
19
210
0
01 Mar 2019
An Embarrassingly Simple Approach for Transfer Learning from Pretrained
  Language Models
An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models
Alexandra Chronopoulou
Christos Baziotis
Alexandros Potamianos
CLL
25
130
0
27 Feb 2019
Cross-lingual Language Model Pretraining
Cross-lingual Language Model Pretraining
Guillaume Lample
Alexis Conneau
25
2,709
0
22 Jan 2019
Transfer learning of language-independent end-to-end ASR with language
  model fusion
Transfer learning of language-independent end-to-end ASR with language model fusion
S. Hariri
Jaejin Cho
M. Baskar
Tatsuya Kawahara
R. Brunner
19
42
0
06 Nov 2018
Bi-Directional Differentiable Input Reconstruction for Low-Resource
  Neural Machine Translation
Bi-Directional Differentiable Input Reconstruction for Low-Resource Neural Machine Translation
Xing Niu
Weijia Xu
Marine Carpuat
19
17
0
02 Nov 2018
On the End-to-End Solution to Mandarin-English Code-switching Speech
  Recognition
On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition
Zhiping Zeng
Yerbolat Khassanov
Van Tung Pham
Haihua Xu
Chng Eng Siong
Haizhou Li
18
92
0
01 Nov 2018
MeanSum: A Neural Model for Unsupervised Multi-document Abstractive
  Summarization
MeanSum: A Neural Model for Unsupervised Multi-document Abstractive Summarization
Eric Chu
Peter J. Liu
20
19
0
12 Oct 2018
Unsupervised Learning via Meta-Learning
Unsupervised Learning via Meta-Learning
Kyle Hsu
Sergey Levine
Chelsea Finn
SSL
OffRL
37
229
0
04 Oct 2018
Semi-Supervised Sequence Modeling with Cross-View Training
Semi-Supervised Sequence Modeling with Cross-View Training
Kevin Clark
Minh-Thang Luong
Christopher D. Manning
Quoc V. Le
SSL
11
333
0
22 Sep 2018
Dissecting Contextual Word Embeddings: Architecture and Representation
Dissecting Contextual Word Embeddings: Architecture and Representation
Matthew E. Peters
Mark Neumann
Luke Zettlemoyer
Wen-tau Yih
35
426
0
27 Aug 2018
12
Next