ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.03382
  4. Cited By
Fast Decoding in Sequence Models using Discrete Latent Variables

Fast Decoding in Sequence Models using Discrete Latent Variables

9 March 2018
Łukasz Kaiser
Aurko Roy
Ashish Vaswani
Niki Parmar
Samy Bengio
Jakob Uszkoreit
Noam M. Shazeer
ArXivPDFHTML

Papers citing "Fast Decoding in Sequence Models using Discrete Latent Variables"

50 / 60 papers shown
Title
Falcon: Faster and Parallel Inference of Large Language Models through Enhanced Semi-Autoregressive Drafting and Custom-Designed Decoding Tree
Falcon: Faster and Parallel Inference of Large Language Models through Enhanced Semi-Autoregressive Drafting and Custom-Designed Decoding Tree
Xiangxiang Gao
Weisheng Xie
Yiwei Xiang
Feng Ji
91
6
0
17 Dec 2024
Disentanglement with Factor Quantized Variational Autoencoders
Disentanglement with Factor Quantized Variational Autoencoders
Gulcin Baykal
M. Kandemir
Gözde B. Ünal
CoGe
DRL
39
0
0
23 Sep 2024
Beyond Next Token Prediction: Patch-Level Training for Large Language Models
Beyond Next Token Prediction: Patch-Level Training for Large Language Models
Chenze Shao
Fandong Meng
Jie Zhou
51
1
0
17 Jul 2024
Balance of Number of Embedding and their Dimensions in Vector
  Quantization
Balance of Number of Embedding and their Dimensions in Vector Quantization
Hang Chen
Sankepally Sainath Reddy
Ziwei Chen
Dianbo Liu
49
1
0
06 Jul 2024
What Have We Achieved on Non-autoregressive Translation?
What Have We Achieved on Non-autoregressive Translation?
Yafu Li
Huajian Zhang
Jianhao Yan
Yongjing Yin
Yue Zhang
38
1
0
21 May 2024
Sign Stitching: A Novel Approach to Sign Language Production
Sign Stitching: A Novel Approach to Sign Language Production
Harry Walsh
Ben Saunders
Richard Bowden
52
3
0
13 May 2024
Non-autoregressive Sequence-to-Sequence Vision-Language Models
Non-autoregressive Sequence-to-Sequence Vision-Language Models
Kunyu Shi
Qi Dong
Luis Goncalves
Zhuowen Tu
Stefano Soatto
VLM
47
3
0
04 Mar 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
47
4
0
29 Feb 2024
Transformer-VQ: Linear-Time Transformers via Vector Quantization
Transformer-VQ: Linear-Time Transformers via Vector Quantization
Albert Mohwald
34
15
0
28 Sep 2023
Neural Markov Jump Processes
Neural Markov Jump Processes
Patrick Seifner
Ramses J. Sanchez
BDL
35
7
0
31 May 2023
SPEC: Summary Preference Decomposition for Low-Resource Abstractive
  Summarization
SPEC: Summary Preference Decomposition for Low-Resource Abstractive Summarization
Yi-Syuan Chen
Yun-Zhu Song
Hong-Han Shuai
33
6
0
24 Mar 2023
Self-Organising Neural Discrete Representation Learning à la Kohonen
Self-Organising Neural Discrete Representation Learning à la Kohonen
Kazuki Irie
Róbert Csordás
Jürgen Schmidhuber
SSL
32
1
0
15 Feb 2023
Choreographer: Learning and Adapting Skills in Imagination
Choreographer: Learning and Adapting Skills in Imagination
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Alexandre Lacoste
Sai Rajeswar
29
22
0
23 Nov 2022
Learning an Artificial Language for Knowledge-Sharing in Multilingual
  Translation
Learning an Artificial Language for Knowledge-Sharing in Multilingual Translation
Danni Liu
Jan Niehues
21
5
0
02 Nov 2022
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for
  Text Generation and Modular Control
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control
Xiaochuang Han
Sachin Kumar
Yulia Tsvetkov
45
79
0
31 Oct 2022
Multi-Granularity Optimization for Non-Autoregressive Translation
Multi-Granularity Optimization for Non-Autoregressive Translation
Yafu Li
Leyang Cui
Yongjing Yin
Yue Zhang
37
7
0
20 Oct 2022
JukeDrummer: Conditional Beat-aware Audio-domain Drum Accompaniment
  Generation via Transformer VQ-VAE
JukeDrummer: Conditional Beat-aware Audio-domain Drum Accompaniment Generation via Transformer VQ-VAE
Yueh-Kao Wu
Ching-Yu Chiu
Yi-Hsuan Yang
ViT
21
14
0
12 Oct 2022
Viterbi Decoding of Directed Acyclic Transformer for Non-Autoregressive
  Machine Translation
Viterbi Decoding of Directed Acyclic Transformer for Non-Autoregressive Machine Translation
Chenze Shao
Zhengrui Ma
Yang Feng
42
14
0
11 Oct 2022
Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine
  Translation
Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation
Chenze Shao
Yang Feng
35
20
0
08 Oct 2022
Are disentangled representations all you need to build speaker
  anonymization systems?
Are disentangled representations all you need to build speaker anonymization systems?
Pierre Champion
D. Jouvet
Anthony Larcher
35
20
0
22 Aug 2022
Diffusion-LM Improves Controllable Text Generation
Diffusion-LM Improves Controllable Text Generation
Xiang Lisa Li
John Thickstun
Ishaan Gulrajani
Percy Liang
Tatsunori B. Hashimoto
AI4CE
173
779
0
27 May 2022
Non-Autoregressive Machine Translation: It's Not as Fast as it Seems
Non-Autoregressive Machine Translation: It's Not as Fast as it Seems
Jindvrich Helcl
Barry Haddow
Alexandra Birch
27
20
0
04 May 2022
$\textit{latent}$-GLAT: Glancing at Latent Variables for Parallel Text
  Generation
latent\textit{latent}latent-GLAT: Glancing at Latent Variables for Parallel Text Generation
Yu Bao
Hao Zhou
Shujian Huang
Dongqi Wang
Lihua Qian
Xinyu Dai
Jiajun Chen
Lei Li
31
38
0
05 Apr 2022
Entropy optimized semi-supervised decomposed vector-quantized
  variational autoencoder model based on transfer learning for multiclass text
  classification and generation
Entropy optimized semi-supervised decomposed vector-quantized variational autoencoder model based on transfer learning for multiclass text classification and generation
Shivani Malhotra
Vinay Kumar
A. K. Agarwal
DRL
11
0
0
10 Nov 2021
Discrete Acoustic Space for an Efficient Sampling in Neural
  Text-To-Speech
Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech
Mu Li
Jonas Rohnke
Antonio Bonafonte
Mateusz Lajszczak
Trevor Wood
DRL
25
2
0
24 Oct 2021
KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE using
  Mel-spectrograms
KaraSinger: Score-Free Singing Voice Synthesis with VQ-VAE using Mel-spectrograms
Chien-Feng Liao
Jen-Yu Liu
Yi-Hsuan Yang
27
5
0
08 Oct 2021
Learning Energy-Based Approximate Inference Networks for Structured
  Applications in NLP
Learning Energy-Based Approximate Inference Networks for Structured Applications in NLP
Lifu Tu
BDL
35
0
0
27 Aug 2021
MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive
  Machine Translation
MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive Machine Translation
Pan Xie
Zexian Li
Xiaohui Hu
34
11
0
19 Aug 2021
Non-Autoregressive Translation by Learning Target Categorical Codes
Non-Autoregressive Translation by Learning Target Categorical Codes
Yu Bao
Shujian Huang
Tong Xiao
Dongqi Wang
Xinyu Dai
Jiajun Chen
32
35
0
21 Mar 2021
Deep Generative Modelling: A Comparative Review of VAEs, GANs,
  Normalizing Flows, Energy-Based and Autoregressive Models
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models
Sam Bond-Taylor
Adam Leach
Yang Long
Chris G. Willcocks
VLM
TPM
41
483
0
08 Mar 2021
Fully Non-autoregressive Neural Machine Translation: Tricks of the Trade
Fully Non-autoregressive Neural Machine Translation: Tricks of the Trade
Jiatao Gu
X. Kong
31
135
0
31 Dec 2020
Infusing Sequential Information into Conditional Masked Translation
  Model with Self-Review Mechanism
Infusing Sequential Information into Conditional Masked Translation Model with Self-Review Mechanism
Pan Xie
Zhi Cui
Preslav Nakov
Xiaohui Hu
Jianwei Cui
Bin Wang
154
9
0
19 Oct 2020
Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data
Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data
Michael Cogswell
Jiasen Lu
Rishabh Jain
Stefan Lee
Devi Parikh
Dhruv Batra
VLM
EgoV
36
15
0
24 Jul 2020
Latent Video Transformer
Latent Video Transformer
Ruslan Rakhimov
Denis Volkhonskiy
Alexey Artemov
Denis Zorin
Evgeny Burnaev
VGen
33
119
0
18 Jun 2020
Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine
  Translation
Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Translation
Jungo Kasai
Nikolaos Pappas
Hao Peng
James Cross
Noah A. Smith
41
134
0
18 Jun 2020
Vector-quantized neural networks for acoustic unit discovery in the
  ZeroSpeech 2020 challenge
Vector-quantized neural networks for acoustic unit discovery in the ZeroSpeech 2020 challenge
Benjamin van Niekerk
Leanne Nortje
Herman Kamper
13
115
0
19 May 2020
Variational Transformers for Diverse Response Generation
Variational Transformers for Diverse Response Generation
Zhaojiang Lin
Genta Indra Winata
Peng Xu
Zihan Liu
Pascale Fung
DRL
11
51
0
28 Mar 2020
Non-Autoregressive Dialog State Tracking
Non-Autoregressive Dialog State Tracking
Hung Le
R. Socher
Guosheng Lin
32
52
0
19 Feb 2020
LAVA NAT: A Non-Autoregressive Translation Model with Look-Around
  Decoding and Vocabulary Attention
LAVA NAT: A Non-Autoregressive Translation Model with Look-Around Decoding and Vocabulary Attention
Xiaoya Li
Yuxian Meng
Arianna Yuan
Fei Wu
Jiwei Li
40
12
0
08 Feb 2020
Neural Machine Translation: A Review and Survey
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
20
312
0
04 Dec 2019
Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine
  Translation
Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation
Junliang Guo
Xu Tan
Linli Xu
Tao Qin
Enhong Chen
Tie-Yan Liu
14
85
0
20 Nov 2019
Fast Structured Decoding for Sequence Models
Fast Structured Decoding for Sequence Models
Zhiqing Sun
Zhuohan Li
Haoqing Wang
Zi Lin
Di He
Zhihong Deng
24
122
0
25 Oct 2019
Hint-Based Training for Non-Autoregressive Machine Translation
Hint-Based Training for Non-Autoregressive Machine Translation
Zhuohan Li
Zi Lin
Di He
Fei Tian
Tao Qin
Liwei Wang
Tie-Yan Liu
31
72
0
15 Sep 2019
CTRL: A Conditional Transformer Language Model for Controllable
  Generation
CTRL: A Conditional Transformer Language Model for Controllable Generation
N. Keskar
Bryan McCann
L. Varshney
Caiming Xiong
R. Socher
AI4CE
57
1,236
0
11 Sep 2019
Sequence Generation: From Both Sides to the Middle
Sequence Generation: From Both Sides to the Middle
Long Zhou
Jiajun Zhang
Chengqing Zong
Heng Yu
33
22
0
23 Jun 2019
Retrieving Sequential Information for Non-Autoregressive Neural Machine
  Translation
Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation
Chenze Shao
Yang Feng
Jinchao Zhang
Fandong Meng
Xilin Chen
Jie Zhou
24
42
0
22 Jun 2019
Interpretable Neural Network Decoupling
Interpretable Neural Network Decoupling
Yuchao Li
Rongrong Ji
Shaohui Lin
Baochang Zhang
Chenqian Yan
Yongjian Wu
Feiyue Huang
Ling Shao
37
2
0
04 Jun 2019
Unsupervised Paraphrasing without Translation
Unsupervised Paraphrasing without Translation
Aurko Roy
David Grangier
BDL
LRM
11
61
0
29 May 2019
Quantization-Based Regularization for Autoencoders
Quantization-Based Regularization for Autoencoders
Hanwei Wu
M. Flierl
DRL
11
2
0
27 May 2019
Levenshtein Transformer
Levenshtein Transformer
Jiatao Gu
Changhan Wang
Jake Zhao
49
359
0
27 May 2019
12
Next