Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.02281
Cited By
Non-Autoregressive Neural Machine Translation
7 November 2017
Jiatao Gu
James Bradbury
Caiming Xiong
V. Li
R. Socher
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Non-Autoregressive Neural Machine Translation"
50 / 195 papers shown
Title
Spatial Speech Translation: Translating Across Space With Binaural Hearables
Tuochao Chen
Qirui Wang
Runlin He
Shyam Gollakota
31
0
0
25 Apr 2025
Looking beyond the next token
Abitha Thankaraj
Yiding Jiang
J. Zico Kolter
Yonatan Bisk
LRM
57
1
0
15 Apr 2025
Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More
Arvid Frydenlund
LRM
55
0
0
13 Mar 2025
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
Marianne Arriola
Aaron Gokaslan
Justin T Chiu
Zhihan Yang
Zhixuan Qi
Jiaqi Han
S. Sahoo
Volodymyr Kuleshov
DiffM
77
5
0
12 Mar 2025
FourierNAT: A Fourier-Mixing-Based Non-Autoregressive Transformer for Parallel Sequence Generation
Andrew Kiruluta
Eric Lundy
Andreas Lemos
AI4TS
47
0
0
04 Mar 2025
Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions
Yizhe Zhang
Richard He Bai
Zijin Gu
Ruixiang Zhang
Jiatao Gu
Emmanuel Abbe
Samy Bengio
Navdeep Jaitly
LRM
BDL
70
1
0
25 Feb 2025
Energy-Based Diffusion Language Models for Text Generation
Minkai Xu
Tomas Geffner
Karsten Kreis
Weili Nie
Yilun Xu
J. Leskovec
Stefano Ermon
Arash Vahdat
DiffM
49
7
0
28 Oct 2024
Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Shansan Gong
Shivam Agarwal
Yizhe Zhang
Jiacheng Ye
Lin Zheng
...
Peilin Zhao
W. Bi
Jiawei Han
Hao Peng
Lingpeng Kong
AI4CE
78
16
0
23 Oct 2024
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Jiacheng Ye
Jiahui Gao
Shansan Gong
Lin Zheng
Xin Jiang
Zhiyu Li
Lingpeng Kong
DiffM
LRM
69
15
0
18 Oct 2024
The Mystery of the Pathological Path-star Task for Language Models
Arvid Frydenlund
LRM
27
4
0
17 Oct 2024
DeepOSets: Non-Autoregressive In-Context Learning of Supervised Learning Operators
Shao-Ting Chiu
Junyuan Hong
Ulisses Braga-Neto
BDL
33
0
0
11 Oct 2024
Beyond Next Token Prediction: Patch-Level Training for Large Language Models
Chenze Shao
Fandong Meng
Jie Zhou
51
1
0
17 Jul 2024
CTC-based Non-autoregressive Textless Speech-to-Speech Translation
Qingkai Fang
Zhengrui Ma
Yan Zhou
Min Zhang
Yang Feng
52
0
0
11 Jun 2024
Non-autoregressive Personalized Bundle Generation
Wenchuan Yang
Cheng Yang
Jichao Li
Yuejin Tan
Xin Lu
Chuan Shi
28
0
0
11 Jun 2024
What Have We Achieved on Non-autoregressive Translation?
Yafu Li
Huajian Zhang
Jianhao Yan
Yongjing Yin
Yue Zhang
38
1
0
21 May 2024
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
Minghao Wu
Jiahao Xu
Yulin Yuan
Gholamreza Haffari
Longyue Wang
Weihua Luo
Kaifu Zhang
LLMAG
119
23
0
20 May 2024
A Novel Paradigm Boosting Translation Capabilities of Large Language Models
Jiaxin Guo
Hao Yang
Zongyao Li
Daimeng Wei
Hengchao Shang
Xiaoyu Chen
47
7
0
18 Mar 2024
The pitfalls of next-token prediction
Gregor Bachmann
Vaishnavh Nagarajan
37
63
0
11 Mar 2024
Schema-Aware Multi-Task Learning for Complex Text-to-SQL
Yangjun Wu
Han Wang
34
0
0
09 Mar 2024
Non-autoregressive Sequence-to-Sequence Vision-Language Models
Kunyu Shi
Qi Dong
Luis Goncalves
Zhuowen Tu
Stefano Soatto
VLM
47
3
0
04 Mar 2024
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification
Yifan Peng
Yui Sudo
Muhammad Shakeel
Shinji Watanabe
VLM
46
17
0
20 Feb 2024
Analysis of Levenshtein Transformer's Decoder and Its Variants
Ruiyang Zhou
16
0
0
19 Feb 2024
BiTA: Bi-Directional Tuning for Lossless Acceleration in Large Language Models
Feng-Huei Lin
Hanling Yi
Hongbin Li
Yifan Yang
Xiaotian Yu
Guangming Lu
Rong Xiao
41
3
0
23 Jan 2024
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding
Heming Xia
Zhe Yang
Qingxiu Dong
Peiyi Wang
Yongqi Li
Tao Ge
Tianyu Liu
Wenjie Li
Zhifang Sui
LRM
38
101
0
15 Jan 2024
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
62
1
0
19 Dec 2023
StructRe: Rewriting for Structured Shape Modeling
Jiepeng Wang
Hao Pan
Yang Liu
Xin Tong
Taku Komura
Wenping Wang
41
0
0
29 Nov 2023
PaSS: Parallel Speculative Sampling
Giovanni Monea
Armand Joulin
Edouard Grave
MoE
16
32
0
22 Nov 2023
GEC-DePenD: Non-Autoregressive Grammatical Error Correction with Decoupled Permutation and Decoding
Konstantin Yakovlev
Alexander Podolskiy
A. Bout
Sergey I. Nikolenko
Irina Piontkovskaya
33
4
0
14 Nov 2023
Controlled Text Generation for Black-box Language Models via Score-based Progressive Editor
Sangwon Yu
Changmin Lee
Hojin Lee
Sungroh Yoon
29
0
0
13 Nov 2023
Non-autoregressive Streaming Transformer for Simultaneous Translation
Zhengrui Ma
Shaolei Zhang
Shoutao Guo
Chenze Shao
Min Zhang
Yang Feng
32
13
0
23 Oct 2023
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
54
14
0
23 Aug 2023
f-Divergence Minimization for Sequence-Level Knowledge Distillation
Yuqiao Wen
Zichao Li
Wenyu Du
Lili Mou
32
53
0
27 Jul 2023
XDLM: Cross-lingual Diffusion Language Model for Machine Translation
Linyao Chen
Aosong Feng
Boming Yang
Zihui Li
AI4CE
33
2
0
25 Jul 2023
Revisiting Non-Autoregressive Translation at Scale
Zhihao Wang
Longyue Wang
Jinsong Su
Junfeng Yao
Zhaopeng Tu
36
3
0
25 May 2023
NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders
Livio Baldini Soares
D. Gillick
Jeremy R. Cole
Tom Kwiatkowski
36
1
0
23 May 2023
Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Zangwei Zheng
Xiaozhe Ren
Fuzhao Xue
Yang Luo
Xin Jiang
Yang You
42
55
0
22 May 2023
Extrapolating Multilingual Understanding Models as Multilingual Generators
Bohong Wu
Fei Yuan
Hai Zhao
Lei Li
Jingjing Xu
AI4CE
25
2
0
22 May 2023
Accelerating Transformer Inference for Translation via Parallel Decoding
Andrea Santilli
Silvio Severino
Emilian Postolache
Valentino Maiorca
Michele Mancusi
R. Marin
Emanuele Rodolà
41
79
0
17 May 2023
SoundStorm: Efficient Parallel Audio Generation
Zalan Borsos
Matthew Sharifi
Damien Vincent
Eugene Kharitonov
Neil Zeghidour
Marco Tagliasacchi
28
98
0
16 May 2023
Label Dependencies-aware Set Prediction Networks for Multi-label Text Classification
Xinkai Du
Quanjie Han
Yalin Sun
Chao Lv
Sun Maosong
26
2
0
14 Apr 2023
User Adaptive Language Learning Chatbots with a Curriculum
Kun Qian
Ryan Shea
Yu Li
Luke K. Fryer
Zhou Yu
32
12
0
11 Apr 2023
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
Jiaao Chen
Aston Zhang
Mu Li
Alexander J. Smola
Diyi Yang
DiffM
32
17
0
10 Apr 2023
An Overview on Language Models: Recent Developments and Outlook
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
33
42
0
10 Mar 2023
Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding
Jiacheng Li
Longhui Wei
Zongyuan Zhan
Xinfu He
Siliang Tang
Qi Tian
Yueting Zhuang
29
4
0
07 Mar 2023
Efficient Informed Proposals for Discrete Distributions via Newton's Series Approximation
Yue Xiang
Dongyao Zhu
Bowen Lei
Dongkuan Xu
Ruqi Zhang
26
5
0
27 Feb 2023
DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization
Zhiqing Sun
Yiming Yang
DiffM
38
121
0
16 Feb 2023
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
41
57
0
11 Feb 2023
Plan-then-Seam: Towards Efficient Table-to-Text Generation
Liang Li
Ruiying Geng
Chengyang Fang
Bing Li
Can Ma
Binhua Li
Yongbin Li
LMTD
33
2
0
10 Feb 2023
N-Gram Nearest Neighbor Machine Translation
Rui Lv
Junliang Guo
Rui Wang
Xu Tan
Qi Liu
Tao Qin
23
2
0
30 Jan 2023
Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise
Zheng-Wen Lin
Yeyun Gong
Yelong Shen
Tong Wu
Zhihao Fan
Chen Lin
Nan Duan
Weizhu Chen
AI4CE
DiffM
VLM
35
61
0
22 Dec 2022
1
2
3
4
Next