Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.03122
Cited By
v1
v2
v3 (latest)
Convolutional Sequence to Sequence Learning
8 May 2017
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Convolutional Sequence to Sequence Learning"
50 / 1,328 papers shown
Title
TransDocs: Optical Character Recognition with word to word translation
Abhishek Bamotra
P. Uppala
40
1
0
15 Apr 2023
Masked Pre-Training of Transformers for Histology Image Analysis
Shuai Jiang
Liesbeth Hondelink
A. Suriawinata
Saeed Hassanpour
MedIm
57
18
0
14 Apr 2023
Best Practices for 2-Body Pose Forecasting
Muhammad Rameez Ur Rahman
Luca Scofano
Edoardo De Matteis
Alessandro Flaborea
Alessio Sampieri
Fabio Galasso
82
10
0
12 Apr 2023
Dynamic Graph Representation Learning with Neural Networks: A Survey
Leshanshui Yang
Sébastien Adam
Clément Chatelain
AI4TS
AI4CE
83
19
0
12 Apr 2023
Multi-Graph Convolution Network for Pose Forecasting
Hongwei Ren
Yuhong Shi
Kewei Liang
3DH
72
1
0
11 Apr 2023
HyperINR: A Fast and Predictive Hypernetwork for Implicit Neural Representations via Knowledge Distillation
Qi Wu
David Bauer
Yuyang Chen
Kwan-Liu Ma
72
16
0
09 Apr 2023
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder
Z. Fu
W. Lam
Qian Yu
Anthony Man-Cho So
Shengding Hu
Zhiyuan Liu
Nigel Collier
AuLLM
69
44
0
08 Apr 2023
Semi-supervised Neural Machine Translation with Consistency Regularization for Low-Resource Languages
Viet H. Pham
Thang M. Pham
Giang Nguyen
Long H. B. Nguyen
D. Dinh
29
0
0
02 Apr 2023
Analysis and Comparison of Two-Level KFAC Methods for Training Deep Neural Networks
Abdoulaye Koroko
A. Anciaux-Sedrakian
I. B. Gharbia
Valérie Garès
M. Haddou
Quang-Huy Tran
60
0
0
31 Mar 2023
Backdoor Attacks with Input-unique Triggers in NLP
Xukun Zhou
Jiwei Li
Tianwei Zhang
Lingjuan Lyu
Muqiao Yang
Jun He
SILM
AAML
48
9
0
25 Mar 2023
MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep Learning Clusters
Yihao Zhao
Xin Liu
Shufan Liu
Xiang Li
Yibo Zhu
Gang Huang
Xuanzhe Liu
Xin Jin
101
11
0
24 Mar 2023
Integrating Image Features with Convolutional Sequence-to-sequence Network for Multilingual Visual Question Answering
T. M. Thai
Son T. Luu
86
0
0
22 Mar 2023
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
Chaoning Zhang
Chenshuang Zhang
Sheng Zheng
Yu Qiao
Chenghao Li
...
Lik-Hang Lee
Yang Yang
Heng Tao Shen
In So Kweon
Choong Seon Hong
186
170
0
21 Mar 2023
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Muhammad Usama
Junaid Qadir
169
48
0
21 Mar 2023
CerviFormer: A Pap-smear based cervical cancer classification method using cross attention and latent transformer
Bhaswati Singha Deo
M. Pal
P. Panigrahi
A. Pradhan
MedIm
43
25
0
17 Mar 2023
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis
Renrui Zhang
Liuhui Wang
Ziyu Guo
Yali Wang
Peng Gao
Hongsheng Li
Jianbo Shi
3DPC
73
55
0
14 Mar 2023
Learning Transductions and Alignments with RNN Seq2seq Models
Zhengxiang Wang
29
0
0
13 Mar 2023
Convex Bounds on the Softmax Function with Applications to Robustness Verification
Dennis L. Wei
Haoze Wu
Min Wu
Pin-Yu Chen
Clark W. Barrett
E. Farchi
UQCV
AAML
52
9
0
03 Mar 2023
Leveraging Large Text Corpora for End-to-End Speech Summarization
Kohei Matsuura
Takanori Ashihara
Takafumi Moriya
Tomohiro Tanaka
A. Ogawa
Marc Delcroix
Ryo Masumura
49
14
0
02 Mar 2023
Variance-reduced Clipping for Non-convex Optimization
Amirhossein Reisizadeh
Haochuan Li
Subhro Das
Ali Jadbabaie
96
29
0
02 Mar 2023
EPISODE: Episodic Gradient Clipping with Periodic Resampled Corrections for Federated Learning with Heterogeneous Data
M. Crawshaw
Yajie Bao
Mingrui Liu
FedML
87
8
0
14 Feb 2023
Enhancing Multivariate Time Series Classifiers through Self-Attention and Relative Positioning Infusion
Mehryar Abbasi
Parvaneh Saeedi
AI4TS
84
6
0
13 Feb 2023
Protecting Language Generation Models via Invisible Watermarking
Xuandong Zhao
Yu-Xiang Wang
Lei Li
WaLM
105
87
0
06 Feb 2023
KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair
Nan Jiang
Thibaud Lutellier
Xin Peng
Lin Tan
Dan Goldwasser
Xinming Zhang
89
50
0
03 Feb 2023
Learning the Dynamics of Sparsely Observed Interacting Systems
Linus Bleistein
Adeline Fermanian
A. Jannot
Agathe Guilloux
162
5
0
27 Jan 2023
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
115
2
0
26 Jan 2023
PULL: Reactive Log Anomaly Detection Based On Iterative PU Learning
Thorsten Wittkopp
Dominik Scheinert
Philipp Wiesner
Alexander Acker
O. Kao
AI4TS
58
6
0
25 Jan 2023
Variation-Aware Semantic Image Synthesis
Mingle Xu
Jaehwan Lee
Sook Yoon
Hyongsuk Kim
D. Park
77
4
0
25 Jan 2023
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Vilém Zouhar
Shehzaad Dhuliawala
Wangchunshu Zhou
Nico Daheim
Tom Kocmi
Yuchen Eleanor Jiang
Mrinmaya Sachan
66
11
0
21 Jan 2023
HanoiT: Enhancing Context-aware Translation via Selective Context
Jian Yang
Yuwei Yin
Shuming Ma
Liqun Yang
Hongcheng Guo
Haoyang Huang
Dongdong Zhang
Yutao Zeng
Zhoujun Li
Furu Wei
85
5
0
17 Jan 2023
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya Zhang
Yixin Chen
Dacheng Tao
OffRL
148
30
0
29 Dec 2022
A Survey of Deep Learning for Mathematical Reasoning
Pan Lu
Liang Qiu
Wenhao Yu
Sean Welleck
Kai-Wei Chang
ReLM
LRM
133
150
0
20 Dec 2022
EIT: Enhanced Interactive Transformer
Tong Zheng
Bei Li
Huiwen Bao
Tong Xiao
Jingbo Zhu
119
2
0
20 Dec 2022
Graph Learning and Its Advancements on Large Language Models: A Holistic Survey
Shaopeng Wei
Yu Zhao
Xingyan Chen
Qing Li
Fuzhen Zhuang
Ji Liu
Fuji Ren
Gang Kou
AI4CE
131
5
0
17 Dec 2022
Spatial-temporal traffic modeling with a fusion graph reconstructed by tensor decomposition
Qin Li
Xu Yang
Yong Wang
Yuankai Wu
Deqiang He
79
10
0
12 Dec 2022
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
Jianhao Yan
Jin Xu
Fandong Meng
Jie Zhou
Yue Zhang
111
4
0
08 Dec 2022
Dynamic Graph Node Classification via Time Augmentation
Jiarui Sun
Mengting Gu
Chin-Chia Michael Yeh
Yujie Fan
Girish Chowdhary
Wei Zhang
35
4
0
07 Dec 2022
AL-iGAN: An Active Learning Framework for Tunnel Geological Reconstruction Based on TBM Operational Data
Hao Wang
Lixue Liu
Xueguan Song
Chao Zhang
Dacheng Tao
GAN
AI4CE
58
0
0
02 Dec 2022
Rephrasing the Reference for Non-Autoregressive Machine Translation
Chenze Shao
Jinchao Zhang
Jie Zhou
Yang Feng
60
5
0
30 Nov 2022
Beyond Ensemble Averages: Leveraging Climate Model Ensembles for Subseasonal Forecasting
Elena Orlova
Haokun Liu
Raphael Rossellini
B. Cash
Rebecca Willett
105
3
0
29 Nov 2022
Learning Regularized Positional Encoding for Molecular Prediction
Xiang Gao
Weihao Gao
Wen Xiao
Zhirui Wang
Chong Wang
Liang Xiang
AI4CE
80
2
0
23 Nov 2022
Exemplar-free Continual Learning of Vision Transformers via Gated Class-Attention and Cascaded Feature Drift Compensation
Marco Cotogni
Fei Yang
C. Cusano
Andrew D. Bagdanov
Joost van de Weijer
CLL
93
0
0
22 Nov 2022
SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields
Ashkan Mirzaei
Tristan Aumentado-Armstrong
Konstantinos G. Derpanis
J. Kelly
Marcus A. Brubaker
Igor Gilitschenski
Alex Levinshtein
103
117
0
22 Nov 2022
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention
Zineng Tang
Jaemin Cho
Jie Lei
Joey Tianyi Zhou
VLM
84
9
0
21 Nov 2022
L-MAE: Masked Autoencoders are Semantic Segmentation Datasets Augmenter
Jiaru Jia
Ming-Yuan Liu
Jiake Xie
Xin Chen
Hong Zhang
Xin Jiang
Aiqing Yang
84
0
0
21 Nov 2022
Impact of visual assistance for automated audio captioning
Wim Boes
Hugo Van hamme
58
1
0
18 Nov 2022
A Copy Mechanism for Handling Knowledge Base Elements in SPARQL Neural Machine Translation
Rose Hirigoyen
Payel Das
Samuel Reyd
54
5
0
18 Nov 2022
Zero-Shot Dynamic Quantization for Transformer Inference
Yousef El-Kurdi
Jerry Quinn
Avirup Sil
MQ
64
1
0
17 Nov 2022
Parameter-Efficient Transformer with Hybrid Axial-Attention for Medical Image Segmentation
Yiyue Hu
Lei Zhang
Nan Mu
Leijun Liu
ViT
MedIm
44
1
0
17 Nov 2022
Improving Factual Consistency in Summarization with Compression-Based Post-Editing
Alexander R. Fabbri
Prafulla Kumar Choubey
Jesse Vig
Chien-Sheng Wu
Caiming Xiong
HILM
KELM
125
18
0
11 Nov 2022
Previous
1
2
3
4
5
...
25
26
27
Next