Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.03122
Cited By
Convolutional Sequence to Sequence Learning
8 May 2017
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Convolutional Sequence to Sequence Learning"
50 / 1,321 papers shown
Title
A Survey of Transformers
Tianyang Lin
Yuxin Wang
Xiangyang Liu
Xipeng Qiu
ViT
53
1,088
0
08 Jun 2021
Self-supervised and Supervised Joint Training for Resource-rich Machine Translation
Yong Cheng
Wei Wang
Lu Jiang
Wolfgang Macherey
26
17
0
08 Jun 2021
Lexicon Learning for Few-Shot Neural Sequence Modeling
Ekin Akyürek
Jacob Andreas
42
33
0
07 Jun 2021
Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding
Yang Li
Si Si
Gang Li
Cho-Jui Hsieh
Samy Bengio
24
88
0
05 Jun 2021
Motion Planning Transformers: A Motion Planning Framework for Mobile Robots
Jacob J. Johnson
Uday S. Kalra
Ankit Bhatia
Linjun Li
A. H. Qureshi
Michael C. Yip
9
14
0
05 Jun 2021
Scalable Transformers for Neural Machine Translation
Peng Gao
Shijie Geng
Ping Luo
Xiaogang Wang
Jifeng Dai
Hongsheng Li
31
13
0
04 Jun 2021
Deep Probabilistic Time Series Forecasting using Augmented Recurrent Input for Dynamic Systems
Haitao Liu
Changjun Liu
Xiaomo Jiang
Xudong Chen
Shuhua Yang
Xiaofang Wang
BDL
AI4TS
44
2
0
03 Jun 2021
Defending Against Backdoor Attacks in Natural Language Generation
Xiaofei Sun
Xiaoya Li
Yuxian Meng
Xiang Ao
Fei Wu
Jiwei Li
Tianwei Zhang
AAML
SILM
31
47
0
03 Jun 2021
GWLAN: General Word-Level AutocompletioN for Computer-Aided Translation
Huayang Li
Lemao Liu
Guoping Huang
Shuming Shi
ALM
21
20
0
31 May 2021
Fast Nearest Neighbor Machine Translation
Yuxian Meng
Xiaoya Li
Xiayu Zheng
Fei Wu
Xiaofei Sun
Tianwei Zhang
Jiwei Li
LRM
19
49
0
30 May 2021
REAM
♯
\sharp
♯
: An Enhancement Approach to Reference-based Evaluation Metrics for Open-domain Dialog Generation
Jun Gao
Wei Bi
Ruifeng Xu
Shuming Shi
6
7
0
30 May 2021
Reinforcement Learning for on-line Sequence Transformation
Grzegorz Rypesc
Lukasz Lepak
Pawel Wawrzyñski
OffRL
14
0
0
28 May 2021
THINK: A Novel Conversation Model for Generating Grammatically Correct and Coherent Responses
Bin Sun
Shaoxiong Feng
Yiwei Li
Jiamou Liu
Kan Li
19
3
0
28 May 2021
Graph-Based Deep Learning for Medical Diagnosis and Analysis: Past, Present and Future
David Ahmedt-Aristizabal
M. Armin
Simon Denman
Clinton Fookes
L. Petersson
16
178
0
27 May 2021
TranSmart: A Practical Interactive Machine Translation System
Guoping Huang
Lemao Liu
Xing Wang
Longyue Wang
Huayang Li
Zhaopeng Tu
Chengyang Huang
Shuming Shi
18
32
0
27 May 2021
TreeBERT: A Tree-Based Pre-Trained Model for Programming Language
Xue Jiang
Zhuoran Zheng
Chen Lyu
Liang Li
Lei Lyu
19
89
0
26 May 2021
VANiLLa : Verbalized Answers in Natural Language at Large Scale
Debanjali Biswas
Mohnish Dubey
Md. Rony
Jens Lehmann
6
9
0
24 May 2021
Pretrained Language Models for Text Generation: A Survey
Junyi Li
Tianyi Tang
Wayne Xin Zhao
Ji-Rong Wen
LM&MA
VLM
SyDa
30
185
0
21 May 2021
Relative Positional Encoding for Transformers with Linear Complexity
Antoine Liutkus
Ondřej Cífka
Shih-Lun Wu
Umut Simsekli
Yi-Hsuan Yang
Gaël Richard
38
45
0
18 May 2021
Application of Deep Self-Attention in Knowledge Tracing
Junhao Zeng
Qingchun Zhang
Ning Xie
Bochun Yang
AI4Ed
15
9
0
17 May 2021
Classifying Long Clinical Documents with Pre-trained Transformers
X. Su
Timothy A. Miller
Xiyu Ding
Majid Afshar
Dmitriy Dligach
MedIm
10
6
0
14 May 2021
Global Structure-Aware Drum Transcription Based on Self-Attention Mechanisms
Ryoto Ishizuka
Ryo Nishikimi
Kazuyoshi Yoshii
27
6
0
12 May 2021
Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey
Jinjie Ni
Tom Young
Vlad Pandelea
Fuzhao Xue
Min Zhang
54
268
0
10 May 2021
Dispatcher: A Message-Passing Approach To Language Modelling
A. Cetoli
45
0
0
09 May 2021
Duplex Sequence-to-Sequence Learning for Reversible Machine Translation
Zaixiang Zheng
Hao Zhou
Shujian Huang
Jiajun Chen
Jingjing Xu
Lei Li
29
12
0
07 May 2021
ResMLP: Feedforward networks for image classification with data-efficient training
Hugo Touvron
Piotr Bojanowski
Mathilde Caron
Matthieu Cord
Alaaeldin El-Nouby
...
Gautier Izacard
Armand Joulin
Gabriel Synnaeve
Jakob Verbeek
Hervé Jégou
VLM
36
656
0
07 May 2021
Are Pre-trained Convolutions Better than Pre-trained Transformers?
Yi Tay
Mostafa Dehghani
J. Gupta
Dara Bahri
V. Aribandi
Zhen Qin
Donald Metzler
AI4CE
25
48
0
07 May 2021
Efficient Weight factorization for Multilingual Speech Recognition
Ngoc-Quan Pham
Tuan-Nam Nguyen
S. Stueker
A. Waibel
43
19
0
07 May 2021
Initialization and Regularization of Factorized Neural Layers
M. Khodak
Neil A. Tenenholtz
Lester W. Mackey
Nicolò Fusi
65
56
0
03 May 2021
TE-ESN: Time Encoding Echo State Network for Prediction Based on Irregularly Sampled Time Series Data
Chenxi Sun
linda Qiao
Moxian Song
Yanxiu Zhou
Yongyue Sun
D. Cai
Hongyan Li
AI4TS
42
23
0
02 May 2021
Automatic Post-Editing for Vietnamese
Thanh Tien Vu
Dai Quoc Nguyen
15
0
0
25 Apr 2021
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Hassan Akbari
Liangzhe Yuan
Rui Qian
Wei-Hong Chuang
Shih-Fu Chang
Huayu Chen
Boqing Gong
ViT
251
577
0
22 Apr 2021
SoT: Delving Deeper into Classification Head for Transformer
Jiangtao Xie
Rui Zeng
Qilong Wang
Ziqi Zhou
P. Li
ViT
34
12
0
22 Apr 2021
RoFormer: Enhanced Transformer with Rotary Position Embedding
Jianlin Su
Yu Lu
Shengfeng Pan
Ahmed Murtadha
Bo Wen
Yunfeng Liu
46
2,190
0
20 Apr 2021
Comparison of Grammatical Error Correction Using Back-Translation Models
Aomi Koyama
Kengo Hotate
Masahiro Kaneko
Mamoru Komachi
15
10
0
16 Apr 2021
Aligning Latent and Image Spaces to Connect the Unconnectable
Ivan Skorokhodov
Grigorii Sotnikov
Mohamed Elhoseiny
DiffM
27
78
0
14 Apr 2021
Transformer-based Methods for Recognizing Ultra Fine-grained Entities (RUFES)
Emanuela Boros
A. Doucet
6
2
0
13 Apr 2021
Temporal Consistency Two-Stream CNN for Human Motion Prediction
Jin Tang
Jin Zhang
Jianqin Yin
3DH
16
17
0
11 Apr 2021
Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach
Simiao Zuo
Chen Liang
Haoming Jiang
Xiaodong Liu
Pengcheng He
Jianfeng Gao
Weizhu Chen
T. Zhao
58
9
0
11 Apr 2021
LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring
Anton Mitrofanov
Mariya Korenevskaya
Ivan Podluzhny
Yuri Y. Khokhlov
A. Laptev
A. Andrusenko
A. Ilin
M. Korenevsky
Ivan Medennikov
A. Romanenko
KELM
LRM
11
2
0
06 Apr 2021
FixMyPose: Pose Correctional Captioning and Retrieval
Hyounghun Kim
Abhaysinh Zala
Graham Burri
Joey Tianyi Zhou
36
16
0
04 Apr 2021
Learning Neural Representation of Camera Pose with Matrix Representation of Pose Shift via View Synthesis
Y. Zhu
Ruiqi Gao
Siyuan Huang
Song-Chun Zhu
Ying Nian Wu
SSL
24
9
0
04 Apr 2021
Measuring Linguistic Diversity During COVID-19
Artaches Ambartsoumian
F. Popowich
Benjamin Adams
16
35
0
03 Apr 2021
LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
Ben Graham
Alaaeldin El-Nouby
Hugo Touvron
Pierre Stock
Armand Joulin
Hervé Jégou
Matthijs Douze
ViT
22
774
0
02 Apr 2021
Embedding API Dependency Graph for Neural Code Generation
Chen Lyu
Ruyun Wang
Hongyu Zhang
Hanwen Zhang
Songlin Hu
GNN
31
20
0
29 Mar 2021
PENELOPIE: Enabling Open Information Extraction for the Greek Language through Machine Translation
D. Papadopoulos
N. Papadakis
N. Matsatsinis
11
8
0
28 Mar 2021
Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation
Shuhao Gu
Yang Feng
Wanying Xie
CLL
AI4CE
25
27
0
25 Mar 2021
Finetuning Pretrained Transformers into RNNs
Jungo Kasai
Hao Peng
Yizhe Zhang
Dani Yogatama
Gabriel Ilharco
Nikolaos Pappas
Yi Mao
Weizhu Chen
Noah A. Smith
44
63
0
24 Mar 2021
Hallucination of speech recognition errors with sequence to sequence learning
Prashant Serai
Vishal Sunder
Eric Fosler-Lussier
21
17
0
23 Mar 2021
GPNAS: A Neural Network Architecture Search Framework Based on Graphical Predictor
Dige Ai
Hong Zhang
AI4CE
11
0
0
19 Mar 2021
Previous
1
2
3
...
8
9
10
...
25
26
27
Next