Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.03122
Cited By
v1
v2
v3 (latest)
Convolutional Sequence to Sequence Learning
8 May 2017
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Convolutional Sequence to Sequence Learning"
50 / 1,328 papers shown
Title
Taming Pretrained Transformers for Extreme Multi-label Text Classification
Wei-Cheng Chang
Hsiang-Fu Yu
Kai Zhong
Yiming Yang
Inderjit Dhillon
75
20
0
07 May 2019
Comprehensible Context-driven Text Game Playing
Xusen Yin
Jonathan May
70
33
0
06 May 2019
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning
Jingwen Chen
Yingwei Pan
Yehao Li
Ting Yao
Hongyang Chao
Tao Mei
81
103
0
03 May 2019
Very Deep Self-Attention Networks for End-to-End Speech Recognition
Ngoc-Quan Pham
T. Nguyen
Jan Niehues
Markus Müller
Sebastian Stüker
A. Waibel
91
161
0
30 Apr 2019
Review-Driven Answer Generation for Product-Related Questions in E-Commerce
Shiqian Chen
Chenliang Li
Feng Ji
Wei Zhou
Haiqing Chen
70
50
0
27 Apr 2019
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision
Jiayuan Mao
Chuang Gan
Pushmeet Kohli
J. Tenenbaum
Jiajun Wu
NAI
171
706
0
26 Apr 2019
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond
Yue Cao
Jiarui Xu
Stephen Lin
Fangyun Wei
Han Hu
ISeg
144
1,582
0
25 Apr 2019
Generating Long Sequences with Sparse Transformers
R. Child
Scott Gray
Alec Radford
Ilya Sutskever
142
1,925
0
23 Apr 2019
Multi-Task Learning for Argumentation Mining
Tobias Kahse
41
4
0
23 Apr 2019
MinCall - MinION end2end convolutional deep learning basecaller
N. Miculinic
Marko Ratkovic
M. Šikić
26
11
0
22 Apr 2019
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
598
5,892
0
21 Apr 2019
Dynamic Past and Future for Neural Machine Translation
Zaixiang Zheng
Shujian Huang
Zhaopeng Tu
Xinyu Dai
Jiajun Chen
86
30
0
21 Apr 2019
Mask-Predict: Parallel Decoding of Conditional Masked Language Models
Marjan Ghazvininejad
Omer Levy
Yinhan Liu
Luke Zettlemoyer
MoE
74
35
0
19 Apr 2019
A Systematic Study of Leveraging Subword Information for Learning Word Representations
Yi Zhu
Ivan Vulić
Anna Korhonen
87
30
0
16 Apr 2019
An Empirical Study of Spatial Attention Mechanisms in Deep Networks
Xizhou Zhu
Dazhi Cheng
Zheng Zhang
Stephen Lin
Jifeng Dai
92
416
0
11 Apr 2019
APE at Scale and its Implications on MT Evaluation Biases
Markus Freitag
Isaac Caswell
Scott Roy
ALM
52
8
0
09 Apr 2019
Bilingual-GAN: A Step Towards Parallel Text Generation
Ahmad Rashid
Alan Do-Omri
Md. Akmal Haidar
Qun Liu
Mehdi Rezagholizadeh
85
206
0
09 Apr 2019
Convolutional Temporal Attention Model for Video-based Person Re-identification
Tanzila Rahman
Mrigank Rochan
Yang Wang
28
6
0
09 Apr 2019
Who Needs Words? Lexicon-Free Speech Recognition
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
59
27
0
09 Apr 2019
Semi-Supervised Few-Shot Learning for Dual Question-Answer Extraction
Jue Wang
Ke Chen
Lidan Shou
Sai Wu
S. Mehrotra
RALM
46
3
0
08 Apr 2019
Improving Domain Adaptation Translation with Domain Invariant and Specific Information
Shuhao Gu
Yang Feng
Qun Liu
53
38
0
08 Apr 2019
Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion
Hao Sun
Xu Tan
Jun-Wei Gan
Hongzhi Liu
Sheng Zhao
Tao Qin
Tie-Yan Liu
74
66
0
06 Apr 2019
Modeling Point Clouds with Self-Attention and Gumbel Subset Sampling
Jiancheng Yang
Qiang Zhang
Bingbing Ni
Linguo Li
Jinxian Liu
Mengdie Zhou
Qi Tian
3DPC
95
382
0
06 Apr 2019
Fast Weakly Supervised Action Segmentation Using Mutual Consistency
Yaser Souri
Mohsen Fayyaz
Luca Minciullo
Gianpiero Francesca
Juergen Gall
91
52
0
05 Apr 2019
Convolutional Self-Attention Networks
Baosong Yang
Longyue Wang
Derek F. Wong
Lidia S. Chao
Zhaopeng Tu
71
126
0
05 Apr 2019
Information Aggregation for Multi-Head Attention with Routing-by-Agreement
Jian Li
Baosong Yang
Zi-Yi Dou
Xing Wang
Michael R. Lyu
Zhaopeng Tu
77
46
0
05 Apr 2019
Unifying Human and Statistical Evaluation for Natural Language Generation
Tatsunori B. Hashimoto
Hugh Zhang
Percy Liang
87
225
0
04 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
78
97
0
04 Apr 2019
Extract and Edit: An Alternative to Back-Translation for Unsupervised Neural Machine Translation
Jiawei Wu
Xin Eric Wang
William Yang Wang
82
39
0
04 Apr 2019
Improving Noise Tolerance of Mixed-Signal Neural Networks
M. Klachko
M. Mahmoodi
D. Strukov
39
29
0
02 Apr 2019
A Holistic Representation Guided Attention Network for Scene Text Recognition
L. Yang
Yuyang Deng
Peng Wang
Hui Li
Zhen Li
Yanning Zhang
101
36
0
02 Apr 2019
Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition
Pengfei Zhang
Cuiling Lan
Wenjun Zeng
Junliang Xing
Jianru Xue
Nanning Zheng
3DH
104
446
0
02 Apr 2019
ScriptNet: Neural Static Analysis for Malicious JavaScript Detection
Jack W. Stokes
Rakshit Agrawal
Geoff McDonald
Matthew J. Hausknecht
26
11
0
01 Apr 2019
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
VLM
FaML
177
3,159
0
01 Apr 2019
Learning to Stop in Structured Prediction for Neural Machine Translation
Mingbo Ma
Renjie Zheng
Liang Huang
79
5
0
01 Apr 2019
Linguistic generalization and compositionality in modern artificial neural networks
Marco Baroni
AI4CE
101
149
0
30 Mar 2019
Towards Knowledge-Based Personalized Product Description Generation in E-commerce
Qibin Chen
Junyang Lin
Yichang Zhang
Hongxia Yang
Jingren Zhou
Jie Tang
68
99
0
29 Mar 2019
Hierarchical Pooling Structure for Weakly Labeled Sound Event Detection
Ke-Xin He
Yuhan Shen
Weiqiang Zhang
70
6
0
28 Mar 2019
A Large-Scale Multi-Length Headline Corpus for Analyzing Length-Constrained Headline Generation Model Evaluation
Yuta Hitomi
Yuya Taguchi
Hideaki Tamori
Ko Kikuta
Jiro Nishitoba
Naoaki Okazaki
Kentaro Inui
Manabu Okumura
65
9
0
28 Mar 2019
Recognizing Arrow Of Time In The Short Stories
Fahimeh Hosseini
Hosein Fooladi
Mohammad Reza Samsami
18
0
0
25 Mar 2019
Modelling Sequential Music Track Skips using a Multi-RNN Approach
Christian B. Hansen
Casper Hansen
Stephen Alstrup
J. Simonsen
Christina Lioma
41
9
0
20 Mar 2019
Simple, Fast, Accurate Intent Classification and Slot Labeling for Goal-Oriented Dialogue Systems
Arshit Gupta
John Hewitt
Katrin Kirchhoff
50
38
0
19 Mar 2019
CVIT-MT Systems for WAT-2018
Jerin Philip
Vinay P. Namboodiri
C. V. Jawahar
27
10
0
19 Mar 2019
Neutron: An Implementation of the Transformer Translation Model and its Variants
Hongfei Xu
Qiuhui Liu
76
19
0
18 Mar 2019
Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition
Johannes Michael
R. Labahn
Tobias Grüning
Jochen Zöllner
164
114
0
18 Mar 2019
The Missing Ingredient in Zero-Shot Neural Machine Translation
N. Arivazhagan
Ankur Bapna
Orhan Firat
Roee Aharoni
Melvin Johnson
Wolfgang Macherey
85
117
0
17 Mar 2019
Formality Style Transfer with Hybrid Textual Annotations
Ruochen Xu
Tao Ge
Furu Wei
73
41
0
15 Mar 2019
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement
W. Kool
H. V. Hoof
Max Welling
135
220
0
14 Mar 2019
Context-Aware Learning for Neural Machine Translation
Sébastien Jean
Kyunghyun Cho
64
18
0
12 Mar 2019
Positively Scale-Invariant Flatness of ReLU Neural Networks
Mingyang Yi
Qi Meng
Wei-neng Chen
Zhi-Ming Ma
Tie-Yan Liu
76
18
0
06 Mar 2019
Previous
1
2
3
...
19
20
21
...
25
26
27
Next