ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.03122
  4. Cited By
Convolutional Sequence to Sequence Learning

Convolutional Sequence to Sequence Learning

8 May 2017
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
    AIMat
ArXivPDFHTML

Papers citing "Convolutional Sequence to Sequence Learning"

50 / 1,321 papers shown
Title
MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep
  Learning Clusters
MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep Learning Clusters
Yihao Zhao
Xin Liu
Shufan Liu
Xiang Li
Yibo Zhu
Gang Huang
Xuanzhe Liu
Xin Jin
32
11
0
24 Mar 2023
Integrating Image Features with Convolutional Sequence-to-sequence
  Network for Multilingual Visual Question Answering
Integrating Image Features with Convolutional Sequence-to-sequence Network for Multilingual Visual Question Answering
T. M. Thai
Son T. Luu
40
0
0
22 Mar 2023
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to
  GPT-5 All You Need?
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
Chaoning Zhang
Chenshuang Zhang
Sheng Zheng
Yu Qiao
Chenghao Li
...
Lik-Hang Lee
Yang Yang
Heng Tao Shen
In So Kweon
Choong Seon Hong
85
159
0
21 Mar 2023
Transformers in Speech Processing: A Survey
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Junaid Qadir
42
47
0
21 Mar 2023
CerviFormer: A Pap-smear based cervical cancer classification method
  using cross attention and latent transformer
CerviFormer: A Pap-smear based cervical cancer classification method using cross attention and latent transformer
Bhaswati Singha Deo
M. Pal
P. Panigrahi
A. Pradhan
MedIm
36
22
0
17 Mar 2023
Parameter is Not All You Need: Starting from Non-Parametric Networks for
  3D Point Cloud Analysis
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis
Renrui Zhang
Liuhui Wang
Ziyu Guo
Yali Wang
Peng Gao
Hongsheng Li
Jianbo Shi
3DPC
32
52
0
14 Mar 2023
Learning Transductions and Alignments with RNN Seq2seq Models
Learning Transductions and Alignments with RNN Seq2seq Models
Zhengxiang Wang
19
0
0
13 Mar 2023
Convex Bounds on the Softmax Function with Applications to Robustness
  Verification
Convex Bounds on the Softmax Function with Applications to Robustness Verification
Dennis L. Wei
Haoze Wu
Min Wu
Pin-Yu Chen
Clark W. Barrett
E. Farchi
UQCV
AAML
25
8
0
03 Mar 2023
Leveraging Large Text Corpora for End-to-End Speech Summarization
Leveraging Large Text Corpora for End-to-End Speech Summarization
Kohei Matsuura
Takanori Ashihara
Takafumi Moriya
Tomohiro Tanaka
A. Ogawa
Marc Delcroix
Ryo Masumura
27
14
0
02 Mar 2023
Variance-reduced Clipping for Non-convex Optimization
Variance-reduced Clipping for Non-convex Optimization
Amirhossein Reisizadeh
Haochuan Li
Subhro Das
Ali Jadbabaie
23
26
0
02 Mar 2023
EPISODE: Episodic Gradient Clipping with Periodic Resampled Corrections
  for Federated Learning with Heterogeneous Data
EPISODE: Episodic Gradient Clipping with Periodic Resampled Corrections for Federated Learning with Heterogeneous Data
M. Crawshaw
Yajie Bao
Mingrui Liu
FedML
27
8
0
14 Feb 2023
Enhancing Multivariate Time Series Classifiers through Self-Attention
  and Relative Positioning Infusion
Enhancing Multivariate Time Series Classifiers through Self-Attention and Relative Positioning Infusion
Mehryar Abbasi
Parvaneh Saeedi
AI4TS
27
6
0
13 Feb 2023
Protecting Language Generation Models via Invisible Watermarking
Protecting Language Generation Models via Invisible Watermarking
Xuandong Zhao
Yu-Xiang Wang
Lei Li
WaLM
24
82
0
06 Feb 2023
KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program
  Repair
KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair
Nan Jiang
Thibaud Lutellier
Yiling Lou
Lin Tan
Dan Goldwasser
Xinming Zhang
27
43
0
03 Feb 2023
Learning the Dynamics of Sparsely Observed Interacting Systems
Learning the Dynamics of Sparsely Observed Interacting Systems
Linus Bleistein
Adeline Fermanian
A. Jannot
Agathe Guilloux
46
5
0
27 Jan 2023
Open Problems in Applied Deep Learning
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
42
2
0
26 Jan 2023
PULL: Reactive Log Anomaly Detection Based On Iterative PU Learning
PULL: Reactive Log Anomaly Detection Based On Iterative PU Learning
Thorsten Wittkopp
Dominik Scheinert
Philipp Wiesner
Alexander Acker
O. Kao
AI4TS
26
6
0
25 Jan 2023
Variation-Aware Semantic Image Synthesis
Variation-Aware Semantic Image Synthesis
Mingle Xu
Jaehwan Lee
Sook Yoon
Hyongsuk Kim
D. Park
38
3
0
25 Jan 2023
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics
  Without the Reference
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Vilém Zouhar
S. Dhuliawala
Wangchunshu Zhou
Nico Daheim
Tom Kocmi
Yuchen Eleanor Jiang
Mrinmaya Sachan
18
9
0
21 Jan 2023
HanoiT: Enhancing Context-aware Translation via Selective Context
HanoiT: Enhancing Context-aware Translation via Selective Context
Jian Yang
Yuwei Yin
Shuming Ma
Liqun Yang
Hongcheng Guo
Haoyang Huang
Dongdong Zhang
Yutao Zeng
Zhoujun Li
Furu Wei
32
5
0
17 Jan 2023
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya Zhang
Yixin Chen
Dacheng Tao
OffRL
27
25
0
29 Dec 2022
A Survey of Deep Learning for Mathematical Reasoning
A Survey of Deep Learning for Mathematical Reasoning
Pan Lu
Liang Qiu
Wenhao Yu
Sean Welleck
Kai-Wei Chang
ReLM
LRM
46
139
0
20 Dec 2022
EIT: Enhanced Interactive Transformer
EIT: Enhanced Interactive Transformer
Tong Zheng
Bei Li
Huiwen Bao
Tong Xiao
Jingbo Zhu
32
2
0
20 Dec 2022
Graph Learning and Its Advancements on Large Language Models: A Holistic
  Survey
Graph Learning and Its Advancements on Large Language Models: A Holistic Survey
Shaopeng Wei
Yu Zhao
Xingyan Chen
Qing Li
Fuzhen Zhuang
Ji Liu
Fuji Ren
Gang Kou
AI4CE
27
5
0
17 Dec 2022
Spatial-temporal traffic modeling with a fusion graph reconstructed by
  tensor decomposition
Spatial-temporal traffic modeling with a fusion graph reconstructed by tensor decomposition
Qin Li
Xu Yang
Yong Wang
Yuankai Wu
Deqiang He
44
10
0
12 Dec 2022
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
Jianhao Yan
Jin Xu
Fandong Meng
Jie Zhou
Yue Zhang
24
3
0
08 Dec 2022
Dynamic Graph Node Classification via Time Augmentation
Dynamic Graph Node Classification via Time Augmentation
Jiarui Sun
Mengting Gu
Chin-Chia Michael Yeh
Yujie Fan
Girish Chowdhary
Wei Zhang
26
1
0
07 Dec 2022
AL-iGAN: An Active Learning Framework for Tunnel Geological
  Reconstruction Based on TBM Operational Data
AL-iGAN: An Active Learning Framework for Tunnel Geological Reconstruction Based on TBM Operational Data
Hao Wang
Lixue Liu
Xueguan Song
Chao Zhang
Dacheng Tao
GAN
AI4CE
24
0
0
02 Dec 2022
Rephrasing the Reference for Non-Autoregressive Machine Translation
Rephrasing the Reference for Non-Autoregressive Machine Translation
Chenze Shao
Jinchao Zhang
Jie Zhou
Yang Feng
27
5
0
30 Nov 2022
Beyond Ensemble Averages: Leveraging Climate Model Ensembles for
  Subseasonal Forecasting
Beyond Ensemble Averages: Leveraging Climate Model Ensembles for Subseasonal Forecasting
Elena Orlova
Haokun Liu
Raphael Rossellini
B. Cash
Rebecca Willett
26
3
0
29 Nov 2022
Learning Regularized Positional Encoding for Molecular Prediction
Learning Regularized Positional Encoding for Molecular Prediction
Xiang Gao
Weihao Gao
Wen Xiao
Zhirui Wang
Chong Wang
Liang Xiang
AI4CE
27
1
0
23 Nov 2022
Exemplar-free Continual Learning of Vision Transformers via Gated
  Class-Attention and Cascaded Feature Drift Compensation
Exemplar-free Continual Learning of Vision Transformers via Gated Class-Attention and Cascaded Feature Drift Compensation
Marco Cotogni
Fei Yang
C. Cusano
Andrew D. Bagdanov
Joost van de Weijer
CLL
30
0
0
22 Nov 2022
SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural
  Radiance Fields
SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields
Ashkan Mirzaei
Tristan Aumentado-Armstrong
Konstantinos G. Derpanis
J. Kelly
Marcus A. Brubaker
Igor Gilitschenski
Alex Levinshtein
26
110
0
22 Nov 2022
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative
  Latent Attention
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention
Zineng Tang
Jaemin Cho
Jie Lei
Joey Tianyi Zhou
VLM
24
9
0
21 Nov 2022
L-MAE: Masked Autoencoders are Semantic Segmentation Datasets Augmenter
L-MAE: Masked Autoencoders are Semantic Segmentation Datasets Augmenter
Jiaru Jia
Ming-Yu Liu
Jiake Xie
Xin Chen
Hong Zhang
Xin Jiang
Aiqing Yang
43
0
0
21 Nov 2022
Impact of visual assistance for automated audio captioning
Impact of visual assistance for automated audio captioning
Wim Boes
Hugo Van hamme
17
1
0
18 Nov 2022
A Copy Mechanism for Handling Knowledge Base Elements in SPARQL Neural
  Machine Translation
A Copy Mechanism for Handling Knowledge Base Elements in SPARQL Neural Machine Translation
Rose Hirigoyen
Amal Zouaq
Samuel Reyd
26
4
0
18 Nov 2022
Zero-Shot Dynamic Quantization for Transformer Inference
Zero-Shot Dynamic Quantization for Transformer Inference
Yousef El-Kurdi
Jerry Quinn
Avirup Sil
MQ
22
1
0
17 Nov 2022
Parameter-Efficient Transformer with Hybrid Axial-Attention for Medical
  Image Segmentation
Parameter-Efficient Transformer with Hybrid Axial-Attention for Medical Image Segmentation
Yiyue Hu
Lei Zhang
Nan Mu
Leijun Liu
ViT
MedIm
22
1
0
17 Nov 2022
Improving Factual Consistency in Summarization with Compression-Based
  Post-Editing
Improving Factual Consistency in Summarization with Compression-Based Post-Editing
Alexander R. Fabbri
Prafulla Kumar Choubey
Jesse Vig
Chien-Sheng Wu
Caiming Xiong
HILM
KELM
44
17
0
11 Nov 2022
DPCSpell: A Transformer-based Detector-Purificator-Corrector Framework
  for Spelling Error Correction of Bangla and Resource Scarce Indic Languages
DPCSpell: A Transformer-based Detector-Purificator-Corrector Framework for Spelling Error Correction of Bangla and Resource Scarce Indic Languages
Mehedi Hasan Bijoy
Nahid Md Lokman Hossain
Salekul Islam
Swakkhar Shatabda
11
8
0
07 Nov 2022
CodePAD: Sequence-based Code Generation with Pushdown Automaton
CodePAD: Sequence-based Code Generation with Pushdown Automaton
Yihong Dong
Xue Jiang
Yuchen Liu
Ge Li
Zhi Jin
28
6
0
02 Nov 2022
Multi-Viewpoint and Multi-Evaluation with Felicitous Inductive Bias
  Boost Machine Abstract Reasoning Ability
Multi-Viewpoint and Multi-Evaluation with Felicitous Inductive Bias Boost Machine Abstract Reasoning Ability
Qinglai Wei
Diancheng Chen
Beiming Yuan
32
10
0
26 Oct 2022
MemoNet: Memorizing All Cross Features' Representations Efficiently via
  Multi-Hash Codebook Network for CTR Prediction
MemoNet: Memorizing All Cross Features' Representations Efficiently via Multi-Hash Codebook Network for CTR Prediction
P. Zhang
Junlin Zhang
25
3
0
25 Oct 2022
Towards Efficient Dialogue Pre-training with Transferable and
  Interpretable Latent Structure
Towards Efficient Dialogue Pre-training with Transferable and Interpretable Latent Structure
Xueliang Zhao
Lemao Liu
Tingchen Fu
Shuming Shi
Dongyan Zhao
Rui Yan
129
4
0
22 Oct 2022
Collaborative Reasoning on Multi-Modal Semantic Graphs for
  Video-Grounded Dialogue Generation
Collaborative Reasoning on Multi-Modal Semantic Graphs for Video-Grounded Dialogue Generation
Xueliang Zhao
Yuxuan Wang
Chongyang Tao
Chenshuo Wang
Dongyan Zhao
43
6
0
22 Oct 2022
Shift-Reduce Task-Oriented Semantic Parsing with Stack-Transformers
Shift-Reduce Task-Oriented Semantic Parsing with Stack-Transformers
Daniel Fernández-González
39
0
0
21 Oct 2022
Is Encoder-Decoder Redundant for Neural Machine Translation?
Is Encoder-Decoder Redundant for Neural Machine Translation?
Yingbo Gao
Christian Herold
Zijian Yang
Hermann Ney
27
4
0
21 Oct 2022
A Pareto-optimal compositional energy-based model for sampling and
  optimization of protein sequences
A Pareto-optimal compositional energy-based model for sampling and optimization of protein sequences
Natavsa Tagasovska
Nathan C. Frey
Andreas Loukas
I. Hotzel
J. Lafrance-Vanasse
...
A. Rajpal
Richard Bonneau
Kyunghyun Cho
Stephen Ra
Vladimir Gligorijević
18
11
0
19 Oct 2022
Improving Chinese Story Generation via Awareness of Syntactic
  Dependencies and Semantics
Improving Chinese Story Generation via Awareness of Syntactic Dependencies and Semantics
Hen-Hsen Huang
Chen Tang
Tyler Loakman
Frank Guerin
Chenghua Lin
31
12
0
19 Oct 2022
Previous
12345...252627
Next