ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.3215
  4. Cited By
Sequence to Sequence Learning with Neural Networks
v1v2v3 (latest)

Sequence to Sequence Learning with Neural Networks

10 September 2014
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Sequence to Sequence Learning with Neural Networks"

50 / 6,208 papers shown
Title
Improve SGD Training via Aligning Mini-batches
Improve SGD Training via Aligning Mini-batches
Xiangrui Li
Deng Pan
X. Li
D. Zhu
20
1
0
23 Feb 2020
Interpretable Crowd Flow Prediction with Spatial-Temporal Self-Attention
Interpretable Crowd Flow Prediction with Spatial-Temporal Self-Attention
Haoxing Lin
Weijia Jia
Yongjian You
Yiping Sun
AI4TS
73
6
0
22 Feb 2020
"Wait, I'm Still Talking!" Predicting the Dialogue Interaction Behavior
  Using Imagine-Then-Arbitrate Model
"Wait, I'm Still Talking!" Predicting the Dialogue Interaction Behavior Using Imagine-Then-Arbitrate Model
Zehao Lin
Shaobo Cui
Guodun Li
Xiaoming Kang
Feng Ji
Fenglin Li
Zhongzhou Zhao
Haiqing Chen
Yin Zhang
78
1
0
22 Feb 2020
Guider láttention dans les modeles de sequence a sequence pour la
  prediction des actes de dialogue
Guider láttention dans les modeles de sequence a sequence pour la prediction des actes de dialogue
Pierre Colombo
E. Chapuis
Matteo Manica
Emmanuel Vignon
Giovanna Varni
Chloé Clavel
3DV
71
2
0
21 Feb 2020
Imputer: Sequence Modelling via Imputation and Dynamic Programming
Imputer: Sequence Modelling via Imputation and Dynamic Programming
William Chan
Chitwan Saharia
Geoffrey E. Hinton
Mohammad Norouzi
Navdeep Jaitly
BDLAI4TS
97
116
0
20 Feb 2020
Guiding attention in Sequence-to-sequence models for Dialogue Act
  prediction
Guiding attention in Sequence-to-sequence models for Dialogue Act prediction
Pierre Colombo
E. Chapuis
Matteo Manica
Emmanuel Vignon
Giovanna Varni
Chloé Clavel
3DV
143
63
0
20 Feb 2020
Balancing Cost and Benefit with Tied-Multi Transformers
Balancing Cost and Benefit with Tied-Multi Transformers
Raj Dabre
Raphaël Rubino
Atsushi Fujita
72
6
0
20 Feb 2020
On Learning Sets of Symmetric Elements
On Learning Sets of Symmetric Elements
Haggai Maron
Or Litany
Gal Chechik
Ethan Fetaya
96
137
0
20 Feb 2020
Learn to Design the Heuristics for Vehicle Routing Problem
Learn to Design the Heuristics for Vehicle Routing Problem
Lei Gao
Mingxiang Chen
Qichang Chen
Ganzhong Luo
Nuoyi Zhu
Zhixin Liu
58
52
0
20 Feb 2020
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
Zhangyin Feng
Daya Guo
Duyu Tang
Nan Duan
Xiaocheng Feng
...
Linjun Shou
Bing Qin
Ting Liu
Daxin Jiang
Ming Zhou
233
2,727
0
19 Feb 2020
Towards Making the Most of Context in Neural Machine Translation
Towards Making the Most of Context in Neural Machine Translation
Zaixiang Zheng
Xiang Yue
Shujian Huang
Jiajun Chen
Alexandra Birch
72
19
0
19 Feb 2020
Studying the Effects of Cognitive Biases in Evaluation of Conversational
  Agents
Studying the Effects of Cognitive Biases in Evaluation of Conversational Agents
Sashank Santhanam
Alireza Karduni
Samira Shaikh
HAI
102
26
0
18 Feb 2020
Comparative Visual Analytics for Assessing Medical Records with Sequence
  Embedding
Comparative Visual Analytics for Assessing Medical Records with Sequence Embedding
Rongchen Guo
Takanori Fujiwara
Yiran Li
K. Lima
S. Sen
N. Tran
K. Ma
36
32
0
18 Feb 2020
SentenceMIM: A Latent Variable Language Model
SentenceMIM: A Latent Variable Language Model
M. Livne
Kevin Swersky
David J. Fleet
VLM
98
6
0
18 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLMAI4TSAI4CE
114
140
0
18 Feb 2020
ResiliNet: Failure-Resilient Inference in Distributed Neural Networks
ResiliNet: Failure-Resilient Inference in Distributed Neural Networks
Ashkan Yousefpour
Brian Q. Nguyen
Siddartha Devic
Guanhua Wang
Aboudy Kreidieh
Hans Lobel
Alexandre M. Bayen
J. Jue
48
2
0
18 Feb 2020
Conditional Self-Attention for Query-based Summarization
Conditional Self-Attention for Query-based Summarization
Yujia Xie
Dinesh Manocha
Yi Mao
Weizhu Chen
78
19
0
18 Feb 2020
On the Discrepancy between Density Estimation and Sequence Generation
On the Discrepancy between Density Estimation and Sequence Generation
Jason D. Lee
Dustin Tran
Orhan Firat
Kyunghyun Cho
44
11
0
17 Feb 2020
Controlling Computation versus Quality for Neural Sequence Models
Controlling Computation versus Quality for Neural Sequence Models
Ankur Bapna
N. Arivazhagan
Orhan Firat
87
30
0
17 Feb 2020
Low-Rank Bottleneck in Multi-head Attention Models
Low-Rank Bottleneck in Multi-head Attention Models
Srinadh Bhojanapalli
Chulhee Yun
A. S. Rawat
Sashank J. Reddi
Sanjiv Kumar
78
97
0
17 Feb 2020
Incorporating BERT into Neural Machine Translation
Incorporating BERT into Neural Machine Translation
Jinhua Zhu
Yingce Xia
Lijun Wu
Di He
Tao Qin
Wen-gang Zhou
Houqiang Li
Tie-Yan Liu
FedMLAIMat
50
360
0
17 Feb 2020
Hybrid Embedded Deep Stacked Sparse Autoencoder with w_LPPD SVM Ensemble
Hybrid Embedded Deep Stacked Sparse Autoencoder with w_LPPD SVM Ensemble
Yongming Li
Yan Lei
Pin Wang
Yuchuan Liu
22
1
0
17 Feb 2020
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic
  Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO
  Framework
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO Framework
C. Sur
118
7
0
16 Feb 2020
Exploring Neural Models for Parsing Natural Language into First-Order
  Logic
Exploring Neural Models for Parsing Natural Language into First-Order Logic
Hrituraj Singh
Milan Aggarwal
Balaji Krishnamurthy
LRM
106
21
0
16 Feb 2020
Learning to Generate Multiple Style Transfer Outputs for an Input
  Sentence
Learning to Generate Multiple Style Transfer Outputs for an Input Sentence
Kevin Qinghong Lin
Ming-Yuan Liu
Ming-Ting Sun
Jan Kautz
40
5
0
16 Feb 2020
Differentiable Top-k Operator with Optimal Transport
Differentiable Top-k Operator with Optimal Transport
Yujia Xie
H. Dai
Minshuo Chen
Bo Dai
T. Zhao
H. Zha
Wei Wei
Tomas Pfister
OT
70
27
0
16 Feb 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image
  Captioning With R-CNN Feature Distribution Composition (FDC)
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC)
C. Sur
54
17
0
15 Feb 2020
Multivariate Probabilistic Time Series Forecasting via Conditioned
  Normalizing Flows
Multivariate Probabilistic Time Series Forecasting via Conditioned Normalizing Flows
Kashif Rasul
Abdul-Saboor Sheikh
Ingmar Schuster
Urs M. Bergmann
Roland Vollgraf
BDLAI4TSAI4CE
137
187
0
14 Feb 2020
A Data Efficient End-To-End Spoken Language Understanding Architecture
A Data Efficient End-To-End Spoken Language Understanding Architecture
Marco Dinarelli
Nikita Kapoor
Bassam Jabaian
Laurent Besacier
3DV
56
20
0
14 Feb 2020
Pre-Training for Query Rewriting in A Spoken Language Understanding
  System
Pre-Training for Query Rewriting in A Spoken Language Understanding System
Zheng Chen
Xing Fan
Yuan Ling
Lambert Mathias
Chenlei Guo
54
23
0
13 Feb 2020
Deep Learning for Source Code Modeling and Generation: Models,
  Applications and Challenges
Deep Learning for Source Code Modeling and Generation: Models, Applications and Challenges
T. H. Le
Hao Chen
Muhammad Ali Babar
VLM
147
155
0
13 Feb 2020
Learning to Compare for Better Training and Evaluation of Open Domain
  Natural Language Generation Models
Learning to Compare for Better Training and Evaluation of Open Domain Natural Language Generation Models
Wangchunshu Zhou
Ke Xu
ELMALM
67
44
0
12 Feb 2020
DeepMutation: A Neural Mutation Tool
DeepMutation: A Neural Mutation Tool
Michele Tufano
J. Kimko
Shiya Wang
Cody Watson
Gabriele Bavota
M. D. Penta
Denys Poshyvanyk
31
21
0
12 Feb 2020
On Layer Normalization in the Transformer Architecture
On Layer Normalization in the Transformer Architecture
Ruibin Xiong
Yunchang Yang
Di He
Kai Zheng
Shuxin Zheng
Chen Xing
Huishuai Zhang
Yanyan Lan
Liwei Wang
Tie-Yan Liu
AI4CE
160
1,006
0
12 Feb 2020
Non-Autoregressive Neural Dialogue Generation
Non-Autoregressive Neural Dialogue Generation
Qinghong Han
Yuxian Meng
Leilei Gan
Jiwei Li
80
14
0
11 Feb 2020
ForecastNet: A Time-Variant Deep Feed-Forward Neural Network
  Architecture for Multi-Step-Ahead Time-Series Forecasting
ForecastNet: A Time-Variant Deep Feed-Forward Neural Network Architecture for Multi-Step-Ahead Time-Series Forecasting
J. Dabrowski
Yifan Zhang
Ashfaqur Rahman
AI4TS
35
36
0
11 Feb 2020
Stability for the Training of Deep Neural Networks and Other Classifiers
Stability for the Training of Deep Neural Networks and Other Classifiers
L. Berlyand
P. Jabin
C. A. Safsten
42
7
0
10 Feb 2020
Exploring Chemical Space using Natural Language Processing Methodologies
  for Drug Discovery
Exploring Chemical Space using Natural Language Processing Methodologies for Drug Discovery
Hakime Öztürk
Arzucan Özgür
P. Schwaller
Teodoro Laino
Elif Özkirimli
98
122
0
10 Feb 2020
On the Communication Latency of Wireless Decentralized Learning
On the Communication Latency of Wireless Decentralized Learning
Navid Naderializadeh
60
3
0
10 Feb 2020
Automating App Review Response Generation
Automating App Review Response Generation
Cuiyun Gao
Jichuan Zeng
Xin Xia
David Lo
Michael R. Lyu
Irwin King
46
36
0
10 Feb 2020
Explainable Deep RDFS Reasoner
Explainable Deep RDFS Reasoner
B. Makni
Ibrahim Abdelaziz
James A. Hendler
18
3
0
10 Feb 2020
A New Perspective for Flexible Feature Gathering in Scene Text
  Recognition Via Character Anchor Pooling
A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling
Shangbang Long
Yushuo Guan
Kaigui Bian
Cong Yao
90
13
0
10 Feb 2020
Evaluating Sequence-to-Sequence Learning Models for If-Then Program
  Synthesis
Evaluating Sequence-to-Sequence Learning Models for If-Then Program Synthesis
Dhairya Dalal
Byron V. Galbraith
28
6
0
10 Feb 2020
Abstractive Summarization for Low Resource Data using Domain Transfer
  and Data Synthesis
Abstractive Summarization for Low Resource Data using Domain Transfer and Data Synthesis
Ahmed Magooda
Diane Litman
57
17
0
09 Feb 2020
Attend to the beginning: A study on using bidirectional attention for
  extractive summarization
Attend to the beginning: A study on using bidirectional attention for extractive summarization
Ahmed Magooda
C. Marcjan
39
2
0
09 Feb 2020
A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model
  for Vehicle Routing Problems
A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems
Bo Peng
Jiahai Wang
Zizhen Zhang
61
76
0
09 Feb 2020
Spatial-Temporal Multi-Cue Network for Continuous Sign Language
  Recognition
Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition
Hao Zhou
Wen-gang Zhou
Yun Zhou
Houqiang Li
NoLa
82
201
0
08 Feb 2020
Time-aware Large Kernel Convolutions
Time-aware Large Kernel Convolutions
Vasileios Lioutas
Yuhong Guo
AI4TS
97
29
0
08 Feb 2020
LAVA NAT: A Non-Autoregressive Translation Model with Look-Around
  Decoding and Vocabulary Attention
LAVA NAT: A Non-Autoregressive Translation Model with Look-Around Decoding and Vocabulary Attention
Xiaoya Li
Yuxian Meng
Arianna Yuan
Leilei Gan
Jiwei Li
104
12
0
08 Feb 2020
Description Based Text Classification with Reinforcement Learning
Description Based Text Classification with Reinforcement Learning
Duo Chai
Wei Wu
Qinghong Han
Leilei Gan
Jiwei Li
VLM
181
68
0
08 Feb 2020
Previous
123...686970...123124125
Next