ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.3215
  4. Cited By
Sequence to Sequence Learning with Neural Networks
v1v2v3 (latest)

Sequence to Sequence Learning with Neural Networks

10 September 2014
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Sequence to Sequence Learning with Neural Networks"

50 / 6,200 papers shown
Title
Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs
Chongjun Tu
Peng Ye
Dongzhan Zhou
Lei Bai
Gang Yu
Tao Chen
Wanli Ouyang
133
0
0
13 Mar 2025
Through the Magnifying Glass: Adaptive Perception Magnification for Hallucination-Free VLM Decoding
Shunqi Mao
Chaoyi Zhang
Weidong Cai
MLLM
464
1
0
13 Mar 2025
Isolated Channel Vision Transformers: From Single-Channel Pretraining to Multi-Channel Finetuning
Wenyi Lian
Joakim Lindblad
Patrick Micke
Natasa Sladoje
102
1
0
12 Mar 2025
MinGRU-Based Encoder for Turbo Autoencoder Frameworks
Rick Fritschek
Rafael F. Schaefer
109
0
0
11 Mar 2025
LATMOS: Latent Automaton Task Model from Observation Sequences
Weixiao Zhan
Qiyue Dong
Eduardo Sebastián
Nikolay Atanasov
90
0
0
11 Mar 2025
A Deep-Learning Iterative Stacked Approach for Prediction of Reactive Dissolution in Porous Media
Marcos Cirne
Hannah Menke
A. Abdellatif
Julien Maes
Florian Doster
A. Elsheikh
AI4CE
79
0
0
11 Mar 2025
Stick to Facts: Towards Fidelity-oriented Product Description Generation
Zhangming Chan
Preslav Nakov
Yongliang Wang
Jia-Nan Li
Qing Cui
Kun Gai
Dongyan Zhao
Rui Yan
197
24
0
11 Mar 2025
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Yingfeng Luo
Tong Zheng
Yongyu Mu
Yangqiu Song
Qinghong Zhang
...
Ziqiang Xu
Peinan Feng
Xiaoqian Liu
Tong Xiao
Jingbo Zhu
AI4CE
512
3
0
09 Mar 2025
Malware Detection at the Edge with Lightweight LLMs: A Performance Evaluation
Christian Rondanini
B. Carminati
E. Ferrari
Antonio Gaudiano
Ashish Kundu
116
0
0
06 Mar 2025
Deep Causal Behavioral Policy Learning: Applications to Healthcare
Jonas Knecht
Anna Zink
Jonathan Kolstad
Maya Petersen
CML
121
0
0
05 Mar 2025
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
Rui Zhao
Weijia Mao
Mike Zheng Shou
114
1
0
05 Mar 2025
BioD2C: A Dual-level Semantic Consistency Constraint Framework for Biomedical VQA
Zhengyang Ji
Shang Gao
Li Liu
Yifan Jia
Yutao Yue
58
0
0
04 Mar 2025
Fast 3D point clouds retrieval for Large-scale 3D Place Recognition
Fast 3D point clouds retrieval for Large-scale 3D Place Recognition
Chahine-Nicolas Zede
Laurent Carrafa
Valérie Gouet-Brunet
3DPC
168
0
0
28 Feb 2025
Learning to Substitute Components for Compositional Generalization
Learning to Substitute Components for Compositional Generalization
Zechao Li
Gangwei Jiang
Chenwang Wu
Ying Wei
Defu Lian
Enhong Chen
114
0
0
28 Feb 2025
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Shaharukh Khan
Ayush Tarun
Ali Faraz
Palash Kamble
Vivek Dahiya
Praveen Kumar Pokala
Ashish Kulkarni
Chandra Khatri
Abhinav Ravi
Shubham Agarwal
441
1
0
27 Feb 2025
Chemical knowledge-informed framework for privacy-aware retrosynthesis learning
Chemical knowledge-informed framework for privacy-aware retrosynthesis learning
Guikun Chen
Xu Zhang
Yue Yang
Yong Liu
Yi Yang
Wenguan Wang
88
0
0
26 Feb 2025
Introduction to Sequence Modeling with Transformers
Introduction to Sequence Modeling with Transformers
Joni-Kristian Kämäräinen
83
1
0
26 Feb 2025
SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models
SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models
Yuxuan Zhang
CLLALM
185
1
0
25 Feb 2025
Forecasting Local Ionospheric Parameters Using Transformers
Forecasting Local Ionospheric Parameters Using Transformers
D. J. Alford-Lago
C. Curtis
Alexander T. Ihler
Katherine A. Zawdie
Douglas P. Drob
90
0
0
24 Feb 2025
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences
Yangshijie Zhang
AAML
86
0
0
24 Feb 2025
CSTRL: Context-Driven Sequential Transfer Learning for Abstractive Radiology Report Summarization
CSTRL: Context-Driven Sequential Transfer Learning for Abstractive Radiology Report Summarization
Mst. Fahmida Sultana Naznin
Adnan Ibney Faruq
Mostafa Rifat Tazwar
Md Jobayer
Md. Mehedi Hasan Shawon
Md Rakibul Hasan
MedIm
68
0
0
21 Feb 2025
Quantum Recurrent Neural Networks with Encoder-Decoder for Time-Dependent Partial Differential Equations
Quantum Recurrent Neural Networks with Encoder-Decoder for Time-Dependent Partial Differential Equations
Yuan Chen
Abdul Khaliq
Khaled M. Furati
AI4CE
207
0
0
20 Feb 2025
Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning
Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning
Caihua Liu
Xu Li
Wenjing Xue
Wei Tang
Xia Feng
80
0
0
20 Feb 2025
Language Models Can Predict Their Own Behavior
Language Models Can Predict Their Own Behavior
Dhananjay Ashok
Jonathan May
ReLMAI4TSLRM
124
2
0
18 Feb 2025
Investigating Inference-time Scaling for Chain of Multi-modal Thought: A Preliminary Study
Investigating Inference-time Scaling for Chain of Multi-modal Thought: A Preliminary Study
Yujie Lin
Ante Wang
Moye Chen
Jingyao Liu
Hao Liu
Jinsong Su
Xinyan Xiao
LRM
143
3
0
17 Feb 2025
A Robust Attack: Displacement Backdoor Attack
A Robust Attack: Displacement Backdoor Attack
Yong Li
Han Gao
AAML
84
0
0
14 Feb 2025
Spatiotemporal Graph Neural Networks in short term load forecasting: Does adding Graph Structure in Consumption Data Improve Predictions?
Spatiotemporal Graph Neural Networks in short term load forecasting: Does adding Graph Structure in Consumption Data Improve Predictions?
Quoc Viet Nguyen
Joaquín Delgado Fernández
Sergio Potenciano Menci
AI4TS
99
0
0
14 Feb 2025
Theoretical Benefit and Limitation of Diffusion Language Model
Theoretical Benefit and Limitation of Diffusion Language Model
Guhao Feng
Yihan Geng
Jian Guan
Wei Wu
Liwei Wang
Di He
DiffM
154
2
0
13 Feb 2025
Enhancing LLM Character-Level Manipulation via Divide and Conquer
Enhancing LLM Character-Level Manipulation via Divide and Conquer
Zhen Xiong
Yujun Cai
Bryan Hooi
Nanyun Peng
Kai-Wei Chang
Zhecheng Li
162
0
0
12 Feb 2025
Comprehensive Framework for Evaluating Conversational AI Chatbots
Comprehensive Framework for Evaluating Conversational AI Chatbots
Shailja Gupta
Rajesh Ranjan
Surya Narayan Singh
70
0
0
10 Feb 2025
What makes a good feedforward computational graph?
What makes a good feedforward computational graph?
Alex Vitvitskyi
J. G. Araújo
Marc Lackenby
Petar Velickovic
131
3
0
10 Feb 2025
Do we really have to filter out random noise in pre-training data for language models?
Do we really have to filter out random noise in pre-training data for language models?
Jinghan Ru
Yuxin Xie
Xianwei Zhuang
Yuguo Yin
Zhihui Guo
Zhiming Liu
Qianli Ren
Yuexian Zou
193
6
0
10 Feb 2025
A comparison of translation performance between DeepL and Supertext
A comparison of translation performance between DeepL and Supertext
Alex Flückiger
Chantal Amrhein
Tim Graf
Frédéric Odermatt
Martin Pömsl
Philippe Schläpfer
Florian Schottmann
Samuel Läubli
ELM
128
0
0
04 Feb 2025
Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities
Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities
Rebecca Mobbs
Dimitrios Makris
Vasileios Argyriou
67
0
0
02 Feb 2025
A Hardware-Efficient Photonic Tensor Core: Accelerating Deep Neural Networks with Structured Compression
A Hardware-Efficient Photonic Tensor Core: Accelerating Deep Neural Networks with Structured Compression
Shupeng Ning
Hanqing Zhu
Chenghao Feng
Jiaqi Gu
David Z. Pan
Ray T. Chen
71
0
0
01 Feb 2025
HadamRNN: Binary and Sparse Ternary Orthogonal RNNs
HadamRNN: Binary and Sparse Ternary Orthogonal RNNs
Armand Foucault
Franck Mamalet
François Malgouyres
MQ
300
0
0
28 Jan 2025
Abstractive Text Summarization for Bangla Language Using NLP and Machine Learning Approaches
Asif Ahammad Miazee
Tonmoy Roy
Md Robiul Islam
Yeamin Safat
CVBM
49
0
0
28 Jan 2025
State-space models are accurate and efficient neural operators for dynamical systems
State-space models are accurate and efficient neural operators for dynamical systems
Zheyuan Hu
Nazanin Ahmadi Daryakenari
Qianli Shen
Kenji Kawaguchi
George Karniadakis
MambaAI4CE
237
19
0
28 Jan 2025
Can summarization approximate simplification? A gold standard comparison
Giacomo Magnifico
Eduard Barbu
67
0
0
28 Jan 2025
Data re-uploading in Quantum Machine Learning for time series: application to traffic forecasting
Data re-uploading in Quantum Machine Learning for time series: application to traffic forecasting
Nikolaos Schetakis
Paolo Bonfini
Negin Alisoltani
Konstantinos Blazakis
Symeon I. Tsintzos
Alexis Askitopoulos
Davit Aghamalyan
Panagiotis Fafoutellis
Eleni I. Vlahogianni
94
1
0
22 Jan 2025
Reliable Text-to-SQL with Adaptive Abstention
Reliable Text-to-SQL with Adaptive Abstention
Kaiwen Chen
Yueting Chen
Xiaohui Yu
Nick Koudas
RALM
90
2
0
18 Jan 2025
The Theater Stage as Laboratory: Review of Real-Time Comedy LLM Systems for Live Performance
The Theater Stage as Laboratory: Review of Real-Time Comedy LLM Systems for Live Performance
Piotr Wojciech Mirowski
Boyd Branch
Kory W. Mathewson
65
0
0
14 Jan 2025
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
Anupam Pandey
Deepjyoti Bodo
Arpan Phukan
Asif Ekbal
150
0
0
13 Jan 2025
TTS-Transducer: End-to-End Speech Synthesis with Neural Transducer
TTS-Transducer: End-to-End Speech Synthesis with Neural Transducer
Vladimir Bataev
Subhankar Ghosh
Vitaly Lavrukhin
Jason Chun Lok Li
AI4TS
118
1
0
10 Jan 2025
On Creating A Brain-To-Text Decoder
On Creating A Brain-To-Text Decoder
Zenon Lamprou
Yashar Moshfeghi
82
0
0
10 Jan 2025
Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation
Alireza Salemi
Cheng-rong Li
Mingyang Zhang
Qiaozhu Mei
Weize Kong
Tao Chen
Zhuowan Li
Michael Bendersky
Hamed Zamani
LRMRALMReLM
110
9
0
07 Jan 2025
Understanding How Nonlinear Layers Create Linearly Separable Features for Low-Dimensional Data
Alec S. Xu
Can Yaras
Peng Wang
Q. Qu
95
1
0
04 Jan 2025
Exploring the Implicit Semantic Ability of Multimodal Large Language Models: A Pilot Study on Entity Set Expansion
Hebin Wang
Yangning Li
Hai-Tao Zheng
Hai-Tao Zheng
Wenhao Jiang
Hong-Gee Kim
145
0
0
03 Jan 2025
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language
Jeong Hun Yeo
Chae Won Kim
Hyunjun Kim
Hyeongseop Rha
Seunghee Han
Wen-Huang Cheng
Y. Ro
164
3
0
03 Jan 2025
PsychAdapter: Adapting LLM Transformers to Reflect Traits, Personality and Mental Health
PsychAdapter: Adapting LLM Transformers to Reflect Traits, Personality and Mental Health
Huy-Hien Vu
Huy Anh Nguyen
Adithya Ganesan
Swanie Juhng
Oscar Kjell
...
Margaret L. Kern
Ryan L. Boyd
L. Ungar
H. Andrew Schwartz
J. Eichstaedt
161
0
0
03 Jan 2025
Previous
123456...122123124
Next