Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1610.09038
Cited By
Professor Forcing: A New Algorithm for Training Recurrent Networks
27 October 2016
Alex Lamb
Anirudh Goyal
Ying Zhang
Saizheng Zhang
Aaron Courville
Yoshua Bengio
GAN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Professor Forcing: A New Algorithm for Training Recurrent Networks"
50 / 291 papers shown
Title
Low-Resource Neural Machine Translation Using Recurrent Neural Networks and Transfer Learning: A Case Study on English-to-Igbo
Ocheme Anthony Ekle
Biswarup Das
29
0
0
24 Apr 2025
DiffAD: A Unified Diffusion Modeling Approach for Autonomous Driving
Tao Wang
Cong Zhang
Xingguang Qu
Kun Li
W. Liu
C. Huang
56
0
0
15 Mar 2025
Robust Latent Matters: Boosting Image Generation with Sampling Error Synthesis
Kai Qiu
X. Li
Jason Kuen
H. Chen
Xiaohao Xu
Jiuxiang Gu
Yinyi Luo
Bhiksha Raj
Zhe-nan Lin
Marios Savvides
62
0
0
11 Mar 2025
Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model
M. Li
Rui Wang
Lei Sun
Y. Bai
Xiangxiang Chu
59
0
0
08 Mar 2025
Unlocking Efficient, Scalable, and Continual Knowledge Editing with Basis-Level Representation Fine-Tuning
Tianci Liu
R. Li
Yunzhe Qi
Hui Liu
X. Tang
...
Qingyu Yin
Monica Cheng
Jun Huan
Haoyu Wang
Jing Gao
KELM
46
2
0
01 Mar 2025
Fast and Accurate Blind Flexible Docking
Zizhuo Zhang
Lijun Wu
Kaiyuan Gao
Jiangchao Yao
Tao Qin
Bo Han
36
0
0
20 Feb 2025
Taming Teacher Forcing for Masked Autoregressive Video Generation
Deyu Zhou
Quan Sun
Yuang Peng
Kun Yan
Runpei Dong
...
Zheng Ge
Nan Duan
Xiangyu Zhang
L. Ni
H. Shum
VGen
54
6
0
21 Jan 2025
DriveLM: Driving with Graph Visual Question Answering
Chonghao Sima
Katrin Renz
Kashyap Chitta
L. Chen
Hanxue Zhang
Chengen Xie
Jens Beißwenger
Ping Luo
Andreas Geiger
Hongyang Li
90
162
0
17 Jan 2025
ORID: Organ-Regional Information Driven Framework for Radiology Report Generation
Tiancheng Gu
Kaicheng Yang
Xiang An
Ziyong Feng
Dongnan Liu
Weidong Cai
74
1
0
20 Nov 2024
SeriesGAN: Time Series Generation via Adversarial and Autoregressive Learning
MohammadReza EskandariNasab
S. M. Hamdi
S. F. Boubrahimi
GAN
AI4TS
33
0
0
28 Oct 2024
Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series
Ilan Naiman
Nimrod Berman
Itai Pemper
Idan Arbiv
Gal Fadlon
Omri Azencot
32
11
0
25 Oct 2024
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Zhepeng Cen
Yao Liu
Siliang Zeng
Pratik Chaudhar
Huzefa Rangwala
George Karypis
Rasool Fakoor
SyDa
AIFin
34
3
0
18 Oct 2024
End-to-end Planner Training for Language Modeling
Nathan Cornille
Florian Mai
Jingyuan Sun
Marie-Francine Moens
23
0
0
16 Oct 2024
Extra Global Attention Designation Using Keyword Detection in Sparse Transformer Architectures
Evan Lucas
Dylan Kangas
Timothy C Havens
24
0
0
11 Oct 2024
ChronoGAN: Supervised and Embedded Generative Adversarial Networks for Time Series Generation
MohammadReza EskandariNasab
S. M. Hamdi
S. F. Boubrahimi
GAN
AI4TS
23
1
0
21 Sep 2024
Inference acceleration for large language models using "stairs" assisted greedy generation
Domas Grigaliunas
M. Lukoševičius
26
0
0
29 Jul 2024
Hierarchically Disentangled Recurrent Network for Factorizing System Dynamics of Multi-scale Systems: An application on Hydrological Systems
Rahul Ghosh
Zac McEachran
Arvind Renganathan
Kelly Lindsay
Somya Sharma
M. Steinbach
John L. Nieber
Christopher J. Duffy
Vipin Kumar
AI4CE
BDL
39
0
0
29 Jul 2024
Reinforced Decoder: Towards Training Recurrent Neural Networks for Time Series Forecasting
Qi Sima
Xinze Zhang
Yukun Bao
Siyue Yang
Liang Shen
AI4TS
37
1
0
14 Jun 2024
Defining error accumulation in ML atmospheric simulators
R. Parthipan
Mohit Anand
Hannah M. Christensen
J. S. Hosking
Damon J. Wischik
29
1
0
23 May 2024
End-to-End Real-World Polyphonic Piano Audio-to-Score Transcription with Hierarchical Decoding
Wei Zeng
Xian He
Ye Wang
19
0
0
22 May 2024
PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference
Dongjie Yang
Xiaodong Han
Yan Gao
Yao Hu
Shilin Zhang
Hai Zhao
36
50
0
21 May 2024
ACEGEN: Reinforcement learning of generative chemical agents for drug discovery
Albert Bou
Morgan Thomas
Sebastian Dittert
Carles Navarro Ramírez
Maciej Majewski
...
Mazen Ahmad
Vincent Moens
Woody Sherman
Simone Sciabola
Gianni de Fabritiis
40
4
0
07 May 2024
Efficient Sample-Specific Encoder Perturbations
Yassir Fathullah
Mark J. F. Gales
21
0
0
01 May 2024
Understanding attention-based encoder-decoder networks: a case study with chess scoresheet recognition
Sergio Y. Hayashi
N. Hirata
43
0
0
23 Apr 2024
Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation
Jingxuan Wei
Linzhuang Sun
Yichong Leng
Xu Tan
Bihui Yu
Ruifeng Guo
43
3
0
23 Apr 2024
MASSM: An End-to-End Deep Learning Framework for Multi-Anatomy Statistical Shape Modeling Directly From Images
Janmesh Ukey
Tushar Kataria
Shireen Elhabian
MedIm
25
1
0
16 Mar 2024
The pitfalls of next-token prediction
Gregor Bachmann
Vaishnavh Nagarajan
37
59
0
11 Mar 2024
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection Attacks
Dario Pasquini
Martin Strohmeier
Carmela Troncoso
AAML
26
21
0
06 Mar 2024
Neural machine translation of clinical procedure codes for medical diagnosis and uncertainty quantification
Pei-Hung Chung
Shuhan He
Norawit Kijpaisalratana
Abdel-badih el Ariss
Byung-Jun Yoon
11
0
0
07 Feb 2024
DySLIM: Dynamics Stable Learning by Invariant Measure for Chaotic Systems
Yair Schiff
Zhong Yi Wan
Jeffrey B. Parker
Stephan Hoyer
Volodymyr Kuleshov
Fei Sha
Leonardo Zepeda-Núnez
28
11
0
06 Feb 2024
A Multi-step Loss Function for Robust Learning of the Dynamics in Model-based Reinforcement Learning
Abdelhakim Benechehab
Albert Thomas
Giuseppe Paolo
Maurizio Filippone
Balázs Kégl
NoLa
15
1
0
05 Feb 2024
SutraNets: Sub-series Autoregressive Networks for Long-Sequence, Probabilistic Forecasting
Shane Bergsma
Timothy J. Zeyl
Lei Guo
AI4TS
30
3
0
22 Dec 2023
DSS: Synthesizing long Digital Ink using Data augmentation, Style encoding and Split generation
A. Timofeev
Anastasiia Fadeeva
A. Afonin
C. Musat
Andrii Maksai
50
2
0
29 Nov 2023
Multilingual Mathematical Autoformalization
Albert Q. Jiang
Wenda Li
M. Jamnik
AI4CE
29
19
0
07 Nov 2023
Time-series Generation by Contrastive Imitation
Daniel Jarrett
Ioana Bica
M. Schaar
AI4TS
13
24
0
02 Nov 2023
Multi-Path Long-Term Vessel Trajectories Forecasting with Probabilistic Feature Fusion for Problem Shifting
Gabriel Spadon
Jay Kumar
Derek Eden
Josh van Berkel
Tom Foster
Amílcar Soares
Ronan Fablet
Stan Matwin
Ronald Pelot
37
4
0
29 Oct 2023
Kernel-Elastic Autoencoder for Molecular Design
Haote Li
Yu Shee
B. Allen
F. Maschietto
Victor S. Batista
19
5
0
12 Oct 2023
FABind: Fast and Accurate Protein-Ligand Binding
Qizhi Pei
Kaiyuan Gao
Lijun Wu
Jinhua Zhu
Yingce Xia
Shufang Xie
Tao Qin
Kun He
Tie-Yan Liu
Rui Yan
34
19
0
10 Oct 2023
Generative Modeling of Regular and Irregular Time Series Data via Koopman VAEs
Ilan Naiman
N. Benjamin Erichson
Pu Ren
Lbnl Michael W. Mahoney ICSI
Omri Azencot
AI4TS
19
18
0
04 Oct 2023
Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation
Kai Huang
Hanyu Yin
Heng Huang
Wei Gao
25
11
0
22 Sep 2023
Quantitative Analysis of Forecasting Models:In the Aspect of Online Political Bias
S. Tripuraneni
Sadia Kamal
A. Bagavathi
11
1
0
11 Sep 2023
Fully Embedded Time-Series Generative Adversarial Networks
Joe Beck
S. Chakraborty
GAN
TTA
AI4TS
22
2
0
30 Aug 2023
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Haohe Liu
Yiitan Yuan
Xubo Liu
Xinhao Mei
Qiuqiang Kong
Qiao Tian
Yuping Wang
Wenwu Wang
Yuxuan Wang
Mark D. Plumbley
DiffM
25
221
0
10 Aug 2023
Bird's-Eye-View Scene Graph for Vision-Language Navigation
Ruitao Liu
Xiaohan Wang
Wenguan Wang
Yi Yang
15
48
0
09 Aug 2023
Scaling Data Generation in Vision-and-Language Navigation
Zun Wang
Jialu Li
Yicong Hong
Yi Wang
Qi Wu
Mohit Bansal
Stephen Gould
Hao Tan
Yu Qiao
LM&Ro
32
54
0
28 Jul 2023
Adversarial Conversational Shaping for Intelligent Agents
Piotr Tarasiewicz
Sultan Kenjeyev
Ilana Sebag
Shehab Alshehabi
GAN
11
0
0
20 Jul 2023
On the Constrained Time-Series Generation Problem
Andrea Coletta
Sriram Gopalakrishnan
Daniel Borrajo
Svitlana Vyetrenko
DiffM
AI4TS
18
34
0
04 Jul 2023
Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation
Zihao Yue
Anwen Hu
Liang Zhang
Qin Jin
24
2
0
23 Jun 2023
PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model
Yizhe Zhang
Jiatao Gu
Zhuofeng Wu
Shuangfei Zhai
J. Susskind
Navdeep Jaitly
DiffM
30
24
0
05 Jun 2023
The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation
Saurabh Saxena
Charles Herrmann
Junhwa Hur
Abhishek Kar
Mohammad Norouzi
Deqing Sun
David J. Fleet
DiffM
33
78
0
02 Jun 2023
1
2
3
4
5
6
Next