Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.03474
Cited By
Recurrent Highway Networks
12 July 2016
J. Zilly
R. Srivastava
Jan Koutník
Jürgen Schmidhuber
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Recurrent Highway Networks"
50 / 76 papers shown
Title
Hadamard product in deep learning: Introduction, Advances and Challenges
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
V. Cevher
AAML
98
1
0
17 Apr 2025
Deep Companion Learning: Enhancing Generalization Through Historical Consistency
Ruizhao Zhu
Venkatesh Saligrama
FedML
40
0
0
26 Jul 2024
DDNAS: Discretized Differentiable Neural Architecture Search for Text Classification
Kuan-Yu Chen
Cheng Li
Kuo-Jung Lee
28
1
0
12 Jul 2023
Unit Scaling: Out-of-the-Box Low-Precision Training
Charlie Blake
Douglas Orr
Carlo Luschi
MQ
24
7
0
20 Mar 2023
State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions
Cheng Wang
Carolin (Haas) Lawrence
Mathias Niepert
21
3
0
10 Dec 2022
An Improved Time Feedforward Connections Recurrent Neural Networks
Jin Wang
Yongsong Zou
Se-Jung Lim
AI4TS
27
3
0
03 Nov 2022
Rethinking skip connection model as a learnable Markov chain
Dengsheng Chen
Jie Hu
Wenwen Qiang
Xiaoming Wei
Enhua Wu
BDL
27
1
0
30 Sep 2022
Entangled Residual Mappings
Mathias Lechner
Ramin Hasani
Z. Babaiee
Radu Grosu
Daniela Rus
T. Henzinger
Sepp Hochreiter
14
5
0
02 Jun 2022
Dependency-based Mixture Language Models
Zhixian Yang
Xiaojun Wan
49
2
0
19 Mar 2022
Pay Attention to Evolution: Time Series Forecasting with Deep Graph-Evolution Learning
Gabriel Spadon
linda Qiao
Bruno Brandoli
Stan Matwin
Jose F. Rodrigues-Jr
Jimeng Sun
AI4TS
31
57
0
28 Aug 2020
NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language Processing
Nikita Klyuchnikov
I. Trofimov
Ekaterina Artemova
Mikhail Salnikov
M. Fedorov
Evgeny Burnaev
VLM
21
101
0
12 Jun 2020
Self-Enhanced GNN: Improving Graph Neural Networks Using Model Outputs
Han Yang
Xiao Yan
XINYAN DAI
Yongqiang Chen
James Cheng
13
36
0
18 Feb 2020
Encoding word order in complex embeddings
Benyou Wang
Donghao Zhao
Christina Lioma
Qiuchi Li
Peng Zhang
J. Simonsen
19
111
0
27 Dec 2019
Explicit Sparse Transformer: Concentrated Attention Through Explicit Selection
Guangxiang Zhao
Junyang Lin
Zhiyuan Zhang
Xuancheng Ren
Qi Su
Xu Sun
22
108
0
25 Dec 2019
Single Headed Attention RNN: Stop Thinking With Your Head
Stephen Merity
27
68
0
26 Nov 2019
Compressive Transformers for Long-Range Sequence Modelling
Jack W. Rae
Anna Potapenko
Siddhant M. Jayakumar
Timothy Lillicrap
RALM
VLM
KELM
13
621
0
13 Nov 2019
Stabilizing Transformers for Reinforcement Learning
Emilio Parisotto
H. F. Song
Jack W. Rae
Razvan Pascanu
Çağlar Gülçehre
...
Aidan Clark
Seb Noury
M. Botvinick
N. Heess
R. Hadsell
OffRL
22
360
0
13 Oct 2019
Searching for A Robust Neural Architecture in Four GPU Hours
Xuanyi Dong
Yezhou Yang
20
647
0
10 Oct 2019
Knowledge Distillation For Recurrent Neural Network Language Modeling With Trust Regularization
Yangyang Shi
M. Hwang
X. Lei
Haoyu Sheng
34
25
0
08 Apr 2019
Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning
Frederik Benzing
M. Gauy
Asier Mujika
A. Martinsson
Angelika Steger
23
22
0
11 Feb 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
38
3,679
0
09 Jan 2019
Symbolic inductive bias for visually grounded learning of spoken language
Grzegorz Chrupała
27
28
0
21 Dec 2018
Graph Neural Networks: A Review of Methods and Applications
Jie Zhou
Ganqu Cui
Shengding Hu
Zhengyan Zhang
Cheng Yang
Zhiyuan Liu
Lifeng Wang
Changcheng Li
Maosong Sun
AI4CE
GNN
33
5,406
0
20 Dec 2018
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks
Songlin Yang
Shawn Tan
Alessandro Sordoni
Aaron Courville
32
323
0
22 Oct 2018
Trellis Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
25
145
0
15 Oct 2018
Batch-normalized Recurrent Highway Networks
Chi Zhang
Thang Nguyen
Shagan Sah
R. Ptucha
A. Loui
C. Salvaggio
22
8
0
26 Sep 2018
Interstellar: Using Halide's Scheduling Language to Analyze DNN Accelerators
Xuan S. Yang
Mingyu Gao
Qiaoyi Liu
Jeff Setter
Jing Pu
...
Kaidi Cao
Heonjae Ha
Priyanka Raina
Christos Kozyrakis
M. Horowitz
24
226
0
10 Sep 2018
Direct Output Connection for a High-Rank Language Model
Sho Takase
Jun Suzuki
Masaaki Nagata
18
36
0
30 Aug 2018
Pyramidal Recurrent Unit for Language Modeling
Sachin Mehta
Rik Koncel-Kedziorski
Mohammad Rastegari
Hannaneh Hajishirzi
21
10
0
27 Aug 2018
Multimodal Language Analysis with Recurrent Multistage Fusion
Paul Pu Liang
Liu Ziyin
Amir Zadeh
Louis-Philippe Morency
30
198
0
12 Aug 2018
Recent Advances in Deep Learning: An Overview
Matiur Rahman Minar
Jibon Naher
VLM
24
116
0
21 Jul 2018
The streaming rollout of deep networks - towards fully model-parallel execution
Volker Fischer
Jan M. Köhler
Thomas Pfeil
24
16
0
13 Jun 2018
Hierarchical Attention-Based Recurrent Highway Networks for Time Series Prediction
Yunzhe Tao
Lin Ma
Weizhong Zhang
Jian-Dong Liu
Wei Liu
Q. Du
AI4TS
19
25
0
02 Jun 2018
Approximating Real-Time Recurrent Learning with Random Kronecker Factors
Asier Mujika
Florian Meier
Angelika Steger
19
60
0
28 May 2018
Long-Term Human Motion Prediction by Modeling Motion Context and Enhancing Motion Dynamic
Yongyi Tang
Lin Ma
Wei Liu
Weishi Zheng
30
147
0
07 May 2018
Noisin: Unbiased Regularization for Recurrent Neural Networks
Adji Bousso Dieng
Rajesh Ranganath
Jaan Altosaar
David M. Blei
22
22
0
03 May 2018
Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling
Liyuan Liu
Xiang Ren
Jingbo Shang
Jian-wei Peng
Jiawei Han
25
44
0
20 Apr 2018
PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning
Yunbo Wang
Zhifeng Gao
Mingsheng Long
Jianmin Wang
Philip S. Yu
20
470
0
17 Apr 2018
Can recurrent neural networks warp time?
Corentin Tallec
Yann Ollivier
CLL
AI4CE
17
135
0
23 Mar 2018
An Analysis of Neural Language Modeling at Multiple Scales
Stephen Merity
N. Keskar
R. Socher
24
170
0
22 Mar 2018
From Nodes to Networks: Evolving Recurrent Neural Networks
Aditya Rawal
Risto Miikkulainen
18
53
0
12 Mar 2018
Learning Longer-term Dependencies in RNNs with Auxiliary Losses
Trieu H. Trinh
Andrew M. Dai
Thang Luong
Quoc V. Le
35
179
0
01 Mar 2018
High Order Recurrent Neural Networks for Acoustic Modelling
C. Zhang
P. Woodland
ODL
41
16
0
22 Feb 2018
Exploring Architectures, Data and Units For Streaming End-to-End Speech Recognition with RNN-Transducer
Kanishka Rao
Hasim Sak
Rohit Prabhavalkar
AI4TS
24
346
0
02 Jan 2018
Deep Learning Scaling is Predictable, Empirically
Joel Hestness
Sharan Narang
Newsha Ardalani
G. Diamos
Heewoo Jun
Hassan Kianinejad
Md. Mostofa Ali Patwary
Yang Yang
Yanqi Zhou
63
716
0
01 Dec 2017
A
4
N
T
A^{4}NT
A
4
NT
: Author Attribute Anonymity by Adversarial Training of Neural Machine Translation
Rakshith Shetty
Bernt Schiele
Mario Fritz
44
95
0
06 Nov 2017
Wider and Deeper, Cheaper and Faster: Tensorized LSTMs for Sequence Learning
Zhen He
Shaobing Gao
Liang Xiao
Daxue Liu
Hangen He
David Barber
AIMat
37
64
0
05 Nov 2017
Neural Language Modeling by Jointly Learning Syntax and Lexicon
Songlin Yang
Zhouhan Lin
Chin-Wei Huang
Aaron Courville
43
178
0
02 Nov 2017
Language Modeling with Highway LSTM
Gakuto Kurata
Bhuvana Ramabhadran
G. Saon
A. Sethy
AI4TS
21
38
0
19 Sep 2017
Learning Intrinsic Sparse Structures within Long Short-Term Memory
W. Wen
Yuxiong He
Samyam Rajbhandari
Minjia Zhang
Wenhan Wang
Fang Liu
Bin Hu
Yiran Chen
H. Li
MQ
35
140
0
15 Sep 2017
1
2
Next