Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.02515
Cited By
Self-Normalizing Neural Networks
8 June 2017
G. Klambauer
Thomas Unterthiner
Andreas Mayr
Sepp Hochreiter
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Self-Normalizing Neural Networks"
50 / 867 papers shown
Title
Trajectory Normalized Gradients for Distributed Optimization
Jianqiao Wangni
Ke Li
Jianbo Shi
Jitendra Malik
11
2
0
24 Jan 2019
Disentangling Video with Independent Prediction
William F. Whitney
Rob Fergus
CML
CoGe
OCL
DRL
38
1
0
17 Jan 2019
Is it Time to Swish? Comparing Deep Learning Activation Functions Across NLP tasks
Steffen Eger
Paul Youssef
Iryna Gurevych
LLMSV
17
76
0
09 Jan 2019
LiSHT: Non-Parametric Linearly Scaled Hyperbolic Tangent Activation Function for Neural Networks
Swalpa Kumar Roy
Suvojit Manna
S. Dubey
B. B. Chaudhuri
13
49
0
01 Jan 2019
Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks
Mohamed Yousef
K. Hussain
U. S. Mohammed
3DV
18
124
0
31 Dec 2018
Supervised Domain Enablement Attention for Personalized Domain Classification
Joo-Kyung Kim
Young-Bum Kim
33
10
0
18 Dec 2018
NIPS - Not Even Wrong? A Systematic Review of Empirically Complete Demonstrations of Algorithmic Effectiveness in the Machine Learning and Artificial Intelligence Literature
Franz J. Király
Bilal A. Mateen
R. Sonabend
18
10
0
18 Dec 2018
Flatten-T Swish: a thresholded ReLU-Swish-like activation function for deep learning
Hock Hung Chieng
Noorhaniza Wahid
P. Ong
Sai Raj Kishore Perla
16
41
0
15 Dec 2018
Evolutionary Neural Architecture Search for Image Restoration
Gerard Jacques van Wyk
Anna Sergeevna Bosman
14
34
0
14 Dec 2018
Guided Dropout
Rohit Keshari
Richa Singh
Mayank Vatsa
BDL
26
37
0
10 Dec 2018
Generalized Batch Normalization: Towards Accelerating Deep Neural Networks
Xiaoyong Yuan
Zheng Feng
Matthew Norton
Xiaolin Li
6
24
0
08 Dec 2018
Attention Boosted Sequential Inference Model
Guanyu Li
Pengfei Zhang
Caiyan Jia
21
3
0
05 Dec 2018
ECC: Platform-Independent Energy-Constrained Deep Neural Network Compression via a Bilinear Regression Model
Haichuan Yang
Yuhao Zhu
Ji Liu
28
40
0
05 Dec 2018
EENMF: An End-to-End Neural Matching Framework for E-Commerce Sponsored Search
Wenjin Wu
Guojun Liu
Hui Ye
Chenshuang Zhang
Tianshu Wu
Daorui Xiao
Wei Lin
Xiaoyu Zhu
27
8
0
04 Dec 2018
SwishNet: A Fast Convolutional Neural Network for Speech, Music and Noise Classification and Segmentation
Md Shamim Hussain
M. A. Haque
23
48
0
01 Dec 2018
The SWAG Algorithm; a Mathematical Approach that Outperforms Traditional Deep Learning. Theory and Implementation
S. Safaei
Vahid Safaei
Solmazi Safaei
Zerotti Woods
H. Arabnia
Juan B. Gutierrez
20
0
0
28 Nov 2018
SOC: hunting the underground inside story of the ethereum Social-network Opinion and Comment
TonTon Hsien-De Huang
Po-Wei Hong
Ying-Tse Lee
Yi-Lun Wang
Chi-Leong Lok
Hung-Yu kao
25
2
0
27 Nov 2018
Neural Non-Stationary Spectral Kernel
Sami Remes
Markus Heinonen
Samuel Kaski
BDL
16
9
0
27 Nov 2018
Driver Behavior Recognition via Interwoven Deep Convolutional Neural Nets with Multi-stream Inputs
Chaoyun Zhang
Rui Li
Woojin Kim
Daesub Yoon
P. Patras
31
49
0
22 Nov 2018
Regularizing by the Variance of the Activations' Sample-Variances
Etai Littwin
Lior Wolf
VLM
15
12
0
21 Nov 2018
Unsupervised Multimodal Representation Learning across Medical Images and Reports
T. Hsu
W. Weng
Willie Boag
Matthew B. A. McDermott
Peter Szolovits
SSL
22
35
0
21 Nov 2018
A Deep Neural Network for Unsupervised Anomaly Detection and Diagnosis in Multivariate Time Series Data
Chuxu Zhang
Dongjin Song
Yuncong Chen
Xinyang Feng
C. Lumezanu
Wei Cheng
Jingchao Ni
Bo Zong
Haifeng Chen
Nitesh V. Chawla
AI4TS
13
691
0
20 Nov 2018
Higher-order Network for Action Recognition
Jie Shao
Xiangyang Xue
11
0
0
19 Nov 2018
Deep Determinantal Point Processes
Mike Gartrell
Elvis Dohmatob
Jon Alberdi
8
4
0
17 Nov 2018
SGR: Self-Supervised Spectral Graph Representation Learning
Anton Tsitsulin
Davide Mottin
Panagiotis Karras
A. Bronstein
Emmanuel Müller
SSL
12
6
0
15 Nov 2018
Modality Attention for End-to-End Audio-visual Speech Recognition
Pan Zhou
Wenwen Yang
Wei Chen
Yanfeng Wang
Jia Jia
24
69
0
13 Nov 2018
Activation Functions: Comparison of trends in Practice and Research for Deep Learning
S. Bodenstedt
Dominik Rivoir
A. Gachagan
S. T. Mees
9
1,268
0
08 Nov 2018
Linear Memory Networks
Xi Chen
Ali Ghadirzadeh
Mårten Björkman
KELM
20
8
0
08 Nov 2018
Quasi-random sampling for multivariate distributions via generative neural networks
Marius Hofert
Avinash Prasad
Mu Zhu
14
14
0
01 Nov 2018
A Streamlined Encoder/Decoder Architecture for Melody Extraction
Tsung-Han Hsieh
Li Su
Yi-Hsuan Yang
17
52
0
30 Oct 2018
MPNA: A Massively-Parallel Neural Array Accelerator with Dataflow Optimization for Convolutional Neural Networks
Muhammad Abdullah Hanif
Rachmad Vidya Wicaksana Putra
Muhammad Tanvir
R. Hafiz
Semeen Rehman
Muhammad Shafique
9
17
0
30 Oct 2018
A Methodology for Automatic Selection of Activation Functions to Design Hybrid Deep Neural Networks
Alberto Marchisio
Muhammad Abdullah Hanif
Semeen Rehman
Maurizio Martina
Muhammad Shafique
27
11
0
27 Oct 2018
Batch Normalization Sampling
Zhaodong Chen
Lei Deng
Guoqi Li
Jiawei Sun
Xing Hu
Xin Ma
Yuan Xie
16
0
0
25 Oct 2018
Single-Image SVBRDF Capture with a Rendering-Aware Deep Network
Valentin Deschaintre
M. Aittala
F. Durand
G. Drettakis
Adrien Bousseau
3DH
6
260
0
23 Oct 2018
GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning
Jacky Liang
Viktor Makoviychuk
Ankur Handa
N. Chentanez
Miles Macklin
Dieter Fox
AI4CE
18
182
0
12 Oct 2018
Weighted Sigmoid Gate Unit for an Activation Function of Deep Neural Network
Masayuki Tanaka
27
54
0
03 Oct 2018
DeepCMB: Lensing Reconstruction of the Cosmic Microwave Background with Deep Neural Networks
J. Caldeira
W. L. K. Wu
Brian D. Nord
Camille Avestruz
Shubhendu Trivedi
K. Story
25
63
0
02 Oct 2018
Unsupervised Emergence of Spatial Structure from Sensorimotor Prediction
Alban Laflaquière
Michael Garcia Ortiz
4
3
0
02 Oct 2018
Aggregation of binary feature descriptors for compact scene model representation in large scale structure-from-motion applications
J. Komorowski
Tomasz Trzciñski
3DPC
3DV
9
0
0
28 Sep 2018
SConE: Siamese Constellation Embedding Descriptor for Image Matching
Tomasz Trzciñski
J. Komorowski
Lukasz Dabala
K. Czarnota
Grzegorz Kurzejamski
Simon Lynen
14
10
0
28 Sep 2018
Dynamical Isometry is Achieved in Residual Networks in a Universal Way for any Activation Function
W. Tarnowski
P. Warchol
Stanislaw Jastrzebski
Jacek Tabor
M. Nowak
16
37
0
24 Sep 2018
Orthogonally Decoupled Variational Gaussian Processes
Hugh Salimbeni
Ching-An Cheng
Byron Boots
M. Deisenroth
11
43
0
24 Sep 2018
Design Space Exploration of Neural Network Activation Function Circuits
Tao Yang
Yadong Wei
Zhaopeng Tu
Haolun Zeng
Michel A. Kinsy
Nanning Zheng
Pengju Ren
17
50
0
22 Sep 2018
Automatic Program Synthesis of Long Programs with a Learned Garbage Collector
Amit Zohar
Lior Wolf
8
78
0
12 Sep 2018
Efficient and Robust Parallel DNN Training through Model Parallelism on Multi-GPU Platform
Chi-Chung Chen
Chia-Lin Yang
Hsiang-Yun Cheng
25
100
0
08 Sep 2018
Embedding Multimodal Relational Data for Knowledge Base Completion
Pouya Pezeshkpour
Liyan Chen
Sameer Singh
12
125
0
05 Sep 2018
Diverse and Coherent Paragraph Generation from Images
Moitreya Chatterjee
A. Schwing
19
66
0
03 Sep 2018
Data Dropout: Optimizing Training Data for Convolutional Neural Networks
Tianyang Wang
Jun Huan
Bo Li
OOD
14
56
0
01 Sep 2018
Dropout with Tabu Strategy for Regularizing Deep Neural Networks
Zongjie Ma
A. Sattar
Jun Zhou
Qingliang Chen
Kaile Su
14
6
0
29 Aug 2018
An Attention-Gated Convolutional Neural Network for Sentence Classification
Yang Liu
Lixin Ji
Ruiyang Huang
Tuosiyu Ming
Chao Gao
Jianpeng Zhang
12
38
0
22 Aug 2018
Previous
1
2
3
...
14
15
16
17
18
Next