ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.08375
  4. Cited By
Deep Learning using Rectified Linear Units (ReLU)
v1v2 (latest)

Deep Learning using Rectified Linear Units (ReLU)

22 March 2018
Abien Fred Agarap
ArXiv (abs)PDFHTML

Papers citing "Deep Learning using Rectified Linear Units (ReLU)"

45 / 45 papers shown
Title
HiLAB: A Hybrid Inverse-Design Framework
HiLAB: A Hybrid Inverse-Design Framework
Reza Marzban
Hamed Abiri
Raphael Pestourie
Ali Adibi
73
0
0
23 May 2025
COUNTDOWN: Contextually Sparse Activation Filtering Out Unnecessary Weights in Down Projection
COUNTDOWN: Contextually Sparse Activation Filtering Out Unnecessary Weights in Down Projection
Jaewon Cheon
Pilsung Kang
68
0
0
23 May 2025
HOME-3: High-Order Momentum Estimator with Third-Power Gradient for Convex and Smooth Nonconvex Optimization
HOME-3: High-Order Momentum Estimator with Third-Power Gradient for Convex and Smooth Nonconvex Optimization
Wei Zhang
Arif Hassan Zidan
Afrar Jahin
Wei Zhang
Tianming Liu
ODL
73
0
0
16 May 2025
RGB-Event Fusion with Self-Attention for Collision Prediction
RGB-Event Fusion with Self-Attention for Collision Prediction
Pietro Bonazzi
Christian Vogt
Michael Jost
Haotong Qin
Lyes Khacef
Federico Paredes-Valles
Michele Magno
72
0
0
07 May 2025
A Theory of Machine Understanding via the Minimum Description Length Principle
A Theory of Machine Understanding via the Minimum Description Length Principle
Canlin Zhang
Xiuwen Liu
115
0
0
01 Apr 2025
Adaptive Rank Allocation: Speeding Up Modern Transformers with RaNA Adapters
Adaptive Rank Allocation: Speeding Up Modern Transformers with RaNA Adapters
Roberto Garcia
Jerry Liu
Daniel Sorvisto
Sabri Eyuboglu
143
0
0
23 Mar 2025
Informer in Algorithmic Investment Strategies on High Frequency Bitcoin Data
Informer in Algorithmic Investment Strategies on High Frequency Bitcoin Data
Filip Stefaniuk
Robert Ślepaczuk
AIFin
164
0
0
23 Mar 2025
Predicting Practically? Domain Generalization for Predictive Analytics in Real-world Environments
Hanyu Duan
Yi Yang
Ahmed Abbasi
Kar Yan Tam
OOD
159
0
0
05 Mar 2025
VesselSAM: Leveraging SAM for Aortic Vessel Segmentation with LoRA and Atrous Attention
VesselSAM: Leveraging SAM for Aortic Vessel Segmentation with LoRA and Atrous Attention
Adnan Iltaf
Rayan Merghani Ahmed
Bin Li
Bin Li
Shoujun Zhou
113
0
0
25 Feb 2025
LESA: Learnable LLM Layer Scaling-Up
LESA: Learnable LLM Layer Scaling-Up
Yifei Yang
Zouying Cao
Xinbei Ma
Yao Yao
L. Qin
Zhongfu Chen
Hai Zhao
144
0
0
20 Feb 2025
Hypencoder: Hypernetworks for Information Retrieval
Hypencoder: Hypernetworks for Information Retrieval
Julian Killingback
Hansi Zeng
Hamed Zamani
166
1
0
07 Feb 2025
Analyzing Spatio-Temporal Dynamics of Dissolved Oxygen for the River Thames using Superstatistical Methods and Machine Learning
Analyzing Spatio-Temporal Dynamics of Dissolved Oxygen for the River Thames using Superstatistical Methods and Machine Learning
Hankun He
Takuya Boehringer
Benjamin Schäfer
Kate Heppell
Christian Beck
188
4
0
10 Jan 2025
Most Influential Subset Selection: Challenges, Promises, and Beyond
Most Influential Subset Selection: Challenges, Promises, and Beyond
Yuzheng Hu
Pingbang Hu
Han Zhao
Jiaqi W. Ma
TDI
188
8
0
10 Jan 2025
ROSE: Revolutionizing Open-Set Dense Segmentation with Patch-Wise Perceptual Large Multimodal Model
Kunyang Han
Yibo Hu
Mengxue Qu
Hailin Shi
Yao Zhao
Y. X. Wei
MLLMVLM3DV
201
1
0
29 Nov 2024
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Jongseong Bae
Junwoo Ha
Ha Young Kim
136
1
0
25 Nov 2024
Label Distribution Shift-Aware Prediction Refinement for Test-Time Adaptation
Label Distribution Shift-Aware Prediction Refinement for Test-Time Adaptation
M-U Jang
Hye Won Chung
TTA
508
0
0
20 Nov 2024
Online Item Cold-Start Recommendation with Popularity-Aware Meta-Learning
Online Item Cold-Start Recommendation with Popularity-Aware Meta-Learning
Yihao Luo
Yuezihan Jiang
Yinjie Jiang
Gaode Chen
Jiadong Wang
Kaigui Bian
Peiyi Li
Qi Zhang
63
0
0
18 Nov 2024
Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
Yuqi Luo
Chenyang Song
Xu Han
Yuxiao Chen
Chaojun Xiao
Zhiyuan Liu
Maosong Sun
116
5
0
04 Nov 2024
ELBOing Stein: Variational Bayes with Stein Mixture Inference
ELBOing Stein: Variational Bayes with Stein Mixture Inference
Ola Rønning
Eric T. Nalisnick
Christophe Ley
Padhraic Smyth
Thomas Hamelryck
BDL
90
1
0
30 Oct 2024
Lambda-Skip Connections: the architectural component that prevents Rank Collapse
Lambda-Skip Connections: the architectural component that prevents Rank Collapse
Federico Arangath Joseph
Jerome Sieber
Melanie Zeilinger
Carmen Amo Alonso
187
0
0
14 Oct 2024
Scaling Laws for Predicting Downstream Performance in LLMs
Scaling Laws for Predicting Downstream Performance in LLMs
Yangyi Chen
Binxuan Huang
Yifan Gao
Zhengyang Wang
Jingfeng Yang
Heng Ji
LRM
100
9
0
11 Oct 2024
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition
Jiyeon Kim
Hyunji Lee
Hyowon Cho
Joel Jang
Hyeonbin Hwang
Seungpil Won
Youbin Ahn
Dohaeng Lee
Minjoon Seo
KELM
378
5
0
02 Oct 2024
Amortized Bayesian Multilevel Models
Amortized Bayesian Multilevel Models
Daniel Habermann
Marvin Schmitt
Lars Kühmichel
Andreas Bulling
Stefan T. Radev
Paul-Christian Bürkner
206
4
0
23 Aug 2024
Towards Zero-Shot Multimodal Machine Translation
Towards Zero-Shot Multimodal Machine Translation
Matthieu Futeral
Cordelia Schmid
Benoît Sagot
Rachel Bawden
79
4
0
18 Jul 2024
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models
Daking Rai
Yilun Zhou
Shi Feng
Abulhair Saparov
Ziyu Yao
156
32
0
02 Jul 2024
Bayesian RG Flow in Neural Network Field Theories
Bayesian RG Flow in Neural Network Field Theories
Jessica N. Howard
Marc S. Klinger
Anindita Maiti
A. G. Stapleton
97
2
0
27 May 2024
SoK: Leveraging Transformers for Malware Analysis
SoK: Leveraging Transformers for Malware Analysis
Pradip Kunwar
Kshitiz Aryal
Maanak Gupta
Mahmoud Abdelsalam
Elisa Bertino
129
0
0
27 May 2024
Self-Expansion of Pre-trained Models with Mixture of Adapters for Continual Learning
Self-Expansion of Pre-trained Models with Mixture of Adapters for Continual Learning
Huiyi Wang
Haodong Lu
Lina Yao
Dong Gong
KELMCLL
92
11
0
27 Mar 2024
Streaming Sequence Transduction through Dynamic Compression
Streaming Sequence Transduction through Dynamic Compression
Weiting Tan
Yunmo Chen
Tongfei Chen
Guanghui Qin
Haoran Xu
Heidi C. Zhang
Benjamin Van Durme
Philipp Koehn
128
2
0
02 Feb 2024
OnDev-LCT: On-Device Lightweight Convolutional Transformers towards federated learning
OnDev-LCT: On-Device Lightweight Convolutional Transformers towards federated learning
Chu Myaet Thwal
Minh N. H. Nguyen
Ye Lin Tun
Seongjin Kim
My T. Thai
Choong Seon Hong
99
5
0
22 Jan 2024
Teaching Robots to Build Simulations of Themselves
Teaching Robots to Build Simulations of Themselves
Yuhang Hu
Jiong Lin
Hod Lipson
SSL
123
4
0
20 Nov 2023
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
Xuzhe Dang
Stefan Edelkamp
129
4
0
06 Nov 2023
TinyFormer: Efficient Transformer Design and Deployment on Tiny Devices
TinyFormer: Efficient Transformer Design and Deployment on Tiny Devices
Jianlei Yang
Jiacheng Liao
Fanding Lei
Meichen Liu
Junyi Chen
Lingkun Long
Han Wan
Bei Yu
Weisheng Zhao
MoE
105
2
0
03 Nov 2023
Revealing CNN Architectures via Side-Channel Analysis in Dataflow-based Inference Accelerators
Revealing CNN Architectures via Side-Channel Analysis in Dataflow-based Inference Accelerators
Hansika Weerasena
Prabhat Mishra
FedML
100
4
0
01 Nov 2023
Unsupervised Denoising for Signal-Dependent and Row-Correlated Imaging Noise
Unsupervised Denoising for Signal-Dependent and Row-Correlated Imaging Noise
Benjamin Salmon
Alexander Krull
72
1
0
11 Oct 2023
APTx: better activation function than MISH, SWISH, and ReLU's variants used in deep learning
APTx: better activation function than MISH, SWISH, and ReLU's variants used in deep learning
Ravin Kumar
61
5
0
10 Sep 2022
Uncertainty-Driven Action Quality Assessment
Uncertainty-Driven Action Quality Assessment
Caixia Zhou
Yaping Huang
66
10
0
29 Jul 2022
Highly-scalable, physics-informed GANs for learning solutions of
  stochastic PDEs
Highly-scalable, physics-informed GANs for learning solutions of stochastic PDEs
Liu Yang
Sean Treichler
Thorsten Kurth
Keno Fischer
D. Barajas-Solano
...
Valentin Churavy
A. Tartakovsky
Michael Houston
P. Prabhat
George Karniadakis
AI4CE
76
38
0
29 Oct 2019
A Neural Network Architecture Combining Gated Recurrent Unit (GRU) and
  Support Vector Machine (SVM) for Intrusion Detection in Network Traffic Data
A Neural Network Architecture Combining Gated Recurrent Unit (GRU) and Support Vector Machine (SVM) for Intrusion Detection in Network Traffic Data
Abien Fred Agarap
57
217
0
10 Sep 2017
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning
  Algorithms
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms
Han Xiao
Kashif Rasul
Roland Vollgraf
283
8,904
0
25 Aug 2017
Parametric Exponential Linear Unit for Deep Convolutional Neural
  Networks
Parametric Exponential Linear Unit for Deep Convolutional Neural Networks
Ludovic Trottier
Philippe Giguère
B. Chaib-draa
65
200
0
30 May 2016
Semantically Conditioned LSTM-based Natural Language Generation for
  Spoken Dialogue Systems
Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems
Tsung-Hsien Wen
Milica Gasic
N. Mrksic
Pei-hao Su
David Vandyke
S. Young
101
949
0
07 Aug 2015
Attention-Based Models for Speech Recognition
Attention-Based Models for Speech Recognition
J. Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
127
2,607
0
24 Jun 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.9K
150,115
0
22 Dec 2014
Deep Learning using Linear Support Vector Machines
Deep Learning using Linear Support Vector Machines
Yichuan Tang
95
894
0
02 Jun 2013
1