ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1308.3432
  4. Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for
  Conditional Computation

Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation

15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
ArXiv (abs)PDFHTML

Papers citing "Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"

50 / 1,519 papers shown
Title
Magnitude Attention-based Dynamic Pruning
Magnitude Attention-based Dynamic Pruning
Jihye Back
Namhyuk Ahn
Jang-Hyun Kim
82
2
0
08 Jun 2023
Learning Probabilistic Symmetrization for Architecture Agnostic
  Equivariance
Learning Probabilistic Symmetrization for Architecture Agnostic Equivariance
Jinwoo Kim
Tien Dat Nguyen
Ayhan Suleymanzade
Hyeokjun An
Seunghoon Hong
102
24
0
05 Jun 2023
Encoding Time-Series Explanations through Self-Supervised Model Behavior
  Consistency
Encoding Time-Series Explanations through Self-Supervised Model Behavior Consistency
Owen Queen
Thomas Hartvigsen
Teddy Koker
Huan He
Theodoros Tsiligkaridis
Marinka Zitnik
AI4TS
98
21
0
03 Jun 2023
Towards Learning Discrete Representations via Self-Supervision for
  Wearables-Based Human Activity Recognition
Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition
H. Haresamudram
Irfan Essa
Thomas Ploetz
102
8
0
01 Jun 2023
AWQ: Activation-aware Weight Quantization for LLM Compression and
  Acceleration
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Ji Lin
Jiaming Tang
Haotian Tang
Shang Yang
Wei-Ming Chen
Wei-Chen Wang
Guangxuan Xiao
Xingyu Dang
Chuang Gan
Song Han
EDLMQ
241
588
0
01 Jun 2023
Continual Task Allocation in Meta-Policy Network via Sparse Prompting
Continual Task Allocation in Meta-Policy Network via Sparse Prompting
Yijun Yang
Tianyi Zhou
Jing Jiang
Guodong Long
Yuhui Shi
CLLOffRL
95
9
0
29 May 2023
Learning to Quantize Vulnerability Patterns and Match to Locate
  Statement-Level Vulnerabilities
Learning to Quantize Vulnerability Patterns and Match to Locate Statement-Level Vulnerabilities
Michael Fu
Trung Le
Van Nguyen
Chakkrit Tantithamthavorn
Dinh Q. Phung
72
3
0
26 May 2023
Inventing art styles with no artistic training data
Inventing art styles with no artistic training data
Nilin Abrahamsen
Jiahao Yao
GAN
103
3
0
19 May 2023
DinoSR: Self-Distillation and Online Clustering for Self-supervised
  Speech Representation Learning
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
Alexander H. Liu
Heng-Jui Chang
Michael Auli
Wei-Ning Hsu
James R. Glass
92
26
0
17 May 2023
GSB: Group Superposition Binarization for Vision Transformer with
  Limited Training Samples
GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples
T. Gao
Chengzhong Xu
Le Zhang
Hui Kong
121
4
0
13 May 2023
Accelerator-Aware Training for Transducer-Based Speech Recognition
Accelerator-Aware Training for Transducer-Based Speech Recognition
Suhaila M. Shakiah
Rupak Vignesh Swaminathan
Hieu Duy Nguyen
Raviteja Chinta
Tariq Afzal
Nathan Susanj
Athanasios Mouchtaris
Grant P. Strimel
Ariya Rastrow
54
1
0
12 May 2023
Domain Agnostic Image-to-image Translation using Low-Resolution
  Conditioning
Domain Agnostic Image-to-image Translation using Low-Resolution Conditioning
M. Abid
Arman Afrasiyabi
Ihsen Hedhli
Jean-François Lalonde
Christian Gagné
VLM
71
0
0
08 May 2023
Input Layer Binarization with Bit-Plane Encoding
Input Layer Binarization with Bit-Plane Encoding
Lorenzo Vorabbi
Davide Maltoni
Stefano Santi
MQ
60
6
0
04 May 2023
Guaranteed Quantization Error Computation for Neural Network Model
  Compression
Guaranteed Quantization Error Computation for Neural Network Model Compression
Wesley Cooke
Zihao Mo
Weiming Xiang
36
5
0
26 Apr 2023
Binary stochasticity enabled highly efficient neuromorphic deep learning
  achieves better-than-software accuracy
Binary stochasticity enabled highly efficient neuromorphic deep learning achieves better-than-software accuracy
Yang Li
Wei Wang
Ming Wang
C. Dou
Zhengyu Ma
...
Guanhua Yang
Feng Zhang
Ling Li
Daniele Ielmini
Ming-Yuan Liu
30
5
0
25 Apr 2023
Learning Task-Specific Strategies for Accelerated MRI
Learning Task-Specific Strategies for Accelerated MRI
Zihui Wu
Tianwei Yin
Yu Sun
R. Frost
A. Kouwe
Adrian Dalca
Katherine Bouman
70
4
0
25 Apr 2023
Efficient Halftoning via Deep Reinforcement Learning
Efficient Halftoning via Deep Reinforcement Learning
Haitian Jiang
Dongliang Xiong
Xiaowen Jiang
Li Ding
Liang Chen
Kai Huang
67
3
0
24 Apr 2023
Bridging Discrete and Backpropagation: Straight-Through and Beyond
Bridging Discrete and Backpropagation: Straight-Through and Beyond
Liyuan Liu
Chengyu Dong
Xiaodong Liu
Bin Yu
Jianfeng Gao
BDL
92
23
0
17 Apr 2023
Revisiting Single-gated Mixtures of Experts
Revisiting Single-gated Mixtures of Experts
Amelie Royer
I. Karmanov
Andrii Skliar
B. Bejnordi
Tijmen Blankevoort
MoEMoMe
75
6
0
11 Apr 2023
RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors
RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors
Ruixia Wu
Zheng-Peng Duan
Chunle Guo
Zhi Chai
Chongyi Li
75
93
0
08 Apr 2023
GradMDM: Adversarial Attack on Dynamic Networks
GradMDM: Adversarial Attack on Dynamic Networks
Jianhong Pan
Lin Geng Foo
Qichen Zheng
Zhipeng Fan
Hossein Rahmani
Qiuhong Ke
Jing Liu
AAML
90
7
0
01 Apr 2023
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer
Sheng Xu
Yanjing Li
Mingbao Lin
Penglei Gao
Guodong Guo
Jinhu Lu
Baochang Zhang
MQ
98
24
0
01 Apr 2023
Randomly Initialized Subnetworks with Iterative Weight Recycling
Randomly Initialized Subnetworks with Iterative Weight Recycling
Matt Gorbett
L. D. Whitley
77
4
0
28 Mar 2023
Forget-free Continual Learning with Soft-Winning SubNetworks
Forget-free Continual Learning with Soft-Winning SubNetworks
Haeyong Kang
Jaehong Yoon
Sultan Rizky Hikmawan Madjid
Sung Ju Hwang
Chang D. Yoo
CLL
105
4
0
27 Mar 2023
A Dynamic Multi-Scale Voxel Flow Network for Video Prediction
A Dynamic Multi-Scale Voxel Flow Network for Video Prediction
Xiaotao Hu
Zhewei Huang
Ailin Huang
Jun Xu
Shuchang Zhou
VGen
102
71
0
17 Mar 2023
Gated Compression Layers for Efficient Always-On Models
Gated Compression Layers for Efficient Always-On Models
Haiguang Li
T. Thormundsson
I. Poupyrev
N. Gillian
85
2
0
15 Mar 2023
Efficient Transformer-based 3D Object Detection with Dynamic Token
  Halting
Efficient Transformer-based 3D Object Detection with Dynamic Token Halting
Mao Ye
Gregory P. Meyer
Yuning Chai
Qiang Liu
78
9
0
09 Mar 2023
Optimal ANN-SNN Conversion for High-accuracy and Ultra-low-latency
  Spiking Neural Networks
Optimal ANN-SNN Conversion for High-accuracy and Ultra-low-latency Spiking Neural Networks
Tong Bu
Wei Fang
Jianhao Ding
Penglin Dai
Zhaofei Yu
Tiejun Huang
157
210
0
08 Mar 2023
Robustness-preserving Lifelong Learning via Dataset Condensation
Robustness-preserving Lifelong Learning via Dataset Condensation
Jinghan Jia
Yihua Zhang
Dogyoon Song
Sijia Liu
Alfred Hero
DD
67
5
0
07 Mar 2023
MetaGrad: Adaptive Gradient Quantization with Hypernetworks
MetaGrad: Adaptive Gradient Quantization with Hypernetworks
Kaixin Xu
Alina Hui Xiu Lee
Ziyuan Zhao
Zhe Wang
Min-man Wu
Weisi Lin
MQ
86
1
0
04 Mar 2023
Fixed-point quantization aware training for on-device keyword-spotting
Fixed-point quantization aware training for on-device keyword-spotting
Sashank Macha
Om Oza
Alex Escott
Francesco Calivá
Robert M. Armitano
S. Cheekatmalla
S. Parthasarathi
Yuzong Liu
MQ
49
4
0
04 Mar 2023
MERF: Memory-Efficient Radiance Fields for Real-time View Synthesis in
  Unbounded Scenes
MERF: Memory-Efficient Radiance Fields for Real-time View Synthesis in Unbounded Scenes
Christian Reiser
Richard Szeliski
Dor Verbin
Pratul P. Srinivasan
B. Mildenhall
Andreas Geiger
Jonathan T. Barron
Peter Hedman
149
235
0
23 Feb 2023
Modular Deep Learning
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
Edoardo Ponti
MoMeOOD
161
80
0
22 Feb 2023
Entity-Level Text-Guided Image Manipulation
Entity-Level Text-Guided Image Manipulation
Yikai Wang
Jianan Wang
Guansong Lu
Hang Xu
Zhenguo Li
Wei Zhang
Yanwei Fu
VGen
73
3
0
22 Feb 2023
Learning a Consensus Sub-Network with Polarization Regularization and One Pass Training
Learning a Consensus Sub-Network with Polarization Regularization and One Pass Training
Xiaoying Zhi
Varun Babbar
P. Sun
Fran Silavong
Ruibo Shi
Sean J. Moran
Sean Moran
151
1
0
17 Feb 2023
Unsupervised Hashing with Similarity Distribution Calibration
Unsupervised Hashing with Similarity Distribution Calibration
KamWoh Ng
Xiatian Zhu
Jiun Tian Hoe
Chee Seng Chan
Tianyu Zhang
Yi-Zhe Song
Tao Xiang
63
2
0
15 Feb 2023
Towards Optimal Compression: Joint Pruning and Quantization
Towards Optimal Compression: Joint Pruning and Quantization
Ben Zandonati
Glenn Bucagu
Adrian Alan Pol
M. Pierini
Olya Sirkin
Tal Kopetz
MQ
95
3
0
15 Feb 2023
SEAM: Searching Transferable Mixed-Precision Quantization Policy through
  Large Margin Regularization
SEAM: Searching Transferable Mixed-Precision Quantization Policy through Large Margin Regularization
Chen Tang
Kai Ouyang
Zenghao Chai
Yunpeng Bai
Yuan Meng
Zhi Wang
Wenwu Zhu
MQ
79
9
0
14 Feb 2023
Reliability Assurance for Deep Neural Network Architectures Against
  Numerical Defects
Reliability Assurance for Deep Neural Network Architectures Against Numerical Defects
Linyi Li
Yuhao Zhang
Luyao Ren
Yingfei Xiong
Tao Xie
63
9
0
13 Feb 2023
BEST: BERT Pre-Training for Sign Language Recognition with Coupling
  Tokenization
BEST: BERT Pre-Training for Sign Language Recognition with Coupling Tokenization
Weichao Zhao
Hezhen Hu
Wen-gang Zhou
Jiaxin Shi
Houqiang Li
SLR
74
33
0
10 Feb 2023
Learning Discretized Neural Networks under Ricci Flow
Learning Discretized Neural Networks under Ricci Flow
Jun Chen
Han Chen
Mengmeng Wang
Guang Dai
Ivor W. Tsang
Yang Liu
85
4
0
07 Feb 2023
DITTO: Offline Imitation Learning with World Models
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
82
18
0
06 Feb 2023
Self-Compressing Neural Networks
Self-Compressing Neural Networks
Szabolcs Cséfalvay
J. Imber
56
3
0
30 Jan 2023
Understanding INT4 Quantization for Transformer Models: Latency Speedup,
  Composability, and Failure Cases
Understanding INT4 Quantization for Transformer Models: Latency Speedup, Composability, and Failure Cases
Xiaoxia Wu
Cheng-rong Li
Reza Yazdani Aminabadi
Z. Yao
Yuxiong He
MQ
77
25
0
27 Jan 2023
Deep Quantum Error Correction
Deep Quantum Error Correction
Yoni Choukroun
Lior Wolf
70
11
0
27 Jan 2023
Masked Vector Quantization
David D. Nguyen
David Leibowitz
Surya Nepal
S. Kanhere
MQ
47
0
0
16 Jan 2023
RedBit: An End-to-End Flexible Framework for Evaluating the Accuracy of
  Quantized CNNs
RedBit: An End-to-End Flexible Framework for Evaluating the Accuracy of Quantized CNNs
A. M. Ribeiro-dos-Santos
João Dinis Ferreira
O. Mutlu
G. Falcão
MQ
97
2
0
15 Jan 2023
Mastering Diverse Domains through World Models
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
97
617
0
10 Jan 2023
Dynamic Grained Encoder for Vision Transformers
Dynamic Grained Encoder for Vision Transformers
Lin Song
Songyang Zhang
Songtao Liu
Zeming Li
Xuming He
Hongbin Sun
Jian Sun
Nanning Zheng
ViT
87
34
0
10 Jan 2023
AdaEnsemble: Learning Adaptively Sparse Structured Ensemble Network for
  Click-Through Rate Prediction
AdaEnsemble: Learning Adaptively Sparse Structured Ensemble Network for Click-Through Rate Prediction
Yachen Yan
Liubo Li
84
3
0
06 Jan 2023
Previous
123...121314...293031
Next