ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1308.3432
  4. Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for
  Conditional Computation

Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation

15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
ArXiv (abs)PDFHTML

Papers citing "Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"

50 / 1,517 papers shown
Title
Low-Precision Mixed-Computation Models for Inference on Edge
Low-Precision Mixed-Computation Models for Inference on Edge
Seyedarmin Azizi
M. Nazemi
M. Kamal
Massoud Pedram
MQ
79
3
0
03 Dec 2023
Learning High-Order Relationships of Brain Regions
Learning High-Order Relationships of Brain Regions
Weikang Qiu
Huangrui Chu
Selena Wang
Haolan Zuo
Xiaoxiao Li
Yize Zhao
Rex Ying
112
6
0
02 Dec 2023
Harnessing Discrete Representations For Continual Reinforcement Learning
Harnessing Discrete Representations For Continual Reinforcement Learning
Edan Meyer
Adam White
Marlos C. Machado
OffRL
72
5
0
02 Dec 2023
Mixed-Precision Quantization for Federated Learning on
  Resource-Constrained Heterogeneous Devices
Mixed-Precision Quantization for Federated Learning on Resource-Constrained Heterogeneous Devices
Huancheng Chen
H. Vikalo
FedMLMQ
120
7
0
29 Nov 2023
Implicit-explicit Integrated Representations for Multi-view Video
  Compression
Implicit-explicit Integrated Representations for Multi-view Video Compression
Chen Zhu
Guo Lu
Bing He
Rong Xie
Li Song
54
7
0
29 Nov 2023
E-ViLM: Efficient Video-Language Model via Masked Video Modeling with
  Semantic Vector-Quantized Tokenizer
E-ViLM: Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer
Jacob Zhiyuan Fang
Skyler Zheng
Vasu Sharma
Robinson Piramuthu
VLM
161
0
0
28 Nov 2023
Text2Tree: Aligning Text Representation to the Label Tree Hierarchy for
  Imbalanced Medical Classification
Text2Tree: Aligning Text Representation to the Label Tree Hierarchy for Imbalanced Medical Classification
Jiahuan Yan
Haojun Gao
Zhang Kai
Weize Liu
Benlin Liu
Jian Wu
Jintai Chen
53
4
0
28 Nov 2023
Learning to Skip for Language Modeling
Learning to Skip for Language Modeling
Dewen Zeng
Nan Du
Tao Wang
Yuanzhong Xu
Tao Lei
Zhifeng Chen
Claire Cui
71
12
0
26 Nov 2023
CRISP: Hybrid Structured Sparsity for Class-aware Model Pruning
CRISP: Hybrid Structured Sparsity for Class-aware Model Pruning
Shivam Aggarwal
Kuluhan Binici
Tulika Mitra
VLM
62
4
0
24 Nov 2023
Compact 3D Gaussian Representation for Radiance Field
Compact 3D Gaussian Representation for Radiance Field
J. Lee
Daniel Rho
Xiangyu Sun
Jong Hwan Ko
Eunbyung Park
3DGS
114
204
0
22 Nov 2023
Differentiable Sampling of Categorical Distributions Using the
  CatLog-Derivative Trick
Differentiable Sampling of Categorical Distributions Using the CatLog-Derivative Trick
Lennert De Smet
Emanuele Sansone
Pedro Zuidberg Dos Martires
78
13
0
21 Nov 2023
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive
  Review
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review
M. Lê
Pierre Wolinski
Julyan Arbel
91
10
0
20 Nov 2023
Low-Precision Floating-Point for Efficient On-Board Deep Neural Network
  Processing
Low-Precision Floating-Point for Efficient On-Board Deep Neural Network Processing
Cédric Gernigon
Silviu-Ioan Filip
Olivier Sentieys
Clément Coggiola
Mickael Bruno
MQ
58
8
0
18 Nov 2023
Interpretable Reinforcement Learning for Robotics and Continuous Control
Interpretable Reinforcement Learning for Robotics and Continuous Control
Rohan R. Paleja
Letian Chen
Yaru Niu
Andrew Silva
Zhaoxin Li
...
K. Chang
H. E. Tseng
Yan Wang
S. Nageshrao
Matthew C. Gombolay
80
7
0
16 Nov 2023
Adversarially Robust Spiking Neural Networks Through Conversion
Adversarially Robust Spiking Neural Networks Through Conversion
Ozan Özdenizci
Robert Legenstein
AAML
86
10
0
15 Nov 2023
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized
  Representation
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation
Jiangzong Wang
Pengcheng Li
Xulong Zhang
Ning Cheng
Jing Xiao
81
0
0
14 Nov 2023
Explainable History Distillation by Marked Temporal Point Process
Explainable History Distillation by Marked Temporal Point Process
Sishun Liu
Ke Deng
Yan Wang
Xiuzhen Zhang
71
0
0
13 Nov 2023
Pruning random resistive memory for optimizing analogue AI
Pruning random resistive memory for optimizing analogue AI
Yi Li
Song-jian Wang
Yaping Zhao
Shaocong Wang
Woyu Zhang
...
Xiaoxin Xu
Dashan Shang
Qi Liu
Kwang-Ting Cheng
Ming-Yuan Liu
43
1
0
13 Nov 2023
AccEPT: An Acceleration Scheme for Speeding Up Edge Pipeline-parallel
  Training
AccEPT: An Acceleration Scheme for Speeding Up Edge Pipeline-parallel Training
Yuhao Chen
Yuxuan Yan
Qianqian Yang
Yuanchao Shu
Shibo He
Zhiguo Shi
Jiming Chen
84
0
0
10 Nov 2023
Real-Time Neural Rasterization for Large Scenes
Real-Time Neural Rasterization for Large Scenes
Jeffrey Yunfan Liu
Yun Chen
Ze Yang
Jingkang Wang
S. Manivasagam
R. Urtasun
AI4TSAI4CE
106
35
0
09 Nov 2023
RepQ: Generalizing Quantization-Aware Training for Re-Parametrized
  Architectures
RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures
Anastasiia Prutianova
Alexey Zaytsev
Chung-Kuei Lee
Fengyu Sun
Ivan Koryakovskiy
MQ
66
0
0
09 Nov 2023
Reducing the Side-Effects of Oscillations in Training of Quantized YOLO
  Networks
Reducing the Side-Effects of Oscillations in Training of Quantized YOLO Networks
Kartik Gupta
Akshay Asthana
MQ
38
8
0
09 Nov 2023
Recursion in Recursion: Two-Level Nested Recursion for Length
  Generalization with Scalability
Recursion in Recursion: Two-Level Nested Recursion for Length Generalization with Scalability
Jishnu Ray Chowdhury
Cornelia Caragea
78
5
0
08 Nov 2023
AFPQ: Asymmetric Floating Point Quantization for LLMs
AFPQ: Asymmetric Floating Point Quantization for LLMs
Yijia Zhang
Sicheng Zhang
Shijie Cao
Dayou Du
Jianyu Wei
Ting Cao
Ningyi Xu
MQ
58
5
0
03 Nov 2023
CoPriv: Network/Protocol Co-Optimization for Communication-Efficient
  Private Inference
CoPriv: Network/Protocol Co-Optimization for Communication-Efficient Private Inference
Wenxuan Zeng
Meng Li
Haichuan Yang
Wen-jie Lu
Runsheng Wang
Ru Huang
94
7
0
03 Nov 2023
Attacking Graph Neural Networks with Bit Flips: Weisfeiler and Lehman Go
  Indifferent
Attacking Graph Neural Networks with Bit Flips: Weisfeiler and Lehman Go Indifferent
Lorenz Kummer
Samir Moustafa
Nils N. Kriege
Wilfried N. Gansterer
GNNAAML
86
0
0
02 Nov 2023
Copilot4D: Learning Unsupervised World Models for Autonomous Driving via
  Discrete Diffusion
Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion
Lunjun Zhang
Yuwen Xiong
Ze Yang
Sergio Casas
Rui Hu
R. Urtasun
106
60
0
02 Nov 2023
Fully Quantized Always-on Face Detector Considering Mobile Image Sensors
Fully Quantized Always-on Face Detector Considering Mobile Image Sensors
Haechang Lee
Wongi Jeong
Dongil Ryu
Hyunwoo Je
Albert No
Kijeong Kim
Se Young Chun
CVBM
59
0
0
02 Nov 2023
Learn to Categorize or Categorize to Learn? Self-Coding for Generalized
  Category Discovery
Learn to Categorize or Categorize to Learn? Self-Coding for Generalized Category Discovery
Sarah Rastegar
Hazel Doughty
Cees G. M. Snoek
128
17
0
30 Oct 2023
Differentiable Learning of Generalized Structured Matrices for Efficient
  Deep Neural Networks
Differentiable Learning of Generalized Structured Matrices for Efficient Deep Neural Networks
Changwoo Lee
Hun-Seok Kim
59
3
0
29 Oct 2023
Improving Compositional Generalization Using Iterated Learning and
  Simplicial Embeddings
Improving Compositional Generalization Using Iterated Learning and Simplicial Embeddings
Yi Ren
Samuel Lavoie
Mikhail Galkin
Danica J. Sutherland
Aaron Courville
86
16
0
28 Oct 2023
Med-DANet V2: A Flexible Dynamic Architecture for Efficient Medical
  Volumetric Segmentation
Med-DANet V2: A Flexible Dynamic Architecture for Efficient Medical Volumetric Segmentation
Haoran Shen
Yifu Zhang
Wenxuan Wang
Chen Chen
Jing Liu
Shanshan Song
Jiangyun Li
MedIm
67
0
0
28 Oct 2023
Scale-Adaptive Feature Aggregation for Efficient Space-Time Video
  Super-Resolution
Scale-Adaptive Feature Aggregation for Efficient Space-Time Video Super-Resolution
Zhewei Huang
Ailin Huang
Xiaotao Hu
Chen Hu
Jun Xu
Shuchang Zhou
89
8
0
26 Oct 2023
Codebook Features: Sparse and Discrete Interpretability for Neural
  Networks
Codebook Features: Sparse and Discrete Interpretability for Neural Networks
Alex Tamkin
Mohammad Taufeeque
Noah D. Goodman
87
29
0
26 Oct 2023
QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models
QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models
Elias Frantar
Dan Alistarh
MQMoE
89
29
0
25 Oct 2023
SpikingJelly: An open-source machine learning infrastructure platform
  for spike-based intelligence
SpikingJelly: An open-source machine learning infrastructure platform for spike-based intelligence
Wei Fang
Yanqing Chen
Jianhao Ding
Zhaofei Yu
T. Masquelier
Ding Chen
Liwei Huang
Huihui Zhou
Guoqi Li
Yonghong Tian
116
238
0
25 Oct 2023
On the Interplay between Fairness and Explainability
On the Interplay between Fairness and Explainability
Stephanie Brandl
Emanuele Bugliarello
Ilias Chalkidis
FaML
99
5
0
25 Oct 2023
VMAF Re-implementation on PyTorch: Some Experimental Results
VMAF Re-implementation on PyTorch: Some Experimental Results
Kirill Aistov
Maxim Koroteev
148
2
0
24 Oct 2023
Graph Deep Learning for Time Series Forecasting
Graph Deep Learning for Time Series Forecasting
Andrea Cini
Ivan Marisca
Daniele Zambon
Cesare Alippi
AI4TSAI4CE
128
16
0
24 Oct 2023
Towards Hybrid-grained Feature Interaction Selection for Deep Sparse
  Network
Towards Hybrid-grained Feature Interaction Selection for Deep Sparse Network
Fuyuan Lyu
Xing Tang
Dugang Liu
Chen Ma
Weihong Luo
Liang Chen
Xiuqiang He
Xue Liu
100
2
0
23 Oct 2023
Projected Stochastic Gradient Descent with Quantum Annealed Binary
  Gradients
Projected Stochastic Gradient Descent with Quantum Annealed Binary Gradients
Maximilian Krahn
Michele Sasdelli
Fengyi Yang
Vladislav Golyanik
Arno Solin
Tat-Jun Chin
Tolga Birdal
MQ
177
2
0
23 Oct 2023
SpVOS: Efficient Video Object Segmentation with Triple Sparse
  Convolution
SpVOS: Efficient Video Object Segmentation with Triple Sparse Convolution
Weihao Lin
Tao Chen
Chong Yu
VOS
80
3
0
23 Oct 2023
Hierarchical Vector Quantized Transformer for Multi-class Unsupervised
  Anomaly Detection
Hierarchical Vector Quantized Transformer for Multi-class Unsupervised Anomaly Detection
Ruiying Lu
YuJie Wu
Long Tian
Dongsheng Wang
Bo Chen
Xiyang Liu
Ruimin Hu
113
43
0
22 Oct 2023
Calibrating Neural Simulation-Based Inference with Differentiable
  Coverage Probability
Calibrating Neural Simulation-Based Inference with Differentiable Coverage Probability
Maciej Falkiewicz
Naoya Takeishi
Imahn Shekhzadeh
Antoine Wehenkel
Arnaud Delaunoy
Gilles Louppe
Alexandros Kalousis
81
8
0
20 Oct 2023
DIG-MILP: a Deep Instance Generator for Mixed-Integer Linear Programming
  with Feasibility Guarantee
DIG-MILP: a Deep Instance Generator for Mixed-Integer Linear Programming with Feasibility Guarantee
Haoyu Wang
Jialin Liu
Xiaohan Chen
Xinshang Wang
Pan Li
Wotao Yin
63
6
0
20 Oct 2023
BitNet: Scaling 1-bit Transformers for Large Language Models
BitNet: Scaling 1-bit Transformers for Large Language Models
Hongyu Wang
Shuming Ma
Li Dong
Shaohan Huang
Huaijie Wang
Lingxiao Ma
Fan Yang
Ruiping Wang
Yi Wu
Furu Wei
MQ
78
119
0
17 Oct 2023
Tracking and Mapping in Medical Computer Vision: A Review
Tracking and Mapping in Medical Computer Vision: A Review
Adam Schmidt
Omid Mohareri
S. DiMaio
Michael C. Yip
Septimiu E. Salcudean
129
37
0
17 Oct 2023
TEQ: Trainable Equivalent Transformation for Quantization of LLMs
TEQ: Trainable Equivalent Transformation for Quantization of LLMs
Wenhua Cheng
Yiyang Cai
Kaokao Lv
Haihao Shen
MQ
99
7
0
17 Oct 2023
Hamming Encoder: Mining Discriminative k-mers for Discrete Sequence
  Classification
Hamming Encoder: Mining Discriminative k-mers for Discrete Sequence Classification
Junjie Dong
Mudi Jiang
Lianyu Hu
Zengyou He
58
0
0
16 Oct 2023
STORM: Efficient Stochastic Transformer based World Models for
  Reinforcement Learning
STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning
Weipu Zhang
Gang Wang
Jian Sun
Yetian Yuan
Gao Huang
104
45
0
14 Oct 2023
Previous
123...91011...293031
Next