ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1308.3432
  4. Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for
  Conditional Computation

Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation

15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
ArXivPDFHTML

Papers citing "Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"

50 / 1,869 papers shown
Title
Gaussian Weight Sampling for Scalable, Efficient and Stable Pseudo-Quantization Training
Gaussian Weight Sampling for Scalable, Efficient and Stable Pseudo-Quantization Training
Myeonghwan Ahn
Sungjoo Yoo
MQ
17
0
0
16 May 2025
An Introduction to Discrete Variational Autoencoders
An Introduction to Discrete Variational Autoencoders
Alan Jeffares
Liyuan Liu
DRL
BDL
CML
41
0
0
15 May 2025
Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs
Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs
Jingcheng Niu
Xingdi Yuan
Tong Wang
Hamidreza Saghir
Amir H. Abdi
27
0
0
14 May 2025
Efficient Mixed Precision Quantization in Graph Neural Networks
Efficient Mixed Precision Quantization in Graph Neural Networks
Samir Moustafa
Nils M. Kriege
Wilfried Gansterer
GNN
MQ
35
0
0
14 May 2025
Analog Foundation Models
Analog Foundation Models
Julian Büchel
Iason Chalas
Giovanni Acampa
An Chen
Omobayode Fagbohungbe
Sidney Tsai
Kaoutar El Maghraoui
Manuel Le Gallo
Abbas Rahimi
Abu Sebastian
MQ
35
0
0
14 May 2025
Differentiable Channel Selection in Self-Attention For Person Re-Identification
Differentiable Channel Selection in Self-Attention For Person Re-Identification
Yancheng Wang
Nebojsa Jojic
Yingzhen Yang
29
0
0
13 May 2025
Resource-Efficient Language Models: Quantization for Fast and Accessible Inference
Resource-Efficient Language Models: Quantization for Fast and Accessible Inference
Tollef Emil Jørgensen
MQ
54
0
0
13 May 2025
Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding
Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding
Dianwen Ng
Kun Zhou
Yi-Wen Chao
Zhiwei Xiong
B. Ma
E. Chng
33
0
0
12 May 2025
Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach
Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach
Minting Pan
Yitao Zheng
Jiajian Li
Yunbo Wang
Xiaokang Yang
OffRL
48
0
0
10 May 2025
Diffusion Model Quantization: A Review
Diffusion Model Quantization: A Review
Qian Zeng
Chenggong Hu
Mingli Song
Jie Song
MQ
45
0
0
08 May 2025
Input-Specific and Universal Adversarial Attack Generation for Spiking Neural Networks in the Spiking Domain
Input-Specific and Universal Adversarial Attack Generation for Spiking Neural Networks in the Spiking Domain
Spyridon Raptis
Haralampos-G. Stratigopoulos
AAML
28
0
0
07 May 2025
Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques
Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques
Sanjay Surendranath Girija
Shashank Kapoor
Lakshit Arora
Dipen Pradhan
Aman Raj
Ankit Shetgaonkar
57
0
0
05 May 2025
HybridGS: High-Efficiency Gaussian Splatting Data Compression using Dual-Channel Sparse Representation and Point Cloud Encoder
HybridGS: High-Efficiency Gaussian Splatting Data Compression using Dual-Channel Sparse Representation and Point Cloud Encoder
Qi Yang
Le Yang
G. Van der Auwera
Zhu Li
31
0
0
03 May 2025
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs
Hongyu Wang
Shuming Ma
Furu Wei
MQ
51
1
0
25 Apr 2025
POET: Prompt Offset Tuning for Continual Human Action Adaptation
POET: Prompt Offset Tuning for Continual Human Action Adaptation
Prachi Garg
Joseph K J
V. Balasubramanian
Necati Cihan Camgöz
Chengde Wan
Kenrick Kin
Weiguang Si
Shugao Ma
Fernando de la Torre
66
0
0
25 Apr 2025
Precision Neural Network Quantization via Learnable Adaptive Modules
Precision Neural Network Quantization via Learnable Adaptive Modules
Wenqiang Zhou
Zhendong Yu
X. Liu
Jiaming Yang
Rong Xiao
Tao Wang
Chenwei Tang
Jiancheng Lv
MQ
51
0
0
24 Apr 2025
DRAWER: Digital Reconstruction and Articulation With Environment Realism
DRAWER: Digital Reconstruction and Articulation With Environment Realism
Hongchi Xia
Entong Su
Marius Memmel
Arhan Jain
Raymond Yu
Numfor Mbiziwo-Tiapo
Ali Farhadi
Abhishek Gupta
Shenlong Wang
Wei-Chiu Ma
VGen
30
1
0
21 Apr 2025
TTRD3: Texture Transfer Residual Denoising Dual Diffusion Model for Remote Sensing Image Super-Resolution
TTRD3: Texture Transfer Residual Denoising Dual Diffusion Model for Remote Sensing Image Super-Resolution
Yide Liu
Haijiang Sun
Xiaowen Zhang
Qiaoyuan Liu
Zhouchang Chen
Chongzhuo Xiao
DiffM
39
0
0
17 Apr 2025
Learning Compatible Multi-Prize Subnetworks for Asymmetric Retrieval
Learning Compatible Multi-Prize Subnetworks for Asymmetric Retrieval
Yushuai Sun
Zikun Zhou
D. Jiang
Yaowei Wang
Jun Yu
Guangming Lu
Wenjie Pei
34
0
0
16 Apr 2025
Dense Backpropagation Improves Training for Sparse Mixture-of-Experts
Dense Backpropagation Improves Training for Sparse Mixture-of-Experts
Ashwinee Panda
Vatsal Baherwani
Zain Sarwar
Benjamin Thérien
Supriyo Chakraborty
Tom Goldstein
MoE
42
0
0
16 Apr 2025
Optimizing Compound Retrieval Systems
Optimizing Compound Retrieval Systems
Harrie Oosterhuis
R. Jagerman
Zhen Qin
Xuanhui Wang
36
0
0
16 Apr 2025
Quantization Error Propagation: Revisiting Layer-Wise Post-Training Quantization
Quantization Error Propagation: Revisiting Layer-Wise Post-Training Quantization
Yamato Arai
Yuma Ichikawa
MQ
34
0
0
13 Apr 2025
Slow Thinking for Sequential Recommendation
Slow Thinking for Sequential Recommendation
Junjie Zhang
Beichen Zhang
Wenqi Sun
Hongyu Lu
Wayne Xin Zhao
Yu Chen
Zhicheng Dou
OffRL
LRM
39
0
0
13 Apr 2025
DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation
DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation
Wangbo Zhao
Yizeng Han
Jiasheng Tang
Kai Wang
Hao Luo
Yibing Song
Gao Huang
Fan Wang
Yang You
74
0
0
09 Apr 2025
Adapting World Models with Latent-State Dynamics Residuals
Adapting World Models with Latent-State Dynamics Residuals
JB Lanier
Kyungmin Kim
Armin Karamzade
Yifei Liu
Ankita Sinha
Kat He
Davide Corsi
Roy Fox
46
0
0
03 Apr 2025
Push-Grasp Policy Learning Using Equivariant Models and Grasp Score Optimization
Push-Grasp Policy Learning Using Equivariant Models and Grasp Score Optimization
Boce Hu
Heng Tian
Dian Wang
Haojie Huang
Xupeng Zhu
Robin Walters
Robert W. Platt
39
0
0
03 Apr 2025
Moment Quantization for Video Temporal Grounding
Moment Quantization for Video Temporal Grounding
Xiaolong Sun
Le Wang
Sanping Zhou
Liushuai Shi
Kun Xia
Mengnan Liu
Yabing Wang
Gang Hua
MQ
31
0
0
03 Apr 2025
AdaRank: Adaptive Rank Pruning for Enhanced Model Merging
AdaRank: Adaptive Rank Pruning for Enhanced Model Merging
Chanhyuk Lee
Jiho Choi
Chanryeol Lee
Donggyun Kim
Seunghoon Hong
MoMe
55
0
0
28 Mar 2025
Harnessing uncertainty when learning through Equilibrium Propagation in neural networks
Harnessing uncertainty when learning through Equilibrium Propagation in neural networks
Jonathan Peters
Philippe Talatchian
42
0
0
28 Mar 2025
Boosting Large Language Models with Mask Fine-Tuning
Boosting Large Language Models with Mask Fine-Tuning
M. Zhang
Yue Bai
Huan Wang
Yizhou Wang
Qihua Dong
Y. Fu
CLL
53
0
0
27 Mar 2025
Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework
Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework
Soham Sane
MoE
67
0
0
26 Mar 2025
Faster Parameter-Efficient Tuning with Token Redundancy Reduction
Faster Parameter-Efficient Tuning with Token Redundancy Reduction
Kwonyoung Kim
Jungin Park
Jin-Hwa Kim
Hyeongjun Kwon
Kwanghoon Sohn
70
0
0
26 Mar 2025
TC-GS: Tri-plane based compression for 3D Gaussian Splatting
TC-GS: Tri-plane based compression for 3D Gaussian Splatting
Taorui Wang
Zitong Yu
Yong Xu
3DGS
76
0
0
26 Mar 2025
QUAD: Quantization and Parameter-Efficient Tuning of LLM with Activation Decomposition
QUAD: Quantization and Parameter-Efficient Tuning of LLM with Activation Decomposition
Yuxuan Hu
Xiaodong Chen
C. Li
H. Chen
J. Zhang
MQ
60
0
0
25 Mar 2025
CODA: Repurposing Continuous VAEs for Discrete Tokenization
CODA: Repurposing Continuous VAEs for Discrete Tokenization
Zeyu Liu
Zanlin Ni
Yeguo Hua
Xin Deng
Xiao Ma
Cheng Zhong
Gao Huang
47
0
0
22 Mar 2025
Optimized Minimal 3D Gaussian Splatting
Optimized Minimal 3D Gaussian Splatting
J. Lee
J. Ko
Eunbyung Park
3DGS
44
0
0
21 Mar 2025
Offline Model-Based Optimization: Comprehensive Review
Offline Model-Based Optimization: Comprehensive Review
Minsu Kim
Jiayao Gu
Ye Yuan
Taeyoung Yun
Ziqiang Liu
Yoshua Bengio
Can Chen
OffRL
67
2
0
21 Mar 2025
Efficient ANN-Guided Distillation: Aligning Rate-based Features of Spiking Neural Networks through Hybrid Block-wise Replacement
Efficient ANN-Guided Distillation: Aligning Rate-based Features of Spiking Neural Networks through Hybrid Block-wise Replacement
Shu Yang
Chao Yu
Lei Liu
Hanzhi Ma
Aili Wang
Erping Li
44
0
0
20 Mar 2025
Input-Triggered Hardware Trojan Attack on Spiking Neural Networks
Input-Triggered Hardware Trojan Attack on Spiking Neural Networks
Spyridon Raptis
Paul Kling
Ioannis Kaskampas
Ihsen Alouani
Haralampos-G. Stratigopoulos
AAML
49
1
0
20 Mar 2025
Towards Automated Semantic Interpretability in Reinforcement Learning via Vision-Language Models
Towards Automated Semantic Interpretability in Reinforcement Learning via Vision-Language Models
Zhaoxin Li
Zhang Xi-Jia
Batuhan Altundas
Letian Chen
Rohan R. Paleja
Matthew C. Gombolay
OffRL
46
0
0
20 Mar 2025
Natural Quantization of Neural Networks
Natural Quantization of Neural Networks
Richard Barney
Djamil Lakhdar-Hamina
Victor Galitski
MQ
45
0
0
19 Mar 2025
Cube: A Roblox View of 3D Intelligence
Cube: A Roblox View of 3D Intelligence
Foundation AI Team Roblox
Kiran Bhat
Nishchaie Khanna
Karun Channa
Tinghui Zhou
...
Kyle Price
Steve Han
Yiqing Wang
A. Singh
David Baszucki
63
0
0
19 Mar 2025
Efficient Personalization of Quantized Diffusion Model without Backpropagation
Efficient Personalization of Quantized Diffusion Model without Backpropagation
H. Seo
Wongi Jeong
Kyungryeol Lee
Se Young Chun
DiffM
MQ
78
0
0
19 Mar 2025
PARQ: Piecewise-Affine Regularized Quantization
PARQ: Piecewise-Affine Regularized Quantization
Lisa Jin
Jianhao Ma
Zechun Liu
Andrey Gromov
Aaron Defazio
Lin Xiao
MQ
43
0
0
19 Mar 2025
Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels
Robust Machine Unlearning for Quantized Neural Networks via Adaptive Gradient Reweighting with Similar Labels
Yujia Tong
Yuze Wang
Jingling Yuan
Chuang Hu
NoLa
71
0
0
18 Mar 2025
ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning
ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning
Baohao Liao
Christian Herold
Seyyed Hadi Hashemi
Stefan Vasilev
Shahram Khadivi
Christof Monz
MQ
44
0
0
17 Mar 2025
Stabilizing Quantization-Aware Training by Implicit-Regularization on Hessian Matrix
Junbiao Pang
Tianyang Cai
44
1
0
14 Mar 2025
TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
Xuying Zhang
Yutong Liu
Yangguang Li
Renrui Zhang
Y. Liu
...
Wanli Ouyang
Zhiwei Xiong
Peng Gao
Qibin Hou
Ming-Ming Cheng
127
3
0
13 Mar 2025
LUMOS: Language-Conditioned Imitation Learning with World Models
Iman Nematollahi
Branton DeMoss
Akshay L Chandra
Nick Hawes
Wolfram Burgard
Ingmar Posner
OffRL
43
0
0
13 Mar 2025
Generative Binary Memory: Pseudo-Replay Class-Incremental Learning on Binarized Embeddings
Yanis Basso-Bert
Anca Molnos
Romain Lemaire
William Guicquero
Antoine Dupret
BDL
61
0
0
13 Mar 2025
1234...363738
Next