ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1308.3432
  4. Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for
  Conditional Computation

Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation

15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
ArXiv (abs)PDFHTML

Papers citing "Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"

50 / 1,511 papers shown
Title
Visual-Instructed Degradation Diffusion for All-in-One Image Restoration
Visual-Instructed Degradation Diffusion for All-in-One Image Restoration
Wenyang Luo
Haina Qin
Zewen Chen
L. xilinx Wang
Dandan Zheng
Yuming Li
Yufan Liu
B. Li
Weiming Hu
28
0
0
20 Jun 2025
Distribution Parameter Actor-Critic: Shifting the Agent-Environment Boundary for Diverse Action Spaces
Distribution Parameter Actor-Critic: Shifting the Agent-Environment Boundary for Diverse Action Spaces
Jiamin He
A. Rupam Mahmood
Martha White
24
0
0
19 Jun 2025
Enhancing Vector Quantization with Distributional Matching: A Theoretical and Empirical Study
Enhancing Vector Quantization with Distributional Matching: A Theoretical and Empirical Study
Xianghong Fang
Litao Guo
Hengchao Chen
Yuxuan Zhang
XiaofanXia
...
Yexin Liu
Hao Wang
Harry Yang
Yuan Yuan
Qiang Sun
MQ
31
0
0
18 Jun 2025
Quantizing Small-Scale State-Space Models for Edge AI
Quantizing Small-Scale State-Space Models for Edge AI
Leo Zhao
Tristan Torchet
Melika Payvand
Laura Kriener
Filippo Moro
MQ
28
0
0
14 Jun 2025
The Effect of Stochasticity in Score-Based Diffusion Sampling: a KL Divergence Analysis
The Effect of Stochasticity in Score-Based Diffusion Sampling: a KL Divergence Analysis
Bernardo P. Schaeffer
Ricardo M. S. Rosa
Glauco Valle
DiffM
19
0
0
13 Jun 2025
Vision Generalist Model: A Survey
Vision Generalist Model: A Survey
Ziyi Wang
Yongming Rao
Shuofeng Sun
Xinrun Liu
Yi Wei
...
Zuyan Liu
Yanbo Wang
Hongmin Liu
Jie Zhou
Jiwen Lu
72
0
0
11 Jun 2025
Towards Reasonable Concept Bottleneck Models
Nektarios Kalampalikis
Kavya Gupta
Georgi Vitanov
Isabel Valera
LRM
103
0
0
05 Jun 2025
A Smooth Sea Never Made a Skilled SAILOR\texttt{SAILOR}SAILOR: Robust Imitation via Learning to Search
A. Jain
Vibhakar Mohta
Subin Kim
Atiksh Bhardwaj
Juntao Ren
Yunhai Feng
Sanjiban Choudhury
Gokul Swamy
OffRL
126
0
0
05 Jun 2025
BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing
BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing
Masaya Kawamura
Takuya Hasumi
Yuma Shirahata
Ryuichi Yamamoto
MQ
56
0
0
04 Jun 2025
Unifying Uniform and Binary-coding Quantization for Accurate Compression of Large Language Models
Unifying Uniform and Binary-coding Quantization for Accurate Compression of Large Language Models
Seungcheol Park
Jeongin Bae
Beomseok Kwon
Minjun Kim
Byeongwook Kim
S. Kwon
U. Kang
Dongsoo Lee
MQ
149
0
0
04 Jun 2025
SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling
SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling
Anhao Zhao
Fanghua Ye
Yingqi Fan
Junlong Tong
Zhiwei Fei
Hui Su
Xiaoyu Shen
70
0
0
04 Jun 2025
Learning Binarized Representations with Pseudo-positive Sample Enhancement for Efficient Graph Collaborative Filtering
Learning Binarized Representations with Pseudo-positive Sample Enhancement for Efficient Graph Collaborative Filtering
Yankai Chen
Yue Que
Xinni Zhang
Chen Ma
Irwin King
75
0
0
03 Jun 2025
Simple, Good, Fast: Self-Supervised World Models Free of Baggage
Simple, Good, Fast: Self-Supervised World Models Free of Baggage
Jan Robine
Marc Höftmann
Stefan Harmeling
DRLOCL
71
1
0
03 Jun 2025
AUTOCIRCUIT-RL: Reinforcement Learning-Driven LLM for Automated Circuit Topology Generation
AUTOCIRCUIT-RL: Reinforcement Learning-Driven LLM for Automated Circuit Topology Generation
Prashanth Vijayaraghavan
Luyao Shi
Ehsan Degan
Vandana Mukherjee
Xin Zhang
72
0
0
03 Jun 2025
Unified Scaling Laws for Compressed Representations
Unified Scaling Laws for Compressed Representations
Andrei Panferov
Alexandra Volkova
Ionut-Vlad Modoranu
Vage Egiazarian
M. Safaryan
Dan Alistarh
59
0
0
02 Jun 2025
MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and Generation
MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and Generation
Yakun Song
Jiawei Chen
Xiaobin Zhuang
Chenpeng Du
Ziyang Ma
...
Dongya Jia
Zhuo Chen
Yuping Wang
Yuxuan Wang
Xie Chen
38
0
0
31 May 2025
SwitchCodec: A High-Fidelity Nerual Audio Codec With Sparse Quantization
SwitchCodec: A High-Fidelity Nerual Audio Codec With Sparse Quantization
Jin Wang
Wenbin Jiang
Xiangbo Wang
34
0
0
30 May 2025
LittleBit: Ultra Low-Bit Quantization via Latent Factorization
LittleBit: Ultra Low-Bit Quantization via Latent Factorization
Banseok Lee
Dongkyu Kim
Youngcheon You
Youngmin Kim
MQ
31
0
0
30 May 2025
Learning Interpretable Differentiable Logic Networks for Tabular Regression
Learning Interpretable Differentiable Logic Networks for Tabular Regression
C. Yue
N. Jha
166
0
0
29 May 2025
Merge-Friendly Post-Training Quantization for Multi-Target Domain Adaptation
Merge-Friendly Post-Training Quantization for Multi-Target Domain Adaptation
Juncheol Shin
Minsang Seok
Seonggon Kim
Eunhyeok Park
MQMoMe
36
0
0
29 May 2025
Model Immunization from a Condition Number Perspective
Model Immunization from a Condition Number Perspective
Amber Yijia Zheng
Cedar Site Bai
Brian Bullins
Raymond A. Yeh
MedIm
21
0
0
29 May 2025
Effective and Efficient One-pass Compression of Speech Foundation Models Using Sparsity-aware Self-pinching Gates
Effective and Efficient One-pass Compression of Speech Foundation Models Using Sparsity-aware Self-pinching Gates
Haoning Xu
Zhaoqing Li
Youjun Chen
Huimeng Wang
Guinan Li
Mengzhe Geng
Chengxi Deng
Xunying Liu
61
0
0
28 May 2025
Highly Efficient and Effective LLMs with Multi-Boolean Architectures
Highly Efficient and Effective LLMs with Multi-Boolean Architectures
Ba-Hien Tran
Van Minh Nguyen
MQ
63
0
0
28 May 2025
Reward-Independent Messaging for Decentralized Multi-Agent Reinforcement Learning
Reward-Independent Messaging for Decentralized Multi-Agent Reinforcement Learning
Naoto Yoshida
Tadahiro Taniguchi
29
0
0
28 May 2025
MoRE: A Mixture of Low-Rank Experts for Adaptive Multi-Task Learning
MoRE: A Mixture of Low-Rank Experts for Adaptive Multi-Task Learning
Dacao Zhang
Kun Zhang
Shimao Chu
Le Wu
Xin Li
Si Wei
MoEALMOffRL
41
0
0
28 May 2025
Pioneering 4-Bit FP Quantization for Diffusion Models: Mixup-Sign Quantization and Timestep-Aware Fine-Tuning
Pioneering 4-Bit FP Quantization for Diffusion Models: Mixup-Sign Quantization and Timestep-Aware Fine-Tuning
Maosen Zhao
Pengtao Chen
Chong Yu
Yan Wen
Xudong Tan
Tao Chen
MQ
44
1
0
27 May 2025
Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech
Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech
Nam-Gyu Kim
Deok-Hyeon Cho
Seung-Bin Kim
Seong-Whan Lee
72
0
0
27 May 2025
Tokenizing Electron Cloud in Protein-Ligand Interaction Learning
Tokenizing Electron Cloud in Protein-Ligand Interaction Learning
H. Lin
Odin Zhang
Jia Xu
Yunfan Liu
Zheng Cheng
Lirong Wu
Yufei Huang
Zhifeng Gao
Stan Z. Li
53
0
0
25 May 2025
Lightweight Embeddings with Graph Rewiring for Collaborative Filtering
Lightweight Embeddings with Graph Rewiring for Collaborative Filtering
Xurong Liang
Tong Chen
Wei Yuan
Hongzhi Yin
37
0
0
25 May 2025
Joint-stochastic-approximation Autoencoders with Application to Semi-supervised Learning
Joint-stochastic-approximation Autoencoders with Application to Semi-supervised Learning
Wenbo He
Zhijian Ou
DRLBDL
40
0
0
24 May 2025
Why Do Some Inputs Break Low-Bit LLM Quantization?
Why Do Some Inputs Break Low-Bit LLM Quantization?
Ting-Yun Chang
Muru Zhang
Jesse Thomason
Robin Jia
MQ
29
0
0
24 May 2025
Bridging Supervised Learning and Reinforcement Learning in Math Reasoning
Bridging Supervised Learning and Reinforcement Learning in Math Reasoning
Huayu Chen
Kaiwen Zheng
Qinsheng Zhang
Ganqu Cui
Yin Cui
Haotian Ye
Tsung-Yi Lin
Ming-Yu Liu
Jun Zhu
Haoxiang Wang
OffRLLRM
258
3
0
23 May 2025
Beyond Discreteness: Finite-Sample Analysis of Straight-Through Estimator for Quantization
Halyun Jeong
Jack Xin
Penghang Yin
MQ
41
0
0
23 May 2025
A Principled Bayesian Framework for Training Binary and Spiking Neural Networks
James A. Walker
M. Khajehnejad
Adeel Razi
BDL
147
0
0
23 May 2025
FPQVAR: Floating Point Quantization for Visual Autoregressive Model with FPGA Hardware Co-design
FPQVAR: Floating Point Quantization for Visual Autoregressive Model with FPGA Hardware Co-design
Renjie Wei
Songqiang Xu
Qingyu Guo
Meng Li
MQ
91
0
0
22 May 2025
Quartet: Native FP4 Training Can Be Optimal for Large Language Models
Quartet: Native FP4 Training Can Be Optimal for Large Language Models
Roberto L. Castro
Andrei Panferov
Soroush Tabesh
Oliver Sieberling
Jiale Chen
Mahdi Nikdan
Saleh Ashkboos
Dan Alistarh
MQ
113
0
0
20 May 2025
VesselGPT: Autoregressive Modeling of Vascular Geometry
VesselGPT: Autoregressive Modeling of Vascular Geometry
Paula Feldman
Martin Sinnona
Viviana Siless
C. Delrieux
Emmanuel Iarussi
AI4CE
82
0
0
19 May 2025
Gaussian Weight Sampling for Scalable, Efficient and Stable Pseudo-Quantization Training
Gaussian Weight Sampling for Scalable, Efficient and Stable Pseudo-Quantization Training
Myeonghwan Ahn
Sungjoo Yoo
MQ
103
0
0
16 May 2025
Adversarially Robust Spiking Neural Networks with Sparse Connectivity
Adversarially Robust Spiking Neural Networks with Sparse Connectivity
Mathias Schmolli
Maximilian Baronig
Robert Legenstein
Ozan Özdenizci
AAML
47
0
0
16 May 2025
An Introduction to Discrete Variational Autoencoders
An Introduction to Discrete Variational Autoencoders
Alan Jeffares
Liyuan Liu
DRLBDLCML
60
0
0
15 May 2025
Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs
Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs
Jingcheng Niu
Xingdi Yuan
Tong Wang
Hamidreza Saghir
Amir H. Abdi
79
0
0
14 May 2025
Efficient Mixed Precision Quantization in Graph Neural Networks
Efficient Mixed Precision Quantization in Graph Neural Networks
Samir Moustafa
Nils M. Kriege
Wilfried Gansterer
GNNMQ
75
0
0
14 May 2025
Analog Foundation Models
Analog Foundation Models
Julian Büchel
Iason Chalas
Giovanni Acampa
An Chen
Omobayode Fagbohungbe
Sidney Tsai
Kaoutar El Maghraoui
Manuel Le Gallo
Abbas Rahimi
Abu Sebastian
MQ
118
0
0
14 May 2025
Differentiable Channel Selection in Self-Attention For Person Re-Identification
Differentiable Channel Selection in Self-Attention For Person Re-Identification
Yancheng Wang
Nebojsa Jojic
Yingzhen Yang
73
0
0
13 May 2025
Resource-Efficient Language Models: Quantization for Fast and Accessible Inference
Resource-Efficient Language Models: Quantization for Fast and Accessible Inference
Tollef Emil Jørgensen
MQ
99
0
0
13 May 2025
Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding
Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding
Dianwen Ng
Kun Zhou
Yi-Wen Chao
Zhiwei Xiong
B. Ma
Eng Siong Chng
93
0
0
12 May 2025
Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach
Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach
Minting Pan
Yitao Zheng
Jiajian Li
Yunbo Wang
Xiaokang Yang
OffRL
130
0
0
10 May 2025
Diffusion Model Quantization: A Review
Diffusion Model Quantization: A Review
Qian Zeng
Chenggong Hu
Mingli Song
Jie Song
MQ
102
0
0
08 May 2025
Input-Specific and Universal Adversarial Attack Generation for Spiking Neural Networks in the Spiking Domain
Input-Specific and Universal Adversarial Attack Generation for Spiking Neural Networks in the Spiking Domain
Spyridon Raptis
Haralampos-G. Stratigopoulos
AAML
78
0
0
07 May 2025
Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques
Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques
Sanjay Surendranath Girija
Shashank Kapoor
Lakshit Arora
Dipen Pradhan
Aman Raj
Ankit Shetgaonkar
162
0
0
05 May 2025
1234...293031
Next