Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1308.3432
Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation
15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"
50 / 1,511 papers shown
Title
Oscillation-Reduced MXFP4 Training for Vision Transformers
Yuxiang Chen
Haocheng Xi
Jun Zhu
Jianfei Chen
MQ
120
3
0
28 Feb 2025
Protein Structure Tokenization: Benchmarking and New Recipe
Xinyu Yuan
Zichen Wang
Marcus Collins
Huzefa Rangwala
62
1
0
28 Feb 2025
Vector-Quantized Vision Foundation Models for Object-Centric Learning
Rongzhen Zhao
V. Wang
Arno Solin
Joni Pajarinen
OCL
VLM
567
1
0
27 Feb 2025
Preference-Based Gradient Estimation for ML-Guided Approximate Combinatorial Optimization
Arman Mielke
Uwe Bauknecht
Thilo Strauss
Mathias Niepert
156
0
0
26 Feb 2025
Binary Neural Networks for Large Language Model: A Survey
Liangdong Liu
Zhitong Zheng
Cong Wang
TianHuang Su
ZhenYu Yang
MQ
145
0
0
26 Feb 2025
Iterative Counterfactual Data Augmentation
Mitchell Plyler
Min Chi
161
0
0
25 Feb 2025
Optimizing Singular Spectrum for Large Language Model Compression
Dengjie Li
Tiancheng Shen
Yao Zhou
Baisong Yang
Zhongying Liu
Masheng Yang
Guohao Li
Yibo Yang
Yujie Zhong
Ming-Hsuan Yang
83
1
0
24 Feb 2025
Interleaved Block-based Learned Image Compression with Feature Enhancement and Quantization Error Compensation
Shiqi Jiang
Hui Yuan
Shuai Li
R. Hamzaoui
Xu Wang
Junyan Huo
99
0
0
24 Feb 2025
Towards Accurate Binary Spiking Neural Networks: Learning with Adaptive Gradient Modulation Mechanism
Yu Liang
Wenjie Wei
A. Belatreche
Honglin Cao
Zijian Zhou
Shuai Wang
Malu Zhang
Yue Yang
MQ
97
0
0
21 Feb 2025
MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures
Jiayu Qin
Jianchao Tan
Kai Zhang
Xunliang Cai
Wei Wang
77
0
0
19 Feb 2025
Self-Supervised Transformers as Iterative Solution Improvers for Constraint Satisfaction
Yudong Xu
Wenhao Li
Scott Sanner
Elias Boutros Khalil
103
0
0
18 Feb 2025
Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?
Jacob Nielsen
Peter Schneider-Kamp
Lukas Galke
MQ
100
1
0
17 Feb 2025
ADMN: A Layer-Wise Adaptive Multimodal Network for Dynamic Input Noise and Compute Resources
Jason Wu
Kang Yang
Lance M. Kaplan
Mani B. Srivastava
71
0
0
11 Feb 2025
The Case for Cleaner Biosignals: High-fidelity Neural Compressor Enables Transfer from Cleaner iEEG to Noisier EEG
Francesco Stefano Carzaniga
Gary Tom Hoppeler
Michael Hersche
Kaspar Anton Schindler
Abbas Rahimi
81
0
0
10 Feb 2025
Membership Inference Risks in Quantized Models: A Theoretical and Empirical Study
Eric Aubinais
Philippe Formont
Pablo Piantanida
Elisabeth Gassiat
114
1
0
10 Feb 2025
PrismAvatar: Real-time animated 3D neural head avatars on edge devices
Prashant Raina
Felix Taubner
Mathieu Tuli
Eu Wern Teh
Kevin Ferreira
3DH
102
1
0
10 Feb 2025
LoCA: Location-Aware Cosine Adaptation for Parameter-Efficient Fine-Tuning
Zhekai Du
Yinjie Min
Jingjing Li
Ke Lu
Changliang Zou
Liuhua Peng
Tingjin Chu
Mingming Gong
476
2
0
05 Feb 2025
Choose Your Model Size: Any Compression by a Single Gradient Descent
Martin Genzel
Patrick Putzky
Pengfei Zhao
Siyang Song
Mattes Mollenhauer
Robert Seidel
Stefan Dietzel
Thomas Wollmann
110
0
0
03 Feb 2025
Nearly Lossless Adaptive Bit Switching
Haiduo Huang
Zhenhua Liu
Tian Xia
Wenzhe zhao
Pengju Ren
MQ
103
0
0
03 Feb 2025
Compact Rule-Based Classifier Learning via Gradient Descent
Javier Fumanal-Idocin
Raquel Fernandez-Peralta
Javier Andreu-Perez
100
0
0
03 Feb 2025
Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity
Alessandro Pierro
Steven Abreu
Jonathan Timcheck
Philipp Stratmann
Andreas Wild
S. Shrestha
111
0
0
03 Feb 2025
Optimizing Large Language Model Training Using FP4 Quantization
Ruizhe Wang
Yeyun Gong
Xiao Liu
Guoshuai Zhao
Ziyue Yang
Baining Guo
Zhengjun Zha
Peng Cheng
MQ
201
12
0
28 Jan 2025
HadamRNN: Binary and Sparse Ternary Orthogonal RNNs
Armand Foucault
Franck Mamalet
François Malgouyres
MQ
304
0
0
28 Jan 2025
BILLNET: A Binarized Conv3D-LSTM Network with Logic-gated residual architecture for hardware-efficient video inference
Van Thien Nguyen
William Guicquero
Gilles Sicard
3DV
MQ
147
2
0
24 Jan 2025
Channel-wise Parallelizable Spiking Neuron with Multiplication-free Dynamics and Large Temporal Receptive Fields
Peng Xue
Wei Fang
Zhengyu Ma
Zihan Huang
Zhaokun Zhou
Yonghong Tian
T. Masquelier
Huihui Zhou
163
1
0
24 Jan 2025
SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling
Shengshi Yao
Jincheng Dai
Xiaoqi Qin
Sixian Wang
Siye Wang
K. Niu
Ping Zhang
139
0
0
22 Jan 2025
mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework
Bingyi Liu
Jian Teng
Hongfei Xue
Enshu Wang
Chuanhui Zhu
Pu Wang
Libing Wu
162
0
0
21 Jan 2025
Sparse Binary Representation Learning for Knowledge Tracing
Yahya Badran
Christine Preisach
79
0
0
20 Jan 2025
MOGNET: A Mux-residual quantized Network leveraging Online-Generated weights
Van Thien Nguyen
William Guicquero
Gilles Sicard
MQ
155
1
0
17 Jan 2025
Histogram-Equalized Quantization for logic-gated Residual Neural Networks
Van Thien Nguyen
William Guicquero
Gilles Sicard
MQ
130
2
0
10 Jan 2025
Hyperbolic Binary Neural Network
Jun Chen
Jingyang Xiang
Tianxin Huang
Xiangrui Zhao
Yong Liu
MQ
49
0
0
08 Jan 2025
Fairness in Reinforcement Learning with Bisimulation Metrics
S. Rezaei-Shoshtari
Hanna Yurchyk
Scott Fujimoto
Doina Precup
David Meger
162
0
0
03 Jan 2025
Forget Vectors at Play: Universal Input Perturbations Driving Machine Unlearning in Image Classification
Changchang Sun
Ren Wang
Yihua Zhang
Jinghan Jia
Jiancheng Liu
Gaowen Liu
Sijia Liu
Yan Yan
AAML
MU
172
0
0
21 Dec 2024
Improving Quantization-aware Training of Low-Precision Network via Block Replacement on Full-Precision Counterpart
Chengting Yu
Shu Yang
Fengzhao Zhang
Hanzhi Ma
Aili Wang
Er-ping Li
MQ
115
4
0
20 Dec 2024
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Hong Chen
Zihan Wang
Xianrui Li
Xingwu Sun
Fangyi Chen
Jiang Liu
Jiadong Wang
Bhiksha Raj
Zicheng Liu
Emad Barsoum
VLM
290
10
0
14 Dec 2024
Video Seal: Open and Efficient Video Watermarking
Pierre Fernandez
Hady ElSahar
I. Zeki Yalniz
Alexandre Mourachko
VLM
161
8
0
12 Dec 2024
On Evaluating the Durability of Safeguards for Open-Weight LLMs
Xiangyu Qi
Boyi Wei
Nicholas Carlini
Yangsibo Huang
Tinghao Xie
Luxi He
Matthew Jagielski
Milad Nasr
Prateek Mittal
Peter Henderson
AAML
137
22
0
10 Dec 2024
GAQAT: gradient-adaptive quantization-aware training for domain generalization
Jiacheng Jiang
Yuan Meng
Chen Tang
Han Yu
Qun Li
Zhi Wang
Wenwu Zhu
MQ
84
0
0
07 Dec 2024
ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models
Xubing Ye
Yukang Gan
Yixiao Ge
Xiao Zhang
Yansong Tang
167
11
0
30 Nov 2024
Complexity Experts are Task-Discriminative Learners for Any Image Restoration
Eduard Zamfir
Zongwei Wu
Nancy Mehta
Yuedong Tan
Danda Pani Paudel
Yulun Zhang
Radu Timofte
MoE
538
5
0
27 Nov 2024
Deep End-to-end Adaptive k-Space Sampling, Reconstruction, and Registration for Dynamic MRI
George Yiasemis
Jan-Jakob Sonke
Jonas Teuwen
153
0
0
27 Nov 2024
Noise Adaptor: Enhancing Low-Latency Spiking Neural Networks through Noise-Injected Low-Bit ANN Conversion
Chen Li
Bipin Rajendran
113
0
0
26 Nov 2024
Tree Transformers are an Ineffective Model of Syntactic Constituency
Michael Ginn
106
0
0
25 Nov 2024
Representation Collapsing Problems in Vector Quantization
Wenhao Zhao
Qiran Zou
Rushi Shah
Dianbo Liu
109
2
0
25 Nov 2024
Efficient Ternary Weight Embedding Model: Bridging Scalability and Performance
Jiayi Chen
Chen Wu
Shanghang Zhang
Nan Li
Lefei Zhang
Qi Zhang
115
0
0
23 Nov 2024
Exploring the Robustness and Transferability of Patch-Based Adversarial Attacks in Quantized Neural Networks
Amira Guesmi
B. Ouni
Mohamed Bennai
AAML
153
0
0
22 Nov 2024
Quantization without Tears
Minghao Fu
Hao Yu
Jie Shao
Junjie Zhou
Ke Zhu
Jianxin Wu
MQ
201
3
0
21 Nov 2024
Llama Guard 3-1B-INT4: Compact and Efficient Safeguard for Human-AI Conversations
Igor Fedorov
Kate Plawiak
Lemeng Wu
Tarek Elgamal
Naveen Suda
...
Bilge Soran
Zacharie Delpierre Coudert
Rachad Alao
Raghuraman Krishnamoorthi
Vikas Chandra
106
5
0
18 Nov 2024
Bi-Mamba: Towards Accurate 1-Bit State Space Models
Shengkun Tang
Liqun Ma
Haoyang Li
Mingjie Sun
Zhiqiang Shen
Mamba
127
3
0
18 Nov 2024
Complexity-Aware Training of Deep Neural Networks for Optimal Structure Discovery
Valentin Frank Ingmar Guenter
Athanasios Sideris
CVBM
41
0
0
14 Nov 2024
Previous
1
2
3
4
5
6
...
29
30
31
Next