Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1308.3432
Cited By
Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation
15 August 2013
Yoshua Bengio
Nicholas Léonard
Aaron Courville
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation"
50 / 1,869 papers shown
Title
Zero-Shot Generalization of Vision-Based RL Without Data Augmentation
Sumeet Batra
Gaurav Sukhatme
OffRL
DRL
31
2
0
09 Oct 2024
S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning
Weihao Lin
Shengji Tang
Chong Yu
Peng Ye
Tao Chen
18
0
0
09 Oct 2024
DDRN:a Data Distribution Reconstruction Network for Occluded Person Re-Identification
Zhaoyong Wang
Yujie Liu
Mingyue Li
Wenxin Zhang
Zongmin Li
21
0
0
09 Oct 2024
JPEG Inspired Deep Learning
Ahmed H. Salamah
Kaixiang Zheng
Yiwen Liu
En-Hui Yang
37
0
0
09 Oct 2024
Restructuring Vector Quantization with the Rotation Trick
Christopher Fifty
Ronald G. Junkins
Dennis Duan
Aniketh Iger
Jerry W. Liu
Ehsan Amid
Sebastian Thrun
Christopher Ré
LLMSV
45
11
0
08 Oct 2024
Continuous Approximations for Improving Quantization Aware Training of LLMs
He Li
Jianhang Hong
Yuanzhuo Wu
Snehal Adbol
Zonglin Li
MQ
29
1
0
06 Oct 2024
Dynamic Diffusion Transformer
Wangbo Zhao
Yizeng Han
Jiasheng Tang
Kai Wang
Yibing Song
Gao Huang
Fan Wang
Yang You
77
13
0
04 Oct 2024
Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization
Tung M. Luu
Thanh Nguyen
Tee Joshua Tian Jin
Sungwoon Kim
Chang D. Yoo
AAML
28
0
0
04 Oct 2024
ARB-LLM: Alternating Refined Binarizations for Large Language Models
Zhiteng Li
Xinyu Yan
Tianao Zhang
Haotong Qin
Dong Xie
Jiang Tian
Zhongchao Shi
Linghe Kong
Yulun Zhang
Xiaokang Yang
MQ
37
2
0
04 Oct 2024
Remember and Recall: Associative-Memory-based Trajectory Prediction
Hang Guo
Yuzhen Zhang
Tianci Gao
Junning Su
Pei Lv
Mingliang Xu
32
0
0
03 Oct 2024
FedPeWS: Personalized Warmup via Subnetworks for Enhanced Heterogeneous Federated Learning
Nurbek Tastan
Samuel Horváth
Martin Takáč
Karthik Nandakumar
FedML
62
0
0
03 Oct 2024
Constraint Guided Model Quantization of Neural Networks
Quinten Van Baelen
P. Karsmakers
MQ
28
0
0
30 Sep 2024
CycleBNN: Cyclic Precision Training in Binary Neural Networks
Federico Fontana
Romeo Lanzino
Anxhelo Diko
G. Foresti
Luigi Cinque
MQ
39
0
0
28 Sep 2024
Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation
Chaomin Shen
Yaomin Huang
Haokun Zhu
Jinsong Fan
Guixu Zhang
34
0
0
27 Sep 2024
Learning Quantized Adaptive Conditions for Diffusion Models
Yuchen Liang
Yuchuan Tian
Lei Yu
Huao Tang
Jie Hu
Xiangzhong Fang
Hanting Chen
DiffM
34
0
0
26 Sep 2024
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
Gongfan Fang
Hongxu Yin
Saurav Muralidharan
Greg Heinrich
Jeff Pool
Jan Kautz
Pavlo Molchanov
Xinchao Wang
37
3
0
26 Sep 2024
Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection
Alireza Ganjdanesh
Yan Kang
Yuchen Liu
Richard Y. Zhang
Zhe Lin
Heng Huang
DiffM
40
2
0
23 Sep 2024
A Diagonal Structured State Space Model on Loihi 2 for Efficient Streaming Sequence Processing
Svea Marie Meyer
Philipp Weidel
Philipp Plank
L. Campos-Macias
Sumit Bam Shrestha
Philipp Stratmann
M. R
41
4
0
23 Sep 2024
R-AIF: Solving Sparse-Reward Robotic Tasks from Pixels with Active Inference and World Models
Viet Dung Nguyen
Zhizhuo Yang
Christopher L. Buckley
Alexander Ororbia
39
2
0
21 Sep 2024
Audio Codec Augmentation for Robust Collaborative Watermarking of Speech Synthesis
Lauri Juvela
Xin Eric Wang
34
3
0
20 Sep 2024
Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing
Seongmin Hong
Jaehyeok Bae
Jongho Lee
S. Chun
26
0
0
18 Sep 2024
LASERS: LAtent Space Encoding for Representations with Sparsity for Generative Modeling
Xin Li
Anand Sarwate
37
0
0
16 Sep 2024
MesonGS: Post-training Compression of 3D Gaussians via Efficient Attribute Transformation
Shuzhao Xie
Weixiang Zhang
Chen Tang
Yunpeng Bai
Rongwei Lu
Shijia Ge
Zhi Wang
3DGS
46
11
0
15 Sep 2024
Robust Training of Neural Networks at Arbitrary Precision and Sparsity
Chengxi Ye
Grace Chu
Yanfeng Liu
Yichi Zhang
Lukasz Lew
Andrew G. Howard
MQ
29
2
0
14 Sep 2024
S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Yuezhou Hu
Jun-Jie Zhu
Jianfei Chen
45
0
0
13 Sep 2024
Efficient and Reliable Vector Similarity Search Using Asymmetric Encoding with NAND-Flash for Many-Class Few-Shot Learning
Hao-Wei Chiang
Chi-Tse Huang
Hsiang-Yun Cheng
P. Tseng
Ming-Hsiu Lee
An-Yeu
Wu
20
0
0
12 Sep 2024
NVRC: Neural Video Representation Compression
Ho Man Kwan
Ge Gao
Fan Zhang
Andrew Gower
David Bull
31
11
0
11 Sep 2024
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation
Wei Shen
Chuheng Zhang
OffRL
41
6
0
11 Sep 2024
FreeAugment: Data Augmentation Search Across All Degrees of Freedom
Tom Bekor
Niv Nayman
Lihi Zelnik-Manor
ViT
46
0
0
07 Sep 2024
Sparsifying Parametric Models with L0 Regularization
N. Botteghi
Urban Fasel
42
0
0
05 Sep 2024
Learning in Order! A Sequential Strategy to Learn Invariant Features for Multimodal Sentiment Analysis
Xianbing Zhao
Lizhen Qu
Tao Feng
Jianfei Cai
Buzhou Tang
53
0
0
05 Sep 2024
GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs
Maxim Zhelnin
Viktor Moskvoretskii
Egor Shvetsov
Egor Venediktov
Mariya Krylova
Aleksandr Zuev
Evgeny Burnaev
32
2
0
27 Aug 2024
1-Bit FQT: Pushing the Limit of Fully Quantized Training to 1-bit
Chang Gao
Jianfei Chen
Kang Zhao
Jiaqi Wang
Liping Jing
MQ
41
2
0
26 Aug 2024
Recurrent Neural Networks Learn to Store and Generate Sequences using Non-Linear Representations
Róbert Csordás
Christopher Potts
Christopher D. Manning
Atticus Geiger
GAN
36
16
0
20 Aug 2024
DisMix: Disentangling Mixtures of Musical Instruments for Source-level Pitch and Timbre Manipulation
Yin-Jyun Luo
K. Cheuk
Woosung Choi
Toshimitsu Uesaka
Keisuke Toyama
...
Chieh-Hsin Lai
Yuhta Takida
Wei-Hsiang Liao
Simon Dixon
Yuki Mitsufuji
CoGe
49
2
0
20 Aug 2024
Obtaining Optimal Spiking Neural Network in Sequence Learning via CRNN-SNN Conversion
Jiahao Su
Kang You
Zekai Xu
Weizhi Xu
Zhezhi He
26
0
0
18 Aug 2024
Vanilla Gradient Descent for Oblique Decision Trees
Subrat Prasad Panda
B. Genest
Arvind Easwaran
Ponnuthurai Nagaratnam Suganthan
OffRL
26
1
0
17 Aug 2024
Task-Aware Dynamic Transformer for Efficient Arbitrary-Scale Image Super-Resolution
Tianyi Xu
Yiji Zhou
Xiaotao Hu
Kai Zhang
Anran Zhang
Xingye Qiu
Jun Xu
43
0
0
16 Aug 2024
An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation
Peiming Guo
Sinuo Liu
Yanzhao Zhang
Dingkun Long
Pengjun Xie
Meishan Zhang
Hao Fei
DiffM
47
1
0
16 Aug 2024
Battery GraphNets : Relational Learning for Lithium-ion Batteries(LiBs) Life Estimation
Sakhinana Sagar Srinivas
Rajat Kumar Sarkar
Venkataramana Runkana
34
0
0
14 Aug 2024
Root Cause Attribution of Delivery Risks via Causal Discovery with Reinforcement Learning
Shi Bo
Minheng Xiao
39
7
0
11 Aug 2024
SAMSA: Efficient Transformer for Many Data Modalities
Minh Lenhat
Viet Anh Nguyen
Khoa Nguyen
Duong Duc Hieu
Dao Huu Hung
Truong-Son Hy
57
0
0
10 Aug 2024
Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields
J. Lee
Daniel Rho
Xiangyu Sun
Jong Hwan Ko
Eunbyung Park
3DGS
51
10
0
07 Aug 2024
GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
Sungho Lee
Marco A. Martínez-Ramírez
Wei-Hsiang Liao
Stefan Uhlich
Giorgio Fabbro
Kyogu Lee
Yuki Mitsufuji
49
5
0
06 Aug 2024
HQOD: Harmonious Quantization for Object Detection
Long Huang
Zhiwei Dong
Song-Lu Chen
Ruiyao Zhang
Shutong Ti
Feng Chen
Xu-Cheng Yin
MQ
29
0
0
05 Aug 2024
STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs
Peijie Dong
Lujun Li
Dayou Du
Yuhan Chen
Zhenheng Tang
...
Wei Xue
Wenhan Luo
Qi-fei Liu
Yi-Ting Guo
Xiaowen Chu
MQ
58
4
0
03 Aug 2024
UniMoT: Unified Molecule-Text Language Model with Discrete Token Representation
Jiayuan Zhu
Yunli Qi
Yongqiang Chen
Quanming Yao
29
7
0
01 Aug 2024
Tamper-Resistant Safeguards for Open-Weight LLMs
Rishub Tamirisa
Bhrugu Bharathi
Long Phan
Andy Zhou
Alice Gatti
...
Andy Zou
Dawn Song
Bo Li
Dan Hendrycks
Mantas Mazeika
AAML
MU
53
42
0
01 Aug 2024
MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction
Li Duan
Junseok Lee
Yeonguk Yu
G. Aragon-Camarasa
Kyoobin Lee
49
4
0
31 Jul 2024
On the Perturbed States for Transformed Input-robust Reinforcement Learning
Tung M. Luu
Haeyong Kang
Matthew Groh
Thanh Nguyen
Chang D. Yoo
OOD
AAML
OffRL
31
0
0
31 Jul 2024
Previous
1
2
3
4
5
...
36
37
38
Next