ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.01144
  4. Cited By
Categorical Reparameterization with Gumbel-Softmax
v1v2v3v4v5 (latest)

Categorical Reparameterization with Gumbel-Softmax

3 November 2016
Eric Jang
S. Gu
Ben Poole
    BDL
ArXiv (abs)PDFHTML

Papers citing "Categorical Reparameterization with Gumbel-Softmax"

50 / 3,025 papers shown
Title
Bayesian Graph Contrastive Learning
Bayesian Graph Contrastive Learning
Arman Hasanzadeh
Mohammadreza Armandpour
Ehsan Hajiramezanali
Mingyuan Zhou
N. Duffield
K. Narayanan
UQCVBDLSSL
53
5
0
15 Dec 2021
Sparse Interventions in Language Models with Differentiable Masking
Sparse Interventions in Language Models with Differentiable Masking
Nicola De Cao
Leon Schmid
Dieuwke Hupkes
Ivan Titov
77
29
0
13 Dec 2021
Sparse Structure Learning via Graph Neural Networks for Inductive
  Document Classification
Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification
Yinhua Piao
Sangseon Lee
Dohoon Lee
Sun Kim
87
35
0
13 Dec 2021
Self-Supervised Modality-Aware Multiple Granularity Pre-Training for
  RGB-Infrared Person Re-Identification
Self-Supervised Modality-Aware Multiple Granularity Pre-Training for RGB-Infrared Person Re-Identification
Lin Wan
Qianyan Jing
Zongyuan Sun
Chuan Zhang
Zhihang Li
Yehansen Chen
SSL
96
5
0
12 Dec 2021
Curvature-guided dynamic scale networks for Multi-view Stereo
Curvature-guided dynamic scale networks for Multi-view Stereo
Khang Truong Giang
Soohwan Song
Sungho Jo
3DV
107
31
0
11 Dec 2021
Discourse-Aware Soft Prompting for Text Generation
Discourse-Aware Soft Prompting for Text Generation
Marjan Ghazvininejad
Vladimir Karpukhin
Vera Gor
Asli Celikyilmaz
69
6
0
10 Dec 2021
Latent Space Explanation by Intervention
Latent Space Explanation by Intervention
Itai Gat
Guy Lorberbom
Idan Schwartz
Tamir Hazan
BDL
57
14
0
09 Dec 2021
DiPS: Differentiable Policy for Sketching in Recommender Systems
DiPS: Differentiable Policy for Sketching in Recommender Systems
Aritra Ghosh
Saayan Mitra
Andrew Lan
BDLOffRL
59
2
0
08 Dec 2021
Cross-domain User Preference Learning for Cold-start Recommendation
Cross-domain User Preference Learning for Cold-start Recommendation
Huiling Zhou
Jie Liu
Zhikang Li
Jin Yu
Hongxia Yang
59
0
0
07 Dec 2021
Unsupervised Learning of Compositional Scene Representations from
  Multiple Unspecified Viewpoints
Unsupervised Learning of Compositional Scene Representations from Multiple Unspecified Viewpoints
Jinyang Yuan
Bin Li
Xiangyang Xue
CoGeOCL
104
11
0
07 Dec 2021
FedDAG: Federated DAG Structure Learning
FedDAG: Federated DAG Structure Learning
Erdun Gao
Junjia Chen
Li Shen
Tongliang Liu
Biwei Huang
H. Bondell
FedML
96
17
0
07 Dec 2021
Enhanced Exploration in Neural Feature Selection for Deep Click-Through
  Rate Prediction Models via Ensemble of Gating Layers
Enhanced Exploration in Neural Feature Selection for Deep Click-Through Rate Prediction Models via Ensemble of Gating Layers
L. Guan
Xia Xiao
Ming-yue Chen
Youlong Cheng
62
1
0
07 Dec 2021
Interpretable Image Classification with Differentiable Prototypes
  Assignment
Interpretable Image Classification with Differentiable Prototypes Assignment
Dawid Rymarczyk
Lukasz Struski
Michal Górszczak
K. Lewandowska
Jacek Tabor
Bartosz Zieliñski
87
102
0
06 Dec 2021
AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural
  Networks
AdaSTE: An Adaptive Straight-Through Estimator to Train Binary Neural Networks
Huu Le
R. Høier
Che-Tsung Lin
Christopher Zach
84
17
0
06 Dec 2021
Achieving Forgetting Prevention and Knowledge Transfer in Continual
  Learning
Achieving Forgetting Prevention and Knowledge Transfer in Continual Learning
Zixuan Ke
Bing-Quan Liu
Nianzu Ma
Hu Xu
Lei Shu
CLL
227
127
0
05 Dec 2021
Interactive Disentanglement: Learning Concepts by Interacting with their
  Prototype Representations
Interactive Disentanglement: Learning Concepts by Interacting with their Prototype Representations
Wolfgang Stammer
Marius Memmel
P. Schramowski
Kristian Kersting
183
27
0
04 Dec 2021
Global Context with Discrete Diffusion in Vector Quantised Modelling for
  Image Generation
Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation
Minghui Hu
Yujie Wang
Tat-Jen Cham
Jianfei Yang
P.N.Suganthan
DiffM
63
43
0
03 Dec 2021
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning
  and Visual Grounding
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Dave Zhenyu Chen
Qirui Wu
Matthias Nießner
Angel X. Chang
83
32
0
02 Dec 2021
AutoGEL: An Automated Graph Neural Network with Explicit Link
  Information
AutoGEL: An Automated Graph Neural Network with Explicit Link Information
Zhiling Wang
Shimin Di
Lei Chen
GNNAI4CE
94
40
0
02 Dec 2021
Visual-Semantic Transformer for Scene Text Recognition
Visual-Semantic Transformer for Scene Text Recognition
Xin Tang
Yongquan Lai
Ying Liu
Yuanyuan Fu
Rui Fang
ViT
68
9
0
02 Dec 2021
Wish you were here: Hindsight Goal Selection for long-horizon dexterous
  manipulation
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation
Todor Davchev
Oleg O. Sushkov
Jean-Baptiste Regli
S. Schaal
Y. Aytar
Markus Wulfmeier
Jonathan Scholz
57
18
0
01 Dec 2021
Human-Object Interaction Detection via Weak Supervision
Human-Object Interaction Detection via Weak Supervision
Mert Kilickaya
A. Smeulders
68
6
0
01 Dec 2021
What to Learn, and How: Toward Effective Learning from Rationales
What to Learn, and How: Toward Effective Learning from Rationales
Samuel Carton
Surya Kanoria
Chenhao Tan
137
25
0
30 Nov 2021
Improvement in Machine Translation with Generative Adversarial Networks
Improvement in Machine Translation with Generative Adversarial Networks
J. Ahn
H. Madhu
V. Nguyen
GANAI4CE
32
2
0
30 Nov 2021
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point
  Modeling
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling
Xumin Yu
Lulu Tang
Yongming Rao
Tiejun Huang
Jie Zhou
Jiwen Lu
3DPC
206
692
0
29 Nov 2021
Self-supervised Feature-Gate Coupling for Dynamic Network Pruning
Self-supervised Feature-Gate Coupling for Dynamic Network Pruning
Mengnan Shi
Chang-rui Liu
Jianbin Jiao
QiXiang Ye
82
1
0
29 Nov 2021
SQUID: Deep Feature In-Painting for Unsupervised Anomaly Detection
SQUID: Deep Feature In-Painting for Unsupervised Anomaly Detection
Tiange Xiang
Yixiao Zhang
Yongyi Lu
Alan Yuille
Chaoyi Zhang
Weidong (Tom) Cai
Zongwei Zhou
UQCV
115
44
0
26 Nov 2021
Learning source-aware representations of music in a discrete latent
  space
Learning source-aware representations of music in a discrete latent space
Jinsung Kim
Yeong-Seok Jeong
Woosung Choi
Jaehwa Chung
Soonyoung Jung
BDLDRL
58
0
0
26 Nov 2021
NomMer: Nominate Synergistic Context in Vision Transformer for Visual
  Recognition
NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition
Hao Liu
Xinghua Jiang
Xin Li
Zhimin Bao
Deqiang Jiang
Bo Ren
ViT
79
16
0
25 Nov 2021
Sparse is Enough in Scaling Transformers
Sparse is Enough in Scaling Transformers
Sebastian Jaszczur
Aakanksha Chowdhery
Afroz Mohiuddin
Lukasz Kaiser
Wojciech Gajewski
Henryk Michalewski
Jonni Kanerva
MoE
71
102
0
24 Nov 2021
Unleashing Transformers: Parallel Token Prediction with Discrete
  Absorbing Diffusion for Fast High-Resolution Image Generation from
  Vector-Quantized Codes
Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes
Sam Bond-Taylor
P. Hessey
Hiroshi Sasaki
T. Breckon
Chris G. Willcocks
DiffM
126
72
0
24 Nov 2021
Towards Empirical Sandwich Bounds on the Rate-Distortion Function
Towards Empirical Sandwich Bounds on the Rate-Distortion Function
Yibo Yang
Stephan Mandt
114
25
0
23 Nov 2021
Variational Learning for Unsupervised Knowledge Grounded Dialogs
Variational Learning for Unsupervised Knowledge Grounded Dialogs
Mayank Mishra
Dhiraj Madan
Gaurav Pandey
Danish Contractor
77
3
0
23 Nov 2021
Efficient Video Transformers with Spatial-Temporal Token Selection
Efficient Video Transformers with Spatial-Temporal Token Selection
Junke Wang
Xitong Yang
Hengduo Li
Li Liu
Zuxuan Wu
Yu-Gang Jiang
ViT
68
67
0
23 Nov 2021
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement
  Learning with Actor Rectification
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification
L. Pan
Longbo Huang
Tengyu Ma
Huazhe Xu
OffRLOnRL
125
55
0
22 Nov 2021
L-Verse: Bidirectional Generation Between Image and Text
L-Verse: Bidirectional Generation Between Image and Text
Taehoon Kim
Gwangmo Song
Sihaeng Lee
Sangyun Kim
Yewon Seo
Soonyoung Lee
S. Kim
Honglak Lee
Kyunghoon Bae
166
26
0
22 Nov 2021
Maximum Mean Discrepancy for Generalization in the Presence of
  Distribution and Missingness Shift
Maximum Mean Discrepancy for Generalization in the Presence of Distribution and Missingness Shift
Liwn Ouyang
Aaron Key
77
11
0
19 Nov 2021
Toward Compact Parameter Representations for Architecture-Agnostic
  Neural Network Compression
Toward Compact Parameter Representations for Architecture-Agnostic Neural Network Compression
Yuezhou Sun
Wenlong Zhao
Lijun Zhang
Xiao Liu
Hui Guan
Matei A. Zaharia
89
0
0
19 Nov 2021
FBNetV5: Neural Architecture Search for Multiple Tasks in One Run
FBNetV5: Neural Architecture Search for Multiple Tasks in One Run
Bichen Wu
Chaojian Li
Hang Zhang
Xiaoliang Dai
Peizhao Zhang
Matthew Yu
Jialiang Wang
Yingyan Lin
Peter Vajda
ViT
151
24
0
19 Nov 2021
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at
  Scale
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Arun Babu
Changhan Wang
Andros Tjandra
Kushal Lakhotia
Qiantong Xu
...
Yatharth Saraf
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
SSL
116
713
0
17 Nov 2021
Towards Interpretable and Reliable Reading Comprehension: A Pipeline
  Model with Unanswerability Prediction
Towards Interpretable and Reliable Reading Comprehension: A Pipeline Model with Unanswerability Prediction
Kosuke Nishida
Kyosuke Nishida
Itsumi Saito
Sen Yoshida
80
7
0
17 Nov 2021
A Survey on Adversarial Attacks for Malware Analysis
A Survey on Adversarial Attacks for Malware Analysis
Kshitiz Aryal
Maanak Gupta
Mahmoud Abdelsalam
AAML
114
53
0
16 Nov 2021
Learning Augmentation Distributions using Transformed Risk Minimization
Learning Augmentation Distributions using Transformed Risk Minimization
Evangelos Chatzipantazis
Stefanos Pertigkiozoglou
Kostas Daniilidis
Yan Sun
81
15
0
16 Nov 2021
Joint Unsupervised and Supervised Training for Multilingual ASR
Joint Unsupervised and Supervised Training for Multilingual ASR
Junwen Bai
Yue Liu
Yu Zhang
Ankur Bapna
Nikhil Siddhartha
K. Sim
Tara N. Sainath
88
59
0
15 Nov 2021
Modular Networks Prevent Catastrophic Interference in Model-Based
  Multi-Task Reinforcement Learning
Modular Networks Prevent Catastrophic Interference in Model-Based Multi-Task Reinforcement Learning
Robin Schiewer
Laurenz Wiskott
24
3
0
15 Nov 2021
Linear, or Non-Linear, That is the Question!
Linear, or Non-Linear, That is the Question!
Taeyong Kong
Taeri Kim
Jinsung Jeon
Jeongwhan Choi
Yeon-Chang Lee
Noseong Park
Sang-Wook Kim
86
62
0
14 Nov 2021
Bi-Discriminator Class-Conditional Tabular GAN
Bi-Discriminator Class-Conditional Tabular GAN
Mohammad Esmaeilpour
Nourhene Chaalia
Adel Abusitta
François-Xavier Devailly
Wissem Maazoun
P. Cardinal
76
12
0
12 Nov 2021
Dynamic Iterative Refinement for Efficient 3D Hand Pose Estimation
Dynamic Iterative Refinement for Efficient 3D Hand Pose Estimation
John Yang
Yash Bhalgat
Simyung Chang
Fatih Porikli
Nojun Kwak
3DH
58
6
0
11 Nov 2021
Learning Generalized Gumbel-max Causal Mechanisms
Learning Generalized Gumbel-max Causal Mechanisms
Guy Lorberbom
Daniel D. Johnson
Chris J. Maddison
Daniel Tarlow
Tamir Hazan
CML
72
20
0
11 Nov 2021
Catalytic Role Of Noise And Necessity Of Inductive Biases In The
  Emergence Of Compositional Communication
Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication
Lukasz Kuciñski
Tomasz Korbak
P. Kołodziej
Piotr Milo's
111
20
0
11 Nov 2021
Previous
123...313233...596061
Next