ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.01144
  4. Cited By
Categorical Reparameterization with Gumbel-Softmax
v1v2v3v4v5 (latest)

Categorical Reparameterization with Gumbel-Softmax

3 November 2016
Eric Jang
S. Gu
Ben Poole
    BDL
ArXiv (abs)PDFHTML

Papers citing "Categorical Reparameterization with Gumbel-Softmax"

50 / 3,025 papers shown
Title
Heterogeneous Knowledge for Augmented Modular Reinforcement Learning
Heterogeneous Knowledge for Augmented Modular Reinforcement Learning
Lorenz Wolf
Mirco Musolesi
OffRL
77
0
0
01 Jun 2023
Learning Transformer Programs
Learning Transformer Programs
Dan Friedman
Alexander Wettig
Danqi Chen
89
36
0
01 Jun 2023
Towards Learning Discrete Representations via Self-Supervision for
  Wearables-Based Human Activity Recognition
Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition
H. Haresamudram
Irfan Essa
Thomas Ploetz
102
8
0
01 Jun 2023
Joint Learning of Label and Environment Causal Independence for Graph
  Out-of-Distribution Generalization
Joint Learning of Label and Environment Causal Independence for Graph Out-of-Distribution Generalization
Shurui Gui
Meng Liu
Xiner Li
Youzhi Luo
Shuiwang Ji
CMLOOD
96
30
0
01 Jun 2023
Differentiable Tree Operations Promote Compositional Generalization
Differentiable Tree Operations Promote Compositional Generalization
Paul Soulos
J. E. Hu
Kate McCurdy
Yunmo Chen
Roland Fernandez
P. Smolensky
Jianfeng Gao
AI4CE
67
7
0
01 Jun 2023
Nonparametric Identifiability of Causal Representations from Unknown
  Interventions
Nonparametric Identifiability of Causal Representations from Unknown Interventions
Julius von Kügelgen
M. Besserve
Wendong Liang
Luigi Gresele
Armin Kekić
Elias Bareinboim
David M. Blei
Bernhard Schölkopf
CML
186
65
0
01 Jun 2023
DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation
DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation
Xiaoliang Ju
Zhaoyang Huang
Yijin Li
Guofeng Zhang
Yu Qiao
Hongsheng Li
111
7
0
01 Jun 2023
Scalable Learning of Latent Language Structure With Logical Offline
  Cycle Consistency
Scalable Learning of Latent Language Structure With Logical Offline Cycle Consistency
Mayank Agarwal
Ramón Fernández Astudillo
Tahira Naseem
Subhajit Chaudhury
Pavan Kapanipathi
Salim Roukos
Alexander G. Gray
OffRL
65
0
0
31 May 2023
Beam Tree Recursive Cells
Beam Tree Recursive Cells
Jishnu Ray Chowdhury
Cornelia Caragea
91
6
0
31 May 2023
Speaking the Language of Your Listener: Audience-Aware Adaptation via
  Plug-and-Play Theory of Mind
Speaking the Language of Your Listener: Audience-Aware Adaptation via Plug-and-Play Theory of Mind
Ece Takmaz
Nicolo' Brandizzi
Mario Giulianelli
Sandro Pezzelle
Raquel Fernández
70
7
0
31 May 2023
Neural Markov Jump Processes
Neural Markov Jump Processes
Patrick Seifner
Ramses J. Sanchez
BDL
83
8
0
31 May 2023
NetHack is Hard to Hack
NetHack is Hard to Hack
Ulyana Piterbarg
Lerrel Pinto
Rob Fergus
60
7
0
30 May 2023
Graph-based Time Series Clustering for End-to-End Hierarchical
  Forecasting
Graph-based Time Series Clustering for End-to-End Hierarchical Forecasting
Andrea Cini
Danilo Mandic
Cesare Alippi
AI4TS
64
10
0
30 May 2023
Joint Optimization of Class-Specific Training- and Test-Time Data
  Augmentation in Segmentation
Joint Optimization of Class-Specific Training- and Test-Time Data Augmentation in Segmentation
Zeju Li
Konstantinos Kamnitsas
Qi Dou
C. Qin
Ben Glocker
68
6
0
30 May 2023
Autoencoding Conditional Neural Processes for Representation Learning
Autoencoding Conditional Neural Processes for Representation Learning
Victor Prokhorov
Ivan Titov
N. Siddharth
BDL
72
0
0
29 May 2023
Shift-Robust Molecular Relational Learning with Causal Substructure
Shift-Robust Molecular Relational Learning with Causal Substructure
Namkyeong Lee
Kanghoon Yoon
Gyoung S. Na
Sein Kim
Chanyoung Park
108
16
0
29 May 2023
Interpretable Reward Redistribution in Reinforcement Learning: A Causal
  Approach
Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach
Yudi Zhang
Yali Du
Erdun Gao
Ziyan Wang
Jun Wang
Meng Fang
Mykola Pechenizkiy
CML
109
18
0
28 May 2023
Attention Schema in Neural Agents
Attention Schema in Neural Agents
Dianbo Liu
Samuele Bolotta
He Zhu
Yoshua Bengio
G. Dumas
63
5
0
27 May 2023
DynaShare: Task and Instance Conditioned Parameter Sharing for
  Multi-Task Learning
DynaShare: Task and Instance Conditioned Parameter Sharing for Multi-Task Learning
E. Rahimian
Golara Javadi
Frederick Tung
Gabriel L. Oliveira
MoE
81
3
0
26 May 2023
GC-Flow: A Graph-Based Flow Network for Effective Clustering
GC-Flow: A Graph-Based Flow Network for Effective Clustering
Tianchun Wang
F. Mirzazadeh
Xinming Zhang
Jing Chen
BDL
91
7
0
26 May 2023
COMCAT: Towards Efficient Compression and Customization of
  Attention-Based Vision Models
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models
Jinqi Xiao
Miao Yin
Yu Gong
Xiao Zang
Jian Ren
Bo Yuan
VLMViT
134
9
0
26 May 2023
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained
  Transformer for Vision, Language, and Multimodal Tasks
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks
Kai Zhang
Jun Yu
Eashan Adhikarla
Rong Zhou
Zhilin Yan
...
Xun Chen
Yong Chen
Quanzheng Li
Hongfang Liu
Lichao Sun
LM&MAMedIm
110
186
0
26 May 2023
GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance
  Segmentation
GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance Segmentation
Tanveer Hannan
Rajat Koner
Maximilian Bernhard
Suprosanna Shit
Bjoern Menze
Volker Tresp
Matthias Schubert
Thomas Seidl
54
4
0
26 May 2023
MixCE: Training Autoregressive Language Models by Mixing Forward and
  Reverse Cross-Entropies
MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Shiyue Zhang
Shijie Wu
Ozan Irsoy
Steven Lu
Joey Tianyi Zhou
Mark Dredze
David S. Rosenberg
82
10
0
26 May 2023
Kernel Density Matrices for Probabilistic Deep Learning
Kernel Density Matrices for Probabilistic Deep Learning
Fabio A. González
Raúl Ramos-Pollán
Joseph A. Gallego-Mejia
32
2
0
26 May 2023
Differentiable Random Partition Models
Differentiable Random Partition Models
Thomas M. Sutter
Alain Ryser
Joram Liebeskind
Julia E. Vogt
97
3
0
26 May 2023
A Score-Based Model for Learning Neural Wavefunctions
A Score-Based Model for Learning Neural Wavefunctions
Xuan Zhang
Shenglong Xu
Shuiwang Ji
DiffM
66
1
0
25 May 2023
Martian time-series unraveled: A multi-scale nested approach with
  factorial variational autoencoders
Martian time-series unraveled: A multi-scale nested approach with factorial variational autoencoders
Ali Siahkoohi
Rudy Morel
Randall Balestriero
Erwan Allys
G. Sainton
Taichi Kawamura
Maarten V. de Hoop
145
2
0
25 May 2023
Constrained Probabilistic Mask Learning for Task-specific Undersampled
  MRI Reconstruction
Constrained Probabilistic Mask Learning for Task-specific Undersampled MRI Reconstruction
Tobias Weber
Michael Ingrisch
Bernd Bischl
David Rügamer
77
2
0
25 May 2023
PQA: Exploring the Potential of Product Quantization in DNN Hardware
  Acceleration
PQA: Exploring the Potential of Product Quantization in DNN Hardware Acceleration
Ahmed F. AbouElhamayed
Angela Cui
Javier Fernandez-Marques
Nicholas D. Lane
Mohamed S. Abdelfattah
MQ
82
6
0
25 May 2023
Differentiable Clustering with Perturbed Spanning Forests
Differentiable Clustering with Perturbed Spanning Forests
Lawrence Stewart
Francis R. Bach
Felipe Llinares-López
Quentin Berthet
104
11
0
25 May 2023
Pre-training Multi-party Dialogue Models with Latent Discourse Inference
Pre-training Multi-party Dialogue Models with Latent Discourse Inference
Yiyang Li
Xinting Huang
Wei Bi
Hai Zhao
72
6
0
24 May 2023
SmartTrim: Adaptive Tokens and Attention Pruning for Efficient
  Vision-Language Models
SmartTrim: Adaptive Tokens and Attention Pruning for Efficient Vision-Language Models
Zekun Wang
Jingchang Chen
Wangchunshu Zhou
Haichao Zhu
Jiafeng Liang
Liping Shan
Ming Liu
Dongliang Xu
Qing Yang
Bing Qin
VLM
102
5
0
24 May 2023
Decoupled Rationalization with Asymmetric Learning Rates: A Flexible
  Lipschitz Restraint
Decoupled Rationalization with Asymmetric Learning Rates: A Flexible Lipschitz Restraint
Wei Liu
Jun Wang
Yining Qi
Rui Li
Yang Qiu
Yuankai Zhang
Jie Han
Yixiong Zou
99
14
0
23 May 2023
Scaling Speech Technology to 1,000+ Languages
Scaling Speech Technology to 1,000+ Languages
Vineel Pratap
Andros Tjandra
Bowen Shi
Paden Tomasello
Arun Babu
...
Yossi Adi
Xiaohui Zhang
Wei-Ning Hsu
Alexis Conneau
Michael Auli
VLM
169
361
0
22 May 2023
DEGREE: Decomposition Based Explanation For Graph Neural Networks
DEGREE: Decomposition Based Explanation For Graph Neural Networks
Qizhang Feng
Ninghao Liu
Fan Yang
Ruixiang Tang
Mengnan Du
Helen Zhou
99
25
0
22 May 2023
More Perspectives Mean Better: Underwater Target Recognition and
  Localization with Multimodal Data via Symbiotic Transformer and Multiview
  Regression
More Perspectives Mean Better: Underwater Target Recognition and Localization with Multimodal Data via Symbiotic Transformer and Multiview Regression
Shipei Liu
Xiaoya Fan
Guowei Wu
74
0
0
22 May 2023
Towards Tracing Code Provenance with Code Watermarking
Towards Tracing Code Provenance with Code Watermarking
Wei Li
Borui Yang
Yujie Sun
Suyu Chen
Ziyun Song
Liyao Xiang
Xinbing Wang
Cheng Zhou
WaLM
97
6
0
21 May 2023
Infor-Coef: Information Bottleneck-based Dynamic Token Downsampling for
  Compact and Efficient language model
Infor-Coef: Information Bottleneck-based Dynamic Token Downsampling for Compact and Efficient language model
Wenxin Tan
57
1
0
21 May 2023
Joint Feature and Differentiable $ k $-NN Graph Learning using Dirichlet
  Energy
Joint Feature and Differentiable k k k-NN Graph Learning using Dirichlet Energy
Lei Xu
Lei Chen
Rong Wang
Feiping Nie
Xuelong Li
87
1
0
21 May 2023
Information Screening whilst Exploiting! Multimodal Relation Extraction
  with Feature Denoising and Multimodal Topic Modeling
Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling
Shengqiong Wu
Hao Fei
Yixin Cao
Lidong Bing
Tat-Seng Chua
93
35
0
19 May 2023
Towards Accurate Image Coding: Improved Autoregressive Image Generation
  with Dynamic Vector Quantization
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization
Mengqi Huang
Zhendong Mao
Zhuowei Chen
Yongdong Zhang
MQ
132
41
0
19 May 2023
Attributable and Scalable Opinion Summarization
Attributable and Scalable Opinion Summarization
Tom Hosking
Hao Tang
Mirella Lapata
71
9
0
19 May 2023
Characterizing tradeoffs between teaching via language and
  demonstrations in multi-agent systems
Characterizing tradeoffs between teaching via language and demonstrations in multi-agent systems
Dhara Yu
Noah D. Goodman
Jesse Mu
LLMAG
65
1
0
19 May 2023
Balancing Test Accuracy and Security in Computerized Adaptive Testing
Balancing Test Accuracy and Security in Computerized Adaptive Testing
Wanyong Feng
Aritra Ghosh
S. Sireci
Andrew Lan
83
5
0
18 May 2023
Less Can Be More: Unsupervised Graph Pruning for Large-scale Dynamic
  Graphs
Less Can Be More: Unsupervised Graph Pruning for Large-scale Dynamic Graphs
Jintang Li
Sheng Tian
Ruofan Wu
Liang Zhu
Welong Zhao
Changhua Meng
Liang Chen
Zibin Zheng
Hongzhi Yin
118
10
0
18 May 2023
Content-Adaptive Downsampling in Convolutional Neural Networks
Content-Adaptive Downsampling in Convolutional Neural Networks
Robin Hesse
Simone Schaub-Meyer
Stefan Roth
106
5
0
16 May 2023
Straightening Out the Straight-Through Estimator: Overcoming
  Optimization Challenges in Vector Quantized Networks
Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks
Minyoung Huh
Brian Cheung
Pulkit Agrawal
Phillip Isola
MQ
62
55
0
15 May 2023
Predicting COVID-19 pandemic by spatio-temporal graph neural networks: A
  New Zealand's study
Predicting COVID-19 pandemic by spatio-temporal graph neural networks: A New Zealand's study
V. Nguyen
Truong-Son Hy
Long Tran-Thanh
N. Nghiem
93
8
0
12 May 2023
Exploring the Rate-Distortion-Complexity Optimization in Neural Image
  Compression
Exploring the Rate-Distortion-Complexity Optimization in Neural Image Compression
Yixin Gao
Runsen Feng
Zongyu Guo
Zhibo Chen
69
6
0
12 May 2023
Previous
123...171819...596061
Next