Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.01144
Cited By
v1
v2
v3
v4
v5 (latest)
Categorical Reparameterization with Gumbel-Softmax
3 November 2016
Eric Jang
S. Gu
Ben Poole
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Categorical Reparameterization with Gumbel-Softmax"
50 / 3,025 papers shown
Title
Heterogeneous Knowledge for Augmented Modular Reinforcement Learning
Lorenz Wolf
Mirco Musolesi
OffRL
77
0
0
01 Jun 2023
Learning Transformer Programs
Dan Friedman
Alexander Wettig
Danqi Chen
89
36
0
01 Jun 2023
Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition
H. Haresamudram
Irfan Essa
Thomas Ploetz
102
8
0
01 Jun 2023
Joint Learning of Label and Environment Causal Independence for Graph Out-of-Distribution Generalization
Shurui Gui
Meng Liu
Xiner Li
Youzhi Luo
Shuiwang Ji
CML
OOD
96
30
0
01 Jun 2023
Differentiable Tree Operations Promote Compositional Generalization
Paul Soulos
J. E. Hu
Kate McCurdy
Yunmo Chen
Roland Fernandez
P. Smolensky
Jianfeng Gao
AI4CE
67
7
0
01 Jun 2023
Nonparametric Identifiability of Causal Representations from Unknown Interventions
Julius von Kügelgen
M. Besserve
Wendong Liang
Luigi Gresele
Armin Kekić
Elias Bareinboim
David M. Blei
Bernhard Schölkopf
CML
186
65
0
01 Jun 2023
DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation
Xiaoliang Ju
Zhaoyang Huang
Yijin Li
Guofeng Zhang
Yu Qiao
Hongsheng Li
111
7
0
01 Jun 2023
Scalable Learning of Latent Language Structure With Logical Offline Cycle Consistency
Mayank Agarwal
Ramón Fernández Astudillo
Tahira Naseem
Subhajit Chaudhury
Pavan Kapanipathi
Salim Roukos
Alexander G. Gray
OffRL
65
0
0
31 May 2023
Beam Tree Recursive Cells
Jishnu Ray Chowdhury
Cornelia Caragea
91
6
0
31 May 2023
Speaking the Language of Your Listener: Audience-Aware Adaptation via Plug-and-Play Theory of Mind
Ece Takmaz
Nicolo' Brandizzi
Mario Giulianelli
Sandro Pezzelle
Raquel Fernández
70
7
0
31 May 2023
Neural Markov Jump Processes
Patrick Seifner
Ramses J. Sanchez
BDL
83
8
0
31 May 2023
NetHack is Hard to Hack
Ulyana Piterbarg
Lerrel Pinto
Rob Fergus
60
7
0
30 May 2023
Graph-based Time Series Clustering for End-to-End Hierarchical Forecasting
Andrea Cini
Danilo Mandic
Cesare Alippi
AI4TS
64
10
0
30 May 2023
Joint Optimization of Class-Specific Training- and Test-Time Data Augmentation in Segmentation
Zeju Li
Konstantinos Kamnitsas
Qi Dou
C. Qin
Ben Glocker
68
6
0
30 May 2023
Autoencoding Conditional Neural Processes for Representation Learning
Victor Prokhorov
Ivan Titov
N. Siddharth
BDL
72
0
0
29 May 2023
Shift-Robust Molecular Relational Learning with Causal Substructure
Namkyeong Lee
Kanghoon Yoon
Gyoung S. Na
Sein Kim
Chanyoung Park
108
16
0
29 May 2023
Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach
Yudi Zhang
Yali Du
Erdun Gao
Ziyan Wang
Jun Wang
Meng Fang
Mykola Pechenizkiy
CML
109
18
0
28 May 2023
Attention Schema in Neural Agents
Dianbo Liu
Samuele Bolotta
He Zhu
Yoshua Bengio
G. Dumas
63
5
0
27 May 2023
DynaShare: Task and Instance Conditioned Parameter Sharing for Multi-Task Learning
E. Rahimian
Golara Javadi
Frederick Tung
Gabriel L. Oliveira
MoE
81
3
0
26 May 2023
GC-Flow: A Graph-Based Flow Network for Effective Clustering
Tianchun Wang
F. Mirzazadeh
Xinming Zhang
Jing Chen
BDL
91
7
0
26 May 2023
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models
Jinqi Xiao
Miao Yin
Yu Gong
Xiao Zang
Jian Ren
Bo Yuan
VLM
ViT
134
9
0
26 May 2023
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks
Kai Zhang
Jun Yu
Eashan Adhikarla
Rong Zhou
Zhilin Yan
...
Xun Chen
Yong Chen
Quanzheng Li
Hongfang Liu
Lichao Sun
LM&MA
MedIm
110
186
0
26 May 2023
GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance Segmentation
Tanveer Hannan
Rajat Koner
Maximilian Bernhard
Suprosanna Shit
Bjoern Menze
Volker Tresp
Matthias Schubert
Thomas Seidl
54
4
0
26 May 2023
MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
Shiyue Zhang
Shijie Wu
Ozan Irsoy
Steven Lu
Joey Tianyi Zhou
Mark Dredze
David S. Rosenberg
82
10
0
26 May 2023
Kernel Density Matrices for Probabilistic Deep Learning
Fabio A. González
Raúl Ramos-Pollán
Joseph A. Gallego-Mejia
32
2
0
26 May 2023
Differentiable Random Partition Models
Thomas M. Sutter
Alain Ryser
Joram Liebeskind
Julia E. Vogt
97
3
0
26 May 2023
A Score-Based Model for Learning Neural Wavefunctions
Xuan Zhang
Shenglong Xu
Shuiwang Ji
DiffM
66
1
0
25 May 2023
Martian time-series unraveled: A multi-scale nested approach with factorial variational autoencoders
Ali Siahkoohi
Rudy Morel
Randall Balestriero
Erwan Allys
G. Sainton
Taichi Kawamura
Maarten V. de Hoop
145
2
0
25 May 2023
Constrained Probabilistic Mask Learning for Task-specific Undersampled MRI Reconstruction
Tobias Weber
Michael Ingrisch
Bernd Bischl
David Rügamer
77
2
0
25 May 2023
PQA: Exploring the Potential of Product Quantization in DNN Hardware Acceleration
Ahmed F. AbouElhamayed
Angela Cui
Javier Fernandez-Marques
Nicholas D. Lane
Mohamed S. Abdelfattah
MQ
82
6
0
25 May 2023
Differentiable Clustering with Perturbed Spanning Forests
Lawrence Stewart
Francis R. Bach
Felipe Llinares-López
Quentin Berthet
104
11
0
25 May 2023
Pre-training Multi-party Dialogue Models with Latent Discourse Inference
Yiyang Li
Xinting Huang
Wei Bi
Hai Zhao
72
6
0
24 May 2023
SmartTrim: Adaptive Tokens and Attention Pruning for Efficient Vision-Language Models
Zekun Wang
Jingchang Chen
Wangchunshu Zhou
Haichao Zhu
Jiafeng Liang
Liping Shan
Ming Liu
Dongliang Xu
Qing Yang
Bing Qin
VLM
102
5
0
24 May 2023
Decoupled Rationalization with Asymmetric Learning Rates: A Flexible Lipschitz Restraint
Wei Liu
Jun Wang
Yining Qi
Rui Li
Yang Qiu
Yuankai Zhang
Jie Han
Yixiong Zou
99
14
0
23 May 2023
Scaling Speech Technology to 1,000+ Languages
Vineel Pratap
Andros Tjandra
Bowen Shi
Paden Tomasello
Arun Babu
...
Yossi Adi
Xiaohui Zhang
Wei-Ning Hsu
Alexis Conneau
Michael Auli
VLM
169
361
0
22 May 2023
DEGREE: Decomposition Based Explanation For Graph Neural Networks
Qizhang Feng
Ninghao Liu
Fan Yang
Ruixiang Tang
Mengnan Du
Helen Zhou
99
25
0
22 May 2023
More Perspectives Mean Better: Underwater Target Recognition and Localization with Multimodal Data via Symbiotic Transformer and Multiview Regression
Shipei Liu
Xiaoya Fan
Guowei Wu
74
0
0
22 May 2023
Towards Tracing Code Provenance with Code Watermarking
Wei Li
Borui Yang
Yujie Sun
Suyu Chen
Ziyun Song
Liyao Xiang
Xinbing Wang
Cheng Zhou
WaLM
97
6
0
21 May 2023
Infor-Coef: Information Bottleneck-based Dynamic Token Downsampling for Compact and Efficient language model
Wenxin Tan
57
1
0
21 May 2023
Joint Feature and Differentiable
k
k
k
-NN Graph Learning using Dirichlet Energy
Lei Xu
Lei Chen
Rong Wang
Feiping Nie
Xuelong Li
87
1
0
21 May 2023
Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling
Shengqiong Wu
Hao Fei
Yixin Cao
Lidong Bing
Tat-Seng Chua
93
35
0
19 May 2023
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization
Mengqi Huang
Zhendong Mao
Zhuowei Chen
Yongdong Zhang
MQ
132
41
0
19 May 2023
Attributable and Scalable Opinion Summarization
Tom Hosking
Hao Tang
Mirella Lapata
71
9
0
19 May 2023
Characterizing tradeoffs between teaching via language and demonstrations in multi-agent systems
Dhara Yu
Noah D. Goodman
Jesse Mu
LLMAG
65
1
0
19 May 2023
Balancing Test Accuracy and Security in Computerized Adaptive Testing
Wanyong Feng
Aritra Ghosh
S. Sireci
Andrew Lan
83
5
0
18 May 2023
Less Can Be More: Unsupervised Graph Pruning for Large-scale Dynamic Graphs
Jintang Li
Sheng Tian
Ruofan Wu
Liang Zhu
Welong Zhao
Changhua Meng
Liang Chen
Zibin Zheng
Hongzhi Yin
118
10
0
18 May 2023
Content-Adaptive Downsampling in Convolutional Neural Networks
Robin Hesse
Simone Schaub-Meyer
Stefan Roth
106
5
0
16 May 2023
Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks
Minyoung Huh
Brian Cheung
Pulkit Agrawal
Phillip Isola
MQ
62
55
0
15 May 2023
Predicting COVID-19 pandemic by spatio-temporal graph neural networks: A New Zealand's study
V. Nguyen
Truong-Son Hy
Long Tran-Thanh
N. Nghiem
93
8
0
12 May 2023
Exploring the Rate-Distortion-Complexity Optimization in Neural Image Compression
Yixin Gao
Runsen Feng
Zongyu Guo
Zhibo Chen
69
6
0
12 May 2023
Previous
1
2
3
...
17
18
19
...
59
60
61
Next