ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.01144
  4. Cited By
Categorical Reparameterization with Gumbel-Softmax
v1v2v3v4v5 (latest)

Categorical Reparameterization with Gumbel-Softmax

3 November 2016
Eric Jang
S. Gu
Ben Poole
    BDL
ArXiv (abs)PDFHTML

Papers citing "Categorical Reparameterization with Gumbel-Softmax"

50 / 3,025 papers shown
Title
Adversarial Augmentation Training Makes Action Recognition Models More
  Robust to Realistic Video Distribution Shifts
Adversarial Augmentation Training Makes Action Recognition Models More Robust to Realistic Video Distribution Shifts
Kiyoon Kim
Shreyank N. Gowda
Panagiotis Eustratiadis
Antreas Antoniou
Robert B Fisher
110
2
0
21 Jan 2024
OrchMoE: Efficient Multi-Adapter Learning with Task-Skill Synergy
OrchMoE: Efficient Multi-Adapter Learning with Task-Skill Synergy
Haowen Wang
Tao Sun
Kaixiang Ji
Jian Wang
Cong Fan
Jinjie Gu
37
1
0
19 Jan 2024
Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through
  Text Reconstruction
Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction
Qingyun Wang
Zixuan Zhang
Hongxiang Li
Xuan Liu
Jiawei Han
Huimin Zhao
Heng Ji
141
1
0
18 Jan 2024
Improving Local Training in Federated Learning via Temperature Scaling
Improving Local Training in Federated Learning via Temperature Scaling
Kichang Lee
Songkuk Kim
Jeonggil Ko
FedML
80
1
0
18 Jan 2024
Towards Generative Abstract Reasoning: Completing Raven's Progressive
  Matrix via Rule Abstraction and Selection
Towards Generative Abstract Reasoning: Completing Raven's Progressive Matrix via Rule Abstraction and Selection
Fan Shi
Bin Li
Xiangyang Xue
ReLMLRM
80
3
0
18 Jan 2024
FREED++: Improving RL Agents for Fragment-Based Molecule Generation by
  Thorough Reproduction
FREED++: Improving RL Agents for Fragment-Based Molecule Generation by Thorough Reproduction
Alexander Telepov
Artem Tsypin
Kuzma Khrabrov
Sergey Yakukhnov
Pavel Strashnov
...
Egor Rumiantsev
Daniel Ezhov
Manvel Avetisian
Olga Popova
Artur Kadurin
74
5
0
18 Jan 2024
Adaptive Self-training Framework for Fine-grained Scene Graph Generation
Adaptive Self-training Framework for Fine-grained Scene Graph Generation
Kibum Kim
Kanghoon Yoon
Yeonjun In
Jinyoung Moon
Donghyun Kim
Chanyoung Park
88
9
0
18 Jan 2024
Convex and Bilevel Optimization for Neuro-Symbolic Inference and
  Learning
Convex and Bilevel Optimization for Neuro-Symbolic Inference and Learning
Charles Dickens
Changyu Gao
Connor Pryor
Stephen J. Wright
Lise Getoor
120
3
0
17 Jan 2024
AgentMixer: Multi-Agent Correlated Policy Factorization
AgentMixer: Multi-Agent Correlated Policy Factorization
Zhiyuan Li
Wenshuai Zhao
Lijun Wu
Joni Pajarinen
OffRL
84
2
0
16 Jan 2024
Optimization of Discrete Parameters Using the Adaptive Gradient Method
  and Directed Evolution
Optimization of Discrete Parameters Using the Adaptive Gradient Method and Directed Evolution
Andrei Beinarovich
Sergey Stepanov
Alexander Zaslavsky
82
0
0
12 Jan 2024
Cross-Attention Watermarking of Large Language Models
Cross-Attention Watermarking of Large Language Models
Folco Bertini Baldassini
H. Nguyen
Ching-Chung Chang
Isao Echizen
WaLM
48
2
0
12 Jan 2024
Video Super-Resolution Transformer with Masked Inter&Intra-Frame
  Attention
Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention
Xingyu Zhou
Leheng Zhang
Xiaorui Zhao
Keze Wang
Leida Li
Shuhang Gu
SupR
92
10
0
12 Jan 2024
Unifying Graph Contrastive Learning via Graph Message Augmentation
Unifying Graph Contrastive Learning via Graph Message Augmentation
Ziyan Zhang
Bo Jiang
Jin Tang
Bin Luo
81
1
0
08 Jan 2024
Denoising Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors
Denoising Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors
Wasu Top Piriyakulkij
Yingheng Wang
Volodymyr Kuleshov
DiffM
145
1
0
05 Jan 2024
Bayesian Intrinsic Groupwise Image Registration: Unsupervised
  Disentanglement of Anatomy and Geometry
Bayesian Intrinsic Groupwise Image Registration: Unsupervised Disentanglement of Anatomy and Geometry
Xinzhe Luo
Xin Wang
Linda Shapiro
Chun Yuan
Jianfeng Feng
Xiahai Zhuang
79
0
0
04 Jan 2024
ReFusion: Improving Natural Language Understanding with
  Computation-Efficient Retrieval Representation Fusion
ReFusion: Improving Natural Language Understanding with Computation-Efficient Retrieval Representation Fusion
Shangyu Wu
Ying Xiong
Yufei Cui
Xue Liu
Buzhou Tang
Tei-Wei Kuo
Chun Jason Xue
96
2
0
04 Jan 2024
Unsupervised Object-Centric Learning from Multiple Unspecified
  Viewpoints
Unsupervised Object-Centric Learning from Multiple Unspecified Viewpoints
Jinyang Yuan
Tonglin Chen
Zhimeng Shen
Bin Li
Xiangyang Xue
OCL
68
4
0
03 Jan 2024
Modular Learning of Deep Causal Generative Models for High-dimensional
  Causal Inference
Modular Learning of Deep Causal Generative Models for High-dimensional Causal Inference
Md Musfiqur Rahman
Murat Kocaoglu
OOD
85
3
0
02 Jan 2024
On Discprecncies between Perturbation Evaluations of Graph Neural
  Network Attributions
On Discprecncies between Perturbation Evaluations of Graph Neural Network Attributions
Razieh Rezaei
Alireza Dizaji
Ashkan Khakzar
Anees Kazi
Nassir Navab
Daniel Rueckert
45
0
0
01 Jan 2024
HQ-VAE: Hierarchical Discrete Representation Learning with Variational
  Bayes
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes
Yuhta Takida
Yukara Ikemiya
Takashi Shibuya
Kazuki Shimada
Woosung Choi
...
Naoki Murata
Toshimitsu Uesaka
Kengo Uchida
Wei-Hsiang Liao
Yuki Mitsufuji
BDL
112
13
0
31 Dec 2023
Mitigating Degree Biases in Message Passing Mechanism by Utilizing
  Community Structures
Mitigating Degree Biases in Message Passing Mechanism by Utilizing Community Structures
Van Thuy Hoang
O-Joun Lee
50
7
0
28 Dec 2023
Deep Learning for Efficient GWAS Feature Selection
Deep Learning for Efficient GWAS Feature Selection
Kexuan Li
48
0
0
22 Dec 2023
Contextual Feature Selection with Conditional Stochastic Gates
Contextual Feature Selection with Conditional Stochastic Gates
Ram Dyuthi Sristi
Ofir Lindenbaum
Shira Lifshitz
Maria Lavzin
Jackie Schiller
Zhengchao Wan
Hadas Benisty
58
2
0
21 Dec 2023
Adapt & Align: Continual Learning with Generative Models Latent Space
  Alignment
Adapt & Align: Continual Learning with Generative Models Latent Space Alignment
Kamil Deja
Bartosz Cywiński
Jan Rybarczyk
Tomasz Trzciñski
CLLDRL
58
0
0
21 Dec 2023
Sign Language Production with Latent Motion Transformer
Sign Language Production with Latent Motion Transformer
Pan Xie
Taiying Peng
Yao Du
Qipeng Zhang
SLR
71
5
0
20 Dec 2023
Model-Based Control with Sparse Neural Dynamics
Model-Based Control with Sparse Neural Dynamics
Ziang Liu
Genggeng Zhou
Jeff He
Tobia Marcucci
Fei-Fei Li
Jiajun Wu
Yunzhu Li
AI4CE
92
18
0
20 Dec 2023
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete
  Diffusion Process
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process
Meng Wang
Henghui Ding
Jun Hao Liew
Jiajun Liu
Yao-Min Zhao
Yunchao Wei
DiffM
109
19
0
19 Dec 2023
GCNext: Towards the Unity of Graph Convolutions for Human Motion
  Prediction
GCNext: Towards the Unity of Graph Convolutions for Human Motion Prediction
Xinshun Wang
Qiongjie Cui
Chong Chen
Mengyuan Liu
3DH
66
11
0
19 Dec 2023
Disentangling continuous and discrete linguistic signals in
  transformer-based sentence embeddings
Disentangling continuous and discrete linguistic signals in transformer-based sentence embeddings
Vivi Nastase
Paola Merlo
109
0
0
18 Dec 2023
Efficiency-oriented approaches for self-supervised speech representation
  learning
Efficiency-oriented approaches for self-supervised speech representation learning
Luis Lugo
Valentin Vielzeuf
SSL
64
1
0
18 Dec 2023
Adaptive Computation Modules: Granular Conditional Computation For
  Efficient Inference
Adaptive Computation Modules: Granular Conditional Computation For Efficient Inference
Bartosz Wójcik
Alessio Devoto
Karol Pustelnik
Pasquale Minervini
Simone Scardapane
86
6
0
15 Dec 2023
Exploration of visual prompt in Grounded pre-trained open-set detection
Exploration of visual prompt in Grounded pre-trained open-set detection
Qibo Chen
Weizhong Jin
Shuchang Li
Mengdi Liu
Li Yu
Jian Jiang
Xiaozheng Wang
VLM
28
0
0
14 Dec 2023
ViLA: Efficient Video-Language Alignment for Video Question Answering
ViLA: Efficient Video-Language Alignment for Video Question Answering
Xijun Wang
Junbang Liang
Chun-Kai Wang
Kenan Deng
Yu Lou
Ming-Chyuan Lin
Shan Yang
116
15
0
13 Dec 2023
A Survey of Text Watermarking in the Era of Large Language Models
A Survey of Text Watermarking in the Era of Large Language Models
Aiwei Liu
Leyi Pan
Yijian Lu
Jingjing Li
Xuming Hu
Xi Zhang
Lijie Wen
Irwin King
Hui Xiong
Philip S. Yu
WaLM
117
66
0
13 Dec 2023
Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs
  for Embodied AI
Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI
Kai Huang
Boyuan Yang
Wei Gao
68
1
0
13 Dec 2023
Building Universal Foundation Models for Medical Image Analysis with
  Spatially Adaptive Networks
Building Universal Foundation Models for Medical Image Analysis with Spatially Adaptive Networks
Lingxiao Luo
Xuanzhong Chen
Bingda Tang
Xinsheng Chen
Rong Han
Chengpeng Hu
Yujiang Li
Ting Chen
MedIm
72
2
0
12 Dec 2023
RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos
RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos
Tanveer Hannan
Md. Mohaiminul Islam
Thomas Seidl
Gedas Bertasius
90
4
0
11 Dec 2023
Concrete Subspace Learning based Interference Elimination for Multi-task
  Model Fusion
Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion
Anke Tang
Li Shen
Yong Luo
Liang Ding
Han Hu
Bo Du
Dacheng Tao
MoMe
96
22
0
11 Dec 2023
Transformer-based Selective Super-Resolution for Efficient Image
  Refinement
Transformer-based Selective Super-Resolution for Efficient Image Refinement
Tianyi Zhang
Kishore Kasichainula
Yaoxin Zhuo
Baoxin Li
Jae-sun Seo
Yu Cao
48
7
0
10 Dec 2023
Exploring Sparsity in Graph Transformers
Exploring Sparsity in Graph Transformers
Chuang Liu
Yibing Zhan
Xueqi Ma
Liang Ding
Dapeng Tao
Hongzhi Zhang
Wenbin Hu
Bo Du
94
7
0
09 Dec 2023
Mastering Complex Coordination through Attention-based Dynamic Graph
Mastering Complex Coordination through Attention-based Dynamic Graph
Guangchong Zhou
Zhiwei Xu
Zeren Zhang
Guoliang Fan
GNN
89
0
0
07 Dec 2023
Enhancing the Rationale-Input Alignment for Self-explaining
  Rationalization
Enhancing the Rationale-Input Alignment for Self-explaining Rationalization
Wei Liu
Yining Qi
Jun Wang
Zhiying Deng
Yuankai Zhang
Chengwei Wang
Ruixuan Li
80
11
0
07 Dec 2023
Language Model Alignment with Elastic Reset
Language Model Alignment with Elastic Reset
Michael Noukhovitch
Samuel Lavoie
Florian Strub
Aaron Courville
KELM
161
27
0
06 Dec 2023
Balanced Marginal and Joint Distributional Learning via Mixture
  Cramer-Wold Distance
Balanced Marginal and Joint Distributional Learning via Mixture Cramer-Wold Distance
SeungHwan An
Sungchul Hong
Jong-June Jeon
64
0
0
06 Dec 2023
Customizable Combination of Parameter-Efficient Modules for Multi-Task
  Learning
Customizable Combination of Parameter-Efficient Modules for Multi-Task Learning
Haowen Wang
Tao Sun
Cong Fan
Jinjie Gu
MoE
74
7
0
06 Dec 2023
Towards Causal Representations of Climate Model Data
Towards Causal Representations of Climate Model Data
Julien Boussard
Chandni Nagda
Julia Kaltenborn
C. E. E. Lange
Philippe Brouillard
Yaniv Gurwicz
Peer Nowack
David Rolnick
76
5
0
05 Dec 2023
Constrained Twin Variational Auto-Encoder for Intrusion Detection in IoT
  Systems
Constrained Twin Variational Auto-Encoder for Intrusion Detection in IoT Systems
Phai Vu Dinh
Quang-Uy Nguyen
D. Hoang
Diep N. Nguyen
Son Pham Bao
E. Dutkiewicz
AAML
62
6
0
05 Dec 2023
Learn2Extend: Extending sequences by retaining their statistical
  properties with mixture models
Learn2Extend: Extending sequences by retaining their statistical properties with mixture models
Dimitris Vartziotis
George Dasoulas
Florian Pausinger
121
0
0
03 Dec 2023
Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters
Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters
James Seale Smith
Yen-Chang Hsu
Z. Kira
Yilin Shen
Hongxia Jin
DiffM
112
6
0
30 Nov 2023
Improving the Robustness of Quantized Deep Neural Networks to White-Box
  Attacks using Stochastic Quantization and Information-Theoretic Ensemble
  Training
Improving the Robustness of Quantized Deep Neural Networks to White-Box Attacks using Stochastic Quantization and Information-Theoretic Ensemble Training
Saurabh Farkya
Aswin Raghavan
Avi Ziskind
70
0
0
30 Nov 2023
Previous
123...101112...596061
Next