ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.01144
  4. Cited By
Categorical Reparameterization with Gumbel-Softmax
v1v2v3v4v5 (latest)

Categorical Reparameterization with Gumbel-Softmax

3 November 2016
Eric Jang
S. Gu
Ben Poole
    BDL
ArXiv (abs)PDFHTML

Papers citing "Categorical Reparameterization with Gumbel-Softmax"

50 / 3,025 papers shown
Title
GNN-based Probabilistic Supply and Inventory Predictions in Supply Chain
  Networks
GNN-based Probabilistic Supply and Inventory Predictions in Supply Chain Networks
Hyung-il Ahn
Young Chol Song
Santiago Olivar
Hershel Mehta
Naveen Tewari
28
3
0
11 Apr 2024
Sketch-Plan-Generalize: Learning and Planning with Neuro-Symbolic Programmatic Representations for Inductive Spatial Concepts
Sketch-Plan-Generalize: Learning and Planning with Neuro-Symbolic Programmatic Representations for Inductive Spatial Concepts
Namasivayam Kalithasan
Sachit Sachdeva
H. Singh
Vishal Bindal
Arnav Tuli
Gurarmaan Singh Panjeta
Divyanshu Aggarwal
Rohan Paul
Rohan Paul
Parag Singla
79
0
0
11 Apr 2024
Matching 2D Images in 3D: Metric Relative Pose from Metric
  Correspondences
Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
Axel Barroso-Laguna
Sowmya P. Munukutla
V. Prisacariu
Eric Brachmann
3DV
70
14
0
09 Apr 2024
Deep Learning-Based Out-of-distribution Source Code Data Identification:
  How Far Have We Gone?
Deep Learning-Based Out-of-distribution Source Code Data Identification: How Far Have We Gone?
Van Nguyen
Xingliang Yuan
Tingmin Wu
Surya Nepal
M. Grobler
Carsten Rudolph
86
1
0
09 Apr 2024
Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning
Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning
Andrei Semenov
Vladimir Ivanov
Aleksandr Beznosikov
Alexander Gasnikov
73
6
0
04 Apr 2024
Dynamic Neural Control Flow Execution: An Agent-Based Deep Equilibrium
  Approach for Binary Vulnerability Detection
Dynamic Neural Control Flow Execution: An Agent-Based Deep Equilibrium Approach for Binary Vulnerability Detection
Litao Li
Steven H. H. Ding
Andrew Walenstein
P. Charland
Benjamin C. M. Fung
65
0
0
03 Apr 2024
VideoDistill: Language-aware Vision Distillation for Video Question
  Answering
VideoDistill: Language-aware Vision Distillation for Video Question Answering
Bo Zou
Chao Yang
Yu Qiao
Chengbin Quan
Youjian Zhao
VGen
89
1
0
01 Apr 2024
A Review of Modern Recommender Systems Using Generative Models
  (Gen-RecSys)
A Review of Modern Recommender Systems Using Generative Models (Gen-RecSys)
Yashar Deldjoo
Zhankui He
Julian McAuley
Anton Korikov
Scott Sanner
Arnau Ramisa
René Vidal
M. Sathiamoorthy
Atoosa Kasirzadeh
Silvia Milano
VLM
152
61
0
31 Mar 2024
From Similarity to Superiority: Channel Clustering for Time Series
  Forecasting
From Similarity to Superiority: Channel Clustering for Time Series Forecasting
Jialin Chen
J. E. Lenssen
Aosong Feng
Weihua Hu
Matthias Fey
Leandros Tassiulas
J. Leskovec
Rex Ying
AI4TS
86
16
0
31 Mar 2024
LLMs are Good Action Recognizers
LLMs are Good Action Recognizers
Haoxuan Qu
Yujun Cai
Jun Liu
107
21
0
31 Mar 2024
Transformer based Pluralistic Image Completion with Reduced Information
  Loss
Transformer based Pluralistic Image Completion with Reduced Information Loss
Qiankun Liu
Yuqi Jiang
Zhentao Tan
DongDong Chen
Ying Fu
Qi Chu
Gang Hua
Nenghai Yu
ViT
114
12
0
31 Mar 2024
D-PAD: Deep-Shallow Multi-Frequency Patterns Disentangling for Time
  Series Forecasting
D-PAD: Deep-Shallow Multi-Frequency Patterns Disentangling for Time Series Forecasting
Xiaobing Yuan
Ling Chen
AI4TS
77
2
0
26 Mar 2024
Intrinsic Subgraph Generation for Interpretable Graph based Visual
  Question Answering
Intrinsic Subgraph Generation for Interpretable Graph based Visual Question Answering
Pascal Tilli
Ngoc Thang Vu
80
1
0
26 Mar 2024
Parametric PDE Control with Deep Reinforcement Learning and
  Differentiable L0-Sparse Polynomial Policies
Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies
N. Botteghi
Urban Fasel
AI4CE
108
6
0
22 Mar 2024
CLIP-VQDiffusion : Langauge Free Training of Text To Image generation
  using CLIP and vector quantized diffusion model
CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model
S. Han
Joohee Kim
DiffMCLIP
76
2
0
22 Mar 2024
Auto-Train-Once: Controller Network Guided Automatic Network Pruning
  from Scratch
Auto-Train-Once: Controller Network Guided Automatic Network Pruning from Scratch
Xidong Wu
Shangqian Gao
Zeyu Zhang
Zhenzhen Li
Runxue Bao
Yanfu Zhang
Xiaoqian Wang
Heng-Chiao Huang
68
11
0
21 Mar 2024
Predictive, scalable and interpretable knowledge tracing on structured
  domains
Predictive, scalable and interpretable knowledge tracing on structured domains
Hanqi Zhou
Robert Bamler
Charley M. Wu
Álvaro Tejero-Cantero
AI4Ed
67
10
0
19 Mar 2024
HCPM: Hierarchical Candidates Pruning for Efficient Detector-Free
  Matching
HCPM: Hierarchical Candidates Pruning for Efficient Detector-Free Matching
Ying Chen
Yong-Jin Liu
Kai Wu
Qiang Nie
Shang Xu
Huifang Ma
Bing Wang
Chengjie Wang
VLM
66
1
0
19 Mar 2024
Non-negative Contrastive Learning
Non-negative Contrastive Learning
Yifei Wang
Qi Zhang
Yaoyu Guo
Yisen Wang
94
12
0
19 Mar 2024
CASPER: Causality-Aware Spatiotemporal Graph Neural Networks for
  Spatiotemporal Time Series Imputation
CASPER: Causality-Aware Spatiotemporal Graph Neural Networks for Spatiotemporal Time Series Imputation
Baoyu Jing
Dawei Zhou
Kan Ren
Carl Yang
CMLAI4TS
103
10
0
18 Mar 2024
Language Evolution with Deep Learning
Language Evolution with Deep Learning
Mathieu Rita
Paul Michel
Rahma Chaabouni
Olivier Pietquin
Emmanuel Dupoux
Florian Strub
65
3
0
18 Mar 2024
SSCAE -- Semantic, Syntactic, and Context-aware natural language
  Adversarial Examples generator
SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator
J. Asl
Mohammad H. Rafiei
Manar Alohaly
Daniel Takabi
AAMLSILM
35
3
0
18 Mar 2024
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT
  Adaptation
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation
Wangbo Zhao
Jiasheng Tang
Yizeng Han
Yibing Song
Kai Wang
Gao Huang
F. Wang
Yang You
128
12
0
18 Mar 2024
HyperVQ: MLR-based Vector Quantization in Hyperbolic Space
HyperVQ: MLR-based Vector Quantization in Hyperbolic Space
Nabarun Goswami
Yusuke Mukuta
Tatsuya Harada
128
4
0
18 Mar 2024
Concept-Best-Matching: Evaluating Compositionality in Emergent
  Communication
Concept-Best-Matching: Evaluating Compositionality in Emergent Communication
Boaz Carmeli
Yonatan Belinkov
Ron Meir
63
4
0
17 Mar 2024
Generation is better than Modification: Combating High Class Homophily
  Variance in Graph Anomaly Detection
Generation is better than Modification: Combating High Class Homophily Variance in Graph Anomaly Detection
Rui Zhang
Dawei Cheng
Xin Liu
Jie Yang
Ouyang Yi
Xian Wu
Yefeng Zheng
65
3
0
15 Mar 2024
Quiet-STaR: Language Models Can Teach Themselves to Think Before
  Speaking
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
E. Zelikman
Georges Harik
Yijia Shao
Varuna Jayasiri
Nick Haber
Noah D. Goodman
LLMAGReLMLRM
140
151
0
14 Mar 2024
Exploiting Structural Consistency of Chest Anatomy for Unsupervised
  Anomaly Detection in Radiography Images
Exploiting Structural Consistency of Chest Anatomy for Unsupervised Anomaly Detection in Radiography Images
Tiange Xiang
Yixiao Zhang
Yongyi Lu
Alan Yuille
Chaoyi Zhang
Weidong Cai
Zongwei Zhou
89
3
0
13 Mar 2024
An Efficient End-to-End Approach to Noise Invariant Speech Features via
  Multi-Task Learning
An Efficient End-to-End Approach to Noise Invariant Speech Features via Multi-Task Learning
Heitor R. Guimarães
Arthur Pimentel
Anderson R. Avila
Mehdi Rezagholizadeh
Boxing Chen
Tiago H. Falk
119
1
0
13 Mar 2024
Unleashing the Power of Meta-tuning for Few-shot Generalization Through
  Sparse Interpolated Experts
Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated Experts
Shengzhuang Chen
Jihoon Tack
Yunqiao Yang
Yee Whye Teh
Jonathan Richard Schwarz
Ying Wei
MoE
125
4
0
13 Mar 2024
Learning-driven Physically-aware Large-scale Circuit Gate Sizing
Learning-driven Physically-aware Large-scale Circuit Gate Sizing
Yuyang Ye
Peng Xu
Lizheng Ren
Tinghuan Chen
Hao Yan
Bei Yu
L. Shi
AI4CE
80
0
0
13 Mar 2024
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Sainbayar Sukhbaatar
O. Yu. Golovneva
Vasu Sharma
Hu Xu
Xi Lin
...
Jacob Kahn
Shang-Wen Li
Wen-tau Yih
Jason Weston
Xian Li
MoMeOffRLMoE
98
69
0
12 Mar 2024
Conditional computation in neural networks: principles and research
  trends
Conditional computation in neural networks: principles and research trends
Simone Scardapane
Alessandro Baiocchi
Alessio Devoto
V. Marsocci
Pasquale Minervini
Jary Pomponi
104
2
0
12 Mar 2024
A Logical Pattern Memory Pre-trained Model for Entailment Tree
  Generation
A Logical Pattern Memory Pre-trained Model for Entailment Tree Generation
Li Yuan
Yi Cai
Haopeng Ren
Jiexin Wang
LRM
69
5
0
11 Mar 2024
UniTable: Towards a Unified Framework for Table Recognition via
  Self-Supervised Pretraining
UniTable: Towards a Unified Framework for Table Recognition via Self-Supervised Pretraining
Sheng-Hsuan Peng
Aishwarya Chakravarthy
Seongmin Lee
Xiaojing Wang
Rajarajeswari Balasubramaniyan
Duen Horng Chau
LMTD
78
1
0
07 Mar 2024
Deep-Learned Compression for Radio-Frequency Signal Classification
Deep-Learned Compression for Radio-Frequency Signal Classification
Armani Rodriguez
Yagna Kaasaragadda
S. Kokalj-Filipovic
54
1
0
05 Mar 2024
VQSynery: Robust Drug Synergy Prediction With Vector Quantization
  Mechanism
VQSynery: Robust Drug Synergy Prediction With Vector Quantization Mechanism
Jiawei Wu
Mingyuan Yan
Dianbo Liu
65
2
0
05 Mar 2024
Modality-Aware and Shift Mixer for Multi-modal Brain Tumor Segmentation
Modality-Aware and Shift Mixer for Multi-modal Brain Tumor Segmentation
Zhongzhen Huang
Linda Wei
Shaoting Zhang
Xiaofan Zhang
155
0
0
04 Mar 2024
CET2: Modelling Topic Transitions for Coherent and Engaging
  Knowledge-Grounded Conversations
CET2: Modelling Topic Transitions for Coherent and Engaging Knowledge-Grounded Conversations
Lin Xu
Qixian Zhou
Jinlan Fu
See-Kiong Ng
70
0
0
04 Mar 2024
Improving out-of-distribution generalization in graphs via hierarchical
  semantic environments
Improving out-of-distribution generalization in graphs via hierarchical semantic environments
Yinhua Piao
Sangseon Lee
Yijingxiu Lu
Sun Kim
OOD
87
2
0
04 Mar 2024
Neural Graph Generator: Feature-Conditioned Graph Generation using
  Latent Diffusion Models
Neural Graph Generator: Feature-Conditioned Graph Generation using Latent Diffusion Models
Iakovos Evdaimon
Giannis Nikolentzos
Michail Chatzianastasis
Hadi Abdine
Michalis Vazirgiannis
DiffM
64
4
0
03 Mar 2024
LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth
  Limited Optical Signal Acquisition
LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Lingfeng Liu
Dong Ni
Hangjie Yuan
ViT
95
0
0
03 Mar 2024
Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral
  Pedestrian Detection
Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Taeheon Kim
Sebin Shin
Youngjoon Yu
Hak Gu Kim
Y. Ro
80
7
0
02 Mar 2024
Accelerating Greedy Coordinate Gradient via Probe Sampling
Accelerating Greedy Coordinate Gradient via Probe Sampling
Yiran Zhao
Wenyue Zheng
Tianle Cai
Xuan Long Do
Kenji Kawaguchi
Anirudh Goyal
Michael Shieh
96
2
0
02 Mar 2024
Hierarchical Indexing for Retrieval-Augmented Opinion Summarization
Hierarchical Indexing for Retrieval-Augmented Opinion Summarization
Tom Hosking
Hao Tang
Mirella Lapata
104
5
0
01 Mar 2024
Automated Efficient Estimation using Monte Carlo Efficient Influence
  Functions
Automated Efficient Estimation using Monte Carlo Efficient Influence Functions
Raj Agrawal
Sam Witty
Andy Zane
Eli Bingham
104
2
0
29 Feb 2024
Multi-objective Differentiable Neural Architecture Search
Multi-objective Differentiable Neural Architecture Search
R. Sukthanker
Arber Zela
B. Staffler
Samuel Dooley
Josif Grabocka
Frank Hutter
171
1
0
28 Feb 2024
Downstream Task Guided Masking Learning in Masked Autoencoders Using Multi-Level Optimization
Downstream Task Guided Masking Learning in Masked Autoencoders Using Multi-Level Optimization
Han Guo
Ramtin Hosseini
Ruiyi Zhang
Sai Ashish Somayajula
Ranak Roy Chowdhury
Rajesh K. Gupta
Pengtao Xie
98
0
0
28 Feb 2024
MCF-VC: Mitigate Catastrophic Forgetting in Class-Incremental Learning
  for Multimodal Video Captioning
MCF-VC: Mitigate Catastrophic Forgetting in Class-Incremental Learning for Multimodal Video Captioning
Huiyu Xiong
Lanxiao Wang
Heqian Qiu
Taijin Zhao
Benliu Qiu
Hongliang Li
CLL
86
1
0
27 Feb 2024
FaultProfIT: Hierarchical Fault Profiling of Incident Tickets in
  Large-scale Cloud Systems
FaultProfIT: Hierarchical Fault Profiling of Incident Tickets in Large-scale Cloud Systems
Junjie Huang
Jinyang Liu
Zhuangbin Chen
Zhihan Jiang
Yichen Li
Jiazhen Gu
Cong Feng
Zengyin Yang
Yongqiang Yang
Michael R. Lyu
56
10
0
27 Feb 2024
Previous
123...8910...596061
Next