ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.01144
  4. Cited By
Categorical Reparameterization with Gumbel-Softmax

Categorical Reparameterization with Gumbel-Softmax

3 November 2016
Eric Jang
S. Gu
Ben Poole
    BDL
ArXivPDFHTML

Papers citing "Categorical Reparameterization with Gumbel-Softmax"

50 / 1,152 papers shown
Title
Learning Slice-Aware Representations with Mixture of Attentions
Learning Slice-Aware Representations with Mixture of Attentions
Cheng Wang
Sungjin Lee
Sunghyun Park
Han Li
Young-Bum Kim
R. Sarikaya
29
2
0
04 Jun 2021
BERTTune: Fine-Tuning Neural Machine Translation with BERTScore
BERTTune: Fine-Tuning Neural Machine Translation with BERTScore
Inigo Jauregi Unanue
Jacob Parnell
Massimo Piccardi
26
32
0
04 Jun 2021
DynamicViT: Efficient Vision Transformers with Dynamic Token
  Sparsification
DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
Yongming Rao
Wenliang Zhao
Benlin Liu
Jiwen Lu
Jie Zhou
Cho-Jui Hsieh
ViT
34
670
0
03 Jun 2021
Implicit MLE: Backpropagating Through Discrete Exponential Family
  Distributions
Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions
Mathias Niepert
Pasquale Minervini
Luca Franceschi
32
81
0
03 Jun 2021
Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion
  Detection
Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection
Lixing Zhu
Gabriele Pergola
Lin Gui
Deyu Zhou
Yulan He
49
145
0
02 Jun 2021
The Out-of-Distribution Problem in Explainability and Search Methods for
  Feature Importance Explanations
The Out-of-Distribution Problem in Explainability and Search Methods for Feature Importance Explanations
Peter Hase
Harry Xie
Joey Tianyi Zhou
OODD
LRM
FAtt
29
91
0
01 Jun 2021
Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit
  Assignment
Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment
Tianze Zhou
Fubiao Zhang
Kun Shao
Kai Li
Wenhan Huang
...
Hangyu Mao
Bin Wang
Dong Li
Wulong Liu
Jianye Hao
37
16
0
01 Jun 2021
Text Summarization with Latent Queries
Text Summarization with Latent Queries
Yumo Xu
Mirella Lapata
RALM
HILM
28
10
0
31 May 2021
CogView: Mastering Text-to-Image Generation via Transformers
CogView: Mastering Text-to-Image Generation via Transformers
Ming Ding
Zhuoyi Yang
Wenyi Hong
Wendi Zheng
Chang Zhou
...
Junyang Lin
Xu Zou
Zhou Shao
Hongxia Yang
Jie Tang
ViT
VLM
54
766
0
26 May 2021
EXoN: EXplainable encoder Network
EXoN: EXplainable encoder Network
SeungHwan An
Hosik Choi
Jong-June Jeon
BDL
DRL
34
5
0
23 May 2021
Multi-Agent Deep Reinforcement Learning using Attentive Graph Neural
  Architectures for Real-Time Strategy Games
Multi-Agent Deep Reinforcement Learning using Attentive Graph Neural Architectures for Real-Time Strategy Games
Won Joon Yun
Sungwon Yi
Joongheon Kim
20
10
0
21 May 2021
Correlated Input-Dependent Label Noise in Large-Scale Image
  Classification
Correlated Input-Dependent Label Noise in Large-Scale Image Classification
Mark Collier
Basil Mustafa
Efi Kokiopoulou
Rodolphe Jenatton
Jesse Berent
NoLa
184
53
0
19 May 2021
Dependent Multi-Task Learning with Causal Intervention for Image
  Captioning
Dependent Multi-Task Learning with Causal Intervention for Image Captioning
Wenqing Chen
Jidong Tian
Caoyun Fan
Hao He
Yaohui Jin
CML
27
6
0
18 May 2021
Parallel Attention Network with Sequence Matching for Video Grounding
Parallel Attention Network with Sequence Matching for Video Grounding
Hao Zhang
Aixin Sun
Wei Jing
Liangli Zhen
Qiufeng Wang
Rick Siow Mong Goh
23
40
0
18 May 2021
Semi-Supervised Variational Reasoning for Medical Dialogue Generation
Semi-Supervised Variational Reasoning for Medical Dialogue Generation
Dongdong Li
Zhaochun Ren
Pengjie Ren
Zhumin Chen
M. Fan
Jun Ma
Maarten de Rijke
BDL
DRL
OffRL
MedIm
32
48
0
13 May 2021
Connecting What to Say With Where to Look by Modeling Human Attention
  Traces
Connecting What to Say With Where to Look by Modeling Human Attention Traces
Zihang Meng
Licheng Yu
Ning Zhang
Tamara L. Berg
Babak Damavandi
Vikas Singh
Amy Bearman
40
25
0
12 May 2021
MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation
MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation
Ahmad Rashid
Vasileios Lioutas
Mehdi Rezagholizadeh
AAML
13
36
0
12 May 2021
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
Yikang Shen
Chun-Fu Chen
Quanfu Fan
Ximeng Sun
Kate Saenko
A. Oliva
Rogerio Feris
38
47
0
11 May 2021
Rationalization through Concepts
Rationalization through Concepts
Diego Antognini
Boi Faltings
FAtt
27
19
0
11 May 2021
SUPR-GAN: SUrgical PRediction GAN for Event Anticipation in Laparoscopic
  and Robotic Surgery
SUPR-GAN: SUrgical PRediction GAN for Event Anticipation in Laparoscopic and Robotic Surgery
Yutong Ban
Guy Rosman
J. Eckhoff
Thomas M. Ward
Daniel A. Hashimoto
Taisei Kondo
Hidekazu Iwaki
O. Meireles
Daniela Rus
22
13
0
10 May 2021
gComm: An environment for investigating generalization in Grounded
  Language Acquisition
gComm: An environment for investigating generalization in Grounded Language Acquisition
Rishi Hazra
Sonu Dixit
31
0
0
09 May 2021
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise
  Rollouts
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts
Weinan Zhang
Xihuai Wang
Jian Shen
Ming Zhou
27
35
0
07 May 2021
Walk in the Cloud: Learning Curves for Point Clouds Shape Analysis
Walk in the Cloud: Learning Curves for Point Clouds Shape Analysis
Tiange Xiang
Chaoyi Zhang
Yang Song
Jianhui Yu
Weidong (Tom) Cai
3DPC
151
283
0
04 May 2021
Recovering Barabási-Albert Parameters of Graphs through
  Disentanglement
Recovering Barabási-Albert Parameters of Graphs through Disentanglement
Cristina Guzman
Daphna Keidar
Tristan Meynier
Andreas Opedal
Niklas Stoehr
21
0
0
03 May 2021
Effective Sparsification of Neural Networks with Global Sparsity
  Constraint
Effective Sparsification of Neural Networks with Global Sparsity Constraint
Xiao Zhou
Weizhong Zhang
Hang Xu
Tong Zhang
21
61
0
03 May 2021
Unsupervised Layered Image Decomposition into Object Prototypes
Unsupervised Layered Image Decomposition into Object Prototypes
Tom Monnier
Elliot Vincent
Jean Ponce
Mathieu Aubry
OCL
16
53
0
29 Apr 2021
MarioNette: Self-Supervised Sprite Learning
MarioNette: Self-Supervised Sprite Learning
Dmitriy Smirnov
Michael Gharbi
Matthew Fisher
Vitor Campagnolo Guizilini
Alexei A. Efros
Justin Solomon
SSL
OCL
77
37
0
29 Apr 2021
Graph Decoupling Attention Markov Networks for Semi-supervised Graph
  Node Classification
Graph Decoupling Attention Markov Networks for Semi-supervised Graph Node Classification
Jie Chen
Shouzhen Chen
Mingyuan Bai
Jian Pu
Junping Zhang
Junbin Gao
39
21
0
28 Apr 2021
Text Generation with Deep Variational GAN
Text Generation with Deep Variational GAN
M. Hossam
Trung Le
Michael Papasimeon
Viet Huynh
Dinh Q. Phung
GAN
DRL
25
5
0
27 Apr 2021
Heterogeneous-Agent Trajectory Forecasting Incorporating Class
  Uncertainty
Heterogeneous-Agent Trajectory Forecasting Incorporating Class Uncertainty
Boris Ivanovic
Kuan-Hui Lee
P. Tokmakov
Blake Wulfe
R. McAllister
Adrien Gaidon
Marco Pavone
24
35
0
26 Apr 2021
Framing Unpacked: A Semi-Supervised Interpretable Multi-View Model of
  Media Frames
Framing Unpacked: A Semi-Supervised Interpretable Multi-View Model of Media Frames
Shima Khanehzar
Trevor Cohn
Gosia Mikołajczak
A. Turpin
Lea Frermann
22
11
0
22 Apr 2021
Deep Learning for Click-Through Rate Estimation
Deep Learning for Click-Through Rate Estimation
Weinan Zhang
Jiarui Qin
Wei Guo
Ruiming Tang
Xiuqiang He
3DV
HAI
33
110
0
21 Apr 2021
Differentiable Model Compression via Pseudo Quantization Noise
Differentiable Model Compression via Pseudo Quantization Noise
Alexandre Défossez
Yossi Adi
Gabriel Synnaeve
DiffM
MQ
26
48
0
20 Apr 2021
On the Influence of Masking Policies in Intermediate Pre-training
On the Influence of Masking Policies in Intermediate Pre-training
Qinyuan Ye
Belinda Z. Li
Sinong Wang
Benjamin Bolte
Hao Ma
Wen-tau Yih
Xiang Ren
Madian Khabsa
13
12
0
18 Apr 2021
Geometry-Free View Synthesis: Transformers and no 3D Priors
Geometry-Free View Synthesis: Transformers and no 3D Priors
Robin Rombach
Patrick Esser
Bjorn Ommer
ViT
22
93
0
15 Apr 2021
Gradient-based Adversarial Attacks against Text Transformers
Gradient-based Adversarial Attacks against Text Transformers
Chuan Guo
Alexandre Sablayrolles
Hervé Jégou
Douwe Kiela
SILM
106
230
0
15 Apr 2021
Hierarchical Adaptive Pooling by Capturing High-order Dependency for
  Graph Representation Learning
Hierarchical Adaptive Pooling by Capturing High-order Dependency for Graph Representation Learning
Ning Liu
Songlei Jian
Dongsheng Li
Yiming Zhang
Zhiquan Lai
Hongzuo Xu
39
31
0
13 Apr 2021
Direct Differentiable Augmentation Search
Direct Differentiable Augmentation Search
Aoming Liu
Zehao Huang
Zhiwu Huang
Naiyan Wang
33
33
0
09 Apr 2021
Generating Multi-type Temporal Sequences to Mitigate Class-imbalanced
  Problem
Generating Multi-type Temporal Sequences to Mitigate Class-imbalanced Problem
Lun Jiang
N. S. Sadghiani
Zhuo Tao
Andrew Cohen
27
0
0
07 Apr 2021
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised
  Pre-Training
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
Wei-Ning Hsu
Anuroop Sriram
Alexei Baevski
Tatiana Likhomanenko
Qiantong Xu
...
Jacob Kahn
Ann Lee
R. Collobert
Gabriel Synnaeve
Michael Auli
SSL
25
237
0
02 Apr 2021
Storchastic: A Framework for General Stochastic Automatic
  Differentiation
Storchastic: A Framework for General Stochastic Automatic Differentiation
Emile van Krieken
Jakub M. Tomczak
A. T. Teije
ODL
OffRL
31
15
0
01 Apr 2021
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network
  for Video Reasoning over Traffic Events
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
Li Xu
He Huang
Jun Liu
ViT
LRM
17
83
0
29 Mar 2021
Prototype-based Personalized Pruning
Prototype-based Personalized Pruning
Jang-Hyun Kim
Simyung Chang
Sungrack Yun
Nojun Kwak
28
4
0
25 Mar 2021
Structured Co-reference Graph Attention for Video-grounded Dialogue
Structured Co-reference Graph Attention for Video-grounded Dialogue
Junyeong Kim
Sunjae Yoon
Dahyun Kim
Chang D. Yoo
26
26
0
24 Mar 2021
AdaSGN: Adapting Joint Number and Model Size for Efficient
  Skeleton-Based Action Recognition
AdaSGN: Adapting Joint Number and Model Size for Efficient Skeleton-Based Action Recognition
Lei Shi
Yifan Zhang
Jian Cheng
Hanqing Lu
30
46
0
22 Mar 2021
Grey-box Adversarial Attack And Defence For Sentiment Classification
Grey-box Adversarial Attack And Defence For Sentiment Classification
Ying Xu
Xu Zhong
Antonio Jimeno Yepes
Jey Han Lau
VLM
AAML
16
53
0
22 Mar 2021
QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small
  Object Detection
QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection
Chenhongyi Yang
Zehao Huang
Naiyan Wang
ObjD
32
225
0
16 Mar 2021
Gumbel-Attention for Multi-modal Machine Translation
Gumbel-Attention for Multi-modal Machine Translation
Pengbo Liu
Hailong Cao
Tiejun Zhao
26
23
0
16 Mar 2021
Get Your Vitamin C! Robust Fact Verification with Contrastive Evidence
Get Your Vitamin C! Robust Fact Verification with Contrastive Evidence
Tal Schuster
Adam Fisch
Regina Barzilay
39
228
0
15 Mar 2021
Searching by Generating: Flexible and Efficient One-Shot NAS with
  Architecture Generator
Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator
Sian-Yao Huang
W. Chu
14
25
0
12 Mar 2021
Previous
123...141516...222324
Next