ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.06059
  4. Cited By
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for
  Sampling Sequences Without Replacement

Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement

14 March 2019
W. Kool
H. V. Hoof
Max Welling
ArXivPDFHTML

Papers citing "Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement"

50 / 51 papers shown
Title
Variational Prefix Tuning for Diverse and Accurate Code Summarization Using Pre-trained Language Models
Variational Prefix Tuning for Diverse and Accurate Code Summarization Using Pre-trained Language Models
Junda Zhao
Yuliang Song
Eldan Cohen
21
0
0
14 May 2025
Optimizing Temperature for Language Models with Multi-Sample Inference
Optimizing Temperature for Language Models with Multi-Sample Inference
Weihua Du
Yiming Yang
Sean Welleck
62
2
0
07 Feb 2025
FedGrAINS: Personalized SubGraph Federated Learning with Adaptive Neighbor Sampling
FedGrAINS: Personalized SubGraph Federated Learning with Adaptive Neighbor Sampling
Emir Ceyani
Han Xie
Baturalp Buyukates
Carl Yang
Salman Avestimehr
FedML
97
0
0
22 Jan 2025
GraphXForm: Graph transformer for computer-aided molecular design
GraphXForm: Graph transformer for computer-aided molecular design
Jonathan Pirnay
Jan G. Rittig
Alexander B. Wolf
Martin Grohe
Jakob Burger
Alexander Mitsos
D. G. Grimm
AI4CE
58
1
0
03 Nov 2024
Self-evolving Agents with reflective and memory-augmented abilities
Self-evolving Agents with reflective and memory-augmented abilities
Xuechen Liang
Yangfan He
Yinghui Xia
Xinyuan Song
Jianhui Wang
...
Keqin Li
Jiaqi Chen
Jinsong Yang
Siyuan Chen
Tianyu Shi
LLMAG
KELM
CLL
41
2
0
01 Sep 2024
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
Zanlin Ni
Yulin Wang
Renping Zhou
Rui Lu
Jiayi Guo
Jinyi Hu
Zhiyuan Liu
Yuan Yao
Gao Huang
37
7
0
31 Aug 2024
Coupling without Communication and Drafter-Invariant Speculative Decoding
Coupling without Communication and Drafter-Invariant Speculative Decoding
Majid Daliri
Christopher Musco
A. Suresh
51
1
0
15 Aug 2024
Informed Correctors for Discrete Diffusion Models
Informed Correctors for Discrete Diffusion Models
Yixiu Zhao
Jiaxin Shi
Lester W. Mackey
Scott W. Linderman
Lester Mackey
Scott Linderman
54
9
0
30 Jul 2024
Towards Model-Agnostic Posterior Approximation for Fast and Accurate
  Variational Autoencoders
Towards Model-Agnostic Posterior Approximation for Fast and Accurate Variational Autoencoders
Yaniv Yacoby
Weiwei Pan
Finale Doshi-Velez
DRL
29
0
0
13 Mar 2024
Effective Bayesian Causal Inference via Structural Marginalisation and Autoregressive Orders
Effective Bayesian Causal Inference via Structural Marginalisation and Autoregressive Orders
Christian Toth
Christian Knoll
Franz Pernkopf
Robert Peharz
CML
52
1
0
22 Feb 2024
Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding
Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding
Yuu Jinnai
Ukyo Honda
Tetsuro Morimura
Peinan Zhang
34
6
0
10 Jan 2024
Unsupervised Extractive Summarization with Learnable Length Control
  Strategies
Unsupervised Extractive Summarization with Learnable Length Control Strategies
Renlong Jie
Xiaojun Meng
Xin Jiang
Qun Liu
32
1
0
12 Dec 2023
Everything Perturbed All at Once: Enabling Differentiable Graph Attacks
Everything Perturbed All at Once: Enabling Differentiable Graph Attacks
Haoran Liu
Bokun Wang
Jianling Wang
Xiangjue Dong
Tianbao Yang
James Caverlee
AAML
36
3
0
29 Aug 2023
Order-based Structure Learning with Normalizing Flows
Order-based Structure Learning with Normalizing Flows
Hamidreza Kamkari
Vahid Balazadeh Meresht
Vahid Zehtab
Rahul G. Krishnan
CML
33
1
0
14 Aug 2023
Policy-Based Self-Competition for Planning Problems
Policy-Based Self-Competition for Planning Problems
Jonathan Pirnay
Q. Göttl
Jakob Burger
D. G. Grimm
36
3
0
07 Jun 2023
Multi-Start Team Orienteering Problem for UAS Mission Re-Planning with
  Data-Efficient Deep Reinforcement Learning
Multi-Start Team Orienteering Problem for UAS Mission Re-Planning with Data-Efficient Deep Reinforcement Learning
Dong Ho Lee
Jaemyung Ahn
22
6
0
02 Mar 2023
A Reparameterized Discrete Diffusion Model for Text Generation
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
41
57
0
11 Feb 2023
Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text
  Generation via Concentrating Attention
Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention
Wenhao Li
Xiaoyuan Yi
Jinyi Hu
Maosong Sun
Xing Xie
44
0
0
14 Nov 2022
Eliciting Knowledge from Large Pre-Trained Models for Unsupervised
  Knowledge-Grounded Conversation
Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded Conversation
Yanyang Li
Jianqiao Zhao
M. Lyu
Liwei Wang
24
15
0
03 Nov 2022
Truncation Sampling as Language Model Desmoothing
Truncation Sampling as Language Model Desmoothing
John Hewitt
Christopher D. Manning
Percy Liang
BDL
44
76
0
27 Oct 2022
Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models
Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models
Luke Vilnis
Yury Zemlyanskiy
Patrick C. Murray
Alexandre Passos
Sumit Sanghai
62
9
0
18 Oct 2022
Is margin all you need? An extensive empirical study of active learning
  on tabular data
Is margin all you need? An extensive empirical study of active learning on tabular data
Dara Bahri
Heinrich Jiang
Tal Schuster
Afshin Rostamizadeh
LMTD
51
10
0
07 Oct 2022
SIMPLE: A Gradient Estimator for $k$-Subset Sampling
SIMPLE: A Gradient Estimator for kkk-Subset Sampling
Kareem Ahmed
Zhe Zeng
Mathias Niepert
Mathias Niepert
BDL
48
24
0
04 Oct 2022
Optimal Transport-based Identity Matching for Identity-invariant Facial
  Expression Recognition
Optimal Transport-based Identity Matching for Identity-invariant Facial Expression Recognition
D. Kim
B. Song
OT
40
10
0
25 Sep 2022
Variational Open-Domain Question Answering
Variational Open-Domain Question Answering
Valentin Liévin
Andreas Geert Motzfeldt
Ida Riis Jensen
Ole Winther
OOD
BDL
36
8
0
23 Sep 2022
Sparse Graph Learning from Spatiotemporal Time Series
Sparse Graph Learning from Spatiotemporal Time Series
Andrea Cini
Daniele Zambon
Cesare Alippi
CML
AI4TS
43
18
0
26 May 2022
Unbiased and Efficient Sampling of Dependency Trees
Unbiased and Efficient Sampling of Dependency Trees
Milovs Stanojević
25
3
0
25 May 2022
A Call for Clarity in Beam Search: How It Works and When It Stops
A Call for Clarity in Beam Search: How It Works and When It Stops
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
26
6
0
11 Apr 2022
Learning Group Importance using the Differentiable Hypergeometric
  Distribution
Learning Group Importance using the Differentiable Hypergeometric Distribution
Thomas M. Sutter
Laura Manduchi
Alain Ryser
Julia E. Vogt
44
7
0
03 Mar 2022
Accelerated Intravascular Ultrasound Imaging using Deep Reinforcement
  Learning
Accelerated Intravascular Ultrasound Imaging using Deep Reinforcement Learning
Tristan S. W. Stevens
Nishith Chennakeshava
F. D. Bruijn
M. Pekar
Ruud J. G. van Sloun
26
4
0
24 Jan 2022
Towards Interactive Language Modeling
Towards Interactive Language Modeling
Maartje ter Hoeve
Evgeny Kharitonov
Dieuwke Hupkes
Emmanuel Dupoux
26
4
0
14 Dec 2021
Leveraging Recursive Gumbel-Max Trick for Approximate Inference in
  Combinatorial Spaces
Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces
Kirill Struminsky
Artyom Gadetsky
D. Rakitin
Danil Karpushkin
Dmitry Vetrov
BDL
27
9
0
28 Oct 2021
A Review of the Gumbel-max Trick and its Extensions for Discrete
  Stochasticity in Machine Learning
A Review of the Gumbel-max Trick and its Extensions for Discrete Stochasticity in Machine Learning
Iris A. M. Huijben
W. Kool
Max B. Paulus
Ruud J. G. van Sloun
28
94
0
04 Oct 2021
Conditional Poisson Stochastic Beam Search
Conditional Poisson Stochastic Beam Search
Clara Meister
Afra Amini
Tim Vieira
Ryan Cotterell
24
10
0
22 Sep 2021
Pixel Contrastive-Consistent Semi-Supervised Semantic Segmentation
Pixel Contrastive-Consistent Semi-Supervised Semantic Segmentation
Yuanyi Zhong
Bodi Yuan
Hong Wu
Zhiqiang Yuan
Jian Peng
Yu-xiong Wang
38
167
0
20 Aug 2021
Differentiable Subset Pruning of Transformer Heads
Differentiable Subset Pruning of Transformer Heads
Jiaoda Li
Ryan Cotterell
Mrinmaya Sachan
45
53
0
10 Aug 2021
Straight to the Gradient: Learning to Use Novel Tokens for Neural Text
  Generation
Straight to the Gradient: Learning to Use Novel Tokens for Neural Text Generation
Xiang Lin
Simeng Han
Chenyu You
20
24
0
14 Jun 2021
NWT: Towards natural audio-to-video generation with representation
  learning
NWT: Towards natural audio-to-video generation with representation learning
Rayhane Mama
Marc S. Tyndel
Hashiam Kadhim
Cole Clifford
Ragavan Thurairatnam
VGen
29
12
0
08 Jun 2021
Storchastic: A Framework for General Stochastic Automatic
  Differentiation
Storchastic: A Framework for General Stochastic Automatic Differentiation
Emile van Krieken
Jakub M. Tomczak
A. T. Teije
ODL
OffRL
31
15
0
01 Apr 2021
Argmax Flows and Multinomial Diffusion: Learning Categorical
  Distributions
Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions
Emiel Hoogeboom
Didrik Nielsen
P. Jaini
Patrick Forré
Max Welling
DiffM
207
395
0
10 Feb 2021
Multi-Decoder Attention Model with Embedding Glimpse for Solving Vehicle
  Routing Problems
Multi-Decoder Attention Model with Embedding Glimpse for Solving Vehicle Routing Problems
Liang Xin
Wen Song
Zhiguang Cao
Jie Zhang
31
147
0
19 Dec 2020
Computer-Generated Music for Tabletop Role-Playing Games
Computer-Generated Music for Tabletop Role-Playing Games
Lucas N. Ferreira
Levi H. S. Lelis
E. Whitehead
32
41
0
16 Aug 2020
Natural Language Processing Advancements By Deep Learning: A Survey
Natural Language Processing Advancements By Deep Learning: A Survey
A. Torfi
Rouzbeh A. Shirvani
Yaser Keneshloo
Nader Tavvaf
Edward A. Fox
AI4CE
VLM
83
215
0
02 Mar 2020
Predictive Sampling with Forecasting Autoregressive Models
Predictive Sampling with Forecasting Autoregressive Models
Auke Wiggers
Emiel Hoogeboom
BDL
27
16
0
23 Feb 2020
Incremental Sampling Without Replacement for Sequence Models
Incremental Sampling Without Replacement for Sequence Models
Kensen Shi
David Bieber
Charles Sutton
VLM
SyDa
BDL
21
24
0
21 Feb 2020
Differentiable Top-k Operator with Optimal Transport
Differentiable Top-k Operator with Optimal Transport
Yujia Xie
H. Dai
Minshuo Chen
Bo Dai
T. Zhao
H. Zha
Wei Wei
Tomas Pfister
OT
32
27
0
16 Feb 2020
Estimating Gradients for Discrete Random Variables by Sampling without
  Replacement
Estimating Gradients for Discrete Random Variables by Sampling without Replacement
W. Kool
H. V. Hoof
Max Welling
BDL
31
49
0
14 Feb 2020
Adversarial Filters of Dataset Biases
Adversarial Filters of Dataset Biases
Ronan Le Bras
Swabha Swayamdipta
Chandra Bhagavatula
Rowan Zellers
Matthew E. Peters
Ashish Sabharwal
Yejin Choi
36
220
0
10 Feb 2020
Paraphrase Generation with Latent Bag of Words
Paraphrase Generation with Latent Bag of Words
Yao Fu
Yansong Feng
John P. Cunningham
BDL
25
91
0
07 Jan 2020
Classical Structured Prediction Losses for Sequence to Sequence Learning
Classical Structured Prediction Losses for Sequence to Sequence Learning
Sergey Edunov
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
AIMat
56
185
0
14 Nov 2017
12
Next