ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.20290
  4. Cited By
Fast Best-of-N Decoding via Speculative Rejection
v1v2 (latest)

Fast Best-of-N Decoding via Speculative Rejection

26 October 2024
Hanshi Sun
Momin Haider
Ruiqi Zhang
Huitao Yang
Jiahao Qiu
Ming Yin
Mengdi Wang
Peter L. Bartlett
Andrea Zanette
    BDL
ArXiv (abs)PDFHTML

Papers citing "Fast Best-of-N Decoding via Speculative Rejection"

24 / 24 papers shown
Title
$\texttt{SPECS}$: Faster Test-Time Scaling through Speculative Drafts
SPECS\texttt{SPECS}SPECS: Faster Test-Time Scaling through Speculative Drafts
Mert Cemri
Nived Rajaraman
Rishabh Tiwari
Xiaoxuan Liu
Kurt Keutzer
Ion Stoica
Kannan Ramchandran
Ahmad Beirami
Ziteng Sun
LRM
12
0
0
15 Jun 2025
Augmenting Large Language Models with Static Code Analysis for Automated Code Quality Improvements
Augmenting Large Language Models with Static Code Analysis for Automated Code Quality Improvements
Seyed Moein Abtahi
Akramul Azim
119
0
0
12 Jun 2025
Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning
Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning
Xiangning Yu
Zhuohan Wang
Linyi Yang
Haoxuan Li
Anjie Liu
Xiao Xue
Jun Wang
Mengyue Yang
ReLMLRMELM
77
0
0
11 Jun 2025
From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?
Zhanke Zhou
Xiao Feng
Zhaocheng Zhu
Jiangchao Yao
Sanmi Koyejo
Bo Han
LRM
15
0
0
09 Jun 2025
How Far Are We from Optimal Reasoning Efficiency?
How Far Are We from Optimal Reasoning Efficiency?
Jiaxuan Gao
Shu Yan
Qixin Tan
Lu Yang
Shusheng Xu
Wei Fu
Zhiyu Mei
Kaifeng Lyu
Yi Wu
LRM
19
0
0
08 Jun 2025
Kinetics: Rethinking Test-Time Scaling Laws
Kinetics: Rethinking Test-Time Scaling Laws
Ranajoy Sadhukhan
Zhuoming Chen
Haizhong Zheng
Yang Zhou
Emma Strubell
Beidi Chen
103
0
0
05 Jun 2025
AdaRewriter: Unleashing the Power of Prompting-based Conversational Query Reformulation via Test-Time Adaptation
AdaRewriter: Unleashing the Power of Prompting-based Conversational Query Reformulation via Test-Time Adaptation
Yilong Lai
Jialong Wu
Zhenglin Wang
Deyu Zhou
47
0
0
02 Jun 2025
Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Hanting Chen
Yasheng Wang
Kai Han
Dong Li
Lin Li
...
Hailin Hu
Yehui Tang
Dacheng Tao
Xinghao Chen
Yunhe Wang
LRM
93
0
0
28 May 2025
Pretraining Language Models to Ponder in Continuous Space
Pretraining Language Models to Ponder in Continuous Space
Boyi Zeng
Shixiang Song
Siyuan Huang
Yixuan Wang
He Li
Ziwei He
Xinbing Wang
Zhiyu Li
Zhouhan Lin
LRM
83
0
0
27 May 2025
System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts
System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts
Xiaoqiang Wang
Suyuchen Wang
Yun Zhu
Bang Liu
ReLMLRM
117
0
0
25 May 2025
ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning
ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning
Ziqing Qiao
Yongheng Deng
Jiali Zeng
Dong Wang
Lai Wei
Fandong Meng
Jie Zhou
Ju Ren
Yaoxue Zhang
LRM
112
3
0
08 May 2025
Semantic Probabilistic Control of Language Models
Semantic Probabilistic Control of Language Models
Kareem Ahmed
Catarina G Belém
Padhraic Smyth
Sameer Singh
115
1
0
04 May 2025
Efficient Reasoning Models: A Survey
Efficient Reasoning Models: A Survey
Sicheng Feng
Gongfan Fang
Xinyin Ma
Xinchao Wang
ReLMLRM
416
13
0
15 Apr 2025
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Yang Sui
Yu-Neng Chuang
Guanchu Wang
Jiamu Zhang
Tianyi Zhang
...
Hongyi Liu
Andrew Wen
Shaochen
Zhong
Hanjie Chen
OffRLReLMLRM
202
100
0
20 Mar 2025
DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models
DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models
Yi Shen
Jing Zhang
Jieyun Huang
Shuming Shi
Wenjing Zhang
Jiangze Yan
Rongjia Du
Ning Wang
Kai Wang
Shiguo Lian
LRM
132
54
0
06 Mar 2025
Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?
Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?
Zhiyuan Zeng
Qinyuan Cheng
Zhangyue Yin
Yunhua Zhou
Xipeng Qiu
LRM
178
20
0
17 Feb 2025
InSTA: Towards Internet-Scale Training For Agents
InSTA: Towards Internet-Scale Training For Agents
Brandon Trabucco
Gunnar Sigurdsson
Robinson Piramuthu
Ruslan Salakhutdinov
ALM
193
4
0
10 Feb 2025
Iterative Deepening Sampling as Efficient Test-Time Scaling
Iterative Deepening Sampling as Efficient Test-Time Scaling
Weizhe Chen
Sven Koenig
B. Dilkina
LRMReLM
151
1
0
08 Feb 2025
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
Hanshi Sun
Li-Wen Chang
Yiyuan Ma
Wenlei Bao
Ningxin Zheng
Xin Liu
Harry Dong
Yuejie Chi
Beidi Chen
VLM
165
21
0
28 Oct 2024
Variational Best-of-N Alignment
Variational Best-of-N Alignment
Afra Amini
Tim Vieira
Ryan Cotterell
Ryan Cotterell
BDL
105
23
0
08 Jul 2024
Cascade Reward Sampling for Efficient Decoding-Time Alignment
Cascade Reward Sampling for Efficient Decoding-Time Alignment
Bolian Li
Yifan Wang
A. Grama
Ruqi Zhang
Ruqi Zhang
AI4TS
138
15
0
24 Jun 2024
DPO Meets PPO: Reinforced Token Optimization for RLHF
DPO Meets PPO: Reinforced Token Optimization for RLHF
Han Zhong
Zikang Shan
Guhao Feng
Wei Xiong
Xinle Cheng
Li Zhao
Di He
Jiang Bian
Liwei Wang
155
72
0
29 Apr 2024
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators
Yann Dubois
Balázs Galambosi
Percy Liang
Tatsunori Hashimoto
ALM
167
403
0
06 Apr 2024
Theoretical guarantees on the best-of-n alignment policy
Theoretical guarantees on the best-of-n alignment policy
Ahmad Beirami
Alekh Agarwal
Jonathan Berant
Alex DÁmour
Jacob Eisenstein
Chirag Nagpal
A. Suresh
124
60
0
03 Jan 2024
1