Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.11586
Cited By
Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs with Semantic Space
14 March 2025
Zhiliang Chen
Xinyuan Niu
Chuan-Sheng Foo
Bryan Kian Hsiang Low
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs with Semantic Space"
34 / 34 papers shown
Title
Group-robust Sample Reweighting for Subpopulation Shifts via Influence Functions
Rui Qiao
Zhaoxuan Wu
Jingtan Wang
Pang Wei Koh
Bryan Kian Hsiang Low
OOD
94
2
0
10 Mar 2025
InfoQuest: Evaluating Multi-Turn Dialogue Agents for Open-Ended Conversations with Hidden Context
Bryan L. M. de Oliveira
Luana G. B. Martins
Bruno Brandão
Luckeciano C. Melo
ELM
371
1
0
17 Feb 2025
Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks
Gregory Kang Ruey Lau
Wenyang Hu
Diwen Liu
Jizhuo Chen
Szu Hui Ng
Bryan Kian Hsiang Low
LRM
AI4CE
108
8
0
12 Dec 2024
Tree Search for Language Model Agents
Jing Yu Koh
Stephen Marcus McAleer
Daniel Fried
Ruslan Salakhutdinov
LM&Ro
LLMAG
LRM
95
73
0
01 Jul 2024
A Complete Survey on LLM-based AI Chatbots
Sumit Kumar Dam
Choong Seon Hong
Yu Qiao
Chaoning Zhang
77
60
0
17 Jun 2024
Prototypical Reward Network for Data-Efficient RLHF
Jinghan Zhang
Xiting Wang
Yiqiao Jin
Changyu Chen
Xinhao Zhang
Kunpeng Liu
ALM
64
19
0
06 Jun 2024
Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars
Zhaoxuan Wu
Xiaoqiang Lin
Zhongxiang Dai
Wenyang Hu
Yao Shu
See-Kiong Ng
Patrick Jaillet
Bryan Kian Hsiang Low
38
11
0
25 May 2024
DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning
Zijian Zhou
Xiaoqiang Lin
Xinyi Xu
Alok Prakash
Daniela Rus
K. H. Low
52
4
0
22 May 2024
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Jinhyuk Lee
Zhuyun Dai
Xiaoqi Ren
Blair Chen
Daniel Cer
...
Aditya Kusupati
Prateek Jain
Siddhartha Reddy Jonnalagadda
Ming-Wei Chang
Iftekhar Naim
RALM
VLM
SyDa
78
46
0
29 Mar 2024
Improving the Reliability of Large Language Models by Leveraging Uncertainty-Aware In-Context Learning
Yuchen Yang
Houqiang Li
Yanfeng Wang
Yu Wang
46
25
0
07 Oct 2023
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models
Xiaoxiao Sun
Yang Yang
Michal Shlapentokh-Rothman
Haohan Wang
Yu-Xiong Wang
LRM
AI4CE
LM&Ro
LLMAG
80
210
0
06 Oct 2023
Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers
Xiaoqiang Lin
Zhaoxuan Wu
Zhongxiang Dai
Wenyang Hu
Yao Shu
See-Kiong Ng
Patrick Jaillet
Bryan Kian Hsiang Low
54
11
0
02 Oct 2023
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Tianle Li
Siyuan Zhuang
...
Zi Lin
Eric P. Xing
Joseph E. Gonzalez
Ion Stoica
Haotong Zhang
67
213
0
21 Sep 2023
Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
Tianbao Xie
Siheng Zhao
Chen Henry Wu
Yitao Liu
Qian Luo
Victor Zhong
Yanchao Yang
Tao Yu
LM&Ro
72
60
0
20 Sep 2023
Large Language Models as Optimizers
Chengrun Yang
Xuezhi Wang
Yifeng Lu
Hanxiao Liu
Quoc V. Le
Denny Zhou
Xinyun Chen
ODL
63
414
0
07 Sep 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
382
3,942
0
29 May 2023
Reasoning with Language Model is Planning with World Model
Shibo Hao
Yi Gu
Haodi Ma
Joshua Jiahua Hong
Zhen Wang
D. Wang
Zhiting Hu
ReLM
LRM
LLMAG
129
573
0
24 May 2023
Large Language Models as Commonsense Knowledge for Large-Scale Task Planning
Zirui Zhao
W. Lee
David Hsu
LRM
LLMAG
LM&Ro
70
218
0
23 May 2023
Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning
Xiao Yu
Maximillian Chen
Zhou Yu
LLMAG
LM&Ro
88
40
0
23 May 2023
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Shunyu Yao
Dian Yu
Jeffrey Zhao
Izhak Shafran
Thomas Griffiths
Yuan Cao
Karthik Narasimhan
LM&Ro
LRM
AI4CE
136
1,936
0
17 May 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
871
12,916
0
04 Mar 2022
Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration
Desik Rengarajan
G. Vaidya
Akshay Sarvesh
D. Kalathil
S. Shakkottai
OffRL
52
57
0
09 Feb 2022
On the Evolution of the MCTS Upper Confidence Bounds for Trees by Means of Evolutionary Algorithms in the Game of Carcassonne
E. López
G. Simpson
44
6
0
17 Dec 2021
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
56
72
0
04 Jul 2020
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Dmitry Lepikhin
HyoukJoong Lee
Yuanzhong Xu
Dehao Chen
Orhan Firat
Yanping Huang
M. Krikun
Noam M. Shazeer
Zhiwen Chen
MoE
89
1,162
0
30 Jun 2020
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
358
18,739
0
13 Feb 2020
Overcoming Limitations of Mixture Density Networks: A Sampling and Fitting Framework for Multimodal Future Prediction
Osama Makansi
Eddy Ilg
Özgün Çiçek
Thomas Brox
106
191
0
09 Jun 2019
Experience Replay for Continual Learning
David Rolnick
Arun Ahuja
Jonathan Richard Schwarz
Timothy Lillicrap
Greg Wayne
CLL
116
1,158
0
28 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.7K
94,770
0
11 Oct 2018
Improving Hearthstone AI by Combining MCTS and Supervised Learning Algorithms
M. Świechowski
T. Tajmajer
Andrzej Janusz
BDL
78
59
0
14 Aug 2018
DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset
Yanran Li
Hui Su
Xiaoyu Shen
Wenjie Li
Ziqiang Cao
Shuzi Niu
55
1,300
0
11 Oct 2017
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan
Alexander Pritzel
Charles Blundell
UQCV
BDL
822
5,811
0
05 Dec 2016
Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models
Ashwin K. Vijayakumar
Michael Cogswell
Ramprasaath R. Selvaraju
Q. Sun
Stefan Lee
David J. Crandall
Dhruv Batra
89
554
0
07 Oct 2016
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
125
12,227
0
19 Dec 2013
1