Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.08152
Cited By
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
15 August 2024
Huajian Xin
Z. Z. Ren
Junxiao Song
Zhihong Shao
Wanjia Zhao
Haocheng Wang
Bo Liu
Liyue Zhang
Xuan Lu
Qiushi Du
Wenjun Gao
Qihao Zhu
Dejian Yang
Zhibin Gou
Z. F. Wu
Fuli Luo
Chong Ruan
AIMat
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search"
18 / 18 papers shown
Title
MPS-Prover: Advancing Stepwise Theorem Proving by Multi-Perspective Search and Data Curation
Zhenwen Liang
Linfeng Song
Yang Li
Tao Yang
Feng Zhang
Haitao Mi
Dong Yu
LRM
12
0
0
16 May 2025
Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving
Qi Liu
Xinhao Zheng
Renqiu Xia
Xingzhi Qi
Qinxiang Cao
Junchi Yan
AIMat
50
0
0
07 May 2025
Benchmarking LLMs' Swarm intelligence
Kai Ruan
Mowen Huang
Ji-Rong Wen
Hao Sun
51
0
0
07 May 2025
CombiBench: Benchmarking LLM Capability for Combinatorial Mathematics
Jiaheng Liu
Xiaohan Lin
Jonas Bayer
Yael Dillies
Weijie Jiang
...
Zhengfeng Yang
J. Zhang
Lihong Zhi
J. Li
Zhengying Liu
104
0
0
06 May 2025
FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models
Zhouliang Yu
Ruotian Peng
Keyi Ding
Y. K. Li
Zhongyuan Peng
...
Huajian Xin
Yifan Jiang
Yandong Wen
Ge Zhang
Weiyang Liu
LRM
128
0
0
05 May 2025
APE-Bench I: Towards File-level Automated Proof Engineering of Formal Math Libraries
Huajian Xin
Luming Li
Xiaoran Jin
Jacques Fleuriot
Wenda Li
AIMat
52
0
0
27 Apr 2025
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models
Wenxuan Huang
Bohan Jia
Zijie Zhai
Shaosheng Cao
Zheyu Ye
Fei Zhao
Zhe Xu
Yao Hu
Shaohui Lin
MU
OffRL
LRM
MLLM
ReLM
VLM
59
41
0
09 Mar 2025
Simplifying Formal Proof-Generating Models with ChatGPT and Basic Searching Techniques
Sangjun Han
Taeil Hur
Youngmi Hur
Kathy Sangkyung Lee
Myungyoon Lee
Hyojae Lim
126
0
0
20 Feb 2025
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Yibo Yan
Shen Wang
Jiahao Huo
Jingheng Ye
Zhendong Chu
Xuming Hu
Philip S. Yu
Carla P. Gomes
B. Selman
Qingsong Wen
LRM
127
9
0
05 Feb 2025
Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation
Shaonan Wu
Shuai Lu
Y. Gong
Nan Duan
Ping Wei
AIMat
45
0
0
21 Oct 2024
Herald: A Natural Language Annotated Lean 4 Dataset
Guoxiong Gao
Yutong Wang
Jiedong Jiang
Qi Gao
Zihan Qin
Tianyi Xu
Bin Dong
83
3
0
09 Oct 2024
LeanAgent: Lifelong Learning for Formal Theorem Proving
Adarsh Kumarappan
Mo Tiwari
Peiyang Song
Robert Joseph George
Chaowei Xiao
Anima Anandkumar
CLL
LLMAG
LRM
72
8
0
08 Oct 2024
miniCTX: Neural Theorem Proving with (Long-)Contexts
Jiewen Hu
Thomas Zhu
Sean Welleck
AIMat
68
6
0
05 Aug 2024
Natural Language Reinforcement Learning
Xidong Feng
Bo Liu
Mengyue Yang
Ziyan Wang
Girish A. Koushiks
Yali Du
Ying Wen
Jun Wang
OffRL
35
3
0
11 Feb 2024
LEGO-Prover: Neural Theorem Proving with Growing Libraries
Haiming Wang
Huajian Xin
Chuanyang Zheng
Lin Li
Zhengying Liu
...
Enze Xie
Jian Yin
Zhenguo Li
Heng Liao
Xiaodan Liang
LRM
39
63
0
01 Oct 2023
Formal Mathematics Statement Curriculum Learning
Stanislas Polu
Jesse Michael Han
Kunhao Zheng
Mantas Baksys
Igor Babuschkin
Ilya Sutskever
AIMat
84
116
0
03 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
367
8,495
0
28 Jan 2022
Reward-Free Exploration for Reinforcement Learning
Chi Jin
A. Krishnamurthy
Max Simchowitz
Tiancheng Yu
OffRL
112
194
0
07 Feb 2020
1