Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.10428
Cited By
v1
v2
v3
v4 (latest)
Dynamic Chain-of-Thought: Towards Adaptive Deep Reasoning
7 February 2025
Libo Wang
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Dynamic Chain-of-Thought: Towards Adaptive Deep Reasoning"
15 / 15 papers shown
Title
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLM
VLM
OffRL
AI4TS
LRM
373
1,692
0
22 Jan 2025
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
Xingyu Chen
Jiahao Xu
Tian Liang
Zhiwei He
Jianhui Pang
...
Zizhuo Zhang
Rui Wang
Zhaopeng Tu
Haitao Mi
Dong Yu
LRM
ReLM
169
168
0
30 Dec 2024
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning
Yuxiang Zhang
Yuqi Yang
Jiangming Shu
Yuhang Wang
Jinlin Xiao
Jitao Sang
ALM
VLM
OffRL
LRM
106
5
0
22 Dec 2024
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?
Zhen Huang
Haoyang Zou
Xuefeng Li
Yixiu Liu
Yuxiang Zheng
Ethan Chern
Shijie Xia
Yiwei Qin
Weizhe Yuan
Pengfei Liu
VLM
106
52
0
25 Nov 2024
O1 Replication Journey: A Strategic Progress Report -- Part 1
Yiwei Qin
Xuefeng Li
Haoyang Zou
Yixiu Liu
Shijie Xia
...
Yixin Ye
Weizhe Yuan
Hector Liu
Yuezun Li
Pengfei Liu
VLM
87
88
0
08 Oct 2024
Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Tianyang Zhong
Zhengliang Liu
Yi Pan
Yutong Zhang
Yifan Zhou
...
Dinggang Shen
Andrea Sikora
Xiaoming Zhai
Dajiang Zhu
Tianming Liu
ReLM
LRM
AI4CE
ELM
VLM
73
99
0
27 Sep 2024
From Text to Transformation: A Comprehensive Review of Large Language Models' Versatility
Pravneet Kaur
Gautam Siddharth Kashyap
Ankit Kumar
Md. Tabrez Nafis
Sandeep Kumar
Vikrant Shokeen
LM&MA
74
55
0
25 Feb 2024
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities
Chaochao Lu
Chao Qian
Guodong Zheng
Hongxing Fan
Hongzhi Gao
...
Yuxi Chen
Zaibin Zhang
Zhelun Shi
Zhen-fei Yin
Zhipin Wang
49
15
0
26 Jan 2024
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Damai Dai
Chengqi Deng
Chenggang Zhao
R. X. Xu
Huazuo Gao
...
Panpan Huang
Fuli Luo
Chong Ruan
Zhifang Sui
W. Liang
MoE
90
292
0
11 Jan 2024
The Impact of Reasoning Step Length on Large Language Models
Mingyu Jin
Qinkai Yu
Dong Shu
Haiyan Zhao
Wenyue Hua
Yanda Meng
Yongfeng Zhang
Jundong Li
ReLM
LRM
108
102
0
10 Jan 2024
Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective
Guhao Feng
Bohang Zhang
Yuntian Gu
Haotian Ye
Di He
Liwei Wang
LRM
100
248
0
24 May 2023
Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting
Miles Turpin
Julian Michael
Ethan Perez
Sam Bowman
ReLM
LRM
80
431
0
07 May 2023
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
817
9,576
0
28 Jan 2022
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
517
19,065
0
20 Jul 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
713
131,652
0
12 Jun 2017
1