Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.19815
Cited By
Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective
26 May 2025
Junnan Liu
Hongwei Liu
Linchen Xiao
Shudong Liu
Taolin Zhang
Zihan Ma
Songyang Zhang
Kai Chen
Author Contacts:
zhangsongyang@pjlab.org.cn
chenkai@pjlab.org.cn
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective"
8 / 58 papers shown
Title
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
94,891
0
11 Oct 2018
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
499
19,065
0
20 Jul 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
701
131,652
0
12 Jun 2017
Prototypical Networks for Few-shot Learning
Jake C. Snell
Kevin Swersky
R. Zemel
300
8,134
0
15 Mar 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
823
11,909
0
09 Mar 2017
Learning to learn by gradient descent by gradient descent
Marcin Andrychowicz
Misha Denil
Sergio Gomez Colmenarejo
Matthew W. Hoffman
David Pfau
Tom Schaul
Brendan Shillingford
Nando de Freitas
110
2,006
0
14 Jun 2016
Memory Networks
Jason Weston
S. Chopra
Antoine Bordes
GNN
KELM
147
1,706
0
15 Oct 2014
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
127
12,231
0
19 Dec 2013
Previous
1
2