ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.19092
  4. Cited By
Reinforced Latent Reasoning for LLM-based Recommendation

Reinforced Latent Reasoning for LLM-based Recommendation

25 May 2025
Yang Zhang
Wenxin Xu
Xiaoyan Zhao
Wenjie Wang
Fuli Feng
Xiangnan He
Tat-Seng Chua
    OffRL
    LRM
ArXivPDFHTML

Papers citing "Reinforced Latent Reasoning for LLM-based Recommendation"

8 / 8 papers shown
Title
Slow Thinking for Sequential Recommendation
Slow Thinking for Sequential Recommendation
Junjie Zhang
Beichen Zhang
Wenqi Sun
Hongyu Lu
Wayne Xin Zhao
Yu Chen
Ji-Rong Wen
OffRL
LRM
70
1
0
13 Apr 2025
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation
Jiakai Tang
Sunhao Dai
Teng Shi
Jun Xu
X. Chen
Wen Chen
Wu Jian
Yuning Jiang
LRM
126
8
0
28 Mar 2025
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
DiJia Su
Hanlin Zhu
Yingchen Xu
Jiantao Jiao
Yuandong Tian
Qinqing Zheng
LRM
85
21
0
05 Feb 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLM
VLM
OffRL
AI4TS
LRM
303
1,503
0
22 Jan 2025
Leveraging LLM Reasoning Enhances Personalized Recommender Systems
Leveraging LLM Reasoning Enhances Personalized Recommender Systems
Zhe Wang
Adam Kraft
Long Jin
Chenwei Cai
Anahita Hosseini
Yuhua Ru
Zemin Zhang
Lichan Hong
Ed H. Chi
Xinyang Yi
LRM
51
11
0
22 Jul 2024
Think before you speak: Training Language Models With Pause Tokens
Think before you speak: Training Language Models With Pause Tokens
Sachin Goyal
Ziwei Ji
A. S. Rawat
A. Menon
Sanjiv Kumar
Vaishnavh Nagarajan
LRM
82
114
0
03 Oct 2023
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
616
9,009
0
28 Jan 2022
Variational Dropout and the Local Reparameterization Trick
Variational Dropout and the Local Reparameterization Trick
Diederik P. Kingma
Tim Salimans
Max Welling
BDL
187
1,500
0
08 Jun 2015
1