Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2506.11108
Cited By
History-Aware Cross-Attention Reinforcement: Self-Supervised Multi Turn and Chain-of-Thought Fine-Tuning with vLLM
8 June 2025
Andrew Kiruluta
Andreas Lemos
Priscilla Burity
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"History-Aware Cross-Attention Reinforcement: Self-Supervised Multi Turn and Chain-of-Thought Fine-Tuning with vLLM"
Title
No papers