ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.08484
  4. Cited By
Self-Play Q-learners Can Provably Collude in the Iterated Prisoner's Dilemma
v1v2 (latest)

Self-Play Q-learners Can Provably Collude in the Iterated Prisoner's Dilemma

13 December 2023
Quentin Bertrand
Juan Agustin Duque
Emilio Calvano
Gauthier Gidel
ArXiv (abs)PDFHTML

Papers citing "Self-Play Q-learners Can Provably Collude in the Iterated Prisoner's Dilemma"

1 / 1 papers shown
Title
Advantage Alignment Algorithms
Advantage Alignment Algorithms
Juan Agustin Duque
Milad Aghajohari
Tim Cooijmans
Tianyu Zhang
Rameswar Panda
Gauthier Gidel
Aaron Courville
86
2
0
20 Jun 2024
1