ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.12491
  4. Cited By
Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse Reinforcement Learning
v1v2 (latest)

Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse Reinforcement Learning

16 October 2024
Jared Joselowitz
Arjun Jagota
Satyapriya Krishna
Sonali Parbhoo
Nyal Patel
Satyapriya Krishna
Sonali Parbhoo
ArXiv (abs)PDFHTML

Papers citing "Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse Reinforcement Learning"

1 / 1 papers shown
Title
Effective Red-Teaming of Policy-Adherent Agents
Effective Red-Teaming of Policy-Adherent Agents
Itay Nakash
George Kour
Koren Lazar
Matan Vetzler
Guy Uziel
Ateret Anaby-Tavor
AAML
95
0
0
11 Jun 2025
1