ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.21528
  4. Cited By
Preemptive Detection and Steering of LLM Misalignment via Latent Reachability

Preemptive Detection and Steering of LLM Misalignment via Latent Reachability

25 September 2025
Sathwik Karnik
Somil Bansal
    LLMSV
ArXiv (abs)PDFHTML

Papers citing "Preemptive Detection and Steering of LLM Misalignment via Latent Reachability"

2 / 2 papers shown
Title
RepV: Safety-Separable Latent Spaces for Scalable Neurosymbolic Plan Verification
RepV: Safety-Separable Latent Spaces for Scalable Neurosymbolic Plan Verification
Yunhao Yang
N. Bhatt
Pranay Samineni
Rohan Siva
Zhanyang Wang
Ufuk Topcu
4
0
0
30 Oct 2025
From Refusal to Recovery: A Control-Theoretic Approach to Generative AI Guardrails
From Refusal to Recovery: A Control-Theoretic Approach to Generative AI Guardrails
Ravi Pandya
Madison Bland
D. Nguyen
Changliu Liu
J. F. Fisac
Andrea V. Bajcsy
32
0
0
15 Oct 2025
1