ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.05410
  4. Cited By
Reasoning Models Don't Always Say What They Think

Reasoning Models Don't Always Say What They Think

8 May 2025
Yanda Chen
Joe Benton
Ansh Radhakrishnan
Jonathan Uesato
Carson E. Denison
John Schulman
Arushi Somani
Peter Hase
Misha Wagner
Fabien Roger
Vlad Mikulik
Samuel R. Bowman
Jan Leike
Jared Kaplan
E. Perez
    ReLM
    LRM
ArXivPDFHTML

Papers citing "Reasoning Models Don't Always Say What They Think"

13 / 13 papers shown
Title
Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning
Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning
Xinghao Chen
Anhao Zhao
Heming Xia
Xuan Lu
Hanlin Wang
Yanjun Chen
Wei Zhang
Jian Wang
W. Li
Xiaoyu Shen
ReLM
LRM
5
0
0
22 May 2025
Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas
Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas
Yu Ying Chiu
Zhilin Wang
Sharan Maiya
Yejin Choi
Kyle Fish
Sydney Levine
Evan Hubinger
7
0
0
20 May 2025
Detection and Mitigation of Hallucination in Large Reasoning Models: A Mechanistic Perspective
Detection and Mitigation of Hallucination in Large Reasoning Models: A Mechanistic Perspective
Zhongxiang Sun
Qipeng Wang
Haoyu Wang
Xiao Zhang
Jun Xu
HILM
LRM
14
0
0
19 May 2025
Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem Features
Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem Features
Alex Heyman
Joel Zylberberg
ReLM
HILM
LRM
14
0
0
17 May 2025
Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies
Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies
Xiaoliang Luo
Xinyi Xu
Michael Ramscar
Bradley C. Love
35
0
0
13 May 2025
Assessing the Chemical Intelligence of Large Language Models
Assessing the Chemical Intelligence of Large Language Models
Nicholas T. Runcie
Charlotte M. Deane
Fergus Imrie
ELM
LRM
48
0
0
12 May 2025
Large Language Model-driven Security Assistant for Internet of Things via Chain-of-Thought
Large Language Model-driven Security Assistant for Internet of Things via Chain-of-Thought
Mingfei Zeng
Ming Xie
Xixi Zheng
Chunhai Li
Chuan Zhang
Liehuang Zhu
36
0
0
08 May 2025
A Mathematical Philosophy of Explanations in Mechanistic Interpretability -- The Strange Science Part I.i
A Mathematical Philosophy of Explanations in Mechanistic Interpretability -- The Strange Science Part I.i
Kola Ayonrinde
Louis Jaburi
MILM
98
1
0
01 May 2025
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks
Yixin Cao
Shibo Hong
Xuzhao Li
Jiahao Ying
Yubo Ma
...
Juanzi Li
Aixin Sun
Xuanjing Huang
Tat-Seng Chua
Tianwei Zhang
ALM
ELM
98
2
0
26 Apr 2025
Continuum-Interaction-Driven Intelligence: Human-Aligned Neural Architecture via Crystallized Reasoning and Fluid Generation
Continuum-Interaction-Driven Intelligence: Human-Aligned Neural Architecture via Crystallized Reasoning and Fluid Generation
Pengcheng Zhou
Zhiqiang Nie
Haochen Li
53
0
0
12 Apr 2025
Right Prediction, Wrong Reasoning: Uncovering LLM Misalignment in RA Disease Diagnosis
Right Prediction, Wrong Reasoning: Uncovering LLM Misalignment in RA Disease Diagnosis
Umakanta Maharana
Sarthak Verma
Avarna Agarwal
Prakashini Mruthyunjaya
Dwarikanath Mahapatra
Sakir Ahmed
Murari Mandal
233
0
0
09 Apr 2025
I'm Sorry Dave: How the old world of personnel security can inform the new world of AI insider risk
I'm Sorry Dave: How the old world of personnel security can inform the new world of AI insider risk
Paul Martin
Sarah Mercer
270
0
0
26 Mar 2025
Implicit Bias-Like Patterns in Reasoning Models
Implicit Bias-Like Patterns in Reasoning Models
Messi H.J. Lee
Calvin K. Lai
LRM
61
0
0
14 Mar 2025
1