ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.22919
  4. Cited By
ER-REASON: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room
v1v2 (latest)

ER-REASON: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room

28 May 2025
Nikita Mehandru
Niloufar Golchini
David Bamman
Travis Zack
Melanie F. Molina
Ahmed Alaa
    ELM
ArXiv (abs)PDFHTML

Papers citing "ER-REASON: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room"

4 / 4 papers shown
Title
Medical Large Language Model Benchmarks Should Prioritize Construct Validity
Ahmed M. Alaa
Thomas Hartvigsen
Niloufar Golchini
Shiladitya Dutta
Frances Dean
Inioluwa Deborah Raji
Travis Zack
AI4MHELMLM&MA
84
6
0
12 Mar 2025
The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer
The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer
Marthe Ballon
Andres Algaba
Vincent Ginis
LRMReLM
104
17
0
24 Feb 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLMVLMOffRLAI4TSLRM
384
2,022
0
22 Jan 2025
Evaluating Transparent Reasoning in Large Language Models for Accountable Critical Tasks
Evaluating Transparent Reasoning in Large Language Models for Accountable Critical Tasks
Bowen Wang
Jiuyang Chang
Yiming Qian
Guoxin Chen
LRMLM&MAELM
111
6
0
04 Aug 2024
1