ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.00122
  4. Cited By
A Course Shared Task on Evaluating LLM Output for Clinical Questions

A Course Shared Task on Evaluating LLM Output for Clinical Questions

31 July 2024
Yufang Hou
Thy Thy Tran
Doan Nam Long Vu
Yiwen Cao
Kai Li
Lukas Rohde
Iryna Gurevych
    LM&MA
    ELM
ArXivPDFHTML

Papers citing "A Course Shared Task on Evaluating LLM Output for Clinical Questions"

2 / 2 papers shown
Title
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge
  Conflicts from Wikipedia
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Yufang Hou
Alessandra Pascale
Javier Carnerero-Cano
T. Tchrakian
Radu Marinescu
Elizabeth M. Daly
Inkit Padhi
P. Sattigeri
51
6
0
19 Jun 2024
Explainable Automated Fact-Checking for Public Health Claims
Explainable Automated Fact-Checking for Public Health Claims
Neema Kotonya
Francesca Toni
218
251
0
19 Oct 2020
1