Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.04709
Cited By
A Comparative Study of Open-Source Large Language Models, GPT-4 and Claude 2: Multiple-Choice Test Taking in Nephrology
9 August 2023
Sean Wu
Michael Koo
L. Blum
A. Black
Liyo Kao
Fabien Scalzo
Ira Kurtz
LM&MA
ELM
AI4MH
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Comparative Study of Open-Source Large Language Models, GPT-4 and Claude 2: Multiple-Choice Test Taking in Nephrology"
20 / 20 papers shown
Title
OpenTCM: A GraphRAG-Empowered LLM-based System for Traditional Chinese Medicine Knowledge Retrieval and Diagnosis
Jinglin He
Yunqi Guo
Lai Kwan Lam
Waikei Leung
Lixing He
Yuanan Jiang
Chi Chiu Wang
Guoliang Xing
Hongkai Chen
34
0
0
28 Apr 2025
Can AI Master Construction Management (CM)? Benchmarking State-of-the-Art Large Language Models on CM Certification Exams
Ruoxin Xiong
Yanyu Wang
Suat Gunhan
Yimin Zhu
Charles Berryman
ELM
33
0
0
04 Apr 2025
Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic Scoring
Xuansheng Wu
Padmaja Pravin Saraf
Gyeong-Geon Lee
Ehsan Latif
Ninghao Liu
Xiaoming Zhai
60
4
0
24 Feb 2025
Explainable CTR Prediction via LLM Reasoning
Xiaohan Yu
Li Zhang
C. L. Philip Chen
OffRL
LRM
69
1
0
03 Dec 2024
The Evolution and Future Perspectives of Artificial Intelligence Generated Content
Chengzhang Zhu
Luobin Cui
Ying Tang
Jiacun Wang
92
1
0
02 Dec 2024
Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs
Afia Anjum
Maksim E. Eren
V. Setlur
Boian Alexandrov
Manish Bhattarai
29
2
0
02 Aug 2024
Automated Review Generation Method Based on Large Language Models
Shican Wu
Xiao Ma
Dehui Luo
Lulu Li
Xiangcheng Shi
...
Ran Luo
Chunlei Pei
Zhijian Zhao
Zhi-Jian Zhao
Jinlong Gong
77
0
0
30 Jul 2024
Adversarial Databases Improve Success in Retrieval-based Large Language Models
Sean Wu
Michael Koo
Li Yo Kao
Andy Black
L. Blum
Fabien Scalzo
Ira Kurtz
RALM
38
0
0
19 Jul 2024
Thought-Like-Pro: Enhancing Reasoning of Large Language Models through Self-Driven Prolog-based Chain-of-Thought
Xiaoyu Tan
Yongxin Deng
Xihe Qiu
Weidi Xu
Chao Qu
Wei Chu
Yinghui Xu
Yuan Qi
LRM
AI4CE
LM&Ro
40
2
0
18 Jul 2024
Lynx: An Open Source Hallucination Evaluation Model
Selvan Sunitha Ravi
B. Mielczarek
Anand Kannappan
Douwe Kiela
Rebecca Qian
VLM
RALM
HILM
56
17
0
11 Jul 2024
Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions
Xiang Li
Haoran Tang
Siyu Chen
Ziwei Wang
Ryan Chen
Marcin Abram
LRM
31
1
0
02 Jul 2024
Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning Enhancement in RLHF and Effective-Merged LLMs
Chen Zheng
Ke Sun
Xun Zhou
MoE
51
0
0
12 Jun 2024
More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model Trustworthiness
Aaron Jiaxun Li
Satyapriya Krishna
Himabindu Lakkaraju
45
3
0
29 Apr 2024
A Multi-Faceted Evaluation Framework for Assessing Synthetic Data Generated by Large Language Models
Yefeng Yuan
Yuhong Liu
Liang Cheng
SyDa
ELM
23
2
0
20 Apr 2024
Latxa: An Open Language Model and Evaluation Suite for Basque
Julen Etxaniz
Oscar Sainz
Naiara Pérez
Itziar Aldabe
German Rigau
Eneko Agirre
Aitor Ormazabal
Mikel Artetxe
A. Soroa
ELM
44
22
0
29 Mar 2024
Automating psychological hypothesis generation with AI: when large language models meet causal graph
Song Tong
Kai Mao
Zhen Huang
Yukun Zhao
Kaiping Peng
27
16
0
22 Feb 2024
Can LLMs perform structured graph reasoning?
Palaash Agrawal
Shavak Vasania
Cheston Tan
LRM
26
2
0
02 Feb 2024
A Study on Large Language Models' Limitations in Multiple-Choice Question Answering
Aisha Khatun
Daniel G. Brown
ELM
27
13
0
15 Jan 2024
LLMs-Healthcare : Current Applications and Challenges of Large Language Models in various Medical Specialties
Ummara Mumtaz
Awais Ahmed
Summaya Mumtaz
AI4MH
LM&MA
37
11
0
28 Oct 2023
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
284
31,267
0
16 Jan 2013
1