Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.06146
Cited By
PubMedQA: A Dataset for Biomedical Research Question Answering
13 September 2019
Qiao Jin
Bhuwan Dhingra
Zhengping Liu
William W. Cohen
Xinghua Lu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PubMedQA: A Dataset for Biomedical Research Question Answering"
50 / 527 papers shown
Title
Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs
Juraj Vladika
Alexander Fichtl
Florian Matthes
KELM
30
1
0
21 Dec 2023
MedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models
Yan Cai
Linlin Wang
Ye Wang
Gerard de Melo
Ya Zhang
Yanfeng Wang
Liang He
AI4MH
ELM
LM&MA
50
17
0
20 Dec 2023
UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts
Chenlu Zhan
Yufei Zhang
Yu Lin
Gaoang Wang
Hongwei Wang
VLM
MedIm
33
5
0
18 Dec 2023
Catwalk: A Unified Language Model Evaluation Framework for Many Datasets
Dirk Groeneveld
Anas Awadalla
Iz Beltagy
Akshita Bhagia
Ian H. Magnusson
Hao Peng
Oyvind Tafjord
Pete Walsh
Kyle Richardson
Jesse Dodge
122
1
0
15 Dec 2023
RJUA-QA: A Comprehensive QA Dataset for Urology
Shiwei Lyu
Chenfei Chi
Hongbo Cai
Lei Shi
Xiaoyan Yang
...
Xiaowei Ma
Yue Shen
Jinjie Gu
Wei Xue
Yiran Huang
LM&MA
26
3
0
15 Dec 2023
Large language models in healthcare and medical domain: A review
Zabir Al Nazi
Wei Peng
LM&MA
34
132
0
12 Dec 2023
PaperQA: Retrieval-Augmented Generative Agent for Scientific Research
Jakub Lála
Odhran O'Donoghue
Aleksandar Shtedritski
Sam Cox
Samuel G. Rodriques
Andrew D. White
RALM
82
76
0
08 Dec 2023
From Beginner to Expert: Modeling Medical Knowledge into General LLMs
Qiang Li
Xiaoyan Yang
Haowen Wang
Qin Wang
Lei Liu
...
Wangshu Zhang
Teng Xu
Jinjie Gu
Jing Zheng
Guannan Zhang
LM&MA
ELM
AI4MH
19
14
0
02 Dec 2023
Hashmarks: Privacy-Preserving Benchmarks for High-Stakes AI Evaluation
P. Bricman
24
0
0
01 Dec 2023
Explanatory Argument Extraction of Correct Answers in Resident Medical Exams
Iakes Goenaga
Aitziber Atutxa
Koldo Gojenola
Maite Oronoz
Rodrigo Agerri
ELM
70
8
0
01 Dec 2023
Should we be going MAD? A Look at Multi-Agent Debate Strategies for LLMs
Andries P. Smit
Paul Duckworth
Nathan Grinsztajn
Thomas D. Barrett
Arnu Pretorius
22
20
0
29 Nov 2023
Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine
Harsha Nori
Yin Tat Lee
Sheng Zhang
Dean Carignan
Richard Edgar
...
Hoifung Poon
Tao Qin
Naoto Usuyama
Chris White
Eric Horvitz
LM&MA
AI4MH
MedIm
ELM
35
294
0
28 Nov 2023
MEDITRON-70B: Scaling Medical Pretraining for Large Language Models
Zeming Chen
Alejandro Hernández Cano
Angelika Romanou
Antoine Bonnet
Kyle Matoba
...
Axel Marmet
Syrielle Montariol
Mary-Anne Hartley
Martin Jaggi
Antoine Bosselut
LM&MA
AI4MH
MedIm
43
179
0
27 Nov 2023
Walking a Tightrope -- Evaluating Large Language Models in High-Risk Domains
Chia-Chien Hung
Wiem Ben-Rim
Lindsay Frost
Lars Bruckner
Carolin (Haas) Lawrence
AILaw
ALM
ELM
25
9
0
25 Nov 2023
Minimizing Factual Inconsistency and Hallucination in Large Language Models
Muneeswaran Irulandi
Shreya Saxena
Siva Prasad
M. V. Sai Prakash
Advaith Shankar
V. Varun
Vishal Vaddina
Saisubramaniam Gopalakrishnan
HILM
32
5
0
23 Nov 2023
LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms
Aditi Jha
Sam Havens
Jeremey Dohmann
Alex Trott
Jacob P. Portes
ALM
19
11
0
22 Nov 2023
nach0: Multimodal Natural and Chemical Languages Foundation Model
M. Livne
Z. Miftahutdinov
E. Tutubalina
Maksim Kuznetsov
Daniil Polykovskiy
...
Aastha Jhunjhunwala
Anthony Costa
Alex Aliper
Alán Aspuru-Guzik
Alex Zhavoronkov
AI4CE
27
12
0
21 Nov 2023
AcademicGPT: Empowering Academic Research
Shufa Wei
Xiaolong Xu
Xianbiao Qi
Xi Yin
Jun Xia
...
Chihao Dai
Lihua Wang
Xiaohui Liu
Lei Zhang
Yutao Xie
LM&MA
44
3
0
21 Nov 2023
Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks
Ling Luo
Jinzhong Ning
Yingwen Zhao
Zhijun Wang
Zeyuan Ding
...
Yuqi Liu
Zhihao Yang
Jian Wang
Yuanyuan Sun
Hongfei Lin
LM&MA
99
51
0
20 Nov 2023
MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning
Xiangru Tang
Anni Zou
Zhuosheng Zhang
Ziming Li
Yilun Zhao
Xingyao Zhang
Arman Cohan
Mark B. Gerstein
LRM
LM&MA
26
136
0
16 Nov 2023
DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation
Yiqing Xie
Sheng Zhang
Hao Cheng
Pengfei Liu
Zelalem Gero
Cliff Wong
Tristan Naumann
Hoifung Poon
Carolyn Rose
MedIm
21
4
0
16 Nov 2023
What if you said that differently?: How Explanation Formats Affect Human Feedback Efficacy and User Perception
Chaitanya Malaviya
Subin Lee
Dan Roth
Mark Yatskar
37
1
0
16 Nov 2023
AuthentiGPT: Detecting Machine-Generated Text via Black-Box Language Models Denoising
Zhen Guo
Shangdi Yu
DeLMO
29
10
0
13 Nov 2023
A Survey of Large Language Models in Medicine: Progress, Application, and Challenge
Hongjian Zhou
Fenglin Liu
Boyang Gu
Xinyu Zou
Jinfa Huang
...
Yefeng Zheng
Lei A. Clifton
Zheng Li
Fenglin Liu
David A. Clifton
LM&MA
33
107
0
09 Nov 2023
Interactive Multi-fidelity Learning for Cost-effective Adaptation of Language Model with Sparse Human Supervision
Jiaxin Zhang
Zhuohang Li
Kamalika Das
Kumar Sricharan
31
2
0
31 Oct 2023
Making Large Language Models Better Data Creators
Dong-Ho Lee
Jay Pujara
Mohit Sewak
Ryen W. White
S. Jauhar
ALM
SyDa
16
23
0
31 Oct 2023
BioInstruct: Instruction Tuning of Large Language Models for Biomedical Natural Language Processing
Hieu Tran
Zhichao Yang
Zonghai Yao
Hong-ye Yu
ALM
LM&MA
40
23
0
30 Oct 2023
Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific Literature
Alejandro Lozano
Scott L. Fleming
Chia-Chun Chiang
Nigam Shah
ELM
RALM
31
32
0
24 Oct 2023
MindLLM: Pre-training Lightweight Large Language Model from Scratch, Evaluations and Domain Applications
Yizhe Yang
Huashan Sun
Jiawei Li
Runheng Liu
Yinghao Li
Yuhang Liu
Heyan Huang
Yang Gao
ALM
LRM
16
8
0
24 Oct 2023
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions
Junchao Wu
Shu Yang
Runzhe Zhan
Yulin Yuan
Derek F. Wong
Lidia S. Chao
DeLMO
32
22
0
23 Oct 2023
Language Models Hallucinate, but May Excel at Fact Verification
Jian Guan
Jesse Dodge
David Wadden
Minlie Huang
Hao Peng
LRM
HILM
34
28
0
23 Oct 2023
AlpaCare:Instruction-tuned Large Language Models for Medical Application
Xinlu Zhang
Chenxin Tian
Xianjun Yang
Lichang Chen
Zekun Li
Linda R. Petzold
LM&MA
32
59
0
23 Oct 2023
PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain
Wei-wei Zhu
Xiaoling Wang
Huanran Zheng
Mosha Chen
Buzhou Tang
ELM
LM&MA
21
33
0
22 Oct 2023
MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
Zexue He
Yu-Xiang Wang
An Yan
Yao Liu
Eric Y. Chang
Amilcare Gentili
Julian McAuley
Chun-Nan Hsu
ELM
83
14
0
21 Oct 2023
MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter
Zhiyuan Liu
Sihang Li
Yancheng Luo
Hao Fei
Yixin Cao
Kenji Kawaguchi
Xiang Wang
Tat-Seng Chua
30
81
0
19 Oct 2023
Watermarking LLMs with Weight Quantization
Linyang Li
Botian Jiang
Pengyu Wang
Ke Ren
Hang Yan
Xipeng Qiu
MQ
WaLM
15
11
0
17 Oct 2023
BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology
Odhran O'Donoghue
Aleksandar Shtedritski
John Ginger
Ralph Abboud
Ali E. Ghareeb
Justin Booth
Samuel G. Rodriques
22
17
0
16 Oct 2023
Tokenizer Choice For LLM Training: Negligible or Crucial?
Mehdi Ali
Michael Fromm
Klaudia Thellmann
Richard Rutmann
Max Lübbering
...
Malte Ostendorff
Samuel Weinbach
R. Sifa
Stefan Kesselheim
Nicolas Flores-Herr
23
47
0
12 Oct 2023
Towards Mitigating Hallucination in Large Language Models via Self-Reflection
Ziwei Ji
Tiezheng Yu
Yan Xu
Nayeon Lee
Etsuko Ishii
Pascale Fung
HILM
11
57
0
10 Oct 2023
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature
Guangsheng Bao
Yanbin Zhao
Zhiyang Teng
Linyi Yang
Yue Zhang
28
130
0
08 Oct 2023
A Comprehensive Evaluation of Large Language Models on Benchmark Biomedical Text Processing Tasks
Fangshuo Liao
Md Tahmid Rahman Laskar
Cruz Barnum
Jimmy X. Huang
AI4MH
LM&MA
35
68
0
06 Oct 2023
RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Xi Lin
Xilun Chen
Mingda Chen
Weijia Shi
Maria Lomeli
...
Jacob Kahn
Gergely Szilvasy
Mike Lewis
Luke Zettlemoyer
Scott Yih
RALM
47
129
0
02 Oct 2023
Synthetic Data Generation in Low-Resource Settings via Fine-Tuning of Large Language Models
Jean Kaddour
Qi Liu
SyDa
35
2
0
02 Oct 2023
Self-Specialization: Uncovering Latent Expertise within Large Language Models
Junmo Kang
Hongyin Luo
Yada Zhu
Jacob A. Hansen
James R. Glass
David D. Cox
Alan Ritter
Rogerio Feris
Leonid Karlinsky
ALM
MoMe
27
4
0
29 Sep 2023
Using Weak Supervision and Data Augmentation in Question Answering
Chumki Basu
Binyuan Hui
Allen McIntosh
Wei Wang
J. Wullert
OOD
49
0
0
28 Sep 2023
ChatCounselor: A Large Language Models for Mental Health Support
June M. Liu
Donghao Li
He Cao
Tianhe Ren
Zeyi Liao
Jiamin Wu
AI4MH
34
34
0
27 Sep 2023
Graph Neural Prompting with Large Language Models
Yijun Tian
Huan Song
Zichen Wang
Haozhu Wang
Ziqing Hu
Fang Wang
Nitesh V. Chawla
Panpan Xu
AI4CE
37
44
0
27 Sep 2023
Unlocking Model Insights: A Dataset for Automated Model Card Generation
Shruti Singh
Hitesh Lodwal
Husain Malwat
Rakesh Thakur
Mayank Singh
SyDa
24
3
0
22 Sep 2023
Adapting Large Language Models via Reading Comprehension
Daixuan Cheng
Shaohan Huang
Furu Wei
CLL
SyDa
AI4CE
32
64
0
18 Sep 2023
A Statistical Turing Test for Generative Models
Hayden Helm
Carey E. Priebe
Weiwei Yang
DeLMO
26
7
0
16 Sep 2023
Previous
1
2
3
...
10
11
7
8
9
Next