Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.06146
Cited By
PubMedQA: A Dataset for Biomedical Research Question Answering
13 September 2019
Qiao Jin
Bhuwan Dhingra
Zhengping Liu
William W. Cohen
Xinghua Lu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PubMedQA: A Dataset for Biomedical Research Question Answering"
50 / 525 papers shown
Title
ExpertQA: Expert-Curated Questions and Attributed Answers
Chaitanya Malaviya
Subin Lee
Sihao Chen
Elizabeth Sieber
Mark Yatskar
Dan Roth
ELM
HILM
25
50
0
14 Sep 2023
Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical Domain
Yanrui Du
Sendong Zhao
Yuhan Chen
Ming Ma
Huaqin Wu
Haifeng Wang
Bing Qin
38
3
0
08 Sep 2023
Aligning Large Language Models for Clinical Tasks
Supun Manathunga
Isuru Hettigoda
LM&MA
ELM
AI4MH
33
10
0
06 Sep 2023
Augmenting Black-box LLMs with Medical Textbooks for Biomedical Question Answering
Yubo Wang
Xueguang Ma
Wenhu Chen
LM&MA
AI4MH
47
9
0
05 Sep 2023
Examining User-Friendly and Open-Sourced Large GPT Models: A Survey on Language, Multimodal, and Scientific GPT Models
Kaiyuan Gao
Su He
Zhenyu He
Jiacheng Lin
Qizhi Pei
Jie Shao
Wei Zhang
LM&MA
SyDa
35
4
0
27 Aug 2023
MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records
Scott L. Fleming
Alejandro Lozano
W. Haberkorn
Jenelle A. Jindal
E. Reis
...
Jonathan H. Chen
Keith Morse
Emma Brunskill
Jason Alan Fries
N. Shah
LM&MA
28
53
0
27 Aug 2023
SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research
Liangtai Sun
Yang Han
Zihan Zhao
Da Ma
Zhe-Wei Shen
Baocai Chen
Lu Chen
Kai Yu
ELM
45
70
0
25 Aug 2023
PaniniQA: Enhancing Patient Education Through Interactive Question Answering
Pengshan Cai
Zonghai Yao
Fei Liu
Dakuo Wang
Meghan Reilly
...
Yi Cao
Alok Kapoor
Adarsha S. Bajracharya
D. Berlowitz
Hongfeng Yu
38
18
0
07 Aug 2023
Teaching Smaller Language Models To Generalise To Unseen Compositional Questions
Tim Hartill
N. Tan
Michael Witbrock
Patricia J. Riddle
ReLM
KELM
LRM
34
2
0
02 Aug 2023
ArcGPT: A Large Language Model Tailored for Real-world Archival Applications
Shitou Zhang
Jingrui Hou
Siyuan Peng
Z. Li
Qibiao Hu
P. Wang
KELM
RALM
LLMAG
32
3
0
27 Jul 2023
Towards Generalist Biomedical AI
Tao Tu
Shekoofeh Azizi
Danny Driess
M. Schaekermann
Mohamed Amin
...
Yossi Matias
K. Singhal
Peter R. Florence
Alan Karthikesalingam
Vivek Natarajan
LM&MA
MedIm
AI4MH
40
243
0
26 Jul 2023
Several categories of Large Language Models (LLMs): A Short Survey
Saurabh Pahune
Manoj Chandrasekharan
AILaw
25
14
0
05 Jul 2023
CARE-MI: Chinese Benchmark for Misinformation Evaluation in Maternity and Infant Care
Tong Xiang
Liangzhi Li
Wangyue Li
Min‐Jun Bai
Lu Wei
Bowen Wang
Noa Garcia
36
5
0
04 Jul 2023
Transformers in Healthcare: A Survey
Subhash Nerella
S. Bandyopadhyay
Jiaqing Zhang
Miguel Contreras
Scott Siegel
...
Jessica Sena
B. Shickel
A. Bihorac
Kia Khezeli
Parisa Rashidi
MedIm
AI4CE
21
25
0
30 Jun 2023
Confidence-Calibrated Ensemble Dense Phrase Retrieval
William Yang
Noah Bergam
A. Jain
Nima Sheikhoslami
17
0
0
28 Jun 2023
SciMRC: Multi-perspective Scientific Machine Reading Comprehension
Xiao Zhang
Heqi Zheng
Yuxiang Nie
Heyan Huang
Xian-Ling Mao
44
1
0
25 Jun 2023
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models
Shizhe Diao
Rui Pan
Hanze Dong
Kashun Shum
Jipeng Zhang
Wei Xiong
Tong Zhang
ALM
20
63
0
21 Jun 2023
Opportunities and Challenges for ChatGPT and Large Language Models in Biomedicine and Health
Shubo Tian
Qiao Jin
Lana Yeganova
Po-Ting Lai
Qingqing Zhu
...
Donald C. Comeau
R. Islamaj
Aadit Kapoor
Xin Gao
Zhiyong Lu
LM&MA
MedIm
AI4MH
109
210
0
15 Jun 2023
Gradient Ascent Post-training Enhances Language Model Generalization
Dongkeun Yoon
Joel Jang
Sungdong Kim
Minjoon Seo
VLM
AI4CE
23
3
0
12 Jun 2023
FedSecurity: Benchmarking Attacks and Defenses in Federated Learning and Federated LLMs
Shanshan Han
Baturalp Buyukates
Zijian Hu
Han Jin
Weizhao Jin
...
Qifan Zhang
Yuhui Zhang
Carlee Joe-Wong
Salman Avestimehr
Chaoyang He
SILM
31
12
0
08 Jun 2023
Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers
Israt Jahan
Md Tahmid Rahman Laskar
Chun Peng
J. Huang
LM&MA
MedIm
AI4MH
43
30
0
07 Jun 2023
Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today
Zhuo Wang
R. Li
Bowen Dong
Jie Wang
Xiuxing Li
...
C. Mao
Wei Zhang
L. Dong
Jing Gao
Jianyong Wang
LM&MA
ELM
AI4MH
28
19
0
02 Jun 2023
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Guilherme Penedo
Quentin Malartic
Daniel Hesslow
Ruxandra-Aimée Cojocaru
Alessandro Cappelli
Hamza Alobeidli
B. Pannier
Ebtesam Almazrouei
Julien Launay
27
751
0
01 Jun 2023
FERMAT: An Alternative to Accuracy for Numerical Reasoning
Jasivan Sivakumar
N. Moosavi
ReLM
LRM
37
3
0
27 May 2023
DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text
Xianjun Yang
Wei Cheng
Yue Wu
Linda R. Petzold
William Yang Wang
Haifeng Chen
DeLMO
30
84
0
27 May 2023
Scientific Fact-Checking: A Survey of Resources and Approaches
Juraj Vladika
Florian Matthes
HILM
36
42
0
26 May 2023
Few-shot Unified Question Answering: Tuning Models or Prompts?
Srijan Bansal
Semih Yavuz
Bo Pang
Meghana Moorthy Bhat
Yingbo Zhou
28
2
0
23 May 2023
Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding
Yu Zhang
Hao Cheng
Zhihong Shen
Xiaodong Liu
Yejiang Wang
Jianfeng Gao
32
13
0
23 May 2023
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
Seungone Kim
Se June Joo
Doyoung Kim
Joel Jang
Seonghyeon Ye
Jamin Shin
Minjoon Seo
ALM
RALM
LRM
23
96
0
23 May 2023
A Study of Generative Large Language Model for Medical Research and Healthcare
C.A.I. Peng
Xi Yang
Aokun Chen
Kaleb E. Smith
Nima M. Pournejatian
...
W. Hogan
E. Shenkman
Yi Guo
Jiang Bian
Yonghui Wu
LM&MA
ELM
AI4MH
155
244
0
22 May 2023
BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance
Karel DÓosterlinck
François Remy
Johannes Deleu
Thomas Demeester
Chris Develder
Klim Zaporojets
Aneiss Ghodsi
Simon Ellershaw
Jack R. Collins
Christopher Potts
50
10
0
22 May 2023
"According to ...": Prompting Language Models Improves Quoting from Pre-Training Data
Orion Weller
Marc Marone
Nathaniel Weir
Dawn J Lawrie
Daniel Khashabi
Benjamin Van Durme
HILM
78
44
0
22 May 2023
MAGE: Machine-generated Text Detection in the Wild
Yafu Li
Qintong Li
Leyang Cui
Wei Bi
Zhilin Wang
Longyue Wang
Linyi Yang
Shuming Shi
Yue Zhang
DeLMO
41
42
0
22 May 2023
SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables
Xinyuan Lu
Liangming Pan
Qian Liu
Preslav Nakov
Min-Yen Kan
LMTD
38
24
0
22 May 2023
ExplainCPE: A Free-text Explanation Benchmark of Chinese Pharmacist Examination
Dongfang Li
Jindi Yu
Baotian Hu
Zhenran Xu
M. Zhang
ELM
6
11
0
22 May 2023
DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text Detection
Xiao Yu
Yuang Qi
Kejiang Chen
Guoqiang Chen
Xi Yang
Pengyuan Zhu
Xiuwei Shang
Weiming Zhang
Neng H. Yu
DeLMO
13
11
0
21 May 2023
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering
Fangkai Yang
Pu Zhao
Zezhong Wang
Lu Wang
Jue Zhang
Mohit Garg
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
37
47
0
19 May 2023
Document Understanding Dataset and Evaluation (DUDE)
Jordy Van Landeghem
Rubèn Pérez Tito
Łukasz Borchmann
Michal Pietruszka
Pawel Józiak
...
Bertrand Ackaert
Ernest Valveny
Matthew Blaschko
Sien Moens
Tomasz Stanislawek
VGen
24
52
0
15 May 2023
MatSci-NLP: Evaluating Scientific Language Models on Materials Science Language Tasks Using Text-to-Schema Modeling
Yurun Song
Santiago Miret
Bang Liu
30
29
0
14 May 2023
Zero-shot Faithful Factual Error Correction
Kung-Hsiang Huang
Hou Pong Chan
Heng Ji
KELM
HILM
26
30
0
13 May 2023
Improving Small Language Models on PubMedQA via Generative Data Augmentation
Zhen Guo
Peiqi Wang
Yanwei Wang
Shangdi Yu
LM&MA
MedIm
18
10
0
12 May 2023
Long-Tailed Question Answering in an Open World
Yinpei Dai
Hao Lang
Yinhe Zheng
Fei Huang
Yongbin Li
VLM
29
7
0
11 May 2023
PMC-LLaMA: Towards Building Open-source Language Models for Medicine
Chaoyi Wu
Weixiong Lin
Xiaoman Zhang
Ya-Qin Zhang
Yanfeng Wang
Weidi Xie
LM&MA
AI4MH
98
75
0
27 Apr 2023
A Lightweight Constrained Generation Alternative for Query-focused Summarization
Zhichao Xu
Daniel Cohen
26
10
0
23 Apr 2023
LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language Models
Patrik Puchert
Poonam Poonam
Christian van Onzenoodt
Timo Ropinski
20
8
0
02 Apr 2023
CQSumDP: A ChatGPT-Annotated Resource for Query-Focused Abstractive Summarization Based on Debatepedia
Md Tahmid Rahman Laskar
Mizanur Rahman
Israt Jahan
Enamul Hoque
J. Huang
40
8
0
31 Mar 2023
Capabilities of GPT-4 on Medical Challenge Problems
Harsha Nori
Nicholas King
S. McKinney
Dean Carignan
Eric Horvitz
LM&MA
ELM
AI4MH
41
766
0
20 Mar 2023
Generating multiple-choice questions for medical question answering with distractors and cue-masking
Damien Sileo
Kanimozhi Uma
Marie-Francine Moens
37
5
0
13 Mar 2023
Almanac: Retrieval-Augmented Language Models for Clinical Medicine
C. Zakka
Akash Chaurasia
R. Shad
Alex R. Dalal
Jennifer L. Kim
...
Kathleen Boyd
Karen Hirsch
C. Langlotz
Joanna Nelson
W. Hiesinger
LM&MA
116
144
0
01 Mar 2023
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Shayne Longpre
Le Hou
Tu Vu
Albert Webson
Hyung Won Chung
...
Denny Zhou
Quoc V. Le
Barret Zoph
Jason W. Wei
Adam Roberts
ALM
41
628
0
31 Jan 2023
Previous
1
2
3
...
10
11
8
9
Next