ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.06146
  4. Cited By
PubMedQA: A Dataset for Biomedical Research Question Answering

PubMedQA: A Dataset for Biomedical Research Question Answering

13 September 2019
Qiao Jin
Bhuwan Dhingra
Zhengping Liu
William W. Cohen
Xinghua Lu
ArXivPDFHTML

Papers citing "PubMedQA: A Dataset for Biomedical Research Question Answering"

50 / 525 papers shown
Title
Reward-RAG: Enhancing RAG with Reward Driven Supervision
Reward-RAG: Enhancing RAG with Reward Driven Supervision
Thang Nguyen
Peter Chin
Yu-Wing Tai
RALM
42
4
0
03 Oct 2024
Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and
  Reliability
Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and Reliability
Weitong Zhang
Chengqi Zang
Bernhard Kainz
31
0
0
01 Oct 2024
DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data
  Mining
DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining
Vinayak Arannil
Neha Narwal
Sourav Sanjukta Bhabesh
Sai Nikhil Thirandas
Darren Yow-Bang Wang
Graham Horwood
Alex Anto Chirayath
Gouri Pandeshwar
41
0
0
30 Sep 2024
SciDFM: A Large Language Model with Mixture-of-Experts for Science
SciDFM: A Large Language Model with Mixture-of-Experts for Science
Liangtai Sun
Danyu Luo
Da Ma
Zihan Zhao
Baocai Chen
Zhennan Shen
Su Zhu
Lu Chen
Xin Chen
Kai Yu
MoE
40
2
0
27 Sep 2024
Efficient In-Domain Question Answering for Resource-Constrained
  Environments
Efficient In-Domain Question Answering for Resource-Constrained Environments
Isaac Chung
Phat Vo
Arman Kizilkale
Aaron Reite
RALM
23
0
0
26 Sep 2024
Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task
  Learning Via Connector-MoE
Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE
Xun Zhu
Ying Hu
Fanbin Mo
Miao Li
Ji Wu
49
8
0
26 Sep 2024
Zero-Shot Detection of LLM-Generated Text using Token Cohesiveness
Zero-Shot Detection of LLM-Generated Text using Token Cohesiveness
Shixuan Ma
Quan Wang
40
2
0
25 Sep 2024
A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?
A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?
Yunfei Xie
Juncheng Wu
Haoqin Tu
Siwei Yang
Bingchen Zhao
Yongshuo Zong
Qiao Jin
Cihang Xie
Yuyin Zhou
LM&MA
ELM
LRM
49
19
0
23 Sep 2024
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining
  for Clinical LLMs
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs
Clément Christophe
Tathagata Raha
Svetlana Maslenkova
Muhammad Umar Salman
Praveen K Kanithi
Marco AF Pimentel
Shadab Khan
LM&MA
35
2
0
23 Sep 2024
Reliable and diverse evaluation of LLM medical knowledge mastery
Reliable and diverse evaluation of LLM medical knowledge mastery
Yuxuan Zhou
Xien Liu
Chen Ning
Xiao Zhang
Ji Wu
MedIm
34
0
0
22 Sep 2024
JMedBench: A Benchmark for Evaluating Japanese Biomedical Large Language
  Models
JMedBench: A Benchmark for Evaluating Japanese Biomedical Large Language Models
Junfeng Jiang
Jiahao Huang
Akiko Aizawa
LM&MA
35
4
0
20 Sep 2024
Development and bilingual evaluation of Japanese medical large language
  model within reasonably low computational resources
Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources
Issey Sukeda
ELM
44
1
0
18 Sep 2024
Eir: Thai Medical Large Language Models
Eir: Thai Medical Large Language Models
Yutthakorn Thiprak
Rungtam Ngodngamthaweesuk
Songtam Ngodngamtaweesuk
LM&MA
ELM
40
0
0
13 Sep 2024
Leveraging Unstructured Text Data for Federated Instruction Tuning of
  Large Language Models
Leveraging Unstructured Text Data for Federated Instruction Tuning of Large Language Models
Rui Ye
Rui Ge
Yuchi Fengting
Jingyi Chai
Yanfeng Wang
Siheng Chen
FedML
40
1
0
11 Sep 2024
A Dataset for Evaluating LLM-based Evaluation Functions for Research
  Question Extraction Task
A Dataset for Evaluating LLM-based Evaluation Functions for Research Question Extraction Task
Yuya Fujisaki
Shiro Takagi
Hideki Asoh
Wataru Kumagai
23
0
0
10 Sep 2024
Language agents achieve superhuman synthesis of scientific knowledge
Language agents achieve superhuman synthesis of scientific knowledge
Michael D. Skarlinski
Sam Cox
Jon M. Laurent
James D. Braza
Michaela M. Hinks
M. Hammerling
Manvitha Ponnapati
Samuel G. Rodriques
Andrew D. White
ELM
HILM
ALM
35
29
0
10 Sep 2024
Enhancing Healthcare LLM Trust with Atypical Presentations Recalibration
Enhancing Healthcare LLM Trust with Atypical Presentations Recalibration
Jeremy Qin
Bang Liu
Quoc Dinh Nguyen
35
2
0
05 Sep 2024
The AdEMAMix Optimizer: Better, Faster, Older
The AdEMAMix Optimizer: Better, Faster, Older
Matteo Pagliardini
Pierre Ablin
David Grangier
ODL
28
8
0
05 Sep 2024
Pooling And Attention: What Are Effective Designs For LLM-Based
  Embedding Models?
Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models?
Yixuan Tang
Yi Yang
33
3
0
04 Sep 2024
Pre-training data selection for biomedical domain adaptation using
  journal impact metrics
Pre-training data selection for biomedical domain adaptation using journal impact metrics
Mathieu Lai-king
P. Paroubek
34
0
0
04 Sep 2024
NDP: Next Distribution Prediction as a More Broad Target
NDP: Next Distribution Prediction as a More Broad Target
Junhao Ruan
Abudukeyumu Abudula
Xinyu Liu
Bei Li
Yinqiao Li
Chenglong Wang
Yuchun Fan
Yuan Ge
Tong Xiao
Jingbo Zhu
32
0
0
30 Aug 2024
Flexible and Effective Mixing of Large Language Models into a Mixture of
  Domain Experts
Flexible and Effective Mixing of Large Language Models into a Mixture of Domain Experts
Rhui Dih Lee
L. Wynter
R. Ganti
MoE
45
1
0
30 Aug 2024
A Survey for Large Language Models in Biomedicine
A Survey for Large Language Models in Biomedicine
Chong Wang
Mengyao Li
Junjun He
Zhongruo Wang
Erfan Darzi
...
Yi Yu
Pietro Liò
Tianyun Wang
Yu Guang Wang
Yiqing Shen
LM&MA
34
9
0
29 Aug 2024
Toward the Evaluation of Large Language Models Considering Score
  Variance across Instruction Templates
Toward the Evaluation of Large Language Models Considering Score Variance across Instruction Templates
Yusuke Sakai
Adam Nohejl
Jiangnan Hang
Hidetaka Kamigaito
Taro Watanabe
ELM
36
2
0
22 Aug 2024
Aligning (Medical) LLMs for (Counterfactual) Fairness
Aligning (Medical) LLMs for (Counterfactual) Fairness
Raphael Poulain
Hamed Fayyaz
Rahmatollah Beheshti
34
3
0
22 Aug 2024
Unconditional Truthfulness: Learning Conditional Dependency for
  Uncertainty Quantification of Large Language Models
Unconditional Truthfulness: Learning Conditional Dependency for Uncertainty Quantification of Large Language Models
Artem Vazhentsev
Ekaterina Fadeeva
Rui Xing
Alexander Panchenko
Preslav Nakov
Timothy Baldwin
Maxim Panov
Artem Shelmanov
HILM
32
0
0
20 Aug 2024
WPN: An Unlearning Method Based on N-pair Contrastive Learning in
  Language Models
WPN: An Unlearning Method Based on N-pair Contrastive Learning in Language Models
Guitao Chen
Yunshen Wang
Hongye Sun
Guang Chen
MU
28
1
0
18 Aug 2024
FEDMEKI: A Benchmark for Scaling Medical Foundation Models via Federated
  Knowledge Injection
FEDMEKI: A Benchmark for Scaling Medical Foundation Models via Federated Knowledge Injection
Jiaqi Wang
Xiaochen Wang
Lingjuan Lyu
Jinghui Chen
Fenglong Ma
82
5
0
17 Aug 2024
Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge
Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge
Ravi Raju
Swayambhoo Jain
Bo Li
Jonathan Li
Urmish Thakker
ALM
ELM
42
11
0
16 Aug 2024
RealMedQA: A pilot biomedical question answering dataset containing
  realistic clinical questions
RealMedQA: A pilot biomedical question answering dataset containing realistic clinical questions
Gregory Kell
A. Roberts
Serge Umansky
Yuti Khare
Najma Ahmed
...
Chloe Simela
Jack Coumbe
Julian Rozario
Ryan-Rhys Griffiths
Iain J. Marshall
49
0
0
16 Aug 2024
Med42-v2: A Suite of Clinical LLMs
Med42-v2: A Suite of Clinical LLMs
Clément Christophe
Praveen K Kanithi
Tathagata Raha
Shadab Khan
Marco AF Pimentel
ELM
LM&MA
AI4MH
28
20
0
12 Aug 2024
Get Confused Cautiously: Textual Sequence Memorization Erasure with
  Selective Entropy Maximization
Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization
Zhaohan Zhang
Ziquan Liu
Ioannis Patras
39
2
0
09 Aug 2024
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph
  Retrieval-Augmented Generation
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation
Junde Wu
Jiayuan Zhu
Yunli Qi
34
24
0
08 Aug 2024
Learning to Rewrite: Generalized LLM-Generated Text Detection
Learning to Rewrite: Generalized LLM-Generated Text Detection
Wei Hao
Ran Li
Weiliang Zhao
Junfeng Yang
Chengzhi Mao
DeLMO
56
3
0
08 Aug 2024
CoverBench: A Challenging Benchmark for Complex Claim Verification
CoverBench: A Challenging Benchmark for Complex Claim Verification
Alon Jacovi
Moran Ambar
Eyal Ben-David
Uri Shaham
Amir Feder
Mor Geva
Dror Marcus
Avi Caciularu
LMTD
49
3
0
06 Aug 2024
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented
  Generation
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation
Daniel Fleischer
Moshe Berchansky
Moshe Wasserblat
Peter Izsak
3DV
58
4
0
05 Aug 2024
Improving Retrieval-Augmented Generation in Medicine with Iterative
  Follow-up Questions
Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions
Guangzhi Xiong
Qiao Jin
Xiao Wang
Minjia Zhang
Zhiyong Lu
Aidong Zhang
RALM
54
24
0
01 Aug 2024
MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning
MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning
Yupeng Chen
Senmiao Wang
Zhihang Lin
Zhihang Lin
Yushun Zhang
Tian Ding
Ruoyu Sun
Ruoyu Sun
CLL
80
1
0
30 Jul 2024
CollectiveSFT: Scaling Large Language Models for Chinese Medical
  Benchmark with Collective Instructions in Healthcare
CollectiveSFT: Scaling Large Language Models for Chinese Medical Benchmark with Collective Instructions in Healthcare
Jingwei Zhu
Minghuan Tan
Min Yang
Ruixue Li
Hamid Alinejad-Rokny
ALM
LM&MA
38
0
0
29 Jul 2024
Large Language Models as Co-Pilots for Causal Inference in Medical
  Studies
Large Language Models as Co-Pilots for Causal Inference in Medical Studies
Ahmed Alaa
Rachael V. Phillips
Emre Kiciman
Laura B. Balzer
Mark van der Laan
Maya L Petersen
CML
ELM
LM&MA
46
0
0
26 Jul 2024
Know Your Limits: A Survey of Abstention in Large Language Models
Know Your Limits: A Survey of Abstention in Large Language Models
Bingbing Wen
Jihan Yao
Shangbin Feng
Chenjun Xu
Yulia Tsvetkov
Bill Howe
Lucy Lu Wang
59
5
0
25 Jul 2024
Learn while Unlearn: An Iterative Unlearning Framework for Generative Language Models
Learn while Unlearn: An Iterative Unlearning Framework for Generative Language Models
Haoyu Tang
Ye Liu
Xukai Liu
Xukai Liu
Yanghai Zhang
Kai Zhang
Xiaofang Zhou
Enhong Chen
MU
75
3
0
25 Jul 2024
ScholarChemQA: Unveiling the Power of Language Models in Chemical
  Research Question Answering
ScholarChemQA: Unveiling the Power of Language Models in Chemical Research Question Answering
Xiuying Chen
Tairan Wang
Taicheng Guo
Kehan Guo
Juexiao Zhou
Haoyang Li
Mingchen Zhuge
Jürgen Schmidhuber
Xin Gao
Xiangliang Zhang
52
3
0
24 Jul 2024
RadioRAG: Factual Large Language Models for Enhanced Diagnostics in
  Radiology Using Dynamic Retrieval Augmented Generation
RadioRAG: Factual Large Language Models for Enhanced Diagnostics in Radiology Using Dynamic Retrieval Augmented Generation
Soroosh Tayebi Arasteh
Mahshad Lotfinia
Keno Bressem
R. Siepmann
Dyke Ferber
Christiane Kuhl
Jakob Nikolas Kather
S. Nebelung
Daniel Truhn
RALM
LM&MA
VLM
27
2
0
22 Jul 2024
An Empirical Study of Retrieval Augmented Generation with
  Chain-of-Thought
An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought
Yuetong Zhao
Hongyu Cao
Xianyu Zhao
Zhijian Ou
RALM
LRM
25
3
0
22 Jul 2024
Domain-Specific Pretraining of Language Models: A Comparative Study in
  the Medical Field
Domain-Specific Pretraining of Language Models: A Comparative Study in the Medical Field
Tobias Kerner
ELM
LM&MA
41
4
0
19 Jul 2024
SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning
SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning
Joseph Marvin Imperial
Harish Tayyar Madabushi
41
1
0
18 Jul 2024
Scientific QA System with Verifiable Answers
Scientific QA System with Verifiable Answers
Adela Ljajić
Milos Kosprdic
Bojana Bašaragin
Darija Medvecki
Lorenzo Cassano
Nikola Milosevic
23
1
0
16 Jul 2024
MSEval: A Dataset for Material Selection in Conceptual Design to
  Evaluate Algorithmic Models
MSEval: A Dataset for Material Selection in Conceptual Design to Evaluate Algorithmic Models
Yash Jain
Daniele Grandi
Allin Groom
Brandon Cramer
Christopher McComb
44
0
0
12 Jul 2024
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers
Shraman Pramanick
Rama Chellappa
Subhashini Venugopalan
50
13
0
12 Jul 2024
Previous
12345...91011
Next