ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.07804
  4. Cited By
Improving Small Language Models on PubMedQA via Generative Data
  Augmentation

Improving Small Language Models on PubMedQA via Generative Data Augmentation

12 May 2023
Zhen Guo
Peiqi Wang
Yanwei Wang
Shangdi Yu
    LM&MA
    MedIm
ArXivPDFHTML

Papers citing "Improving Small Language Models on PubMedQA via Generative Data Augmentation"

23 / 23 papers shown
Title
Capabilities of GPT-4 on Medical Challenge Problems
Capabilities of GPT-4 on Medical Challenge Problems
Harsha Nori
Nicholas King
S. McKinney
Dean Carignan
Eric Horvitz
LM&MA
ELM
AI4MH
107
795
0
20 Mar 2023
A Short Survey of Viewing Large Language Models in Legal Aspect
A Short Survey of Viewing Large Language Models in Legal Aspect
Zhongxiang Sun
AILaw
ELM
82
68
0
16 Mar 2023
LLaMA: Open and Efficient Foundation Language Models
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
1.1K
13,100
0
27 Feb 2023
BioGPT: Generative Pre-trained Transformer for Biomedical Text
  Generation and Mining
BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining
Renqian Luo
Liai Sun
Yingce Xia
Tao Qin
Sheng Zhang
Hoifung Poon
Tie-Yan Liu
MedIm
AI4CE
LM&MA
86
825
0
19 Oct 2022
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than
  In-Context Learning
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning
Haokun Liu
Derek Tam
Mohammed Muqeeth
Jay Mohta
Tenghao Huang
Joey Tianyi Zhou
Colin Raffel
84
899
0
11 May 2022
Training Compute-Optimal Large Language Models
Training Compute-Optimal Large Language Models
Jordan Hoffmann
Sebastian Borgeaud
A. Mensch
Elena Buchatskaya
Trevor Cai
...
Karen Simonyan
Erich Elsen
Jack W. Rae
Oriol Vinyals
Laurent Sifre
AI4TS
180
1,936
0
29 Mar 2022
Guiding Generative Language Models for Data Augmentation in Few-Shot
  Text Classification
Guiding Generative Language Models for Data Augmentation in Few-Shot Text Classification
A. Edwards
Asahi Ushio
Jose Camacho-Collados
Hélène de Ribaupierre
Alun D. Preece
VLM
46
23
0
17 Nov 2021
Program Synthesis with Large Language Models
Program Synthesis with Large Language Models
Jacob Austin
Augustus Odena
Maxwell Nye
Maarten Bosma
Henryk Michalewski
...
Ellen Jiang
Carrie J. Cai
Michael Terry
Quoc V. Le
Charles Sutton
ELM
AIMat
ReCod
ALM
171
1,925
0
16 Aug 2021
Evaluating Large Language Models Trained on Code
Evaluating Large Language Models Trained on Code
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
...
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
ELM
ALM
203
5,454
0
07 Jul 2021
LoRA: Low-Rank Adaptation of Large Language Models
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
371
10,226
0
17 Jun 2021
Few-Shot Question Answering by Pretraining Span Selection
Few-Shot Question Answering by Pretraining Span Selection
Ori Ram
Yuval Kirstain
Jonathan Berant
Amir Globerson
Omer Levy
74
97
0
02 Jan 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Xiang Lisa Li
Percy Liang
213
4,238
0
01 Jan 2021
Domain-Specific Language Model Pretraining for Biomedical Natural
  Language Processing
Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing
Yu Gu
Robert Tinn
Hao Cheng
Michael R. Lucas
Naoto Usuyama
Xiaodong Liu
Tristan Naumann
Jianfeng Gao
Hoifung Poon
LM&MA
AI4CE
68
1,757
0
31 Jul 2020
SqueezeBERT: What can computer vision teach NLP about efficient neural
  networks?
SqueezeBERT: What can computer vision teach NLP about efficient neural networks?
F. Iandola
Albert Eaton Shaw
Ravi Krishna
Kurt Keutzer
VLM
62
127
0
19 Jun 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
604
41,736
0
28 May 2020
Inexpensive Domain Adaptation of Pretrained Language Models: Case
  Studies on Biomedical NER and Covid-19 QA
Inexpensive Domain Adaptation of Pretrained Language Models: Case Studies on Biomedical NER and Covid-19 QA
Nina Poerner
Ulli Waltinger
Hinrich Schütze
OOD
38
50
0
07 Apr 2020
PubMedQA: A Dataset for Biomedical Research Question Answering
PubMedQA: A Dataset for Biomedical Research Question Answering
Qiao Jin
Bhuwan Dhingra
Zhengping Liu
William W. Cohen
Xinghua Lu
353
883
0
13 Sep 2019
Energy and Policy Considerations for Deep Learning in NLP
Energy and Policy Considerations for Deep Learning in NLP
Emma Strubell
Ananya Ganesh
Andrew McCallum
62
2,647
0
05 Jun 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.4K
94,511
0
11 Oct 2018
Medical Image Synthesis for Data Augmentation and Anonymization using
  Generative Adversarial Networks
Medical Image Synthesis for Data Augmentation and Anonymization using Generative Adversarial Networks
Hoo-Chang Shin
Neil A. Tenenholtz
Jameson K. Rogers
C. Schwarz
M. Senjem
J. Gunter
Katherine P. Andriole
Mark H. Michalski
MedIm
96
540
0
26 Jul 2018
Interactive Supercomputing on 40,000 Cores for Machine Learning and Data
  Analysis
Interactive Supercomputing on 40,000 Cores for Machine Learning and Data Analysis
Albert Reuther
J. Kepner
Chansup Byun
S. Samsi
William Arcand
...
J. Mullen
Andrew Prout
Antonio Rosa
Charles Yee
Peter Michaleas
LRM
ReLM
307
280
0
20 Jul 2018
Deep Gradient Compression: Reducing the Communication Bandwidth for
  Distributed Training
Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Chengyue Wu
Song Han
Huizi Mao
Yu Wang
W. Dally
120
1,407
0
05 Dec 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
626
130,942
0
12 Jun 2017
1