ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.03118
  4. Cited By
Not Enough Data? Deep Learning to the Rescue!
v1v2 (latest)

Not Enough Data? Deep Learning to the Rescue!

8 November 2019
Ateret Anaby-Tavor
Boaz Carmeli
Esther Goldbraich
Amir Kantor
George Kour
Segev Shlomov
N. Tepper
Naama Zwerdling
ArXiv (abs)PDFHTML

Papers citing "Not Enough Data? Deep Learning to the Rescue!"

50 / 167 papers shown
Title
Bridging Generative and Discriminative Learning: Few-Shot Relation Extraction via Two-Stage Knowledge-Guided Pre-training
Bridging Generative and Discriminative Learning: Few-Shot Relation Extraction via Two-Stage Knowledge-Guided Pre-training
Quanjiang Guo
Jinchuan Zhang
Sijie Wang
Ling Tian
Zhao Kang
Bin Yan
Weidong Xiao
72
1
0
18 May 2025
WHERE and WHICH: Iterative Debate for Biomedical Synthetic Data Augmentation
WHERE and WHICH: Iterative Debate for Biomedical Synthetic Data Augmentation
Zhengyi Zhao
Shubo Zhang
Bin Liang
Binyang Li
Kam-Fai Wong
SyDa
83
0
0
31 Mar 2025
HILGEN: Hierarchically-Informed Data Generation for Biomedical NER Using Knowledgebases and Large Language Models
Yao Ge
Yuting Guo
Sudeshna Das
Swati Rajwal
Selen Bozkurt
A. Sarker
MedImLM&MA
101
0
0
06 Mar 2025
Synthetic vs. Gold: The Role of LLM-Generated Labels and Data in Cyberbullying Detection
Synthetic vs. Gold: The Role of LLM-Generated Labels and Data in Cyberbullying Detection
Arefeh Kazemi
Sri Balaaji Natarajan Kalaivendan
Joachim Wagner
Hamza Qadeer
Brian Davis
155
1
0
21 Feb 2025
Diversity-oriented Data Augmentation with Large Language Models
Diversity-oriented Data Augmentation with Large Language Models
Zaitian Wang
Jinghan Zhang
Xinhao Zhang
Kunpeng Liu
Pengfei Wang
Yuanchun Zhou
121
3
0
17 Feb 2025
CorrSynth -- A Correlated Sampling Method for Diverse Dataset Generation from LLMs
CorrSynth -- A Correlated Sampling Method for Diverse Dataset Generation from LLMs
Suhas S Kowshik
Abhishek Divekar
Vijit Malik
SyDa
158
0
0
13 Nov 2024
Relation-based Counterfactual Data Augmentation and Contrastive Learning
  for Robustifying Natural Language Inference Models
Relation-based Counterfactual Data Augmentation and Contrastive Learning for Robustifying Natural Language Inference Models
H. Yang
Sseung-won Hwang
Jungmin So
70
0
0
28 Oct 2024
The Effects of Hallucinations in Synthetic Training Data for Relation
  Extraction
The Effects of Hallucinations in Synthetic Training Data for Relation Extraction
Steven Rogulsky
Nicholas Popovic
Michael Färber
HILM
52
1
0
10 Oct 2024
A Target-Aware Analysis of Data Augmentation for Hate Speech Detection
A Target-Aware Analysis of Data Augmentation for Hate Speech Detection
Camilla Casula
Sara Tonelli
54
1
0
10 Oct 2024
Generating Synthetic Datasets for Few-shot Prompt Tuning
Generating Synthetic Datasets for Few-shot Prompt Tuning
Xu Guo
Zilin Du
Boyang Li
Chunyan Miao
90
2
0
08 Oct 2024
Reducing and Exploiting Data Augmentation Noise through Meta Reweighting
  Contrastive Learning for Text Classification
Reducing and Exploiting Data Augmentation Noise through Meta Reweighting Contrastive Learning for Text Classification
Guanyi Mou
Yichuan Li
Kyumin Lee
110
3
0
26 Sep 2024
An Effective, Robust and Fairness-aware Hate Speech Detection Framework
An Effective, Robust and Fairness-aware Hate Speech Detection Framework
Guanyi Mou
Kyumin Lee
69
2
0
25 Sep 2024
An Effective Deployment of Diffusion LM for Data Augmentation in
  Low-Resource Sentiment Classification
An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification
Zhuowei Chen
Lianxi Wang
Yuben Wu
Xinfeng Liao
Yujia Tian
Junyang Zhong
DiffM
107
1
0
05 Sep 2024
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering
  LLM Weaknesses
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Yulong Chen
Yang Liu
Jianhao Yan
X. Bai
Ming Zhong
Yinghao Yang
Ziyi Yang
Chenguang Zhu
Yue Zhang
ALMELM
81
11
0
16 Aug 2024
Model Agnostic Hybrid Sharding For Heterogeneous Distributed Inference
Model Agnostic Hybrid Sharding For Heterogeneous Distributed Inference
Claudio Angione
Yue Zhao
Harry Yang
Ahmad Farhan
Fielding Johnston
James Buban
Patrick Colangelo
90
1
0
29 Jul 2024
TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts
TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts
Ruida Wang
Jipeng Zhang
Yizhen Jia
Boyao Wang
Shizhe Diao
Renjie Pi
Tong Zhang
LRM
97
23
0
03 Jul 2024
Prompting-based Synthetic Data Generation for Few-Shot Question
  Answering
Prompting-based Synthetic Data Generation for Few-Shot Question Answering
Maximilian Schmidt
Andrea Bartezzaghi
Ngoc Thang Vu
SyDa
90
6
0
15 May 2024
A Comprehensive Survey on Data Augmentation
A Comprehensive Survey on Data Augmentation
Zaitian Wang
Pengfei Wang
Kunpeng Liu
Pengyang Wang
Yanjie Fu
Chang-Tien Lu
Charu Aggarwal
Jian Pei
Yuanchun Zhou
ViT
165
27
0
15 May 2024
UniGen: Universal Domain Generalization for Sentiment Classification via
  Zero-shot Dataset Generation
UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation
Juhwan Choi
Yeonghwa Kim
Seunguk Yu
Jungmin Yun
Youngbin Kim
82
3
0
02 May 2024
LLM-Augmented Retrieval: Enhancing Retrieval Models Through Language
  Models and Doc-Level Embedding
LLM-Augmented Retrieval: Enhancing Retrieval Models Through Language Models and Doc-Level Embedding
Mingrui Wu
Sheng Cao
KELMRALM
72
3
0
08 Apr 2024
Edisum: Summarizing and Explaining Wikipedia Edits at Scale
Edisum: Summarizing and Explaining Wikipedia Edits at Scale
Marija Sakota
Isaac Johnson
Guosheng Feng
Robert West
SyDaKELM
70
2
0
04 Apr 2024
Controllable and Diverse Data Augmentation with Large Language Model for
  Low-Resource Open-Domain Dialogue Generation
Controllable and Diverse Data Augmentation with Large Language Model for Low-Resource Open-Domain Dialogue Generation
Zhenhua Liu
Tong Zhu
Jianxiang Xiang
Wenliang Chen
120
3
0
30 Mar 2024
Adverb Is the Key: Simple Text Data Augmentation with Adverb Deletion
Adverb Is the Key: Simple Text Data Augmentation with Adverb Deletion
Juhwan Choi
Youngbin Kim
74
0
0
29 Mar 2024
Enhancing Effectiveness and Robustness in a Low-Resource Regime via
  Decision-Boundary-aware Data Augmentation
Enhancing Effectiveness and Robustness in a Low-Resource Regime via Decision-Boundary-aware Data Augmentation
Kyohoon Jin
Junho Lee
Juhwan Choi
Sangmin Song
Youngbin Kim
69
0
0
22 Mar 2024
Beyond Surface Similarity: Detecting Subtle Semantic Shifts in Financial
  Narratives
Beyond Surface Similarity: Detecting Subtle Semantic Shifts in Financial Narratives
Jiaxin Liu
Yi Yang
Kar Yan Tam
AIFinAI4TS
71
6
0
21 Mar 2024
LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named
  Entity Recognition
LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition
Junjie Ye
Nuo Xu
Yikun Wang
Jie Zhou
Qi Zhang
Tao Gui
Xuanjing Huang
64
16
0
22 Feb 2024
Advancing NLP Models with Strategic Text Augmentation: A Comprehensive
  Study of Augmentation Methods and Curriculum Strategies
Advancing NLP Models with Strategic Text Augmentation: A Comprehensive Study of Augmentation Methods and Curriculum Strategies
Himmet Toprak Kesgin
M. Amasyalı
49
9
0
14 Feb 2024
Improving Black-box Robustness with In-Context Rewriting
Improving Black-box Robustness with In-Context Rewriting
Kyle O'Brien
Nathan Ng
Isha Puri
Jorge Mendez
Hamid Palangi
Yoon Kim
Marzyeh Ghassemi
Tom Hartvigsen
104
7
0
13 Feb 2024
AutoAugment Is What You Need: Enhancing Rule-based Augmentation Methods
  in Low-resource Regimes
AutoAugment Is What You Need: Enhancing Rule-based Augmentation Methods in Low-resource Regimes
Juhwan Choi
Kyohoon Jin
Junho Lee
Sangmin Song
Youngbin Kim
52
1
0
08 Feb 2024
A Survey on Data Augmentation in Large Model Era
A Survey on Data Augmentation in Large Model Era
Yue Zhou
Chenlu Guo
Xu Wang
Yi-Ju Chang
Yuan Wu
LM&MAVLM
128
27
0
27 Jan 2024
Cheap Learning: Maximising Performance of Language Models for Social
  Data Science Using Minimal Data
Cheap Learning: Maximising Performance of Language Models for Social Data Science Using Minimal Data
Leonardo Castro-Gonzalez
Yi-Ling Chung
Hannak Rose Kirk
John Francis
Angus R. Williams
Pica Johansson
Jonathan Bright
69
1
0
22 Jan 2024
InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large
  Multimodal and Language Models
InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models
Bingbing Wen
Zhengyuan Yang
Jianfeng Wang
Zhe Gan
Bill Howe
Lijuan Wang
MLLM
64
1
0
21 Dec 2023
Localized Symbolic Knowledge Distillation for Visual Commonsense Models
Localized Symbolic Knowledge Distillation for Visual Commonsense Models
Jinho Park
Jack Hessel
Khyathi Chandu
Paul Pu Liang
Ximing Lu
...
Youngjae Yu
Qiuyuan Huang
Jianfeng Gao
Ali Farhadi
Yejin Choi
VLM
77
13
0
08 Dec 2023
BERT Goes Off-Topic: Investigating the Domain Transfer Challenge using
  Genre Classification
BERT Goes Off-Topic: Investigating the Domain Transfer Challenge using Genre Classification
D. Roussinov
Serge Sharoff
37
2
0
27 Nov 2023
Generative AI for Hate Speech Detection: Evaluation and Findings
Generative AI for Hate Speech Detection: Evaluation and Findings
Sagi Pendzel
Tomer Wullach
Amir Adler
Einat Minkov
60
11
0
16 Nov 2023
Exploring ChatGPT's Capabilities on Vulnerability Management
Exploring ChatGPT's Capabilities on Vulnerability Management
Peiyu Liu
Junming Liu
Lirong Fu
Kangjie Lu
Yifan Xia
Xuhong Zhang
Wenzhi Chen
Haiqin Weng
Shouling Ji
Wenhai Wang
85
18
0
11 Nov 2023
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme
  Large Language Model Compression
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression
Jiduan Liu
Jiahao Liu
Qifan Wang
Jingang Wang
Xunliang Cai
Dongyan Zhao
Ran Wang
Rui Yan
61
4
0
24 Oct 2023
Text generation for dataset augmentation in security classification
  tasks
Text generation for dataset augmentation in security classification tasks
Alexander P. Welsh
Matthew Edwards
40
1
0
22 Oct 2023
PromptMix: A Class Boundary Augmentation Method for Large Language Model
  Distillation
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
Gaurav Sahu
Olga Vechtomova
Dzmitry Bahdanau
I. Laradji
VLM
109
27
0
22 Oct 2023
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large
  Language Models by Extrapolating Errors from Small Models
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models
Ruida Wang
Wangchunshu Zhou
Mrinmaya Sachan
95
32
0
20 Oct 2023
Does Synthetic Data Make Large Language Models More Efficient?
Does Synthetic Data Make Large Language Models More Efficient?
Sia Gholami
Marwan Omar
79
15
0
11 Oct 2023
"A Tale of Two Movements": Identifying and Comparing Perspectives in
  #BlackLivesMatter and #BlueLivesMatter Movements-related Tweets using Weakly
  Supervised Graph-based Structured Prediction
"A Tale of Two Movements": Identifying and Comparing Perspectives in #BlackLivesMatter and #BlueLivesMatter Movements-related Tweets using Weakly Supervised Graph-based Structured Prediction
Shamik Roy
Dan Goldwasser
88
4
0
11 Oct 2023
InstructProtein: Aligning Human and Protein Language via Knowledge
  Instruction
InstructProtein: Aligning Human and Protein Language via Knowledge Instruction
Zeyuan Wang
Qiang Zhang
Keyan Ding
Ming Qin
Zhuang Xiang
Xiaotong Li
Huajun Chen
104
32
0
05 Oct 2023
Can LLMs Augment Low-Resource Reading Comprehension Datasets?
  Opportunities and Challenges
Can LLMs Augment Low-Resource Reading Comprehension Datasets? Opportunities and Challenges
Vinay Samuel
Houda Aynaou
Arijit Ghosh Chowdhury
Karthik Venkat Ramanan
Aman Chadha
SyDa
105
11
0
21 Sep 2023
Distributional Data Augmentation Methods for Low Resource Language
Distributional Data Augmentation Methods for Low Resource Language
Mosleh Mahamud
Zed Lee
Isak Samsten
80
2
0
09 Sep 2023
Community-Based Hierarchical Positive-Unlabeled (PU) Model Fusion for
  Chronic Disease Prediction
Community-Based Hierarchical Positive-Unlabeled (PU) Model Fusion for Chronic Disease Prediction
Yang Wu
Xurui Li
Xuhong Zhang
Yangyang Kang
Changlong Sun
Xiaozhong Liu
58
3
0
06 Sep 2023
I-WAS: a Data Augmentation Method with GPT-2 for Simile Detection
I-WAS: a Data Augmentation Method with GPT-2 for Simile Detection
Yongzhu Chang
Rongsheng Zhang
Jiashu Pu
53
1
0
08 Aug 2023
From Fake to Hyperpartisan News Detection Using Domain Adaptation
From Fake to Hyperpartisan News Detection Using Domain Adaptation
Razvan-Alexandru Smadu
Sebastian-Vasile Echim
Dumitru-Clementin Cercel
Iuliana Marin
Florin-Catalin Pop
67
3
0
04 Aug 2023
Leveraging Few-Shot Data Augmentation and Waterfall Prompting for
  Response Generation
Leveraging Few-Shot Data Augmentation and Waterfall Prompting for Response Generation
Lea Krause
Selene Báez Santamaría
Michiel van der Meer
Urja Khurana
69
3
0
02 Aug 2023
Exploring Format Consistency for Instruction Tuning
Exploring Format Consistency for Instruction Tuning
Shi Liang
Runchu Tian
Kunlun Zhu
Yujia Qin
Huadong Wang
Xin Cong
Zhiyuan Liu
Xiaojiang Liu
Maosong Sun
ALM
70
13
0
28 Jul 2023
1234
Next