ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.07282
  4. Cited By
RAISE: Reinforced Adaptive Instruction Selection For Large Language Models

RAISE: Reinforced Adaptive Instruction Selection For Large Language Models

9 April 2025
Lv Qingsong
Yangning Li
Zihua Lan
Zishan Xu
Jiwei Tang
Hai-Tao Zheng
Wenhao Jiang
Wanshi Xu
Philip S. Yu
ArXivPDFHTML

Papers citing "RAISE: Reinforced Adaptive Instruction Selection For Large Language Models"

44 / 44 papers shown
Title
Semi-supervised Node Importance Estimation with Informative Distribution Modeling for Uncertainty Regularization
Semi-supervised Node Importance Estimation with Informative Distribution Modeling for Uncertainty Regularization
Yankai Chen
Taotao Wang
Yixiang Fang
Yunyu Xiao
BDL
170
1
0
26 Mar 2025
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning
Bo Liu
Yunxiang Li
Yangqiu Song
Hanjing Wang
Linyi Yang
...
Jun Wang
Jun Wang
Weinan Zhang
Shuyue Hu
Ying Wen
LLMAG
KELM
LRM
AI4CE
129
10
0
12 Mar 2025
One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs
One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs
Hai-Tao Zheng
Jiayi Kuang
Haojing Huang
Zhikun Xu
Xinnian Liang
...
Jue Chen
Chao Qu
Ying Shen
Hai-Tao Zheng
Philip S. Yu
LRM
101
2
0
12 Feb 2025
Refine Knowledge of Large Language Models via Adaptive Contrastive Learning
Refine Knowledge of Large Language Models via Adaptive Contrastive Learning
Hai-Tao Zheng
Haojing Huang
Jiayi Kuang
Yangning Li
Shu Guo
Chao Qu
Jue Chen
Hai-Tao Zheng
Ying Shen
Philip S. Yu
CLL
90
5
0
11 Feb 2025
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
Yangning Li
Hai-Tao Zheng
Xinyu Wang
Yong Jiang
Zhen Zhang
...
Hui Wang
Hai-Tao Zheng
Pengjun Xie
Philip S. Yu
Fei Huang
94
22
0
05 Nov 2024
Recent Advances of Multimodal Continual Learning: A Comprehensive Survey
Recent Advances of Multimodal Continual Learning: A Comprehensive Survey
Dianzhi Yu
Xinni Zhang
Yankai Chen
Aiwei Liu
Yifei Zhang
Philip S. Yu
Irwin King
VLM
CLL
79
12
0
07 Oct 2024
G-DIG: Towards Gradient-based Diverse and High-quality Instruction Data
  Selection for Machine Translation
G-DIG: Towards Gradient-based Diverse and High-quality Instruction Data Selection for Machine Translation
Xingyuan Pan
Luyang Huang
Liyan Kang
Zhicheng Liu
Yu Lu
Shanbo Cheng
ALM
92
14
0
21 May 2024
Large Language Models and Causal Inference in Collaboration: A Survey
Large Language Models and Causal Inference in Collaboration: A Survey
Xiaoyu Liu
Paiheng Xu
Junda Wu
Jiaxin Yuan
Yifan Yang
...
Haoliang Wang
Tong Yu
Julian McAuley
Wei Ai
Furong Huang
ELM
LRM
97
5
0
14 Mar 2024
ItD: Large Language Models Can Teach Themselves Induction through
  Deduction
ItD: Large Language Models Can Teach Themselves Induction through Deduction
Wangtao Sun
Haotian Xu
Xuanqing Yu
Pei Chen
Shizhu He
Jun Zhao
Kang Liu
LRM
57
10
0
09 Mar 2024
Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question
  Answering Benchmark
Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark
Zhikun Xu
Hai-Tao Zheng
Ruixue Ding
Xinyu Wang
Boli Chen
Yong Jiang
Hai-Tao Zheng
Wenlian Lu
Pengjun Xie
Fei Huang
82
11
0
29 Feb 2024
Evaluating Robustness of Generative Search Engine on Adversarial Factual
  Questions
Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions
Xuming Hu
Xiaochuan Li
Junzhe Chen
Hai-Tao Zheng
Yangning Li
...
Yasheng Wang
Qun Liu
Lijie Wen
Philip S. Yu
Zhijiang Guo
AAML
ELM
59
4
0
25 Feb 2024
LESS: Selecting Influential Data for Targeted Instruction Tuning
LESS: Selecting Influential Data for Targeted Instruction Tuning
Mengzhou Xia
Sadhika Malladi
Suchin Gururangan
Sanjeev Arora
Danqi Chen
127
231
0
06 Feb 2024
What Makes Good Data for Alignment? A Comprehensive Study of Automatic
  Data Selection in Instruction Tuning
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
Wei Liu
Weihao Zeng
Keqing He
Yong Jiang
Junxian He
ALM
94
231
0
25 Dec 2023
MMC: Advancing Multimodal Chart Understanding with Large-scale
  Instruction Tuning
MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning
Fuxiao Liu
Xiaoyang Wang
Wenlin Yao
Jianshu Chen
Kaiqiang Song
Sangwoo Cho
Yaser Yacoob
Dong Yu
43
106
0
15 Nov 2023
A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for
  Chinese Spelling Check
A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check
Haojing Huang
Jingheng Ye
Qingyu Zhou
Hai-Tao Zheng
Yangning Li
Feng Zhou
Hai-Tao Zheng
LRM
70
14
0
13 Oct 2023
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data
  Selection for Instruction Tuning
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning
Ming Li
Yong Zhang
Zhitao Li
Jiuhai Chen
Lichang Chen
Ning Cheng
Jianzong Wang
Dinesh Manocha
Jing Xiao
102
203
0
23 Aug 2023
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence
  Understanding
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding
Tianyu Yu
Chengyue Jiang
Chao Lou
Shen Huang
Xiaobin Wang
...
Haitao Zheng
Ningyu Zhang
Pengjun Xie
Fei Huang
Yong Jiang
LRM
98
16
0
21 Aug 2023
MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained
  Semantic Classes and Hard Negative Entities
MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities
Yongqian Li
Tingwei Lu
Hai-Tao Zheng
Tianyu Yu
Shulin Huang
Haitao Zheng
Rui Zhang
Jun Yuan
74
11
0
27 Jul 2023
On the (In)Effectiveness of Large Language Models for Chinese Text
  Correction
On the (In)Effectiveness of Large Language Models for Chinese Text Correction
Hai-Tao Zheng
Haojing Huang
Shirong Ma
Yong Jiang
Yongqian Li
F. Zhou
Haitao Zheng
Qingyu Zhou
57
46
0
18 Jul 2023
HYTREL: Hypergraph-enhanced Tabular Data Representation Learning
HYTREL: Hypergraph-enhanced Tabular Data Representation Learning
Pei Chen
Soumajyoti Sarkar
Leonard Lausen
Balasubramaniam Srinivasan
Sheng Zha
Ruihong Huang
George Karypis
LMTD
67
37
0
14 Jul 2023
Correct Like Humans: Progressive Learning Framework for Chinese Text
  Error Correction
Correct Like Humans: Progressive Learning Framework for Chinese Text Error Correction
Hai-Tao Zheng
Shirong Ma
Shaoshen Chen
Haojing Huang
Shulin Huang
Yongqian Li
Haitao Zheng
Ying Shen
50
10
0
30 Jun 2023
LIMA: Less Is More for Alignment
LIMA: Less Is More for Alignment
Chunting Zhou
Pengfei Liu
Puxin Xu
Srini Iyer
Jiao Sun
...
Susan Zhang
Gargi Ghosh
M. Lewis
Luke Zettlemoyer
Omer Levy
ALM
92
838
0
18 May 2023
GPT-4 Technical Report
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
1.3K
14,313
0
15 Mar 2023
Vision, Deduction and Alignment: An Empirical Study on Multi-modal
  Knowledge Graph Alignment
Vision, Deduction and Alignment: An Empirical Study on Multi-modal Knowledge Graph Alignment
Yongqian Li
Jiaoyan Chen
Hai-Tao Zheng
Yuejia Xiang
Xi Chen
Haitao Zheng
70
25
0
17 Feb 2023
The Flan Collection: Designing Data and Methods for Effective
  Instruction Tuning
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Shayne Longpre
Le Hou
Tu Vu
Albert Webson
Hyung Won Chung
...
Denny Zhou
Quoc V. Le
Barret Zoph
Jason W. Wei
Adam Roberts
ALM
93
669
0
31 Jan 2023
A Curriculum Learning Approach for Multi-domain Text Classification
  Using Keyword weight Ranking
A Curriculum Learning Approach for Multi-domain Text Classification Using Keyword weight Ranking
Zilin Yuan
Hai-Tao Zheng
Yongqian Li
Rui Xie
Wei Wu
Haitao Zheng
66
1
0
27 Oct 2022
Linguistic Rules-Based Corpus Generation for Native Chinese Grammatical
  Error Correction
Linguistic Rules-Based Corpus Generation for Native Chinese Grammatical Error Correction
Shirong Ma
Hai-Tao Zheng
Rongyi Sun
Qingyu Zhou
Shulin Huang
...
Ruiyang Liu
Zhongli Li
Yunbo Cao
Haitao Zheng
Ying Shen
58
27
0
19 Oct 2022
Learning from the Dictionary: Heterogeneous Knowledge Guided Fine-tuning
  for Chinese Spell Checking
Learning from the Dictionary: Heterogeneous Knowledge Guided Fine-tuning for Chinese Spell Checking
Hai-Tao Zheng
Shirong Ma
Qingyu Zhou
Zhongli Li
Li Yangning
Shulin Huang
R. Liu
Chao Li
Yunbo Cao
Haitao Zheng
36
36
0
19 Oct 2022
Automatic Context Pattern Generation for Entity Set Expansion
Automatic Context Pattern Generation for Entity Set Expansion
Hai-Tao Zheng
Shulin Huang
Xinwei Zhang
Qingyu Zhou
Yongqian Li
Ruiyang Liu
Yunbo Cao
Haitao Zheng
Ying Shen
68
23
0
17 Jul 2022
Contextual Similarity is More Valuable than Character Similarity: An
  Empirical Study for Chinese Spell Checking
Contextual Similarity is More Valuable than Character Similarity: An Empirical Study for Chinese Spell Checking
Dingchao Zhang
Hai-Tao Zheng
Qingyu Zhou
Shirong Ma
Yongqian Li
Yunbo Cao
Haitao Zheng
48
16
0
17 Jul 2022
Contrastive Learning with Hard Negative Entities for Entity Set
  Expansion
Contrastive Learning with Hard Negative Entities for Entity Set Expansion
Hai-Tao Zheng
Yongqian Li
Yuxin He
Tianyu Yu
Ying Shen
Haitao Zheng
44
32
0
16 Apr 2022
Super-NaturalInstructions: Generalization via Declarative Instructions
  on 1600+ NLP Tasks
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Yizhong Wang
Swaroop Mishra
Pegah Alipoormolabashi
Yeganeh Kordi
Amirreza Mirzaei
...
Chitta Baral
Yejin Choi
Noah A. Smith
Hannaneh Hajishirzi
Daniel Khashabi
ELM
111
840
0
16 Apr 2022
The Past Mistake is the Future Wisdom: Error-driven Contrastive
  Probability Optimization for Chinese Spell Checking
The Past Mistake is the Future Wisdom: Error-driven Contrastive Probability Optimization for Chinese Spell Checking
Hai-Tao Zheng
Qingyu Zhou
Yongqian Li
Zhongli Li
Ruiyang Liu
Rongyi Sun
Zizhen Wang
Chao Li
Yunbo Cao
Haitao Zheng
109
70
0
02 Mar 2022
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Ruiyang Liu
Hai-Tao Zheng
Li Tao
Dun Liang
Haitao Zheng
145
99
0
07 Nov 2021
Training Verifiers to Solve Math Word Problems
Training Verifiers to Solve Math Word Problems
K. Cobbe
V. Kosaraju
Mohammad Bavarian
Mark Chen
Heewoo Jun
...
Jerry Tworek
Jacob Hilton
Reiichiro Nakano
Christopher Hesse
John Schulman
ReLM
OffRL
LRM
241
4,392
0
27 Oct 2021
Finetuned Language Models Are Zero-Shot Learners
Finetuned Language Models Are Zero-Shot Learners
Jason W. Wei
Maarten Bosma
Vincent Zhao
Kelvin Guu
Adams Wei Yu
Brian Lester
Nan Du
Andrew M. Dai
Quoc V. Le
ALM
UQCV
150
3,742
0
03 Sep 2021
CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in
  NLP
CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in NLP
Qinyuan Ye
Bill Yuchen Lin
Xiang Ren
280
184
0
18 Apr 2021
Measuring Massive Multitask Language Understanding
Measuring Massive Multitask Language Understanding
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
D. Song
Jacob Steinhardt
ELM
RALM
166
4,413
0
07 Sep 2020
Aligning AI With Shared Human Values
Aligning AI With Shared Human Values
Dan Hendrycks
Collin Burns
Steven Basart
Andrew Critch
Jingkai Li
D. Song
Jacob Steinhardt
142
553
0
05 Aug 2020
UnifiedQA: Crossing Format Boundaries With a Single QA System
UnifiedQA: Crossing Format Boundaries With a Single QA System
Daniel Khashabi
Sewon Min
Tushar Khot
Ashish Sabharwal
Oyvind Tafjord
Peter Clark
Hannaneh Hajishirzi
122
738
0
02 May 2020
CommonsenseQA: A Question Answering Challenge Targeting Commonsense
  Knowledge
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
Alon Talmor
Jonathan Herzig
Nicholas Lourie
Jonathan Berant
RALM
140
1,725
0
02 Nov 2018
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning
  Challenge
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge
Peter Clark
Isaac Cowhey
Oren Etzioni
Tushar Khot
Ashish Sabharwal
Carissa Schoenick
Oyvind Tafjord
ELM
RALM
LRM
153
2,583
0
14 Mar 2018
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
468
19,006
0
20 Jul 2017
High-Dimensional Continuous Control Using Generalized Advantage
  Estimation
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
84
3,406
0
08 Jun 2015
1