ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.03713
  4. Cited By
RLDBF: Enhancing LLMs Via Reinforcement Learning With DataBase FeedBack

RLDBF: Enhancing LLMs Via Reinforcement Learning With DataBase FeedBack

28 March 2025
Weichen Dai
Zijie Dai
Zhijie Huang
Yixuan Pan
Xinhe Li
Xi Li
Yi Zhou
Ji Qi
Wu Jiang
ArXivPDFHTML

Papers citing "RLDBF: Enhancing LLMs Via Reinforcement Learning With DataBase FeedBack"

5 / 5 papers shown
Title
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLM
VLM
OffRL
AI4TS
LRM
316
1,589
0
22 Jan 2025
ChemLLM: A Chemical Large Language Model
ChemLLM: A Chemical Large Language Model
Di Zhang
Wei Liu
Qian Tan
Jingdan Chen
Hang Yan
...
Dongzhan Zhou
Shufei Zhang
Mao Su
Han-Sen Zhong
Yuqiang Li
AI4MH
57
42
0
10 Feb 2024
Large Language Models as Master Key: Unlocking the Secrets of Materials
  Science with GPT
Large Language Models as Master Key: Unlocking the Secrets of Materials Science with GPT
Tong Xie
Yuwei Wan
Wei-Ping Huang
Yufei Zhou
Yixuan Liu
...
Shaozhou Wang
Chunyu Kit
Clara Grazian
Weinan Zhang
Hoex
52
52
0
05 Apr 2023
Sharpness-Aware Minimization for Efficiently Improving Generalization
Sharpness-Aware Minimization for Efficiently Improving Generalization
Pierre Foret
Ariel Kleiner
H. Mobahi
Behnam Neyshabur
AAML
184
1,342
0
03 Oct 2020
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Nils Reimers
Iryna Gurevych
953
12,129
0
27 Aug 2019
1