Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.23157
Cited By
v1
v2 (latest)
Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL
29 March 2025
Mohammadreza Pourreza
Shayan Talaei
Ruoxi Sun
Xingchen Wan
Hailong Li
Azalia Mirhoseini
Amin Saberi
Sercan O. Arik
ReLM
AI4TS
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL"
32 / 32 papers shown
Title
CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning
Lei Sheng
Shuai-Shuai Xu
LRM
57
0
0
19 May 2025
Graph-Reward-SQL: Execution-Free Reinforcement Learning for Text-to-SQL via Graph Matching and Stepwise Reward
Han Weng
Boyi Liu
Yuanfeng Song
Dun Zeng
Yingxiang Yang
Yi Zhan
Longjie Cui
Xiaoming Yin
Yang Sun
45
0
0
18 May 2025
Sparks of Tabular Reasoning via Text2SQL Reinforcement Learning
Josefa Lia Stoisser
Marc Boubnovski Martell
Julien Fauqueur
LMTD
ReLM
AI4TS
LRM
154
0
0
23 Apr 2025
DB-Explore: Automated Database Exploration and Instruction Synthesis for Text-to-SQL
Haoyuan Ma
Yongliang Shen
Hengwei Liu
Wenqi Zhang
Haolei Xu
Qiuying Peng
Jun Wang
Weiming Lu
104
2
0
06 Mar 2025
OpenSearch-SQL: Enhancing Text-to-SQL with Dynamic Few-shot and Consistency Alignment
Xiangjin Xie
Guangwei Xu
Lingyan Zhao
Ruijie Guo
AI4TS
68
9
0
19 Feb 2025
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Tianzhe Chu
Yuexiang Zhai
Jihan Yang
Shengbang Tong
Saining Xie
Dale Schuurmans
Quoc V. Le
Sergey Levine
Yi-An Ma
OffRL
228
123
0
28 Jan 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLM
VLM
OffRL
AI4TS
LRM
380
1,970
0
22 Jan 2025
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Kimi Team
Angang Du
Bofei Gao
Bowei Xing
Changjiu Jiang
...
Zihao Huang
Ziyao Xu
Zhiyong Yang
Zonghan Yang
Zongyu Lin
OffRL
ALM
AI4TS
VLM
LRM
271
330
0
22 Jan 2025
O1 Replication Journey: A Strategic Progress Report -- Part 1
Yiwei Qin
Xuefeng Li
Haoyang Zou
Yixiu Liu
Shijie Xia
...
Yixin Ye
Weizhe Yuan
Hector Liu
Yuezun Li
Pengfei Liu
VLM
89
91
0
08 Oct 2024
E-SQL: Direct Schema Linking via Question Enrichment in Text-to-SQL
Hasan Alp Caferoğlu
Özgür Ulusoy
91
22
0
25 Sep 2024
Qwen2.5-Coder Technical Report
Binyuan Hui
Jian Yang
Zeyu Cui
Jiaxi Yang
Dayiheng Liu
...
Fei Huang
Xingzhang Ren
Xuancheng Ren
Jingren Zhou
Junyang Lin
OSLM
113
331
0
18 Sep 2024
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Charlie Snell
Jaehoon Lee
Kelvin Xu
Aviral Kumar
LRM
192
692
0
06 Aug 2024
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Zhihong Shao
Peiyi Wang
Qihao Zhu
Runxin Xu
Jun-Mei Song
...
Haowei Zhang
Mingchuan Zhang
Yiming Li
Yu-Huan Wu
Daya Guo
ReLM
LRM
146
1,274
0
05 Feb 2024
DTS-SQL: Decomposed Text-to-SQL with Small Large Language Models
Mohammadreza Pourreza
Davood Rafiei
48
28
0
02 Feb 2024
C3: Zero-shot Text-to-SQL with ChatGPT
Xuemei Dong
Chuxu Zhang
Yuhang Ge
Yuren Mao
Yunjun Gao
Lu Chen
Jinshu Lin
Dongfang Lou
64
147
0
14 Jul 2023
Let's Verify Step by Step
Hunter Lightman
V. Kosaraju
Yura Burda
Harrison Edwards
Bowen Baker
Teddy Lee
Jan Leike
John Schulman
Ilya Sutskever
K. Cobbe
ALM
OffRL
LRM
193
1,233
0
31 May 2023
SQL-PaLM: Improved Large Language Model Adaptation for Text-to-SQL (extended)
Ruoxi Sun
Sercan O. Arik
Alex Muzio
Lesly Miculicich
S. Gundabathula
...
Hanjun Dai
Hootan Nakhost
Rajarishi Sinha
Zifeng Wang
Tomas Pfister
79
33
0
26 May 2023
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs
Jinyang Li
Binyuan Hui
Ge Qu
Jiaxi Yang
Binhua Li
...
Guoliang Li
Kevin C. C. Chang
Fei Huang
Reynold Cheng
Yongbin Li
LMTD
112
419
0
04 May 2023
DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction
Mohammadreza Pourreza
Davood Rafiei
ReLM
LRM
68
352
0
21 Apr 2023
Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care
Anna L. Trella
Kelly W. Zhang
Inbal Nahum-Shani
Vivek Shetty
Finale Doshi-Velez
Susan Murphy
OnRL
65
21
0
15 Aug 2022
STaR: Bootstrapping Reasoning With Reasoning
E. Zelikman
Yuhuai Wu
Jesse Mu
Noah D. Goodman
ReLM
LRM
144
508
0
28 Mar 2022
Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration
Desik Rengarajan
G. Vaidya
Akshay Sarvesh
D. Kalathil
S. Shakkottai
OffRL
60
58
0
09 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
836
9,644
0
28 Jan 2022
SADGA: Structure-Aware Dual Graph Aggregation Network for Text-to-SQL
Ruichu Cai
Jinjie Yuan
Boyan Xu
Zhifeng Hao
65
64
0
01 Nov 2021
PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models
Torsten Scholak
Nathan Schucher
Dzmitry Bahdanau
209
391
0
10 Sep 2021
Towards Robustness of Text-to-SQL Models against Synonym Substitution
Yujian Gan
Xinyun Chen
Qiuping Huang
Matthew Purver
J. Woodward
Jinxia Xie
Pengsheng Huang
AAML
66
112
0
02 Jun 2021
Reinforcement Learning Approaches in Social Robotics
Neziha Akalin
Amy Loutfi
OffRL
61
103
0
21 Sep 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
865
42,379
0
28 May 2020
RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers
Bailin Wang
Richard Shin
Xiaodong Liu
Oleksandr Polozov
Matthew Richardson
92
592
0
10 Nov 2019
Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation
Jiaqi Guo
Zecheng Zhan
Yan Gao
Yan Xiao
Jian-Guang Lou
Ting Liu
Dongmei Zhang
58
384
0
20 May 2019
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
532
19,265
0
20 Jul 2017
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
109
1,229
0
16 Nov 2016
1