Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.05653
Cited By
v1
v2
v3 (latest)
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
11 September 2023
Xiang Yue
Xingwei Qu
Ge Zhang
Yao Fu
Wenhao Huang
Huan Sun
Yu-Chuan Su
Wenhu Chen
AIMat
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning"
50 / 324 papers shown
Title
Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality
Yuto Harada
Yusuke Yamauchi
Yusuke Oda
Yohei Oseki
Yusuke Miyao
Yu Takagi
ALM
33
0
0
17 Jun 2025
A Survey on Large Language Models for Mathematical Reasoning
Peng-Yuan Wang
Tian-Shuo Liu
Chenyang Wang
Yi-Di Wang
Shu Yan
...
Xu-Hui Liu
Xin-Wei Chen
Jia-Cheng Xu
Ziniu Li
Yang Yu
LRM
39
0
0
10 Jun 2025
Synthesis by Design: Controlled Data Generation via Structural Guidance
Lei Xu
Sirui Chen
Yuxuan Huang
Chaochao Lu
33
0
0
09 Jun 2025
SPARQ: Synthetic Problem Generation for Reasoning via Quality-Diversity Algorithms
Alex Havrilla
Edward Hughes
Mikayel Samvelyan
Jacob Abernethy
SyDa
LRM
45
0
0
06 Jun 2025
Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning
Zhiyuan Ma
Jiayu Liu
Xianzhen Luo
Zhenya Huang
Qingfu Zhu
Wanxiang Che
LLMAG
204
0
0
05 Jun 2025
ClozeMath: Improving Mathematical Reasoning in Language Models by Learning to Fill Equations
Quang Hieu Pham
T. Nguyen
Tung Pham
Anh Tuan Luu
Dat Quoc Nguyen
ReLM
LRM
143
0
0
04 Jun 2025
Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning
Muling Wu
Qi Qian
Wenhao Liu
Xiaohua Wang
Z. Huang
...
Zhibo Xu
Lina Chen
Tianlong Li
Xiaoqing Zheng
Xuanjing Huang
LRM
104
0
0
04 Jun 2025
Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem
Yubo Wang
Ping Nie
Kai Zou
Lijun Wu
Wenhu Chen
OffRL
ReLM
LRM
35
0
0
03 Jun 2025
SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought
Guanghao Li
Wenhao Jiang
Mingfeng Chen
Yan Li
Hao Yu
Shuting Dong
Tao Ren
Ming Tang
Chun Yuan
ReLM
LRM
36
0
0
30 May 2025
ASyMOB: Algebraic Symbolic Mathematical Operations Benchmark
M. Shalyt
Rotem Elimelech
I. Kaminer
37
0
0
28 May 2025
Scaling Reasoning without Attention
Xueliang Zhao
Wei Wu
Lingpeng Kong
OffRL
ReLM
LRM
VLM
81
0
0
28 May 2025
Understand, Think, and Answer: Advancing Visual Reasoning with Large Multimodal Models
Yufei Zhan
Hongyin Zhao
Yousong Zhu
Shurong Zheng
Fan Yang
Ming Tang
Jinqiao Wang
VLM
LRM
66
0
0
27 May 2025
FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities
Jin Wang
Yao Lai
Aoxue Li
Shifeng Zhang
Jiacheng Sun
Ning Kang
Chengyue Wu
Zhenguo Li
Ping Luo
76
2
0
26 May 2025
Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal Models
Kai Sun
Yushi Bai
Zhen-Yi Yang
Jiajie Zhang
Ji Qi
Lei Hou
Juanzi Li
VLM
19
0
0
26 May 2025
ALPS: Attention Localization and Pruning Strategy for Efficient Alignment of Large Language Models
Hao Chen
Haoze Li
Zhiqing Xiao
Lirong Gao
Qi Zhang
Xiaomeng Hu
Ningtao Wang
Xing Fu
Junbo Zhao
212
0
0
24 May 2025
DuFFin: A Dual-Level Fingerprinting Framework for LLMs IP Protection
Yuliang Yan
Haochun Tang
Shuo Yan
Enyan Dai
68
1
0
22 May 2025
Optimal Policy Minimum Bayesian Risk
Ramón Fernandez Astudillo
Md Arafat Sultan
Aashka Trivedi
Yousef El-Kurdi
Tahira Naseem
Radu Florian
Salim Roukos
OffRL
45
0
0
22 May 2025
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space
Zhen Zhang
Xuehai He
Weixiang Yan
Ao Shen
Chenyang Zhao
Shuaiqiang Wang
Yelong Shen
Xin Eric Wang
LRM
121
3
0
21 May 2025
Towards Reliable Proof Generation with LLMs: A Neuro-Symbolic Approach
Oren Sultan
Eitan Stern
Dafna Shahaf
LRM
158
0
0
20 May 2025
AutoMathKG: The automated mathematical knowledge graph based on LLM and vector database
Rong Bian
Yu Geng
Zijian Yang
Bing Cheng
131
0
0
19 May 2025
Learnware of Language Models: Specialized Small Language Models Can Do Big
Zhi-Hao Tan
Zi-Chen Zhao
Hao-Yu Shi
Xin-Yu Zhang
Peng Tan
Yang Yu
Zhi Zhou
145
0
0
19 May 2025
RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning
Qiguang Chen
Libo Qin
Jinhao Liu
Yue Liao
Jiaqi Wang
Jingxuan Zhou
Wanxiang Che
LRM
53
0
0
19 May 2025
Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought
Hanlin Zhu
Shibo Hao
Zhiting Hu
Jiantao Jiao
Stuart Russell
Yuandong Tian
OffRL
LRM
136
0
0
18 May 2025
Spectral Policy Optimization: Coloring your Incorrect Reasoning in GRPO
Peter Chen
Xiaopeng Li
Zhiyu Li
Xi Chen
Tianyi Lin
103
0
0
16 May 2025
Crosslingual Reasoning through Test-Time Scaling
Zheng-Xin Yong
Muhammad Farid Adilazuarda
Jonibek Mansurov
Ruochen Zhang
Niklas Muennighoff
Carsten Eickhoff
Genta Indra Winata
Julia Kreutzer
Stephen H. Bach
Alham Fikri Aji
LRM
ELM
465
9
0
08 May 2025
Chain-of-Thought Tokens are Computer Program Variables
Fangwei Zhu
Peiyi Wang
Zhifang Sui
LRM
148
1
0
08 May 2025
CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation
Jiahao Li
Weijian Ma
Xueyang Li
Yunzhong Lou
G. Zhou
Xiangdong Zhou
136
3
0
07 May 2025
VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model
Zuwei Long
Yunhang Shen
Chaoyou Fu
Heting Gao
Lijiang Li
...
Jinlong Peng
Haoyu Cao
Ke Li
Rongrong Ji
Xing Sun
83
2
0
06 May 2025
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Haoran Xu
Baolin Peng
Hany Awadalla
DongDong Chen
Yen-Chun Chen
...
Yelong Shen
Shuaiqiang Wang
Weijian Xu
Jianfeng Gao
Weizhu Chen
ReLM
LRM
176
5
0
30 Apr 2025
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks
Yixin Cao
Shibo Hong
Xuzhao Li
Jiahao Ying
Yubo Ma
...
Juanzi Li
Aixin Sun
Xuanjing Huang
Tat-Seng Chua
Tianwei Zhang
ALM
ELM
261
7
0
26 Apr 2025
Pushing the boundary on Natural Language Inference
Pablo Miralles-González
Javier Huertas-Tato
Alejandro Martín
David Camacho
LRM
227
0
0
25 Apr 2025
GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning
Liangyu Xu
Yingxiu Zhao
Jiadong Wang
Yingyao Wang
Bu Pi
...
Jihao Gu
Xinfeng Li
Xiaoyong Zhu
Jun Song
Jian Xu
LRM
512
6
0
17 Apr 2025
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Jiazhan Feng
Shijue Huang
Xingwei Qu
Ge Zhang
Yujia Qin
Baoquan Zhong
Chengquan Jiang
Jinxin Chi
Wanjun Zhong
OffRL
ReLM
SyDa
KELM
LRM
183
35
0
15 Apr 2025
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
Wei Xiong
Jiarui Yao
Yuhui Xu
Bo Pang
Lei Wang
...
Junnan Li
Nan Jiang
Tong Zhang
Caiming Xiong
Hanze Dong
OffRL
LRM
122
32
0
15 Apr 2025
FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding
Zheng Liu
Mengjie Liu
Jianfei Chen
Jingwei Xu
Tengjiao Wang
Zeang Sheng
Wentao Zhang
MLLM
164
1
0
14 Apr 2025
Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems
Zaid Khan
Elias Stengel-Eskin
Archiki Prasad
Jaemin Cho
Joey Tianyi Zhou
170
0
0
14 Apr 2025
Breaking the Data Barrier -- Building GUI Agents Through Task Generalization
Junlei Zhang
Zichen Ding
Chang Ma
Zijie Chen
Qiushi Sun
Zhenzhong Lan
Junxian He
492
3
0
14 Apr 2025
Syzygy of Thoughts: Improving LLM CoT with the Minimal Free Resolution
Chenghao Li
Chaoning Zhang
Yi Lu
Jing Zhang
Qigan Sun
X. Wang
Jiwei Wei
Guoqing Wang
Yang Yang
Jikang Cheng
LRM
149
2
0
13 Apr 2025
Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
FangZhi Xu
Hang Yan
Chang Ma
Haiteng Zhao
Qiushi Sun
Kanzhi Cheng
Junxian He
Jun Liu
Zhiyong Wu
LRM
74
5
0
11 Apr 2025
Kimi-VL Technical Report
Kimi Team
Angang Du
B. Yin
Bowei Xing
Bowen Qu
...
Z. Huang
Zhe Chen
Zijia Zhao
Ziwei Chen
Zongyu Lin
MLLM
VLM
MoE
408
32
0
10 Apr 2025
From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models
C. Xu
Ming-Yu Liu
Peng Xu
Ziwei Liu
Wei Ping
Mohammad Shoeybi
Bo Li
Bryan Catanzaro
133
4
0
08 Apr 2025
SmolVLM: Redefining small and efficient multimodal models
Andres Marafioti
Orr Zohar
Miquel Farré
Merve Noyan
Elie Bakouch
...
Hugo Larcher
Mathieu Morlon
Lewis Tunstall
Leandro von Werra
Thomas Wolf
VLM
99
16
0
07 Apr 2025
Entropy-Based Adaptive Weighting for Self-Training
Xiaoxuan Wang
Yihe Deng
Mingyu Derek Ma
Wei Wang
LRM
89
0
0
31 Mar 2025
Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt Detection
Ryan Marinelli
Josef Pichlmeier
Tamás Bisztray
LRM
78
0
0
27 Mar 2025
Vision as LoRA
Han Wang
Yongjie Ye
Bingru Li
Yuxiang Nie
Jinghui Lu
Jingqun Tang
Yanjie Wang
Can Huang
142
2
0
26 Mar 2025
A Survey on Mathematical Reasoning and Optimization with Large Language Models
Ali Forootani
OffRL
LRM
AI4CE
127
1
0
22 Mar 2025
LEMMA: Learning from Errors for MatheMatical Advancement in LLMs
Zhuoshi Pan
Yu Li
Honglin Lin
Qizhi Pei
Zinan Tang
Wei Wu
Chenlin Ming
H. Vicky Zhao
Zeang Sheng
Lijun Wu
LRM
157
6
0
21 Mar 2025
MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems
Felix Chen
Hangjie Yuan
Yunqiu Xu
Tao Feng
Jun Cen
Pengwei Liu
Zeying Huang
Yi Yang
LRM
105
1
0
19 Mar 2025
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model
Tao Wang
Changxu Cheng
Lingfeng Wang
Senda Chen
Wuyue Zhao
VLM
100
1
0
17 Mar 2025
A Survey on Federated Fine-tuning of Large Language Models
Yebo Wu
Chunlin Tian
Jingguang Li
He Sun
Kahou Tam
Zhanting Zhou
Haicheng Liao
Zhijiang Guo
Li Li
Chengzhong Xu
FedML
161
5
0
15 Mar 2025
1
2
3
4
5
6
7
Next