ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.05653
  4. Cited By
MAmmoTH: Building Math Generalist Models through Hybrid Instruction
  Tuning
v1v2v3 (latest)

MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning

11 September 2023
Xiang Yue
Xingwei Qu
Ge Zhang
Yao Fu
Wenhao Huang
Huan Sun
Yu-Chuan Su
Wenhu Chen
    AIMatLRM
ArXiv (abs)PDFHTML

Papers citing "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning"

50 / 324 papers shown
Title
Enhancing Mathematical Reasoning in LLMs by Stepwise Correction
Enhancing Mathematical Reasoning in LLMs by Stepwise Correction
Zhenyu Wu
Qingkai Zeng
Zizhuo Zhang
Zhaoxuan Tan
Chao Shen
Meng Jiang
KELMLRM
149
4
0
16 Oct 2024
A Survey on Data Synthesis and Augmentation for Large Language Models
A Survey on Data Synthesis and Augmentation for Large Language Models
Ke Wang
Jiahui Zhu
Minjie Ren
Ziqiang Liu
Shiwei Li
...
Yiming Lei
Xiaoyu Wu
Qiqi Zhan
Qingjie Liu
Yunhong Wang
SyDa
186
21
0
16 Oct 2024
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
Guorui Zheng
Xidong Wang
Juhao Liang
Nuo Chen
Yuping Zheng
Benyou Wang
MoE
136
5
0
14 Oct 2024
SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
L. Yang
Zhaochen Yu
Tianze Zhang
Minkai Xu
Joseph E. Gonzalez
Tengjiao Wang
Shuicheng Yan
ELMReLMLRM
89
0
0
11 Oct 2024
MathCoder2: Better Math Reasoning from Continued Pretraining on
  Model-translated Mathematical Code
MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code
Zimu Lu
Aojun Zhou
Ke Wang
Houxing Ren
Weikang Shi
Junting Pan
Mingjie Zhan
Hongsheng Li
LRM
116
11
0
10 Oct 2024
AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+
  Interaction Trajectories
AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Yifan Song
Weimin Xiong
Xiutian Zhao
Dawei Zhu
Wenhao Wu
Ke Wang
Cheng Li
Wei Peng
Sujian Li
LLMAG
66
11
0
10 Oct 2024
MoDEM: Mixture of Domain Expert Models
MoDEM: Mixture of Domain Expert Models
Toby Simonds
Kemal Kurniawan
Jey Han Lau
MoE
74
2
0
09 Oct 2024
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge
  with Curriculum Preference Learning
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Xiyao Wang
Linfeng Song
Ye Tian
Dian Yu
Baolin Peng
Haitao Mi
Furong Huang
Dong Yu
LRM
134
14
0
09 Oct 2024
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge
  Distillation from Server
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server
Wenhao Wang
Xiaoyu Liang
Rui Ye
Jingyi Chai
Siheng Chen
Yanfeng Wang
SyDa
95
6
0
08 Oct 2024
Deeper Insights Without Updates: The Power of In-Context Learning Over
  Fine-Tuning
Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Qingyu Yin
Xuzheng He
Luoao Deng
Chak Tou Leong
Fan Wang
Yanzhao Yan
Xiaoyu Shen
Qiang Zhang
137
5
0
07 Oct 2024
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample Efficiency
Kaiyue Wen
Huaqing Zhang
Hongzhou Lin
Jingzhao Zhang
MoELRM
188
7
0
07 Oct 2024
Polymath: A Challenging Multi-modal Mathematical Reasoning Benchmark
Polymath: A Challenging Multi-modal Mathematical Reasoning Benchmark
Himanshu Gupta
Shreyas Verma
Ujjwala Anantheswaran
Kevin Scaria
Mihir Parmar
Swaroop Mishra
Chitta Baral
ReLMLRM
76
8
0
06 Oct 2024
Improving LLM Reasoning through Scaling Inference Computation with
  Collaborative Verification
Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification
Zhenwen Liang
Ye Liu
Tong Niu
Xiangliang Zhang
Yingbo Zhou
Semih Yavuz
LRM
85
25
0
05 Oct 2024
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning
  Trajectories Search
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Murong Yue
Wenlin Yao
Haitao Mi
Dian Yu
Ziyu Yao
Dong Yu
LRM
72
7
0
04 Oct 2024
Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
Tianrun Chen
Zhentao Tan
Tao Gong
Yue Wu
Qi Chu
Bin Liu
Jieping Ye
Nenghai Yu
KELM
83
4
0
03 Oct 2024
Evaluating Robustness of Reward Models for Mathematical Reasoning
Evaluating Robustness of Reward Models for Mathematical Reasoning
Sunghwan Kim
Dongjin Kang
Taeyoon Kwon
Hyungjoo Chae
Jungsoo Won
Dongha Lee
Jinyoung Yeo
76
7
0
02 Oct 2024
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source
  Instruction Data
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data
Shubham Toshniwal
Wei Du
Ivan Moshkov
Branislav Kisacanin
Alexan Ayrapetyan
Igor Gitman
LRM
107
71
0
02 Oct 2024
Mixing It Up: The Cocktail Effect of Multi-Task Fine-Tuning on LLM
  Performance -- A Case Study in Finance
Mixing It Up: The Cocktail Effect of Multi-Task Fine-Tuning on LLM Performance -- A Case Study in Finance
Meni Brief
Oded Ovadia
Gil Shenderovitz
Noga Ben Yoash
Rachel Lemberg
Eitam Sheetrit
85
4
0
01 Oct 2024
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Haotian Zhang
Mingfei Gao
Zhe Gan
Philipp Dufter
Nina Wenzel
...
Haoxuan You
Zirui Wang
Afshin Dehghan
Peter Grasch
Yinfei Yang
VLMMLLM
138
41
1
30 Sep 2024
SciDFM: A Large Language Model with Mixture-of-Experts for Science
SciDFM: A Large Language Model with Mixture-of-Experts for Science
Liangtai Sun
Danyu Luo
Da Ma
Zihan Zhao
Baocai Chen
Zhennan Shen
Su Zhu
Lu Chen
Xin Chen
Kai Yu
MoE
56
2
0
27 Sep 2024
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and
  Adaptive Disambiguate based Efficient Tree Search
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search
Linzhuang Sun
Hao Liang
Jingxuan Wei
Bihui Yu
Conghui He
Guosheng Dong
Wentao Zhang
79
7
0
26 Sep 2024
Enhancing elusive clues in knowledge learning by contrasting attention of language models
Enhancing elusive clues in knowledge learning by contrasting attention of language models
Jian Gao
Xiao Zhang
Ji Wu
Miao Li
109
0
0
26 Sep 2024
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey
  on How to Make your LLMs use External Data More Wisely
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely
Siyun Zhao
Yuqing Yang
Zilong Wang
Zhiyuan He
Luna Qiu
Lili Qiu
SyDaRALM3DV
122
42
0
23 Sep 2024
Phantom of Latent for Large Language and Vision Models
Phantom of Latent for Large Language and Vision Models
Byung-Kwan Lee
Sangyun Chung
Chae Won Kim
Beomchan Park
Yong Man Ro
VLMLRM
100
7
0
23 Sep 2024
Beyond Accuracy Optimization: Computer Vision Losses for Large Language
  Model Fine-Tuning
Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning
Daniele Rege Cambrin
Giuseppe Gallipoli
Irene Benedetto
Luca Cagliero
Paolo Garza
55
0
0
20 Sep 2024
ControlMath: Controllable Data Generation Promotes Math Generalist
  Models
ControlMath: Controllable Data Generation Promotes Math Generalist Models
Nuo Chen
Ning Wu
Jianhui Chang
Jia Li
100
4
0
20 Sep 2024
LogicPro: Improving Complex Logical Reasoning via Program-Guided Learning
LogicPro: Improving Complex Logical Reasoning via Program-Guided Learning
Jin Jiang
Yuchen Yan
Yang Liu
Yonggang Jin
Shuai Peng
Hao Fei
Xunliang Cai
Yixin Cao
Liangcai Gao
Zhi Tang
LRM
145
7
0
19 Sep 2024
NVLM: Open Frontier-Class Multimodal LLMs
NVLM: Open Frontier-Class Multimodal LLMs
Wenliang Dai
Nayeon Lee
Wei Ping
Zhuoling Yang
Zihan Liu
Jon Barker
Tuomas Rintamaki
Mohammad Shoeybi
Bryan Catanzaro
Ming-Yu Liu
MLLMVLMLRM
127
73
0
17 Sep 2024
Self-Evolutionary Large Language Models through Uncertainty-Enhanced
  Preference Optimization
Self-Evolutionary Large Language Models through Uncertainty-Enhanced Preference Optimization
Jianing Wang
Yang Zhou
Xiaocheng Zhang
Mengjiao Bao
Peng Yan
73
2
0
17 Sep 2024
MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large
  Language Model
MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large Language Model
Zhen Yang
Jinhao Chen
Zhengxiao Du
Wenmeng Yu
Weihan Wang
Wenyi Hong
Zhihuan Jiang
Bin Xu
Yuxiao Dong
Jie Tang
VLMLRM
87
11
0
10 Sep 2024
POINTS: Improving Your Vision-language Model with Affordable Strategies
POINTS: Improving Your Vision-language Model with Affordable Strategies
Yuan Liu
Zhongyin Zhao
Ziyuan Zhuang
Le Tian
Xiao Zhou
Jie Zhou
VLM
99
9
0
07 Sep 2024
An overview of domain-specific foundation model: key technologies, applications and challenges
An overview of domain-specific foundation model: key technologies, applications and challenges
Haolong Chen
Hanzhi Chen
Zijian Zhao
Kaifeng Han
Guangxu Zhu
Yichen Zhao
Ying Du
Wei Xu
Qingjiang Shi
ALMVLM
134
5
0
06 Sep 2024
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal
  Sampling
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling
Hritik Bansal
Arian Hosseini
Rishabh Agarwal
Vinh Q. Tran
Mehran Kazemi
SyDaOffRLLRM
122
49
0
29 Aug 2024
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large
  Language Models
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
Dian Yu
Baolin Peng
Ye Tian
Linfeng Song
Haitao Mi
Dong Yu
ALMLRM
75
3
0
28 Aug 2024
Building and better understanding vision-language models: insights and
  future directions
Building and better understanding vision-language models: insights and future directions
Hugo Laurençon
Andrés Marafioti
Victor Sanh
Léo Tronchon
VLM
138
78
0
22 Aug 2024
Multi-tool Integration Application for Math Reasoning Using Large
  Language Model
Multi-tool Integration Application for Math Reasoning Using Large Language Model
Zhihua Duan
Jialin Wang
LLMAGLRM
104
0
0
22 Aug 2024
Towards Efficient Large Language Models for Scientific Text: A Review
Towards Efficient Large Language Models for Scientific Text: A Review
H. To
Ming Liu
Guangyan Huang
69
1
0
20 Aug 2024
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications
J. Huang
Dong Li
Mengxi Xiao
Zihao Jiang
Yuzhe Yang
...
Benyou Wang
Alejandro Lopez-Lira
Qianqian Xie
Sophia Ananiadou
Junichi Tsujii
AIFinAI4TS
81
25
0
20 Aug 2024
Threshold Filtering Packing for Supervised Fine-Tuning: Training Related Samples within Packs
Threshold Filtering Packing for Supervised Fine-Tuning: Training Related Samples within Packs
Jiancheng Dong
Lei Jiang
Wei Jin
Lu Cheng
110
1
0
18 Aug 2024
Math-PUMA: Progressive Upward Multimodal Alignment to Enhance
  Mathematical Reasoning
Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning
Wenwen Zhuang
Xin Huang
Xiantao Zhang
Jin Zeng
LRM
126
31
0
16 Aug 2024
Leveraging Web-Crawled Data for High-Quality Fine-Tuning
Leveraging Web-Crawled Data for High-Quality Fine-Tuning
Jing Zhou
Chenglin Jiang
Wei Shen
Xiao Zhou
Xiaonan He
ALM
90
4
0
15 Aug 2024
Can Large Language Models Understand Symbolic Graphics Programs?
Can Large Language Models Understand Symbolic Graphics Programs?
Zeju Qiu
Weiyang Liu
Haiwen Feng
Zhen Liu
Tim Z. Xiao
Katherine M. Collins
J. Tenenbaum
Adrian Weller
Michael J. Black
Bernhard Schölkopf
127
14
0
15 Aug 2024
MathScape: Evaluating MLLMs in multimodal Math Scenarios through a
  Hierarchical Benchmark
MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark
Minxuan Zhou
Hao Liang
Tianpeng Li
Zhiyu Wu
Mingan Lin
...
Yujing Qiao
Weipeng Chen
Bin Cui
Wentao Zhang
Guosheng Dong
129
5
0
14 Aug 2024
InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic
  Mathematical Reasoning
InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning
Bo-Wen Zhang
Yan Yan
Lin Li
Guang Liu
ReLMLRM
33
6
0
09 Aug 2024
Synthesizing Text-to-SQL Data from Weak and Strong LLMs
Synthesizing Text-to-SQL Data from Weak and Strong LLMs
Jiaxi Yang
Binyuan Hui
Min Yang
Jian Yang
Junyang Lin
Chang Zhou
SyDa
102
34
0
06 Aug 2024
AI-Assisted Generation of Difficult Math Questions
AI-Assisted Generation of Difficult Math Questions
Vedant Shah
Dingli Yu
Kaifeng Lyu
Simon Park
Nan Rosemary Ke
...
Yoshua Bengio
Sanjeev Arora
Anirudh Goyal
Sanjeev Arora
Anirudh Goyal
123
18
0
30 Jul 2024
Towards Effective and Efficient Continual Pre-training of Large Language
  Models
Towards Effective and Efficient Continual Pre-training of Large Language Models
Jie Chen
Zhipeng Chen
Jiapeng Wang
Kun Zhou
Yutao Zhu
...
Rui Yan
Zhewei Wei
Di Hu
Wenbing Huang
Ji-Rong Wen
KELMALMCLLELMLRM
337
6
0
26 Jul 2024
Self-Training with Direct Preference Optimization Improves
  Chain-of-Thought Reasoning
Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
Tianduo Wang
Shichen Li
Wei Lu
LRMAI4CE
88
20
1
25 Jul 2024
Scaling Granite Code Models to 128K Context
Scaling Granite Code Models to 128K Context
Matt Stallone
Vaibhav Saxena
Leonid Karlinsky
Bridget McGinn
Tim Bula
...
Rogerio Feris
Nirmit Desai
David D. Cox
Ruchir Puri
Yikang Shen
69
4
0
18 Jul 2024
COMET: "Cone of experience" enhanced large multimodal model for
  mathematical problem generation
COMET: "Cone of experience" enhanced large multimodal model for mathematical problem generation
Sannyuya Liu
Jintian Feng
Zongkai Yang
Yawei Luo
Qian Wan
Xiaoxuan Shen
Jianwen Sun
90
7
0
16 Jul 2024
Previous
1234567
Next