ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.08410
  4. Cited By
Teaching Small Language Models to Reason

Teaching Small Language Models to Reason

16 December 2022
Lucie Charlotte Magister
Jonathan Mallinson
Jakub Adamek
Eric Malmi
Aliaksei Severyn
    LRM
    AI4CE
    ReLM
ArXivPDFHTML

Papers citing "Teaching Small Language Models to Reason"

50 / 191 papers shown
Title
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs
  Improves Reasoning in Smaller Language Models
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models
Justin Chih-Yao Chen
Swarnadeep Saha
Elias Stengel-Eskin
Mohit Bansal
LRM
LLMAG
32
15
0
02 Feb 2024
Contextualization Distillation from Large Language Model for Knowledge
  Graph Completion
Contextualization Distillation from Large Language Model for Knowledge Graph Completion
Dawei Li
Zhen Tan
Tianlong Chen
Huan Liu
KELM
25
12
0
28 Jan 2024
TPD: Enhancing Student Language Model Reasoning via Principle Discovery
  and Guidance
TPD: Enhancing Student Language Model Reasoning via Principle Discovery and Guidance
Haorui Wang
Rongzhi Zhang
Yinghao Li
Lingkai Kong
Yuchen Zhuang
Xiusi Chen
Chao Zhang
LRM
43
5
0
24 Jan 2024
Distilling Mathematical Reasoning Capabilities into Small Language
  Models
Distilling Mathematical Reasoning Capabilities into Small Language Models
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
LRM
40
9
0
22 Jan 2024
Adapting Large Language Models for Education: Foundational Capabilities,
  Potentials, and Challenges
Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges
Qingyao Li
Lingyue Fu
Weiming Zhang
Xianyu Chen
Jingwei Yu
Wei Xia
Weinan Zhang
Ruiming Tang
Yong Yu
AI4Ed
ELM
43
18
0
27 Dec 2023
GeomVerse: A Systematic Evaluation of Large Models for Geometric
  Reasoning
GeomVerse: A Systematic Evaluation of Large Models for Geometric Reasoning
Mehran Kazemi
Hamidreza Alvari
Ankit Anand
Jialin Wu
Xi Chen
Radu Soricut
LRM
ReLM
39
53
0
19 Dec 2023
A Performance Evaluation of a Quantized Large Language Model on Various
  Smartphones
A Performance Evaluation of a Quantized Large Language Model on Various Smartphones
Tolga Çöplü
Marc Loedi
Arto Bendiken
Mykhailo Makohin
Joshua J. Bouw
Stephen Cobb
MQ
21
5
0
19 Dec 2023
A Survey of Reasoning with Foundation Models
A Survey of Reasoning with Foundation Models
Jiankai Sun
Chuanyang Zheng
E. Xie
Zhengying Liu
Ruihang Chu
...
Xipeng Qiu
Yi-Chen Guo
Hui Xiong
Qun Liu
Zhenguo Li
ReLM
LRM
AI4CE
30
76
0
17 Dec 2023
Efficient Toxic Content Detection by Bootstrapping and Distilling Large
  Language Models
Efficient Toxic Content Detection by Bootstrapping and Distilling Large Language Models
Jiang Zhang
Qiong Wu
Yiming Xu
Cheng Cao
Zheng Du
Konstantinos Psounis
36
15
0
13 Dec 2023
Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning
  Distilled from Large Language Models
Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Models
Hongzhan Lin
Ziyang Luo
Jing Ma
Long Chen
29
9
0
09 Dec 2023
Grounding Foundation Models through Federated Transfer Learning: A
  General Framework
Grounding Foundation Models through Federated Transfer Learning: A General Framework
Yan Kang
Tao Fan
Hanlin Gu
Xiaojin Zhang
Lixin Fan
Qiang Yang
AI4CE
68
19
0
29 Nov 2023
LLM-Assisted Code Cleaning For Training Accurate Code Generators
LLM-Assisted Code Cleaning For Training Accurate Code Generators
Naman Jain
Tianjun Zhang
Wei-Lin Chiang
Joseph E. Gonzalez
Koushik Sen
Ion Stoica
39
27
0
25 Nov 2023
Igniting Language Intelligence: The Hitchhiker's Guide From
  Chain-of-Thought Reasoning to Language Agents
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Zhuosheng Zhang
Yao Yao
Aston Zhang
Xiangru Tang
Xinbei Ma
...
Yiming Wang
Mark B. Gerstein
Rui Wang
Gongshen Liu
Hai Zhao
LLMAG
LM&Ro
LRM
42
53
0
20 Nov 2023
Mind's Mirror: Distilling Self-Evaluation Capability and Comprehensive
  Thinking from Large Language Models
Mind's Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models
Weize Liu
Guocong Li
Kai Zhang
Bang Du
Qiyuan Chen
Xuming Hu
Hongxia Xu
Jintai Chen
Jian Wu
LRM
18
6
0
15 Nov 2023
Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads
  to Answers Faster
Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster
Hongxuan Zhang
Zhining Liu
Yao Zhao
Jiaqi Zheng
Chenyi Zhuang
Jinjie Gu
Guihai Chen
LRM
MLLM
25
1
0
14 Nov 2023
The ART of LLM Refinement: Ask, Refine, and Trust
The ART of LLM Refinement: Ask, Refine, and Trust
Kumar Shridhar
Koustuv Sinha
Andrew Cohen
Tianlu Wang
Ping Yu
Ramakanth Pasunuru
Mrinmaya Sachan
Jason Weston
Asli Celikyilmaz
LLMAG
ReLM
LRM
35
24
0
14 Nov 2023
First-Step Advantage: Importance of Starting Right in Multi-Step Math
  Reasoning
First-Step Advantage: Importance of Starting Right in Multi-Step Math Reasoning
Kushal Kumar Jain
Moritz Miller
Niket Tandon
Kumar Shridhar
ReLM
LRM
43
2
0
14 Nov 2023
VerityMath: Advancing Mathematical Reasoning by Self-Verification
  Through Unit Consistency
VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit Consistency
Vernon Toh
Ratish Puduppully
Nancy F. Chen
LRM
30
5
0
13 Nov 2023
Towards the Law of Capacity Gap in Distilling Language Models
Towards the Law of Capacity Gap in Distilling Language Models
Chen Zhang
Dawei Song
Zheyu Ye
Yan Gao
ELM
38
20
0
13 Nov 2023
Instruction Distillation Makes Large Language Models Efficient Zero-shot
  Rankers
Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers
Weiwei Sun
Zheng Chen
Xinyu Ma
Lingyong Yan
Shuaiqiang Wang
Pengjie Ren
Zhumin Chen
Dawei Yin
Zhaochun Ren
ALM
37
19
0
02 Nov 2023
Learning From Mistakes Makes LLM Better Reasoner
Learning From Mistakes Makes LLM Better Reasoner
Shengnan An
Zexiong Ma
Zeqi Lin
Nanning Zheng
Jian-Guang Lou
Weizhu Chen
LRM
32
75
0
31 Oct 2023
LLM4DyG: Can Large Language Models Solve Spatial-Temporal Problems on
  Dynamic Graphs?
LLM4DyG: Can Large Language Models Solve Spatial-Temporal Problems on Dynamic Graphs?
Zeyang Zhang
Xin Wang
Ziwei Zhang
Haoyang Li
Yi Qin
Wenwu Zhu
40
26
0
26 Oct 2023
Improving Diversity of Demographic Representation in Large Language
  Models via Collective-Critiques and Self-Voting
Improving Diversity of Demographic Representation in Large Language Models via Collective-Critiques and Self-Voting
Preethi Lahoti
Nicholas Blumm
Xiao Ma
Raghavendra Kotikalapudi
Sahitya Potluri
...
Hansa Srinivasan
Ben Packer
Ahmad Beirami
Alex Beutel
Jilin Chen
44
28
0
25 Oct 2023
DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning
  in Language Models
DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models
Ge Zheng
Bin Yang
Jiajin Tang
Hong-Yu Zhou
Sibei Yang
LRM
MLLM
35
93
0
25 Oct 2023
MCC-KD: Multi-CoT Consistent Knowledge Distillation
MCC-KD: Multi-CoT Consistent Knowledge Distillation
Hongzhan Chen
Siyue Wu
Xiaojun Quan
Rui Wang
Ming Yan
Ji Zhang
LRM
19
17
0
23 Oct 2023
Merging Generated and Retrieved Knowledge for Open-Domain QA
Merging Generated and Retrieved Knowledge for Open-Domain QA
Yunxiang Zhang
Muhammad Khalifa
Lajanugen Logeswaran
Moontae Lee
Honglak Lee
Lu Wang
RALM
30
37
0
22 Oct 2023
Democratizing Reasoning Ability: Tailored Learning from Large Language
  Model
Democratizing Reasoning Ability: Tailored Learning from Large Language Model
Zhaoyang Wang
Shaohan Huang
Yuxuan Liu
Jiahai Wang
Minghui Song
...
Haizhen Huang
Furu Wei
Weiwei Deng
Feng Sun
Qi Zhang
LRM
40
11
0
20 Oct 2023
Enhancing Conversational Search: Large Language Model-Aided Informative
  Query Rewriting
Enhancing Conversational Search: Large Language Model-Aided Informative Query Rewriting
Fanghua Ye
Meng Fang
Shenghui Li
Emine Yilmaz
KELM
54
45
0
15 Oct 2023
KwaiYiiMath: Technical Report
KwaiYiiMath: Technical Report
Jia-Yi Fu
Lei Lin
Xiaoyang Gao
Pengli Liu
Zhengzong Chen
...
Zijia Lin
Fuzheng Zhang
Zhongyuan Wang
Di Zhang
Kun Gai
LRM
ReLM
RALM
51
2
0
11 Oct 2023
Rationale-Enhanced Language Models are Better Continual Relation
  Learners
Rationale-Enhanced Language Models are Better Continual Relation Learners
Weimin Xiong
Yifan Song
Peiyi Wang
Sujian Li
KELM
LRM
CLL
13
10
0
10 Oct 2023
Rephrase, Augment, Reason: Visual Grounding of Questions for
  Vision-Language Models
Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
Archiki Prasad
Elias Stengel-Eskin
Mohit Bansal
ReLM
LRM
33
8
0
09 Oct 2023
DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller
  Language Models
DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
Chengcheng Han
Xiaowei Du
Che Zhang
Yixin Lian
Xiang Li
Ming Gao
Baoyuan Wang
LRM
37
14
0
08 Oct 2023
Towards Better Chain-of-Thought Prompting Strategies: A Survey
Towards Better Chain-of-Thought Prompting Strategies: A Survey
Zihan Yu
Liang He
Zhen Wu
Xinyu Dai
Jiajun Chen
LRM
129
45
0
08 Oct 2023
Crystal: Introspective Reasoners Reinforced with Self-Feedback
Crystal: Introspective Reasoners Reinforced with Self-Feedback
Jiacheng Liu
Ramakanth Pasunuru
Hannaneh Hajishirzi
Yejin Choi
Asli Celikyilmaz
LRM
ReLM
31
22
0
07 Oct 2023
Amortizing intractable inference in large language models
Amortizing intractable inference in large language models
Marvin Schmitt
Moksh Jain
Daniel Habermann
Younesse Kaddar
Ullrich Kothe
Stefan T. Radev
Nikolay Malkin
AIFin
BDL
32
47
0
06 Oct 2023
CITING: Large Language Models Create Curriculum for Instruction Tuning
CITING: Large Language Models Create Curriculum for Instruction Tuning
Tao Feng
Zifeng Wang
Jimeng Sun
ALM
33
14
0
04 Oct 2023
Navigate through Enigmatic Labyrinth A Survey of Chain of Thought
  Reasoning: Advances, Frontiers and Future
Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future
Zheng Chu
Jingchang Chen
Qianglong Chen
Weijiang Yu
Tao He
Haotian Wang
Weihua Peng
Ming-Yu Liu
Bing Qin
Ting Liu
LRM
AI4CE
37
153
0
27 Sep 2023
Large Language Model Alignment: A Survey
Large Language Model Alignment: A Survey
Tianhao Shen
Renren Jin
Yufei Huang
Chuang Liu
Weilong Dong
Zishan Guo
Xinwei Wu
Yan Liu
Deyi Xiong
LM&MA
19
177
0
26 Sep 2023
Effective Distillation of Table-based Reasoning Ability from LLMs
Effective Distillation of Table-based Reasoning Ability from LLMs
Bohao Yang
Chen Tang
Kangning Zhao
Chenghao Xiao
Chenghua Lin
LRM
29
22
0
22 Sep 2023
ReConcile: Round-Table Conference Improves Reasoning via Consensus among
  Diverse LLMs
ReConcile: Round-Table Conference Improves Reasoning via Consensus among Diverse LLMs
Justin Chih-Yao Chen
Swarnadeep Saha
Joey Tianyi Zhou
LLMAG
LRM
40
122
0
22 Sep 2023
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language
  Models
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
L. Yu
Weisen Jiang
Han Shi
Jincheng Yu
Zhengying Liu
Yu Zhang
James T. Kwok
Zheng Li
Adrian Weller
Weiyang Liu
OSLM
LRM
50
337
0
21 Sep 2023
SCREWS: A Modular Framework for Reasoning with Revisions
SCREWS: A Modular Framework for Reasoning with Revisions
K. Shridhar
Harsh Jhamtani
Hao Fang
Benjamin Van Durme
Jason Eisner
Patrick Xia
KELM
LRM
30
14
0
20 Sep 2023
Multimodal Multi-Hop Question Answering Through a Conversation Between
  Tools and Efficiently Finetuned Large Language Models
Multimodal Multi-Hop Question Answering Through a Conversation Between Tools and Efficiently Finetuned Large Language Models
Hossein Rajabzadeh
Suyuchen Wang
Hyock Ju Kwon
Bang Liu
KELM
29
3
0
16 Sep 2023
ICLEF: In-Context Learning with Expert Feedback for Explainable Style
  Transfer
ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer
Arkadiy Saakyan
Smaranda Muresan
26
3
0
15 Sep 2023
Auto-Regressive Next-Token Predictors are Universal Learners
Auto-Regressive Next-Token Predictors are Universal Learners
Eran Malach
LRM
24
36
0
13 Sep 2023
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
Xinyi Wang
John Wieting
J. Clark
CLL
ALM
29
1
0
09 Sep 2023
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
54
14
0
23 Aug 2023
Halo: Estimation and Reduction of Hallucinations in Open-Source Weak
  Large Language Models
Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models
Mohamed S. Elaraby
Mengyin Lu
Jacob Dunn
Xueying Zhang
Yu Wang
Shizhu Liu
Pingchuan Tian
Yuping Wang
Yuxuan Wang
HILM
36
26
0
22 Aug 2023
A Survey on Model Compression for Large Language Models
A Survey on Model Compression for Large Language Models
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
36
193
0
15 Aug 2023
Thinking Like an Expert:Multimodal Hypergraph-of-Thought (HoT) Reasoning
  to boost Foundation Modals
Thinking Like an Expert:Multimodal Hypergraph-of-Thought (HoT) Reasoning to boost Foundation Modals
Fanglong Yao
Changyuan Tian
Jintao Liu
Zequn Zhang
Qing Liu
Li Jin
Shuchao Li
Xiaoyu Li
Xian Sun
ReLM
LRM
25
16
0
11 Aug 2023
Previous
1234
Next