ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.08559
  4. Cited By
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of
  Language Models with Hypothesis Refinement

Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement

12 October 2023
Linlu Qiu
Liwei Jiang
Ximing Lu
Melanie Sclar
Valentina Pyatkin
Chandra Bhagavatula
Bailin Wang
Yoon Kim
Yejin Choi
Nouha Dziri
Xiang Ren
    LRM
    ReLM
ArXivPDFHTML

Papers citing "Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement"

50 / 58 papers shown
Title
Improve Rule Retrieval and Reasoning with Self-Induction and Relevance ReEstimate
Improve Rule Retrieval and Reasoning with Self-Induction and Relevance ReEstimate
Ziyang Huang
Wangtao Sun
Jun Zhao
Kang-Jun Liu
LRM
12
0
0
16 May 2025
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models
Xiaobao Wu
LRM
72
1
0
05 May 2025
HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation
HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation
Haokun Liu
Sicong Huang
Jingyu Hu
Yangqiaoyu Zhou
Chenhao Tan
30
0
0
15 Apr 2025
Improving In-Context Learning with Reasoning Distillation
Improving In-Context Learning with Reasoning Distillation
Nafis Sadeq
Xin Xu
Zhouhang Xie
Julian McAuley
Byungkyu Kang
Prarit Lamba
Xiang Gao
RALM
ReLM
LRM
38
0
0
14 Apr 2025
On Language Models' Sensitivity to Suspicious Coincidences
On Language Models' Sensitivity to Suspicious Coincidences
Sriram Padmanabhan
Kanishka Misra
Kyle Mahowald
Eunsol Choi
ReLM
LRM
37
0
0
13 Apr 2025
The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning
The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning
T. Zheng
Yixiang Chen
Chengxi Li
Chunyang Li
Qing Zong
Haochen Shi
Baixuan Xu
Yangqiu Song
Ginny Wong
Simon See
LRM
39
0
0
07 Apr 2025
Instruct-of-Reflection: Enhancing Large Language Models Iterative Reflection Capabilities via Dynamic-Meta Instruction
Liping Liu
Chunhong Zhang
Likang Wu
Chuang Zhao
Zheng Hu
Ming He
Jianping Fan
LLMAG
LRM
38
0
0
02 Mar 2025
Dataset Featurization: Uncovering Natural Language Features through Unsupervised Data Reconstruction
Michal Bravansky
Vaclav Kubon
Suhas Hariharan
Robert Kirk
69
0
0
24 Feb 2025
Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations
Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations
Chunyang Li
Weiqi Wang
Tianshi Zheng
Yangqiu Song
LRM
49
2
0
22 Feb 2025
InductionBench: LLMs Fail in the Simplest Complexity Class
InductionBench: LLMs Fail in the Simplest Complexity Class
Wenyue Hua
Tyler Wong
Sun Fei
Liangming Pan
Adam Jardine
William Yang Wang
LRM
73
2
0
20 Feb 2025
LogiDynamics: Unraveling the Dynamics of Logical Inference in Large Language Model Reasoning
LogiDynamics: Unraveling the Dynamics of Logical Inference in Large Language Model Reasoning
Tianshi Zheng
Jiayang Cheng
Chunyang Li
Haochen Shi
Zhilin Wang
Jiaxin Bai
Yangqiu Song
Ginny Wong
Simon See
LRM
46
2
0
16 Feb 2025
On the Role of Model Prior in Real-World Inductive Reasoning
On the Role of Model Prior in Real-World Inductive Reasoning
Zhuo Liu
Ding Yu
Hangfeng He
LRM
82
0
0
18 Dec 2024
Self-Healing Machine Learning: A Framework for Autonomous Adaptation in
  Real-World Environments
Self-Healing Machine Learning: A Framework for Autonomous Adaptation in Real-World Environments
Paulius Rauba
Nabeel Seedat
Krzysztof Kacprzyk
M. Schaar
AI4CE
54
1
0
31 Oct 2024
IdeaBench: Benchmarking Large Language Models for Research Idea
  Generation
IdeaBench: Benchmarking Large Language Models for Research Idea Generation
Sikun Guo
Amir Hassan Shariatmadari
Guangzhi Xiong
Albert Huang
Eric Xie
Stefan Bekiranov
Aidong Zhang
LM&MA
38
6
0
31 Oct 2024
Matchmaker: Self-Improving Large Language Model Programs for Schema
  Matching
Matchmaker: Self-Improving Large Language Model Programs for Schema Matching
Nabeel Seedat
M. Schaar
39
2
0
31 Oct 2024
MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
Jiachun Li
Pengfei Cao
Zhuoran Jin
Yubo Chen
Kang-Jun Liu
Jun Zhao
LRM
ELM
37
3
0
12 Oct 2024
Mars: Situated Inductive Reasoning in an Open-World Environment
Mars: Situated Inductive Reasoning in an Open-World Environment
Xiaojuan Tang
Jiaqi Li
Yitao Liang
Song-chun Zhu
Muhan Zhang
Zilong Zheng
LM&Ro
LRM
LLMAG
29
1
0
10 Oct 2024
System 2 Reasoning via Generality and Adaptation
System 2 Reasoning via Generality and Adaptation
Sejin Kim
Sundong Kim
LRM
AI4CE
73
0
0
10 Oct 2024
Counterfactual Causal Inference in Natural Language with Large Language
  Models
Counterfactual Causal Inference in Natural Language with Large Language Models
Gael Gendron
Jože M. Rožanec
Michael Witbrock
Gillian Dobbie
CML
34
0
0
08 Oct 2024
Explaining Datasets in Words: Statistical Models with Natural Language Parameters
Explaining Datasets in Words: Statistical Models with Natural Language Parameters
Ruiqi Zhong
Heng Wang
Dan Klein
Jacob Steinhardt
37
6
0
13 Sep 2024
Hypothesizing Missing Causal Variables with LLMs
Hypothesizing Missing Causal Variables with LLMs
Ivaxi Sheth
Sahar Abdelnabi
Mario Fritz
CML
LRM
46
4
0
04 Sep 2024
Symbolic Working Memory Enhances Language Models for Complex Rule
  Application
Symbolic Working Memory Enhances Language Models for Complex Rule Application
Siyuan Wang
Zhongyu Wei
Yejin Choi
Xiang Ren
LRM
LLMAG
33
11
0
24 Aug 2024
The Quest for the Right Mediator: A History, Survey, and Theoretical
  Grounding of Causal Interpretability
The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability
Aaron Mueller
Jannik Brinkmann
Millicent Li
Samuel Marks
Koyena Pal
...
Arnab Sen Sharma
Jiuding Sun
Eric Todd
David Bau
Yonatan Belinkov
CML
52
18
0
02 Aug 2024
Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks
  with Large Language Models
Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models
Logan Cross
Violet Xiang
Agam Bhatia
Daniel L. K. Yamins
Nick Haber
LM&Ro
LRM
LLMAG
48
4
0
09 Jul 2024
Knowledge-based Consistency Testing of Large Language Models
Knowledge-based Consistency Testing of Large Language Models
Sai Sathiesh Rajan
E. Soremekun
Sudipta Chattopadhyay
27
2
0
03 Jul 2024
Large Language Models Assume People are More Rational than We Really are
Large Language Models Assume People are More Rational than We Really are
Ryan Liu
Jiayi Geng
Joshua C. Peterson
Ilia Sucholutsky
Thomas L. Griffiths
76
16
0
24 Jun 2024
Is Programming by Example solved by LLMs?
Is Programming by Example solved by LLMs?
Wen-Ding Li
Kevin Ellis
37
10
0
12 Jun 2024
Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large
  Language Models Reasoning
Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning
Xinlu Zhang
Zhi Chen
Xi Ye
Xianjun Yang
Lichang Chen
William Yang Wang
Linda R. Petzold
LRM
61
10
0
30 May 2024
Code Repair with LLMs gives an Exploration-Exploitation Tradeoff
Code Repair with LLMs gives an Exploration-Exploitation Tradeoff
Hao Tang
Keya Hu
Jin Peng Zhou
Sicheng Zhong
Wei-Long Zheng
Xujie Si
Kevin Ellis
39
13
0
26 May 2024
Hypothesis Generation with Large Language Models
Hypothesis Generation with Large Language Models
Yangqiaoyu Zhou
Haokun Liu
Tejes Srivastava
Hongyuan Mei
Chenhao Tan
LRM
36
26
0
05 Apr 2024
An Incomplete Loop: Deductive, Inductive, and Abductive Learning in
  Large Language Models
An Incomplete Loop: Deductive, Inductive, and Abductive Learning in Large Language Models
Emmy Liu
Graham Neubig
Jacob Andreas
ReLM
LRM
35
6
0
03 Apr 2024
Beyond Accuracy: Evaluating the Reasoning Behavior of Large Language
  Models -- A Survey
Beyond Accuracy: Evaluating the Reasoning Behavior of Large Language Models -- A Survey
Philipp Mondorf
Barbara Plank
ELM
LRM
LM&MA
33
35
0
02 Apr 2024
Few-shot Dialogue Strategy Learning for Motivational Interviewing via
  Inductive Reasoning
Few-shot Dialogue Strategy Learning for Motivational Interviewing via Inductive Reasoning
Zhouhang Xie
Bodhisattwa Prasad Majumder
Mengjie Zhao
Yoshinori Maeda
Keiichi Yamada
Hiromi Wakaki
Julian McAuley
40
3
0
23 Mar 2024
PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models
  with Abstract Visual Patterns
PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns
Yew Ken Chia
Vernon Toh Yan Han
Deepanway Ghosal
Lidong Bing
Soujanya Poria
LRM
ReLM
41
13
0
20 Mar 2024
Reasoning Abilities of Large Language Models: In-Depth Analysis on the
  Abstraction and Reasoning Corpus
Reasoning Abilities of Large Language Models: In-Depth Analysis on the Abstraction and Reasoning Corpus
Seungpil Lee
Woochang Sim
Donghyeon Shin
Sanha Hwang
Wongyu Seo
Jiwon Park
Seokki Lee
Sejin Kim
Sundong Kim
LRM
42
19
0
18 Mar 2024
Quiet-STaR: Language Models Can Teach Themselves to Think Before
  Speaking
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
E. Zelikman
Georges Harik
Yijia Shao
Varuna Jayasiri
Nick Haber
Noah D. Goodman
LLMAG
ReLM
LRM
52
113
0
14 Mar 2024
Meaningful Learning: Advancing Abstract Reasoning in Large Language
  Models via Generic Fact Guidance
Meaningful Learning: Advancing Abstract Reasoning in Large Language Models via Generic Fact Guidance
Kai Xiong
Xiao Ding
Ting Liu
Bing Qin
Dongliang Xu
Qing Yang
Hongtao Liu
Yixin Cao
LRM
36
3
0
14 Mar 2024
ItD: Large Language Models Can Teach Themselves Induction through
  Deduction
ItD: Large Language Models Can Teach Themselves Induction through Deduction
Wangtao Sun
Haotian Xu
Xuanqing Yu
Pei Chen
Shizhu He
Jun Zhao
Kang Liu
LRM
35
10
0
09 Mar 2024
Automated Statistical Model Discovery with Language Models
Automated Statistical Model Discovery with Language Models
Michael Y. Li
Emily B. Fox
Noah D. Goodman
42
14
0
27 Feb 2024
Data-driven Discovery with Large Generative Models
Data-driven Discovery with Large Generative Models
Bodhisattwa Prasad Majumder
Harshit Surana
Dhruv Agarwal
Sanchaita Hazra
Ashish Sabharwal
Peter Clark
43
9
0
21 Feb 2024
WorldCoder, a Model-Based LLM Agent: Building World Models by Writing
  Code and Interacting with the Environment
WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment
Hao Tang
Darren Key
Kevin Ellis
LLMAG
20
27
0
19 Feb 2024
Doing Experiments and Revising Rules with Natural Language and
  Probabilistic Reasoning
Doing Experiments and Revising Rules with Natural Language and Probabilistic Reasoning
Wasu Top Piriyakulkij
Cassidy Langenfeld
Tuan Anh Le
Kevin Ellis
LRM
21
0
0
08 Feb 2024
Limits of Transformer Language Models on Learning to Compose Algorithms
Limits of Transformer Language Models on Learning to Compose Algorithms
Jonathan Thomm
Aleksandar Terzić
Giacomo Camposampiero
Michael Hersche
Bernhard Schölkopf
Abbas Rahimi
39
3
0
08 Feb 2024
Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language
  Conversion for Language Models
Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language Conversion for Language Models
Yile Wang
Sijie Cheng
Zixin Sun
Peng Li
Yang Liu
ReLM
LRM
32
4
0
22 Jan 2024
CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs'
  Mathematical Reasoning Capabilities
CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs' Mathematical Reasoning Capabilities
Yujun Mao
Yoon Kim
Yilun Zhou
LRM
ReLM
26
17
0
13 Jan 2024
Evaluating Large Language Models on the GMAT: Implications for the
  Future of Business Education
Evaluating Large Language Models on the GMAT: Implications for the Future of Business Education
Vahid Ashrafimoghari
Necdet Gurkan
Jordan W. Suchow
ELM
27
6
0
02 Jan 2024
DSPy Assertions: Computational Constraints for Self-Refining Language
  Model Pipelines
DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines
Arnav Singhvi
Manish Shetty
Shangyin Tan
Christopher Potts
Koushik Sen
Matei A. Zaharia
Omar Khattab
17
16
0
20 Dec 2023
Towards a Mechanistic Interpretation of Multi-Step Reasoning
  Capabilities of Language Models
Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
Yifan Hou
Jiaoda Li
Yu Fei
Alessandro Stolfo
Wangchunshu Zhou
Guangtao Zeng
Antoine Bosselut
Mrinmaya Sachan
LRM
30
40
0
23 Oct 2023
Large Language Models can Learn Rules
Large Language Models can Learn Rules
Zhaocheng Zhu
Yuan Xue
Xinyun Chen
Denny Zhou
Jian Tang
Dale Schuurmans
Hanjun Dai
LRM
ReLM
32
63
0
10 Oct 2023
Large Language Models Are Not Strong Abstract Reasoners
Large Language Models Are Not Strong Abstract Reasoners
Gael Gendron
Qiming Bao
Michael Witbrock
Gillian Dobbie
ELM
LRM
29
30
0
31 May 2023
12
Next