Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.02924
Cited By
INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving
6 July 2020
Yuhuai Wu
Albert Qiaochu Jiang
Jimmy Ba
Roger C. Grosse
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving"
16 / 16 papers shown
Title
Mathematical Language Models: A Survey
Wen Liu
Hanglei Hu
Jie Zhou
Yuyang Ding
Junsong Li
...
Mengliang He
Qin Chen
Bo Jiang
Aimin Zhou
Liang He
LRM
103
13
0
03 Jan 2025
Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation
Shaonan Wu
Shuai Lu
Yeyun Gong
Nan Duan
Ping Wei
AIMat
51
0
0
21 Oct 2024
What Matters in Hierarchical Search for Combinatorial Reasoning Problems?
Michał Zawalski
Gracjan Góral
Michał Tyrolski
Emilia Wisnios
Franciszek Budrowski
Marek Cygan
Łukasz Kuciński
Piotr Miłoś
52
0
0
05 Jun 2024
Specification Overfitting in Artificial Intelligence
Benjamin Roth
Pedro Henrique Luz de Araujo
Yuxi Xia
Saskia Kaltenbrunner
Christoph Korab
58
1
0
13 Mar 2024
TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models
Jing Xiong
Jianhao Shen
Ye Yuan
Haiming Wang
Yichun Yin
...
Yinya Huang
Chuanyang Zheng
Xiaodan Liang
Ming Zhang
Qun Liu
AIMat
LRM
35
15
0
16 Oct 2023
Abstract Visual Reasoning: An Algebraic Approach for Solving Raven's Progressive Matrices
Jingyi Xu
Tushar Vaidya
Y. Blankenship
Saket Chandra
Zhangsheng Lai
Kai Fong Ernest Chong
50
8
0
21 Mar 2023
Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs
Albert Q. Jiang
Sean Welleck
Jin Peng Zhou
Wenda Li
Jiacheng Liu
M. Jamnik
Timothée Lacroix
Yuhuai Wu
Guillaume Lample
AIMat
77
160
0
21 Oct 2022
Exploring Length Generalization in Large Language Models
Cem Anil
Yuhuai Wu
Anders Andreassen
Aitor Lewkowycz
Vedant Misra
V. Ramasesh
Ambrose Slone
Guy Gur-Ari
Ethan Dyer
Behnam Neyshabur
ReLM
LRM
38
162
0
11 Jul 2022
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search
Michał Zawalski
Michał Tyrolski
K. Czechowski
Tomasz Odrzygó'zd'z
Damian Stachura
Piotr Pikekos
Yuhuai Wu
Lukasz Kuciñski
Piotr Milo's
LRM
34
9
0
01 Jun 2022
Learning to Find Proofs and Theorems by Learning to Refine Search Strategies: The Case of Loop Invariant Synthesis
Jonathan Laurent
André Platzer
38
9
0
27 May 2022
HyperTree Proof Search for Neural Theorem Proving
Guillaume Lample
Marie-Anne Lachaux
Thibaut Lavril
Xavier Martinet
Amaury Hayat
Gabriel Ebner
Aurelien Rodriguez
Timothée Lacroix
AIMat
49
139
0
23 May 2022
Formal Mathematics Statement Curriculum Learning
Stanislas Polu
Jesse Michael Han
Kunhao Zheng
Mantas Baksys
Igor Babuschkin
Ilya Sutskever
AIMat
93
118
0
03 Feb 2022
Generating Symbolic Reasoning Problems with Transformer GANs
Jens U. Kreber
Christopher Hahn
AI4CE
44
6
0
19 Oct 2021
MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics
Kunhao Zheng
Jesse Michael Han
Stanislas Polu
AIMat
46
154
0
31 Aug 2021
LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning
Yuhuai Wu
M. Rabe
Wenda Li
Jimmy Ba
Roger C. Grosse
Christian Szegedy
AIMat
LRM
82
55
0
15 Jan 2021
Deep Graph Library: A Graph-Centric, Highly-Performant Package for Graph Neural Networks
Minjie Wang
Da Zheng
Zihao Ye
Quan Gan
Mufei Li
...
Jiaqi Zhao
Haotong Zhang
Alex Smola
Jinyang Li
Zheng Zhang
AI4CE
GNN
206
748
0
03 Sep 2019
1