ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.18486
  4. Cited By
Evaluation of OpenAI o1: Opportunities and Challenges of AGI

Evaluation of OpenAI o1: Opportunities and Challenges of AGI

27 September 2024
Tianyang Zhong
Zhengliang Liu
Yi Pan
Yutong Zhang
Yifan Zhou
Yu Bao
Zihao Wu
Yanjun Lyu
Peng Shu
Xiaowei Yu
Chao-Yang Cao
Hanqi Jiang
Hanxu Chen
Yiwei Li
Junhao Chen
Huawen Hu
Yihen Liu
Huaqin Zhao
Shaochen Xu
Haixing Dai
Lin Zhao
Ruidong Zhang
Wei Zhao
Zhenyuan Yang
Jingyuan Chen
Peilong Wang
Wei Ruan
Hui Wang
Huan Zhao
Jing Zhang
Yiming Ren
Shihuan Qin
Tong Chen
Jiaxi Li
Arif Hassan Zidan
Afrar Jahin
Minheng Chen
Sichen Xia
J. Holmes
Yan Zhuang
Jiaqi Wang
Bochen Xu
Weiran Xia
Jichao Yu
Kaibo Tang
Yaxuan Yang
Bo Shen
Tao Yang
Guoyu Lu
Xianqiao Wang
Lilong Chai
He Li
Jin Lu
Lichao Sun
Xin Zhang
Bao Ge
Xintao Hu
Lian-Cheng Zhang
Hua Zhou
Lu Zhang
Shu Zhang
Ninghao Liu
Bei Jiang
Linglong Kong
Zhen Xiang
Yudan Ren
Jun Liu
Xi Jiang
Wei Zhang
Wei Zhang
Xiang Li
Gang Li
Wei Liu
Dinggang Shen
Andrea Sikora
Xiaoming Zhai
Dajiang Zhu
Tianming Liu
    ReLMLRMAI4CEELMVLM
ArXiv (abs)PDFHTML

Papers citing "Evaluation of OpenAI o1: Opportunities and Challenges of AGI"

29 / 29 papers shown
Title
"Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation
Aladin Djuhera
Amin Seffo
Masataro Asai
Holger Boche
LM&Ro
32
0
0
04 Jun 2025
ER-REASON: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room
ER-REASON: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room
Nikita Mehandru
Niloufar Golchini
David Bamman
Travis Zack
Melanie F. Molina
Ahmed Alaa
ELM
76
0
0
28 May 2025
Reason-Align-Respond: Aligning LLM Reasoning with Knowledge Graphs for KGQA
Reason-Align-Respond: Aligning LLM Reasoning with Knowledge Graphs for KGQA
Xiangqing Shen
Fanfan Wang
Rui Xia
RALM
22
0
0
27 May 2025
One Demo Is All It Takes: Planning Domain Derivation with LLMs from A Single Demonstration
One Demo Is All It Takes: Planning Domain Derivation with LLMs from A Single Demonstration
Jinbang Huang
Yixin Xiao
Zhanguang Zhang
Mark Coates
Jianye Hao
Yingxue Zhang
LM&RoLRM
71
0
0
23 May 2025
Adaptive Plan-Execute Framework for Smart Contract Security Auditing
Adaptive Plan-Execute Framework for Smart Contract Security Auditing
Zhiyuan Wei
Jing Sun
Zijian Zhang
Zhe Hou
Zixiao Zhao
189
0
0
21 May 2025
IQBench: How "Smart'' Are Vision-Language Models? A Study with Human IQ Tests
IQBench: How "Smart'' Are Vision-Language Models? A Study with Human IQ Tests
Tan-Hanh Pham
Phu-Vinh Nguyen
Dang The Hung
Bui Trong Duong
Vu Nguyen Thanh
Chris Ngo
Tri Quang Truong
Truong-Son Hy
ReLMCoGeVLMLRM
64
0
0
17 May 2025
Disentangling Reasoning and Knowledge in Medical Large Language Models
Disentangling Reasoning and Knowledge in Medical Large Language Models
Rahul Thapa
Qingyang Wu
Kevin Wu
Harrison Zhang
Angela Zhang
...
Joseph Boen
Shriya Reddy
Ben Athiwaratkun
Shuaiwen Leon Song
James Zou
ELMAI4MHLM&MALRM
105
2
0
16 May 2025
Focus on the Likely: Test-time Instance-based Uncertainty Removal
Focus on the Likely: Test-time Instance-based Uncertainty Removal
Johannes Schneider
80
0
0
02 May 2025
AutoP2C: An LLM-Based Agent Framework for Code Repository Generation from Multimodal Content in Academic Papers
AutoP2C: An LLM-Based Agent Framework for Code Repository Generation from Multimodal Content in Academic Papers
Zijie Lin
Yiqing Shen
Qilin Cai
He Sun
Jinrui Zhou
Mingjun Xiao
136
0
0
28 Apr 2025
Generative Evaluation of Complex Reasoning in Large Language Models
Generative Evaluation of Complex Reasoning in Large Language Models
Haowei Lin
Xiang Wang
Ruilin Yan
Baizhou Huang
Haotian Ye
Jianhua Zhu
Zihao Wang
James Zou
Jianzhu Ma
Yitao Liang
ReLMELMLRM
449
0
0
03 Apr 2025
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
Juncheng Wu
Wenlong Deng
Xiaochen Li
Sheng Liu
Taomian Mi
...
Yihan Cao
Hui Ren
Xuzhao Li
Xiaoxiao Li
Yuyin Zhou
AI4MHLRM
132
16
0
01 Apr 2025
Aurelia: Test-time Reasoning Distillation in Audio-Visual LLMs
Aurelia: Test-time Reasoning Distillation in Audio-Visual LLMs
Sanjoy Chowdhury
Hanan Gani
Nishit Anand
Sayan Nag
Ruohan Gao
Mohamed Elhoseiny
Salman Khan
Dinesh Manocha
LRM
177
1
0
29 Mar 2025
MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection
MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection
Yibo Yan
Shen Wang
Jiahao Huo
Philip S. Yu
Xuming Hu
Qingsong Wen
352
8
0
23 Mar 2025
ReviewAgents: Bridging the Gap Between Human and AI-Generated Paper Reviews
ReviewAgents: Bridging the Gap Between Human and AI-Generated Paper Reviews
Xian Gao
Jiacheng Ruan
Jingsheng Gao
Ting Liu
Yuzhuo Fu
109
3
0
11 Mar 2025
InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models
InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models
Yuchen Yan
Yongliang Shen
Yuhang Liu
Jin Jiang
Hao Fei
Jian Shao
Yueting Zhuang
LRMReLM
142
10
0
09 Mar 2025
Knowledge Augmentation in Federation: Rethinking What Collaborative Learning Can Bring Back to Decentralized Data
Wentai Wu
Ligang He
Saiqin Long
Ahmed M. Abdelmoniem
Yingliang Wu
Rui Mao
126
0
0
05 Mar 2025
Marco-o1 v2: Towards Widening The Distillation Bottleneck for Reasoning Models
Marco-o1 v2: Towards Widening The Distillation Bottleneck for Reasoning Models
Huifeng Yin
Yu Zhao
Mingyang Wu
Xuanfan Ni
Bo Zeng
...
Liangying Shao
Chenyang Lyu
Longyue Wang
Weihua Luo
Kaifu Zhang
LRM
117
4
0
03 Mar 2025
Stay Focused: Problem Drift in Multi-Agent Debate
Stay Focused: Problem Drift in Multi-Agent Debate
Jonas Becker
Lars Benedikt Kaesberg
Andreas Stephan
Jan Philip Wahle
Terry Ruas
Bela Gipp
143
2
0
26 Feb 2025
CMCTS: A Constrained Monte Carlo Tree Search Framework for Mathematical Reasoning in Large Language Model
CMCTS: A Constrained Monte Carlo Tree Search Framework for Mathematical Reasoning in Large Language Model
Qingwen Lin
Boyan Xu
Zijian Li
Zijian Li
Keli Zhang
Ruichu Cai
Ruichu Cai
LRM
107
4
0
16 Feb 2025
LLM-Powered Preference Elicitation in Combinatorial Assignment
LLM-Powered Preference Elicitation in Combinatorial Assignment
Ermis Soumalias
Yanchen Jiang
Kehang Zhu
Michael J. Curry
Sven Seuken
David C. Parkes
120
1
0
14 Feb 2025
Scaling Public Health Text Annotation: Zero-Shot Learning vs. Crowdsourcing for Improved Efficiency and Labeling Accuracy
Scaling Public Health Text Annotation: Zero-Shot Learning vs. Crowdsourcing for Improved Efficiency and Labeling Accuracy
Kamyar Kazari
Yong Chen
Zahra Shakeri
AI4MH
102
1
0
10 Feb 2025
InSTA: Towards Internet-Scale Training For Agents
InSTA: Towards Internet-Scale Training For Agents
Brandon Trabucco
Gunnar Sigurdsson
Robinson Piramuthu
Ruslan Salakhutdinov
ALM
195
4
0
10 Feb 2025
Dynamic Chain-of-Thought: Towards Adaptive Deep Reasoning
Dynamic Chain-of-Thought: Towards Adaptive Deep Reasoning
Libo Wang
LRM
482
3
0
07 Feb 2025
Reasoning-as-Logic-Units: Scaling Test-Time Reasoning in Large Language Models Through Logic Unit Alignment
Reasoning-as-Logic-Units: Scaling Test-Time Reasoning in Large Language Models Through Logic Unit Alignment
Cheryl Li
Tianyuan Xu
Yiwen Guo
LRM
473
3
0
05 Feb 2025
LLM-ProS: Analyzing Large Language Models' Performance in Competitive Problem Solving
LLM-ProS: Analyzing Large Language Models' Performance in Competitive Problem Solving
Md Sifat Hossain
Anika Tabassum
Md. Fahim Arefin
Tarannum Shaila Zaman
ELMLRM
148
1
0
04 Feb 2025
Predictable Artificial Intelligence
Predictable Artificial Intelligence
Lexin Zhou
Pablo Antonio Moreno Casares
Fernando Martínez-Plumed
John Burden
Ryan Burnell
...
Seán Ó hÉigeartaigh
Danaja Rutar
Wout Schellaert
Konstantinos Voudouris
José Hernández-Orallo
146
3
0
08 Jan 2025
Visual Large Language Models for Generalized and Specialized Applications
Yifan Li
Zhixin Lai
Wentao Bao
Zhen Tan
Anh Dao
Kewei Sui
Jiayi Shen
Dong Liu
Huan Liu
Yu Kong
VLM
171
15
0
06 Jan 2025
Unifying KV Cache Compression for Large Language Models with LeanKV
Unifying KV Cache Compression for Large Language Models with LeanKV
Yanqi Zhang
Yuwei Hu
Runyuan Zhao
John C. S. Lui
Haibo Chen
MQ
286
7
0
04 Dec 2024
An Open-Source Reproducible Chess Robot for Human-Robot Interaction Research
An Open-Source Reproducible Chess Robot for Human-Robot Interaction Research
Renchi Zhang
J. D. Winter
Dimitra Dodou
H. Seyffert
Y. B. Eisma
101
0
0
28 May 2024
1