Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.20013
Cited By
WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback
26 May 2025
Minda Hu
Tianqing Fang
Jianshu Zhang
Junyu Ma
Zhisong Zhang
Jingyan Zhou
Hongming Zhang
Haitao Mi
Dong Yu
Irwin King
LLMAG
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback"
24 / 24 papers shown
Title
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
Zhepei Wei
Wenlin Yao
Yao Liu
Weizhi Zhang
Qin Lu
...
Puyang Xu
Chao Zhang
Bing Yin
Hyokun Yun
Lihong Li
OffRL
CLL
OnRL
LRM
41
4
0
22 May 2025
Beyond Áha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models
Zhiyuan Hu
Yansen Wang
Hanze Dong
Yuhui Xu
Amrita Saha
Caiming Xiong
Bryan Hooi
Junnan Li
LRM
42
2
0
15 May 2025
WebThinker: Empowering Large Reasoning Models with Deep Research Capability
Xiaochen Li
Jiajie Jin
Guanting Dong
Hongjin Qian
Yutao Zhu
Yongkang Wu
Ji-Rong Wen
Zhicheng Dou
LLMAG
LRM
117
8
0
30 Apr 2025
Between Underthinking and Overthinking: An Empirical Study of Reasoning Length and correctness in LLMs
Jinyan Su
Jennifer Healey
Preslav Nakov
Claire Cardie
LRM
179
9
0
30 Apr 2025
WebEvolver: Enhancing Web Agent Self-Improvement with Coevolving World Model
Tianqing Fang
Huatian Zhang
Zikai Zhang
Kaixin Ma
Wenhao Yu
Haitao Mi
Dong Yu
LLMAG
KELM
308
1
0
23 Apr 2025
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
Yuhang Liu
Pengxiang Li
C. Xie
Xavier Hu
Xiaotian Han
Shengyu Zhang
Hongxia Yang
Fei Wu
LLMAG
LM&Ro
LRM
AI4CE
81
8
0
19 Apr 2025
Enhancing Web Agents with Explicit Rollback Mechanisms
Zizhuo Zhang
Tianqing Fang
Kaixin Ma
Wenhao Yu
Han Zhang
Haitao Mi
Dong Yu
KELM
81
3
0
16 Apr 2025
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't
Quy-Anh Dang
Chris Ngo
OffRL
LRM
67
13
0
20 Mar 2025
Streaming Looking Ahead with Token-level Self-reward
Han Zhang
Ruixin Hong
Dong Yu
49
2
0
24 Feb 2025
The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks
Alejandro Cuadron
Dacheng Li
Wenjie Ma
Xingyao Wang
Yichuan Wang
...
Aditya Desai
Ion Stoica
Ana Klimovic
Graham Neubig
Joseph E. Gonzalez
LRM
AI4CE
179
39
0
12 Feb 2025
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Zehan Qi
Xiao-Chang Liu
Iat Long Iong
Hanyu Lai
Xingwu Sun
...
Shuntian Yao
Tianjie Zhang
Wei Xu
J. Tang
Yuxiao Dong
111
27
0
28 Jan 2025
WebWalker: Benchmarking LLMs in Web Traversal
Jialong Wu
Wenbiao Yin
Yong Jiang
Zhenglin Wang
Zekun Xi
...
Linhai Zhang
Yulan He
Deyu Zhou
Pengjun Xie
Fei Huang
66
11
0
13 Jan 2025
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Han Zhang
Xiaoman Pan
Hongwei Wang
Kaixin Ma
Wenhao Yu
Dong Yu
LLMAG
76
4
0
03 Jan 2025
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Hongliang He
Wenlin Yao
Kaixin Ma
Wenhao Yu
Han Zhang
Tianqing Fang
Zhenzhong Lan
Dong Yu
LM&Ro
LLMAG
48
11
0
25 Oct 2024
Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks
Mengzhao Jia
Wenhao Yu
Kaixin Ma
Tianqing Fang
Zhihan Zhang
Siru Ouyang
Hongming Zhang
Meng Jiang
Dong Yu
VLM
44
7
0
02 Oct 2024
WebCanvas: Benchmarking Web Agents in Online Environments
Yichen Pan
Dehan Kong
Sida Zhou
Cheng Cui
Yifei Leng
...
Hangyu Liu
Yanyi Shang
Shuyan Zhou
Tongshuang Wu
Zhengyang Wu
45
31
0
18 Jun 2024
Large Language Models Can Self-Improve At Web Agent Tasks
Ajay Patel
M. Hofmarcher
Claudiu Leoveanu-Condrei
Marius-Constantin Dinu
Chris Callison-Burch
Sepp Hochreiter
LLMAG
50
28
0
30 May 2024
MMInA: Benchmarking Multihop Multimodal Internet Agents
Ziniu Zhang
Shulin Tian
Liangyu Chen
Ziwei Liu
LLMAG
LM&Ro
45
15
0
15 Apr 2024
Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model
Zhonghan Zhao
Ke Ma
Wenhao Chai
Xuan Wang
Kewei Chen
Dongxu Guo
Yanting Zhang
Hongwei Wang
Gaoang Wang
54
18
0
06 Apr 2024
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models
Justin Chih-Yao Chen
Swarnadeep Saha
Elias Stengel-Eskin
Mohit Bansal
LRM
LLMAG
42
18
0
02 Feb 2024
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Renat Aksitov
Sobhan Miryoosefi
Zong-xiao Li
Daliang Li
Sheila Babayan
...
Sushant Prakash
Pranesh Srinivasan
Manzil Zaheer
Felix X. Yu
Sanjiv Kumar
LRM
ReLM
LLMAG
KELM
23
47
0
15 Dec 2023
WebArena: A Realistic Web Environment for Building Autonomous Agents
Shuyan Zhou
Frank F. Xu
Hao Zhu
Xuhui Zhou
Robert Lo
...
Tianyue Ou
Yonatan Bisk
Daniel Fried
Uri Alon
Graham Neubig
LLMAG
64
420
0
25 Jul 2023
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
312
2,709
0
06 Oct 2022
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Shunyu Yao
Howard Chen
John Yang
Karthik Narasimhan
LLMAG
LM&Ro
50
472
0
04 Jul 2022
1