ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.12856
  4. Cited By
A Real-World WebAgent with Planning, Long Context Understanding, and
  Program Synthesis

A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis

24 July 2023
Izzeddin Gur
Hiroki Furuta
Austin Huang
Mustafa Safdari
Yutaka Matsuo
Douglas Eck
Aleksandra Faust
    LM&Ro
    LLMAG
ArXivPDFHTML

Papers citing "A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis"

50 / 160 papers shown
Title
Beyond Browsing: API-Based Web Agents
Beyond Browsing: API-Based Web Agents
Yueqi Song
Frank F. Xu
Shuyan Zhou
Graham Neubig
53
15
0
21 Oct 2024
SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Jingxuan Chen
Derek Yuen
Bin Xie
Y. Yang
Gongwei Chen
...
Liqiang Nie
Yasheng Wang
Jianye Hao
Jun Wang
Kun Shao
LLMAG
45
5
0
19 Oct 2024
An Evolved Universal Transformer Memory
An Evolved Universal Transformer Memory
Edoardo Cetin
Qi Sun
Tianyu Zhao
Yujin Tang
140
0
0
17 Oct 2024
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation
Hyungjoo Chae
Namyoung Kim
Kai Tzu-iunn Ong
Minju Gwak
Gwanwoo Song
Jihoon Kim
S. Kim
Dongha Lee
Jinyoung Yeo
LLMAG
33
14
0
17 Oct 2024
Agent Skill Acquisition for Large Language Models via CycleQD
Agent Skill Acquisition for Large Language Models via CycleQD
So Kuroki
Taishi Nakamura
Takuya Akiba
Yujin Tang
MoMe
34
0
0
16 Oct 2024
Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents
Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents
Priyanshu Kumar
Elaine Lau
Saranya Vijayakumar
Tu Trinh
Scale Red Team
...
Sean Hendryx
Shuyan Zhou
Matt Fredrikson
Summer Yue
Zifan Wang
LLMAG
34
17
0
11 Oct 2024
Agent S: An Open Agentic Framework that Uses Computers Like a Human
Agent S: An Open Agentic Framework that Uses Computers Like a Human
Saaket Agashe
Jiuzhou Han
Shuyu Gan
Jiachen Yang
Ang Li
Xin Eric Wang
LLMAG
LM&Ro
AIFin
39
20
0
10 Oct 2024
ClickAgent: Enhancing UI Location Capabilities of Autonomous Agents
ClickAgent: Enhancing UI Location Capabilities of Autonomous Agents
Jakub Hoscilowicz
Bartosz Maj
Bartosz Kozakiewicz
Oleksii Tymoshchuk
Artur Janicki
LLMAG
47
5
0
09 Oct 2024
TinyClick: Single-Turn Agent for Empowering GUI Automation
TinyClick: Single-Turn Agent for Empowering GUI Automation
Pawel Pawlowski
Krystian Zawistowski
Wojciech Lapacz
Marcin Skorupa
Adam Wiacek
Sebastien Postansque
Jakub Hoscilowicz
MLLM
LLMAG
LRM
44
6
0
09 Oct 2024
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Boyu Gou
Ruohan Wang
Boyuan Zheng
Yanan Xie
Cheng Chang
Yiheng Shu
Huan Sun
Yu Su
LM&Ro
LLMAG
76
49
0
07 Oct 2024
Aligning LLMs with Individual Preferences via Interaction
Aligning LLMs with Individual Preferences via Interaction
Shujin Wu
May Fung
Cheng Qian
Jeonghwan Kim
Dilek Z. Hakkani-Tür
Heng Ji
31
10
0
04 Oct 2024
From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with
  LLM-Guided Knowledge
From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge
Xiefeng Wu
OffRL
34
1
0
02 Oct 2024
Synatra: Turning Indirect Knowledge into Direct Demonstrations for
  Digital Agents at Scale
Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at Scale
Tianyue Ou
Frank F. Xu
Aman Madaan
J. Liu
Robert Lo
Abishek Sridhar
Sudipta Sengupta
Dan Roth
Graham Neubig
Shuyan Zhou
OffRL
36
9
0
24 Sep 2024
Steward: Natural Language Web Automation
Steward: Natural Language Web Automation
Brian Tang
Kang G. Shin
LLMAG
29
1
0
23 Sep 2024
NaviQAte: Functionality-Guided Web Application Navigation
NaviQAte: Functionality-Guided Web Application Navigation
M. Shahbandeh
Parsa Alian
Noor Nashid
Ali Mesbah
23
2
0
16 Sep 2024
Untie the Knots: An Efficient Data Augmentation Strategy for
  Long-Context Pre-Training in Language Models
Untie the Knots: An Efficient Data Augmentation Strategy for Long-Context Pre-Training in Language Models
Junfeng Tian
Da Zheng
Yang Cheng
Rui-cang Wang
C. Zhang
Debing Zhang
28
4
0
07 Sep 2024
From Grounding to Planning: Benchmarking Bottlenecks in Web Agents
From Grounding to Planning: Benchmarking Bottlenecks in Web Agents
Segev Shlomov
Ben wiesel
Aviad Sela
Ido Levy
Liane Galanti
Roy Abitbol
LLMAG
34
3
0
03 Sep 2024
BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition
  Capabilities of Language Models in Multi-Agent Systems
BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems
Wei Wang
Dan Zhang
Tao Feng
Boyan Wang
Jie Tang
LLMAG
ELM
31
2
0
28 Aug 2024
LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction
LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction
Songwei Li
Jie Feng
Jiawei Chi
Xinyuan Hu
Xiaomeng Zhao
Fengli Xu
29
4
0
23 Aug 2024
Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents
Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents
Pranav Putta
Edmund Mills
Naman Garg
S. Motwani
Chelsea Finn
Divyansh Garg
Rafael Rafailov
LLMAG
LRM
28
65
0
13 Aug 2024
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in
  Long-Horizon Tasks
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
Zaijing Li
Yuquan Xie
Rui Shao
Gongwei Chen
Dongmei Jiang
Liqiang Nie
54
18
0
07 Aug 2024
Autonomous LLM-Enhanced Adversarial Attack for Text-to-Motion
Autonomous LLM-Enhanced Adversarial Attack for Text-to-Motion
Honglei Miao
Fan Ma
Ruijie Quan
Kun Zhan
Yi Yang
AAML
36
0
0
01 Aug 2024
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Zehui Chen
Kuikun Liu
Qiuchen Wang
Jiangning Liu
Wenwei Zhang
Kai Chen
Feng Zhao
LLMAG
70
19
0
29 Jul 2024
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?
Ori Yoran
S. Amouyal
Chaitanya Malaviya
Ben Bogin
Ofir Press
Jonathan Berant
LLMAG
39
31
0
22 Jul 2024
MMedAgent: Learning to Use Medical Tools with Multi-modal Agent
MMedAgent: Learning to Use Medical Tools with Multi-modal Agent
Binxu Li
Tiankai Yan
Yuanting Pan
Zhe Xu
Jie Luo
Ruiyang Ji
Shilong Liu
Haoyu Dong
Zihao Lin
Yixin Wang
LM&MA
36
25
0
02 Jul 2024
Granite-Function Calling Model: Introducing Function Calling Abilities
  via Multi-task Learning of Granular Tasks
Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks
Ibrahim Abdelaziz
Kinjal Basu
Mayank Agarwal
Sadhana Kumaravel
Matthew Stallone
...
Merve Unuvar
David D. Cox
Salim Roukos
Luis A. Lastras
Pavan Kapanipathi
LLMAG
34
19
0
27 Jun 2024
LLM-ARC: Enhancing LLMs with an Automated Reasoning Critic
LLM-ARC: Enhancing LLMs with an Automated Reasoning Critic
Aditya Kalyanpur
Kailash Saravanakumar
Victor Barres
Jennifer Chu-Carroll
David O. Melville
David Ferrucci
LLMAG
LRM
31
9
0
25 Jun 2024
Identifying User Goals from UI Trajectories
Identifying User Goals from UI Trajectories
Omri Berkovitch
Sapir Caduri
Noam Kahlon
Anatoly Efros
Avi Caciularu
Ido Dagan
LLMAG
32
4
0
20 Jun 2024
WebCanvas: Benchmarking Web Agents in Online Environments
WebCanvas: Benchmarking Web Agents in Online Environments
Yichen Pan
Dehan Kong
Sida Zhou
Cheng Cui
Yifei Leng
...
Hangyu Liu
Yanyi Shang
Shuyan Zhou
Tongshuang Wu
Zhengyang Wu
37
26
0
18 Jun 2024
IDs for AI Systems
IDs for AI Systems
Alan Chan
Noam Kolt
Peter Wills
Usman Anwar
Christian Schroeder de Witt
Nitarshan Rajkumar
Lewis Hammond
David M. Krueger
Lennart Heim
Markus Anderljung
41
6
0
17 Jun 2024
GUICourse: From General Vision Language Models to Versatile GUI Agents
GUICourse: From General Vision Language Models to Versatile GUI Agents
Wentong Chen
Junbo Cui
Jinyi Hu
Yujia Qin
Junjie Fang
...
Yupeng Huo
Yuan Yao
Yankai Lin
Zhiyuan Liu
Maosong Sun
LLMAG
33
30
0
17 Jun 2024
VideoGUI: A Benchmark for GUI Automation from Instructional Videos
VideoGUI: A Benchmark for GUI Automation from Instructional Videos
Kevin Qinghong Lin
Linjie Li
Difei Gao
Qinchen Wu
Mingyi Yan
Zhengyuan Yang
Lijuan Wang
Mike Zheng Shou
43
10
0
14 Jun 2024
From Text to Life: On the Reciprocal Relationship between Artificial
  Life and Large Language Models
From Text to Life: On the Reciprocal Relationship between Artificial Life and Large Language Models
Eleni Nisioti
Claire Glanois
Elias Najarro
Andrew Dai
Elliot Meyerson
J. Pedersen
Laetitia Teodorescu
Conor F. Hayes
Shyam Sudhakaran
Sebastian Risi
AI4CE
LM&Ro
45
2
0
14 Jun 2024
GuardAgent: Safeguard LLM Agents by a Guard Agent via Knowledge-Enabled Reasoning
GuardAgent: Safeguard LLM Agents by a Guard Agent via Knowledge-Enabled Reasoning
Zhen Xiang
Linzhi Zheng
Yanjie Li
Junyuan Hong
Qinbin Li
...
Zidi Xiong
Chulin Xie
Carl Yang
Dawn Song
Bo Li
LLMAG
45
23
0
13 Jun 2024
Two Tales of Persona in LLMs: A Survey of Role-Playing and
  Personalization
Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization
Yu-Min Tseng
Yu-Chao Huang
Teng-Yun Hsiao
Yu-Ching Hsu
Chao-Wei Huang
Jia-Yin Foo
Yun-Nung Chen
LLMAG
256
67
0
03 Jun 2024
Mobile-Agent-v2: Mobile Device Operation Assistant with Effective
  Navigation via Multi-Agent Collaboration
Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration
Junyang Wang
Haiyang Xu
Haitao Jia
Xi Zhang
Ming Yan
Weizhou Shen
Ji Zhang
Fei Huang
Jitao Sang
LM&Ro
LLMAG
34
45
0
03 Jun 2024
The Importance of Directional Feedback for LLM-based Optimizers
The Importance of Directional Feedback for LLM-based Optimizers
Allen Nie
Ching-An Cheng
Andrey Kolobov
Adith Swaminathan
35
16
0
26 May 2024
Devil's Advocate: Anticipatory Reflection for LLM Agents
Devil's Advocate: Anticipatory Reflection for LLM Agents
Haoyu Wang
Tao Li
Zhiwei Deng
Dan Roth
Yang Li
LLMAG
34
2
0
25 May 2024
Latent State Estimation Helps UI Agents to Reason
Latent State Estimation Helps UI Agents to Reason
Will Bishop
Alice Li
Christopher Rawles
Oriana Riva
LRM
LLMAG
19
3
0
17 May 2024
Enhancing the Efficiency and Accuracy of Underlying Asset Reviews in
  Structured Finance: The Application of Multi-agent Framework
Enhancing the Efficiency and Accuracy of Underlying Asset Reviews in Structured Finance: The Application of Multi-agent Framework
Xiangpeng Wan
Haicheng Deng
Kai Zou
Shiqi Xu
LLMAG
23
5
0
07 May 2024
Enhancing Q-Learning with Large Language Model Heuristics
Enhancing Q-Learning with Large Language Model Heuristics
Xiefeng Wu
LRM
32
0
0
06 May 2024
Unifying Bias and Unfairness in Information Retrieval: A Survey of
  Challenges and Opportunities with Large Language Models
Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models
Sunhao Dai
Chen Xu
Shicheng Xu
Liang Pang
Zhenhua Dong
Jun Xu
48
59
0
17 Apr 2024
Autonomous Evaluation and Refinement of Digital Agents
Autonomous Evaluation and Refinement of Digital Agents
Jiayi Pan
Yichi Zhang
Nicholas Tomlin
Yifei Zhou
Sergey Levine
Alane Suhr
ELM
41
48
0
09 Apr 2024
WILBUR: Adaptive In-Context Learning for Robust and Accurate Web Agents
WILBUR: Adaptive In-Context Learning for Robust and Accurate Web Agents
Michael Lutz
Arth Bohra
Manvel Saroyan
Artem Harutyunyan
Giovanni Campagna
LLMAG
40
13
0
08 Apr 2024
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
Keen You
Haotian Zhang
E. Schoop
Floris Weers
Amanda Swearngin
Jeffrey Nichols
Yinfei Yang
Zhe Gan
MLLM
47
82
0
08 Apr 2024
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web
  Navigating Agent
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent
Hanyu Lai
Xiao Liu
Iat Long Iong
Shuntian Yao
Yuxuan Chen
...
Hao Yu
Hanchen Zhang
Xiaohan Zhang
Yuxiao Dong
Jie Tang
LM&Ro
LLMAG
36
44
0
04 Apr 2024
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Kevin Xu
Yeganeh Kordi
Kate Sanders
Yizhong Wang
Adam Byerly
Kate Sanders
Adam Byerly
Jingyu Zhang
Benjamin Van Durme
Daniel Khashabi
LLMAG
72
6
0
18 Mar 2024
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work
  Tasks?
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Alexandre Drouin
Maxime Gasse
Massimo Caccia
I. Laradji
Manuel Del Verme
...
Megh Thakkar
Quentin Cappart
David Vazquez
Nicolas Chapados
Alexandre Lacoste
LLMAG
51
53
0
12 Mar 2024
RL-GPT: Integrating Reinforcement Learning and Code-as-policy
RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Shaoteng Liu
Haoqi Yuan
Minda Hu
Yanwei Li
Yukang Chen
Shu Liu
Zongqing Lu
Jiaya Jia
LLMAG
42
14
0
29 Feb 2024
Researchy Questions: A Dataset of Multi-Perspective, Decompositional
  Questions for LLM Web Agents
Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Corby Rosset
Ho-Lam Chung
Guanghui Qin
Ethan C. Chau
Zhuo Feng
Ahmed Hassan Awadallah
Jennifer Neville
Nikhil Rao
37
10
0
27 Feb 2024
Previous
1234
Next