ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.10076
  4. Cited By
VideoAgent: Self-Improving Video Generation
v1v2v3 (latest)

VideoAgent: Self-Improving Video Generation

14 October 2024
Achint Soni
Sreyas Venkataraman
Abhranil Chandra
Sebastian Fischmeister
Percy Liang
Bo Dai
Sherry Yang
    LM&RoVGen
ArXiv (abs)PDFHTML

Papers citing "VideoAgent: Self-Improving Video Generation"

50 / 65 papers shown
Title
Self-Adapting Improvement Loops for Robotic Learning
Self-Adapting Improvement Loops for Robotic Learning
Calvin Luo
Zilai Zeng
Mingxi Jia
Yilun Du
Chen Sun
24
0
0
07 Jun 2025
A Challenge to Build Neuro-Symbolic Video Agents
A Challenge to Build Neuro-Symbolic Video Agents
Sahil Shah
Harsh Goel
Sai Shankar Narasimhan
Minkyu Choi
S P Sharan
Oguzhan Akcin
Sandeep Chinchali
AI4TS
75
0
0
20 May 2025
Stealing Creator's Workflow: A Creator-Inspired Agentic Framework with Iterative Feedback Loop for Improved Scientific Short-form Generation
Stealing Creator's Workflow: A Creator-Inspired Agentic Framework with Iterative Feedback Loop for Improved Scientific Short-form Generation
J. Park
Maanas Taneja
Qianwen Wang
Dongyeop Kang
VGen
135
0
0
26 Apr 2025
FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models
FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models
Kuanting Wu
Kei Ota
Asako Kanezaki
DiffMVGen
118
0
0
20 Apr 2025
Exploring the Evolution of Physics Cognition in Video Generation: A Survey
Exploring the Evolution of Physics Cognition in Video Generation: A Survey
Minghui Lin
Xiang Wang
Yansen Wang
Shu Wang
Fengqi Dai
...
Cunxiang Wang
Zhengrong Zuo
Nong Sang
Siteng Huang
Donglin Wang
EGVMVGen
156
5
0
27 Mar 2025
Predictive Inverse Dynamics Models are Scalable Learners for Robotic
  Manipulation
Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
Yang Tian
Sizhe Yang
Jia Zeng
P. Wang
Dahua Lin
Hao Dong
Jiangmiao Pang
163
21
0
19 Dec 2024
Artificial Intelligence for Biomedical Video Generation
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
177
0
0
12 Nov 2024
WorldSimBench: Towards Video Generation Models as World Simulators
WorldSimBench: Towards Video Generation Models as World Simulators
Yiran Qin
Zhelun Shi
Jiwen Yu
Xijun Wang
Enshen Zhou
...
Lu Sheng
Jing Shao
Junlin Wu
Wanli Ouyang
Ruimao Zhang
EGVMVGen
218
477
0
23 Oct 2024
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human
  Feedback for Video Generation
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Xuan He
Dongfu Jiang
Ge Zhang
Max Ku
Achint Soni
...
Yaswanth Narsupalli
Rongqi Fan
Zhiheng Lyu
Yuchen Lin
Wenhu Chen
EGVMVGenALM
136
56
0
21 Jun 2024
Toward Self-Improvement of LLMs via Imagination, Searching, and
  Criticizing
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Ye Tian
Baolin Peng
Linfeng Song
Lifeng Jin
Dian Yu
Haitao Mi
Dong Yu
LRMReLM
110
85
0
18 Apr 2024
Multistep Consistency Models
Multistep Consistency Models
Jonathan Heek
Emiel Hoogeboom
Tim Salimans
68
37
0
11 Mar 2024
Video as the New Language for Real-World Decision Making
Video as the New Language for Real-World Decision Making
Sherry Yang
Jacob Walker
Jack Parker-Holder
Yilun Du
Jake Bruce
Andre Barreto
Pieter Abbeel
Dale Schuurmans
VGen
121
56
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
245
103
0
27 Feb 2024
Genie: Generative Interactive Environments
Genie: Generative Interactive Environments
Jake Bruce
Michael Dennis
Ashley D. Edwards
Jack Parker-Holder
Yuge Shi
...
Konrad Zolna
Jeff Clune
Nando de Freitas
Satinder Singh
Tim Rocktaschel
VGenVLM
156
188
0
23 Feb 2024
Self-Rewarding Language Models
Self-Rewarding Language Models
Weizhe Yuan
Richard Yuanzhe Pang
Kyunghyun Cho
Xian Li
Sainbayar Sukhbaatar
Jing Xu
Jason Weston
ReLMSyDaALMLRM
403
338
0
18 Jan 2024
DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven
  Policy Learning
DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning
Sabariswaran Mani
Sreyas Venkataraman
Abhranil Chandra
Adyan Rizvi
Yash Sirvi
Soumojit Bhattacharya
Aritra Hazra
OffRL
93
1
0
17 Jan 2024
Any-point Trajectory Modeling for Policy Learning
Any-point Trajectory Modeling for Policy Learning
Chuan Wen
Xingyu Lin
John So
Kai-xiang Chen
Qi Dou
Yang Gao
Pieter Abbeel
PINNVGen
135
99
0
28 Dec 2023
A Survey of Reinforcement Learning from Human Feedback
A Survey of Reinforcement Learning from Human Feedback
Timo Kaufmann
Paul Weng
Viktor Bengs
Eyke Hüllermeier
OffRL
97
155
0
22 Dec 2023
Improved Techniques for Training Consistency Models
Improved Techniques for Training Consistency Models
Yang Song
Prafulla Dhariwal
85
182
0
22 Oct 2023
Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion
  Models
Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models
Kevin Black
Mitsuhiko Nakamoto
P. Atreya
Homer Walke
Chelsea Finn
Aviral Kumar
Sergey Levine
DiffMLM&Ro
133
143
0
16 Oct 2023
Video Language Planning
Video Language Planning
Yilun Du
Mengjiao Yang
Peter R. Florence
Fei Xia
Ayzaan Wahid
...
Pieter Abbeel
Josh Tenenbaum
L. Kaelbling
Andy Zeng
Jonathan Tompson
PINNLM&Ro
182
100
0
16 Oct 2023
Learning to Act from Actionless Videos through Dense Correspondences
Learning to Act from Actionless Videos through Dense Correspondences
Po-Chen Ko
Jiayuan Mao
Yilun Du
Shao-Hua Sun
Josh Tenenbaum
101
89
0
12 Oct 2023
Learning Interactive Real-World Simulators
Learning Interactive Real-World Simulators
Mengjiao Yang
Yilun Du
Kamyar Ghasemipour
Jonathan Tompson
Leslie Kaelbling
Dale Schuurmans
Pieter Abbeel
LM&RoPINN
79
215
0
09 Oct 2023
Compositional Foundation Models for Hierarchical Planning
Compositional Foundation Models for Hierarchical Planning
Anurag Ajay
Seung-Jun Han
Yilun Du
Shaung Li
Abhi Gupta
Tommi Jaakkola
Josh Tenenbaum
L. Kaelbling
Akash Srivastava
Pulkit Agrawal
LRM
114
71
0
15 Sep 2023
BridgeData V2: A Dataset for Robot Learning at Scale
BridgeData V2: A Dataset for Robot Learning at Scale
Homer Walke
Kevin Black
Abraham Lee
Moo Jin Kim
Maximilian Du
...
Andre Wang He
Vivek Myers
Kuan Fang
Chelsea Finn
Sergey Levine
150
243
0
24 Aug 2023
Open Problems and Fundamental Limitations of Reinforcement Learning from
  Human Feedback
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Stephen Casper
Xander Davies
Claudia Shi
T. Gilbert
Jérémy Scheurer
...
Erdem Biyik
Anca Dragan
David M. Krueger
Dorsa Sadigh
Dylan Hadfield-Menell
ALMOffRL
155
533
0
27 Jul 2023
MovieFactory: Automatic Movie Creation from Text using Large Generative
  Models for Language and Images
MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images
Sitong Su
Huan Yang
Huiguo He
Wenjing Wang
Zixi Tuo
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
VGenDiffM
90
40
0
12 Jun 2023
Probabilistic Adaptation of Text-to-Video Models
Probabilistic Adaptation of Text-to-Video Models
Mengjiao Yang
Yilun Du
Bo Dai
Dale Schuurmans
J. Tenenbaum
Pieter Abbeel
VGenDiffM
137
26
0
02 Jun 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward
  Model
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
405
4,184
0
29 May 2023
Video Prediction Models as Rewards for Reinforcement Learning
Video Prediction Models as Rewards for Reinforcement Learning
Alejandro Escontrela
Ademi Adeniji
Wilson Yan
Ajay Jain
Xue Bin Peng
Ken Goldberg
Youngwoon Lee
Danijar Hafner
Pieter Abbeel
108
59
0
23 May 2023
Align your Latents: High-Resolution Video Synthesis with Latent
  Diffusion Models
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGSVGen
241
1,106
0
18 Apr 2023
GPT-4 Technical Report
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAGMLLM
1.6K
14,832
0
15 Mar 2023
Text-to-image Diffusion Models in Generative AI: A Survey
Text-to-image Diffusion Models in Generative AI: A Survey
Chenshuang Zhang
Chaoning Zhang
Mengchun Zhang
In So Kweon
VLM
120
280
0
14 Mar 2023
Consistency Models
Consistency Models
Yang Song
Prafulla Dhariwal
Mark Chen
Ilya Sutskever
VLMDiffM
117
982
0
02 Mar 2023
Diffusion Model-Augmented Behavioral Cloning
Diffusion Model-Augmented Behavioral Cloning
Shangcheng Chen
Hsiang-Chun Wang
Ming-Hao Hsu
Chun-Mao Lai
Shao-Hua Sun
DiffM
150
31
0
26 Feb 2023
Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be
  Consistent
Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent
Giannis Daras
Y. Dagan
A. Dimakis
C. Daskalakis
DiffM
127
48
0
17 Feb 2023
Learning Universal Policies via Text-Guided Video Generation
Learning Universal Policies via Text-Guided Video Generation
Yilun Du
Mengjiao Yang
Bo Dai
H. Dai
Ofir Nachum
J. Tenenbaum
Dale Schuurmans
Pieter Abbeel
PINNLM&Ro
135
264
0
31 Jan 2023
Imagen Video: High Definition Video Generation with Diffusion Models
Imagen Video: High Definition Video Generation with Diffusion Models
Jonathan Ho
William Chan
Chitwan Saharia
Jay Whang
Ruiqi Gao
...
Diederik P. Kingma
Ben Poole
Mohammad Norouzi
David J. Fleet
Tim Salimans
VGen
181
1,548
0
05 Oct 2022
Make-A-Video: Text-to-Video Generation without Text-Video Data
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer
Adam Polyak
Thomas Hayes
Xiaoyue Yin
Jie An
...
Oron Ashual
Oran Gafni
Devi Parikh
Sonal Gupta
Yaniv Taigman
DiffMVGen
97
1,439
0
29 Sep 2022
Analog Bits: Generating Discrete Data using Diffusion Models with
  Self-Conditioning
Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning
Ting-Li Chen
Ruixiang Zhang
Geoffrey E. Hinton
DiffM
132
313
0
08 Aug 2022
Human-to-Robot Imitation in the Wild
Human-to-Robot Imitation in the Wild
Shikhar Bahl
Abhi Gupta
Deepak Pathak
109
174
0
19 Jul 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online
  Videos
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
159
304
0
23 Jun 2022
DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling
  in Around 10 Steps
DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps
Cheng Lu
Yuhao Zhou
Fan Bao
Jianfei Chen
Chongxuan Li
Jun Zhu
DiffM
287
1,472
0
02 Jun 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via
  Transformers
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
343
632
0
29 May 2022
Fast Sampling of Diffusion Models with Exponential Integrator
Fast Sampling of Diffusion Models with Exponential Integrator
Qinsheng Zhang
Yongxin Chen
DiffM
114
439
0
29 Apr 2022
Training Compute-Optimal Large Language Models
Training Compute-Optimal Large Language Models
Jordan Hoffmann
Sebastian Borgeaud
A. Mensch
Elena Buchatskaya
Trevor Cai
...
Karen Simonyan
Erich Elsen
Jack W. Rae
Oriol Vinyals
Laurent Sifre
AI4TS
217
1,992
0
29 Mar 2022
STaR: Bootstrapping Reasoning With Reasoning
STaR: Bootstrapping Reasoning With Reasoning
E. Zelikman
Yuhuai Wu
Jesse Mu
Noah D. Goodman
ReLMLRM
157
512
0
28 Mar 2022
R3M: A Universal Visual Representation for Robot Manipulation
R3M: A Universal Visual Representation for Robot Manipulation
Suraj Nair
Aravind Rajeswaran
Vikash Kumar
Chelsea Finn
Abhi Gupta
LM&Ro
115
588
0
23 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLMALM
930
13,272
0
04 Mar 2022
Adversarial Imitation Learning from Video using a State Observer
Adversarial Imitation Learning from Video using a State Observer
Haresh Karnan
Garrett A. Warnell
F. Torabi
Peter Stone
GAN
108
13
0
01 Feb 2022
12
Next