ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.02319
  4. Cited By
GenXD: Generating Any 3D and 4D Scenes
v1v2 (latest)

GenXD: Generating Any 3D and 4D Scenes

4 November 2024
Yuyang Zhao
Chung-Ching Lin
Kevin Qinghong Lin
Zhiwen Yan
Linjie Li
Zhiyong Yang
Jianfeng Wang
G. Lee
Lijuan Wang
    VGen
ArXiv (abs)PDFHTML

Papers citing "GenXD: Generating Any 3D and 4D Scenes"

20 / 20 papers shown
Title
Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting
Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting
Jiaxin Huang
Sheng Miao
BangBnag Yang
Yuewen Ma
Yiyi Liao
VGenMDE
144
0
0
15 Apr 2025
Wonderland: Navigating 3D Scenes from a Single Image
Wonderland: Navigating 3D Scenes from a Single Image
Hanwen Liang
Junli Cao
Vidit Goel
Guocheng Qian
Sergei Korolev
Demetri Terzopoulos
Konstantinos N. Plataniotis
Sergey Tulyakov
Jian Ren
VGen
201
14
0
16 Dec 2024
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
Baorui Ma
Huachen Gao
Haoge Deng
Zhengxiong Luo
Tiejun Huang
Lulu Tang
Xinlong Wang
DiffMVGen
235
16
0
09 Dec 2024
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation
Zhuoyan Luo
Fengyuan Shi
Yixiao Ge
Yujiu Yang
Limin Wang
Ying Shan
VLM
134
59
0
06 Sep 2024
SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency
SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency
Yiming Xie
Chun-Han Yao
Vikram S. Voleti
Huaizu Jiang
Varun Jampani
VGen
126
47
0
24 Jul 2024
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
Kepan Nan
Rui Xie
Penghao Zhou
Tiehan Fan
Zhenheng Yang
Zhijie Chen
Xiang Li
Jian Yang
Ying Tai
140
93
0
02 Jul 2024
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed
  Diffusion Models
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling
Seung Wook Kim
Antonio Torralba
Sanja Fidler
Karsten Kreis
DiffM3DGS
80
123
0
21 Dec 2023
Animate124: Animating One Image to 4D Dynamic Scene
Animate124: Animating One Image to 4D Dynamic Scene
Yuyang Zhao
Zhiwen Yan
Enze Xie
Lanqing Hong
Zhenguo Li
Gim Hee Lee
VGen
97
66
0
24 Nov 2023
One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View
  Generation and 3D Diffusion
One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion
Minghua Liu
Ruoxi Shi
Linghao Chen
Zhuoyang Zhang
Chao Xu
Xinyue Wei
Hansheng Chen
Chong Zeng
Jiayuan Gu
Hao Su
115
207
0
14 Nov 2023
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Haoxin Chen
Menghan Xia
Yin-Yin He
Yong Zhang
Xiaodong Cun
...
Yaofang Liu
Qifeng Chen
Xintao Wang
Chao-Liang Weng
Ying Shan
DiffM
75
312
0
30 Oct 2023
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D
  and 3D Diffusion Priors
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
Guocheng Qian
Jinjie Mai
Abdullah Hamdi
Jian Ren
Aliaksandr Siarohin
...
Hsin-Ying Lee
Ivan Skorokhodov
Peter Wonka
Sergey Tulyakov
Guohao Li
DiffM
140
364
0
30 Jun 2023
MVImgNet: A Large-scale Dataset of Multi-view Images
MVImgNet: A Large-scale Dataset of Multi-view Images
Xianggang Yu
Mutian Xu
Yidan Zhang
Haolin Liu
Chongjie Ye
...
Zhangyang Xiong
Tianyou Liang
Guanying Chen
Shuguang Cui
Xiaoguang Han
3DV
129
171
0
10 Mar 2023
Text-To-4D Dynamic Scene Generation
Text-To-4D Dynamic Scene Generation
Uriel Singer
Shelly Sheynin
Adam Polyak
Oron Ashual
Iurii Makarov
...
Naman Goyal
Andrea Vedaldi
Devi Parikh
Justin Johnson
Yaniv Taigman
DiffM
91
156
0
26 Jan 2023
DreamFusion: Text-to-3D using 2D Diffusion
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
177
2,439
0
29 Sep 2022
ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving
  Cameras in the Wild
ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild
Wang Zhao
Shaohui Liu
Hengkai Guo
Wenping Wang
Yang Liu
123
65
0
19 Jul 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
505
15,788
0
20 Dec 2021
Masked-attention Mask Transformer for Universal Image Segmentation
Masked-attention Mask Transformer for Universal Image Segmentation
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
272
2,385
0
02 Dec 2021
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Max Bain
Arsha Nagrani
Gül Varol
Andrew Zisserman
VGen
167
1,189
0
01 Apr 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
1.0K
29,926
0
26 Feb 2021
Infinite Nature: Perpetual View Generation of Natural Scenes from a
  Single Image
Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image
Andrew Liu
Richard Tucker
Varun Jampani
A. Makadia
Noah Snavely
Angjoo Kanazawa
VGen
131
171
0
17 Dec 2020
1