ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.13152
  4. Cited By
Scene Representation Transformer: Geometry-Free Novel View Synthesis
  Through Set-Latent Scene Representations

Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations

25 November 2021
Mehdi S. M. Sajjadi
H. Meyer
Etienne Pot
Urs M. Bergmann
Klaus Greff
Noha Radwan
Suhani Vora
Mario Lucic
Daniel Duckworth
Alexey Dosovitskiy
Jakob Uszkoreit
Thomas Funkhouser
Andrea Tagliasacchi
    ViT
ArXivPDFHTML

Papers citing "Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations"

50 / 150 papers shown
Title
RayZer: A Self-supervised Large View Synthesis Model
RayZer: A Self-supervised Large View Synthesis Model
Hanwen Jiang
Hao Tan
Peng Wang
Haian Jin
Yue Zhao
...
Kai Zhang
Fujun Luan
Kalyan Sunkavalli
Qixing Huang
Georgios Pavlakos
68
0
0
01 May 2025
Direct Motion Models for Assessing Generated Videos
Direct Motion Models for Assessing Generated Videos
Kelsey R. Allen
Carl Doersch
Guangyao Zhou
Mohammed Suhail
Danny Driess
...
Thomas Kipf
Mehdi S. M. Sajjadi
Kevin P. Murphy
João Carreira
Sjoerd van Steenkiste
EGVM
DiffM
VGen
78
0
0
30 Apr 2025
SpinMeRound: Consistent Multi-View Identity Generation Using Diffusion Models
SpinMeRound: Consistent Multi-View Identity Generation Using Diffusion Models
Stathis Galanakis
Alexandros Lattas
Stylianos Moschoglou
Bernhard Kainz
S. Zafeiriou
DiffM
35
0
0
14 Apr 2025
3D Scene Understanding Through Local Random Access Sequence Modeling
3D Scene Understanding Through Local Random Access Sequence Modeling
Wanhee Lee
Klemen Kotar
R. Venkatesh
Jared Watrous
Honglin Chen
Khai Loong Aw
Daniel L. K. Yamins
3DV
42
0
0
04 Apr 2025
Enhanced Cross-modal 3D Retrieval via Tri-modal Reconstruction
Enhanced Cross-modal 3D Retrieval via Tri-modal Reconstruction
Junlong Ren
Hao Wang
45
0
0
02 Apr 2025
ERUPT: Efficient Rendering with Unposed Patch Transformer
ERUPT: Efficient Rendering with Unposed Patch Transformer
Maxim V. Shugaev
Vincent Chen
Maxim Karrenbach
Kyle Ashley
Bridget Kennedy
Naresh P. Cuntoor
32
0
0
31 Mar 2025
SplatVoxel: History-Aware Novel View Streaming without Temporal Training
SplatVoxel: History-Aware Novel View Streaming without Temporal Training
Yiming Wang
Lucy Chai
Xuan Luo
Michael Niemeyer
Manuel Lagunas
Stephen Lombardi
Siyu Tang
Tiancheng Sun
3DGS
58
0
0
18 Mar 2025
Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors
Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors
Katja Schwarz
Norman Mueller
Peter Kontschieder
3DGS
98
2
0
17 Mar 2025
Next-Scale Autoregressive Models are Zero-Shot Single-Image Object View Synthesizers
Next-Scale Autoregressive Models are Zero-Shot Single-Image Object View Synthesizers
Shiran Yuan
Hao Zhao
DiffM
54
0
0
17 Mar 2025
Fake It To Make It: Virtual Multiviews to Enhance Monocular Indoor Semantic Scene Completion
Anith Selvakumar
Manasa Bharadwaj
42
0
0
07 Mar 2025
SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation
SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation
Junlong Ren
Hao Wu
Hui Xiong
Haoran Wang
68
0
0
26 Feb 2025
Textured 3D Regenerative Morphing with 3D Diffusion Prior
Textured 3D Regenerative Morphing with 3D Diffusion Prior
Songlin Yang
Yushi Lan
Honghua Chen
Xingang Pan
DiffM
68
0
0
21 Feb 2025
FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views
FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views
Shangzhan Zhang
Jianyuan Wang
Yinghao Xu
Nan Xue
Christian Rupprecht
Xiaowei Zhou
Yujun Shen
Gordon Wetzstein
123
7
0
17 Feb 2025
MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
Hanwen Jiang
Zexiang Xu
Desai Xie
Z. Chen
Haian Jin
...
Xin Sun
Jiuxiang Gu
Qixing Huang
Georgios Pavlakos
Hao Tan
159
1
0
18 Dec 2024
MV-Adapter: Multi-view Consistent Image Generation Made Easy
MV-Adapter: Multi-view Consistent Image Generation Made Easy
Zehuan Huang
Y. Guo
Haoran Wang
Ran Yi
Lizhuang Ma
Yan-Pei Cao
Lu Sheng
107
8
0
04 Dec 2024
GaussianAnything: Interactive Point Cloud Flow Matching For 3D Object Generation
GaussianAnything: Interactive Point Cloud Flow Matching For 3D Object Generation
Yushi Lan
Shangchen Zhou
Zhaoyang Lyu
Fangzhou Hong
Shuai Yang
Bo Dai
Xingang Pan
Chen Change Loy
3DGS
55
0
0
12 Nov 2024
$SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth
  Estimation
SE(3)SE(3)SE(3) Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation
Yinshuang Xu
Dian Chen
Katherine Liu
Sergey Zakharov
Rares Ambrus
Kostas Daniilidis
Vitor Campagnolo Guizilini
MDE
35
0
0
11 Nov 2024
Moving Off-the-Grid: Scene-Grounded Video Representations
Moving Off-the-Grid: Scene-Grounded Video Representations
Sjoerd van Steenkiste
Daniel Zoran
Yi Yang
Yulia Rubanova
Rishabh Kabra
...
Thomas Keck
João Carreira
Alexey Dosovitskiy
Mehdi S. M. Sajjadi
Thomas Kipf
31
3
0
08 Nov 2024
MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views
MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views
Yuedong Chen
Chuanxia Zheng
Haofei Xu
Bohan Zhuang
Andrea Vedaldi
Tat-Jen Cham
Jianfei Cai
3DGS
63
14
0
07 Nov 2024
Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View
  Synthesis
Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis
Zhiyuan Min
Yawei Luo
Jianwen Sun
Yi Yang
3DGS
41
0
0
30 Oct 2024
Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder
Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder
Antoine Schnepf
Karim Kassab
Jean-Yves Franceschi
Laurent Caraffa
Flavian Vasile
Jeremie Mary
Andrew Comport
Valérie Gouet-Brunet
40
2
0
30 Oct 2024
PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary
  Views
PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary Views
Xin Fei
Wenzhao Zheng
Yueqi Duan
W. Zhan
M. Tomizuka
Kurt Keutzer
Jiwen Lu
3DGS
30
3
0
24 Oct 2024
Large Spatial Model: End-to-end Unposed Images to Semantic 3D
Large Spatial Model: End-to-end Unposed Images to Semantic 3D
Zhiwen Fan
Jian Zhang
Wenyan Cong
Peihao Wang
Renjie Li
...
Zhilin Wang
Danfei Xu
Boris Ivanovic
Marco Pavone
Yue Wang
3DV
41
11
0
24 Oct 2024
G3R: Gradient Guided Generalizable Reconstruction
G3R: Gradient Guided Generalizable Reconstruction
Yun Chen
Jingkang Wang
Ze Yang
S. Manivasagam
R. Urtasun
36
9
0
28 Sep 2024
MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware
  Diffusion and Iterative Refinement
MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement
Xu He
Xiaoyu Li
Di Kang
Jiangnan Ye
Chaopeng Zhang
Liyang Chen
Xiangjun Gao
Han Zhang
Zhiyong Wu
Haolin Zhuang
DiffM
34
7
0
26 Aug 2024
SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse
  Views
SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views
Chao Xu
Ang Li
Linghao Chen
Yulin Liu
Ruoxi Shi
Hao Su
Minghua Liu
3DGS
57
21
0
19 Aug 2024
3D Reconstruction of Protein Structures from Multi-view AFM Images using
  Neural Radiance Fields (NeRFs)
3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs)
Jaydeep Rade
Ethan Herron
S. Sarkar
Anwesha Sarkar
A. Krishnamurthy
AI4CE
31
0
0
12 Aug 2024
Controlling Space and Time with Diffusion Models
Controlling Space and Time with Diffusion Models
Daniel Watson
Saurabh Saxena
Lala Li
Andrea Tagliasacchi
David J. Fleet
VGen
68
27
0
10 Jul 2024
MultiDiff: Consistent Novel View Synthesis from a Single Image
MultiDiff: Consistent Novel View Synthesis from a Single Image
Norman Muller
Katja Schwarz
Barbara Roessle
Lorenzo Porzi
Samuel Rota Buló
Matthias Nießner
Peter Kontschieder
DiffM
49
22
0
26 Jun 2024
Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a
  Single Image
Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image
Stanislaw Szymanowicz
Eldar Insafutdinov
Chuanxia Zheng
Dylan Campbell
João F. Henriques
Christian Rupprecht
Andrea Vedaldi
3DGS
33
49
0
06 Jun 2024
ReFiNe: Recursive Field Networks for Cross-modal Multi-scene
  Representation
ReFiNe: Recursive Field Networks for Cross-modal Multi-scene Representation
Sergey Zakharov
Katherine Liu
Adrien Gaidon
Rares Ambrus
38
1
0
06 Jun 2024
CAT3D: Create Anything in 3D with Multi-View Diffusion Models
CAT3D: Create Anything in 3D with Multi-View Diffusion Models
Ruiqi Gao
Aleksander Holynski
Philipp Henzler
Arthur Brussee
Ricardo Martín Brualla
Pratul P. Srinivasan
Jonathan T. Barron
Ben Poole
43
150
0
16 May 2024
FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic
  Gaussian Splatting
FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic Gaussian Splatting
Yikun Ma
Dandan Zhan
Zhi Jin
3DGS
30
9
0
09 May 2024
COM3D: Leveraging Cross-View Correspondence and Cross-Modal Mining for
  3D Retrieval
COM3D: Leveraging Cross-View Correspondence and Cross-Modal Mining for 3D Retrieval
Hao Wu
Ruochong Li
Hao Wang
Hui Xiong
3DPC
32
2
0
07 May 2024
MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object
  Reconstruction from Single-View
MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View
Emmanuelle Bourigault
Pauline Bourigault
29
2
0
06 May 2024
Lightplane: Highly-Scalable Components for Neural 3D Fields
Lightplane: Highly-Scalable Components for Neural 3D Fields
Ang Cao
Justin Johnson
Andrea Vedaldi
David Novotny
36
8
0
30 Apr 2024
FSRT: Facial Scene Representation Transformer for Face Reenactment from
  Factorized Appearance, Head-pose, and Facial Expression Features
FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features
Andre Rochow
Max Schwarz
Sven Behnke
ViT
48
6
0
15 Apr 2024
MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
Yuedong Chen
Haofei Xu
Chuanxia Zheng
Bohan Zhuang
Marc Pollefeys
Andreas Geiger
Tat-Jen Cham
Jianfei Cai
3DGS
55
157
0
21 Mar 2024
LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D
  Generation
LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation
Yushi Lan
Fangzhou Hong
Shuai Yang
Shangchen Zhou
Xuyi Meng
Bo Dai
Xingang Pan
Chen Change Loy
40
39
0
18 Mar 2024
VideoMV: Consistent Multi-View Generation Based on Large Video
  Generative Model
VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model
Qi Zuo
Xiaodong Gu
Lingteng Qiu
Yuan Dong
Zhengyi Zhao
...
Rui Peng
Siyu Zhu
Zilong Dong
Liefeng Bo
Qixing Huang
DiffM
VGen
39
25
0
18 Mar 2024
Diffusion Models are Geometry Critics: Single Image 3D Editing Using
  Pre-Trained Diffusion Priors
Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors
Ruicheng Wang
Jianfeng Xiang
Jiaolong Yang
Xin Tong
DiffM
34
4
0
18 Mar 2024
GGRt: Towards Pose-free Generalizable 3D Gaussian Splatting in Real-time
GGRt: Towards Pose-free Generalizable 3D Gaussian Splatting in Real-time
Hao Li
Yuanyuan Gao
Chenming Wu
Dingwen Zhang
Yalun Dai
Chen Zhao
Haocheng Feng
Errui Ding
Jingdong Wang
Junwei Han
3DGS
29
9
0
15 Mar 2024
3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface
3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface
Linyi Jin
Nilesh Kulkarni
David Fouhey
3DV
28
2
0
13 Mar 2024
V3D: Video Diffusion Models are Effective 3D Generators
V3D: Video Diffusion Models are Effective 3D Generators
Zilong Chen
Yikai Wang
Feng Wang
Zhengyi Wang
Huaping Liu
VGen
40
61
0
11 Mar 2024
PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis
PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis
Jason J. Yu
Tristan Aumentado-Armstrong
Fereshteh Forghani
Konstantinos G. Derpanis
Marcus A. Brubaker
38
5
0
28 Feb 2024
Parallelized Spatiotemporal Binding
Parallelized Spatiotemporal Binding
Gautam Singh
Yue Wang
Jiawei Yang
Boris Ivanovic
Sungjin Ahn
Marco Pavone
Tong Che
48
1
0
26 Feb 2024
Semantically-aware Neural Radiance Fields for Visual Scene
  Understanding: A Comprehensive Review
Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review
Thang-Anh-Quan Nguyen
Amine Bourki
Mátyás Macudzinski
Anthony Brunel
M. Bennamoun
32
10
0
17 Feb 2024
SPAD : Spatially Aware Multiview Diffusers
SPAD : Spatially Aware Multiview Diffusers
Yash Kant
Ziyi Wu
Michael Vasilkovsky
Guocheng Qian
Jian Ren
R. A. Guler
Guohao Li
Sergey Tulyakov
Igor Gilitschenski
Aliaksandr Siarohin
DiffM
24
35
0
07 Feb 2024
ViewFusion: Learning Composable Diffusion Models for Novel View
  Synthesis
ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis
Bernard Spiegl
Andrea Perin
Stéphane Deny
Alexander Ilin
DiffM
16
2
0
05 Feb 2024
Robust Inverse Graphics via Probabilistic Inference
Robust Inverse Graphics via Probabilistic Inference
Tuan Anh Le
Pavel Sountsov
Matthew D. Hoffman
Ben Lee
Brian Patton
Rif A. Saurous
32
0
0
02 Feb 2024
123
Next