Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.12039
Cited By
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
26 March 2020
Zachary Teed
Jia Deng
MDE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RAFT: Recurrent All-Pairs Field Transforms for Optical Flow"
50 / 1,492 papers shown
Title
VISION-XL: High Definition Video Inverse Problem Solver using Latent Image Diffusion Models
Taesung Kwon
Jong Chul Ye
82
1
0
29 Nov 2024
OMNI-DC: Highly Robust Depth Completion with Multiresolution Depth Integration
Yiming Zuo
Willow Yang
Zeyu Ma
Jia Deng
MDE
90
2
0
28 Nov 2024
On Moving Object Segmentation from Monocular Video with Transformers
Christian Homeyer
Christoph Schnörr
107
3
0
28 Nov 2024
TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video
Jinyuan Qu
Hongyang Li
Shilong Liu
Tianhe Ren
Zhaoyang Zeng
Lei Zhang
3DPC
85
1
0
27 Nov 2024
MotionCharacter: Identity-Preserving and Motion Controllable Human Video Generation
Haopeng Fang
Di Qiu
Binjie Mao
Pengfei Yan
He Tang
VGen
DiffM
78
4
0
27 Nov 2024
RoMo: Robust Motion Segmentation Improves Structure from Motion
Lily Goli
S. Sabour
Mark J. Matthews
Marcus A. Brubaker
Dmitry Lagun
Alec Jacobson
David J. Fleet
Saurabh Saxena
Andrea Tagliasacchi
VOS
117
3
0
27 Nov 2024
GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object Manipulation
Wenbo Cui
Chengyang Zhao
Songlin Wei
Jiazhao Zhang
Haoran Geng
Yaran Chen
H. Wang
He Wang
105
1
0
27 Nov 2024
Depth-PC: A Visual Servo Framework Integrated with Cross-Modality Fusion for Sim2Real Transfer
Haoyu Zhang
Weiyang Lin
Yimu Jiang
Chao Ye
80
0
0
26 Nov 2024
Neural-Network-Enhanced Metalens Camera for High-Definition, Dynamic Imaging in the Long-Wave Infrared Spectrum
Jing-Yang Wei
Hao Huang
Xinming Zhang
De-Mao Ye
Yi Li
Liwen Wang
Yao-Guang Ma
Yang-Hui Li
77
0
0
26 Nov 2024
SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting
Gyeongjin Kang
Jisang Yoo
Jihyeon Park
Seungtae Nam
Hyeonsoo Im
Sangheon Shin
Sangpil Kim
Eunbyung Park
3DGS
161
4
0
26 Nov 2024
RealTraj: Towards Real-World Pedestrian Trajectory Forecasting
Ryo Fujii
Hideo Saito
Ryo Hachiuma
AI4TS
115
1
0
26 Nov 2024
PreF3R: Pose-Free Feed-Forward 3D Gaussian Splatting from Variable-length Image Sequence
Zequn Chen
Jiezhi Yang
Heng Yang
3DGS
74
2
0
25 Nov 2024
Edge Weight Prediction For Category-Agnostic Pose Estimation
Or Hirschorn
S. Avidan
96
0
0
25 Nov 2024
Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
Y. Wang
Jiajie Teng
Jiajiong Cao
Yuming Li
Chenguang Ma
Hongteng Xu
Dixin Luo
VGen
DiffM
81
0
0
25 Nov 2024
Context-Aware Input Orchestration for Video Inpainting
Hoyoung Kim
Azimbek Khudoyberdiev
Seonghwan Jeong
Jihoon Ryoo
88
0
0
25 Nov 2024
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Zhichao Zhang
Wei Sun
Xinyue Li
Yunhao Li
Qihang Ge
...
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guangtao Zhai
EGVM
122
1
0
25 Nov 2024
PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments
Yiming Li
Xiangqi Meng
Xingxing Zuo
Zhe Liu
Hesheng Wang
Daniel Cremers
3DGS
82
1
0
24 Nov 2024
M3-CVC: Controllable Video Compression with Multimodal Generative Models
Rui Wan
Qi Zheng
Yibo Fan
VGen
DiffM
71
0
0
24 Nov 2024
Benchmarking the Robustness of Optical Flow Estimation to Corruptions
Zhonghua Yi
Hao-miao Shi
Zhijie Xu
Yao Gao
Ze Wang
Yujian Zhang
Kailun Yang
Kaiwei Wang
AAML
87
1
0
22 Nov 2024
VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Jiahao Hu
Tianxiong Zhong
Xuebo Wang
Boyuan Jiang
Xingye Tian
Fei Yang
Pengfei Wan
Di Zhang
VGen
74
2
0
22 Nov 2024
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Weijia Wu
Mingyu Liu
Zeyu Zhu
Xi Xia
Haoen Feng
Wen Wang
Kevin Qinghong Lin
Chunhua Shen
Mike Zheng Shou
DiffM
VGen
122
1
0
22 Nov 2024
PhysFlow: Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation
Zhuoman Liu
Weicai Ye
Yan Luximon
Pengfei Wan
Di Zhang
VGen
AI4CE
117
3
0
21 Nov 2024
Extending Video Masked Autoencoders to 128 frames
N. B. Gundavarapu
Luke Friedman
Raghav Goyal
Chaitra Hegde
Eirikur Agustsson
...
Mikhail Sirotenko
Ming Yang
Tobias Weyand
Boqing Gong
Leonid Sigal
89
1
0
20 Nov 2024
DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Weicai Ye
Xinyu Chen
Ruohao Zhan
Di Huang
Xiaoshui Huang
Haoyi Zhu
Hujun Bao
Wanli Ouyang
Tong He
Guofeng Zhang
82
5
0
20 Nov 2024
GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views
Boyao Zhou
Shunyuan Zheng
Hanzhang Tu
Ruizhi Shao
Boning Liu
Shengping Zhang
Liqiang Nie
Yebin Liu
3DGS
91
1
0
18 Nov 2024
SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input
Zhen Lv
Yangqi Long
Congzhentao Huang
Cao Li
Chengfei Lv
Hao Ren
Dian Zheng
DiffM
VGen
MDE
114
5
0
18 Nov 2024
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing
Chang-Shu Liu
Rui Li
Kaidong Zhang
Yunwei Lan
Dong Liu
DiffM
VGen
63
3
0
17 Nov 2024
Iterative Camera-LiDAR Extrinsic Optimization via Surrogate Diffusion
N. Ou
Zhuo Chen
Xinru Zhang
Junzheng Wang
37
0
0
17 Nov 2024
DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment
Mangyu Kong
Jaewon Lee
Seongwon Lee
Euntai Kim
3DGS
32
1
0
16 Nov 2024
BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation
Yufei Wei
Sha Lu
Fuzhang Han
R. Xiong
Yue Wang
33
1
0
15 Nov 2024
OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models
Mathis Koroglu
Hugo Caselles-Dupré
Guillaume Jeanneret Sanmiguel
Matthieu Cord
VGen
DiffM
29
1
0
15 Nov 2024
MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation
Jonas Serych
Michal Neoral
Jirí Matas
31
3
0
14 Nov 2024
4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization
Mijeong Kim
Jongwoo Lim
Bohyung Han
3DGS
41
2
0
13 Nov 2024
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
Xiaofeng Wang
Kang Zhao
F. Liu
Jiayu Wang
Guosheng Zhao
Xiaoyi Bao
Zheng Hua Zhu
Yingya Zhang
Xingang Wang
VGen
66
6
0
13 Nov 2024
Adaptive and Temporally Consistent Gaussian Surfels for Multi-view Dynamic Reconstruction
Decai Chen
Brianne Oberson
I. Feldmann
O. Schreer
Anna Hilsmann
Peter Eisert
3DGS
38
1
0
10 Nov 2024
Improved Video VAE for Latent Video Diffusion Model
Pingyu Wu
Kai Zhu
Yu Liu
Liming Zhao
Wei-dong Zhai
Yang Cao
Zheng-jun Zha
VGen
DiffM
61
4
0
10 Nov 2024
I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength
Wanquan Feng
Jiawei Liu
Pengqi Tu
Tianhao Qi
Mingzhen Sun
Tianxiang Ma
Mingcong Liu
Siyu Zhou
Qian He
VGen
60
7
0
10 Nov 2024
Autoregressive Models in Vision: A Survey
Jing Xiong
Gongye Liu
Lun Huang
Chengyue Wu
Taiqiang Wu
...
Hao Fei
Guillermo Sapiro
Jiebo Luo
Ping Luo
Ngai Wong
VGen
53
9
0
08 Nov 2024
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
Wenqiang Sun
Shuo Chen
F. Liu
Zilong Chen
Yueqi Duan
Jun Zhang
Yikai Wang
VGen
56
31
0
07 Nov 2024
DEIO: Deep Event Inertial Odometry
Weipeng Guan
Fuling Lin
Peiyu Chen
P. Lu
59
2
0
06 Nov 2024
Object and Contact Point Tracking in Demonstrations Using 3D Gaussian Splatting
Michael Büttner
Jonathan M Francis
Helge Rhodin
Andrew Melnik
3DPC
48
0
0
05 Nov 2024
A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding
Yitong Dong
Yijin Li
Zhaoyang Huang
Weikang Bian
Qingbin Liu
Hujun Bao
Zhaopeng Cui
Hongsheng Li
Guofeng Zhang
3DV
59
0
0
04 Nov 2024
Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation
Zhenbin Wang
Lei Zhang
Lituan Wang
Minjuan Zhu
Zhenwei Zhang
VGen
MedIm
64
1
0
03 Nov 2024
Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuli
Matthias Tangemann
Matthias Kümmerer
Matthias Bethge
VOS
47
0
0
03 Nov 2024
Infinite-Resolution Integral Noise Warping for Diffusion Models
Yitong Deng
Winnie Lin
Lingxiao Li
Dmitriy Smirnov
Ryan Burgert
Ning Yu
Vincent Dedun
Mohammad H. Taghavi
36
2
0
02 Nov 2024
Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning
Penghui Ruan
Pichao Wang
Divya Saxena
Jiannong Cao
Yuhui Shi
DiffM
VGen
41
0
0
31 Oct 2024
XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM
Xiaomeng Wang
Nan Wang
Guofeng Zhang
42
0
0
31 Oct 2024
GS-Blur: A 3D Scene-Based Dataset for Realistic Image Deblurring
Dongwoo Lee
J. Park
Kyoung Mu Lee
3DGS
41
0
0
31 Oct 2024
DELTA: Dense Efficient Long-range 3D Tracking for any video
Tuan Duc Ngo
Peiye Zhuang
Chuang Gan
E. Kalogerakis
Sergey Tulyakov
Hsin-Ying Lee
Chaoyang Wang
60
5
0
31 Oct 2024
LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM
Yucheng Huang
Luping Ji
Hudong Liu
Mao Ye
48
0
0
30 Oct 2024
Previous
1
2
3
4
5
6
...
28
29
30
Next