Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.12039
Cited By
v1
v2
v3 (latest)
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
26 March 2020
Zachary Teed
Jia Deng
MDE
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3586★)
Papers citing
"RAFT: Recurrent All-Pairs Field Transforms for Optical Flow"
50 / 1,532 papers shown
Title
TAPNext: Tracking Any Point (TAP) as Next Token Prediction
Artem Zholus
Carl Doersch
Yi Yang
Skanda Koppula
Viorica Patraucean
Xu He
Ignacio Rocco
Mehdi S. M. Sajjadi
Sarath Chandar
Ross Goroshin
89
0
0
08 Apr 2025
Saliency-Motion Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation
Xiangyu Zheng
Wanyun Li
Songcheng He
Jianping Fan
Xiaoqiang Li
We Zhang
VOS
91
0
0
08 Apr 2025
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Mengchao Wang
Qiang Wang
Fan Jiang
Yaqi Fan
Yunpeng Zhang
Yonggang Qi
Kun Zhao
Mu Xu
DiffM
VGen
85
5
0
07 Apr 2025
Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM
Zhicong Sun
Jacqueline Lo
Jinxing Hu
3DGS
69
0
0
07 Apr 2025
Video-Bench: Human-Aligned Video Generation Benchmark
Hui Han
Siyuan Li
Jiaqi Chen
Yiwen Yuan
Yuling Wu
...
You Li
Jing Zhang
Chi Zhang
Li Li
Yongxin Ni
EGVM
VGen
214
0
0
07 Apr 2025
FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency
Shiyan Liu
Rui Qu
Yan Jin
88
0
0
06 Apr 2025
3D Scene Understanding Through Local Random Access Sequence Modeling
Wanhee Lee
Klemen Kotar
R. Venkatesh
Jared Watrous
Honglin Chen
Khai Loong Aw
Daniel L. K. Yamins
3DV
67
0
0
04 Apr 2025
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation
Xin Zhang
Robby T. Tan
Mamba
100
1
0
04 Apr 2025
PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation
Lihua Liu
Jiehong Lin
Zhenxin Liu
Kui Jia
83
0
0
03 Apr 2025
BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation
Van Nguyen Nguyen
Stephen Tyree
Andrew Guo
Mederic Fourmy
Anas Gouda
...
Stan Birchfield
Jiri Matas
Yann Labbé
M. Sundermeyer
Tomás Hodan
3DPC
156
4
0
03 Apr 2025
Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization
Jiadong Wang
Jingyuan Liu
Xin Sun
Krishna Kumar Singh
Zhixin Shu
...
Nanxuan Zhao
Tuanfeng Y. Wang
Simon Chen
Ulrich Neumann
Jae Shin Yoon
74
0
0
03 Apr 2025
Training-free Dense-Aligned Diffusion Guidance for Modular Conditional Image Synthesis
Zixuan Wang
Duo Peng
Feng Chen
Yue Yang
Yinjie Lei
DiffM
144
0
0
02 Apr 2025
Scene-Centric Unsupervised Panoptic Segmentation
Oliver Hahn
Christoph Reich
Nikita Araslanov
Daniel Cremers
Christian Rupprecht
Stefan Roth
OCL
144
0
0
02 Apr 2025
Think Small, Act Big: Primitive Prompt Learning for Lifelong Robot Manipulation
Yuanqi Yao
Siao Liu
Haoming Song
Delin Qu
Qizhi Chen
Yan Ding
Bin Zhao
Ziyi Wang
Xiaochen Li
Dong Wang
CLL
142
1
0
01 Apr 2025
Hierarchical Flow Diffusion for Efficient Frame Interpolation
Yang Hai
Guo Wang
Tan Su
Wenjie Jiang
Yinlin Hu
DiffM
113
0
0
01 Apr 2025
Point Tracking in Surgery--The 2024 Surgical Tattoos in Infrared (STIR) Challenge
Adam Schmidt
Mert Asim Karaoglu
Soham Sinha
Mingang Jang
Ho-Gun Ha
...
Zijian Wu
A. Ladikos
S. DiMaio
Septimiu E. Salcudean
Omid Mohareri
86
1
0
31 Mar 2025
JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation
Fangda Chen
Shanshan Zhao
Chuanfu Xu
Long Lan
VGen
91
2
0
31 Mar 2025
VideoGen-Eval: Agent-based System for Video Generation Evaluation
Yuhang Yang
Ke Fan
Siyang Song
Hongxiang Li
Ailing Zeng
FeiLin Han
Wei-dong Zhai
Wen Liu
Yang Cao
Zheng-jun Zha
EGVM
VGen
123
1
0
30 Mar 2025
VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior
Xindi Yang
Baolu Li
Yanzhe Zhang
Zhenfei Yin
Lei Bai
...
Zhiyong Wang
Jianfei Cai
Tien-Tsin Wong
Huchuan Lu
Xu Jia
DiffM
VGen
147
1
0
30 Mar 2025
AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos
Felix Wimbauer
Weirong Chen
Dominik Muhle
Christian Rupprecht
Daniel Cremers
VGen
166
0
0
30 Mar 2025
Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges
Ukcheol Shin
Jinsun Park
3DV
MDE
81
0
0
28 Mar 2025
Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision
Rulin Zhou
Wenlong He
An Wang
Qiqi Yao
Haijun Hu
Jiankun Wang
Xi Zhang an Hongliang Ren
69
0
0
28 Mar 2025
Can Video Diffusion Model Reconstruct 4D Geometry?
Jinjie Mai
Wenxuan Zhu
Haozhe Liu
Bing Li
Cheng Zheng
Jürgen Schmidhuber
Bernard Ghanem
VGen
MDE
157
0
0
27 Mar 2025
Multispectral Demosaicing via Dual Cameras
SaiKiran Tedla
Junyong Lee
Beixuan Yang
Mahmoud Afifi
M. Brown
97
0
0
27 Mar 2025
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
Dian Zheng
Ziqi Huang
Hongbo Liu
Kai Zou
Yinan He
...
Yize Zhang
Jingwen He
Wei-Shi Zheng
Yu Qiao
Ziwei Liu
EGVM
VGen
130
14
0
27 Mar 2025
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
Shijie Zhou
Hui Ren
Yijia Weng
Shuwang Zhang
Zhen Wang
...
Zhiwen Fan
Suya You
Ziyi Wang
Leonidas Guibas
A. Kadambi
VGen
3DGS
150
2
0
26 Mar 2025
MVFNet: Multipurpose Video Forensics Network using Multiple Forms of Forensic Evidence
Tai D. Nguyen
Matthew C. Stamm
127
0
0
26 Mar 2025
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
T. Liu
Z. Huang
Zhaoxi Chen
Guangcong Wang
Shoukang Hu
Liao Shen
Huiqiang Sun
Z. Cao
Wei Li
Ziwei Liu
VGen
3DGS
129
1
0
26 Mar 2025
FullDiT: Multi-Task Video Generative Foundation Model with Full Attention
Xuan Ju
Weicai Ye
Quande Liu
Qiulin Wang
Xintao Wang
Pengfei Wan
Di Zhang
Kun Gai
Qiang Xu
VGen
108
4
0
25 Mar 2025
Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better
Zihang Lai
Andrea Vedaldi
71
1
0
25 Mar 2025
Burst Image Super-Resolution with Mamba
Ozan Unal
Steven Marty
Dengxin Dai
Mamba
74
0
0
25 Mar 2025
Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals
Stefan Stojanov
David Wendt
Seungwoo Kim
R. Venkatesh
Kevin T. Feigelis
Jiajun Wu
Daniel L. K. Yamins
SSL
99
0
0
25 Mar 2025
AMD-Hummingbird: Towards an Efficient Text-to-Video Model
Takashi Isobe
He Cui
Dong Zhou
Mengmeng Ge
D. Li
E. Barsoum
VGen
93
1
0
24 Mar 2025
Aether: Geometric-Aware Unified World Modeling
Aether Team
Haoyi Zhu
Yanjie Wang
Jianjun Zhou
Wenzheng Chang
...
Zizun Li
Junyi Chen
Chunhua Shen
Jiangmiao Pang
Tong He
DiffM
VGen
124
9
0
24 Mar 2025
MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion
Yikun Ma
Yiqing Li
Jiawei Wu
Xing Luo
Zhi Jin
DiffM
VGen
154
0
0
22 Mar 2025
Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image
Jerred Chen
Ronald Clark
114
1
0
21 Mar 2025
Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos
Yuang Feng
Shuyong Gao
Fuzhen Yan
Yicheng Song
Lingyi Hong
J. Hu
Wenqiang Zhang
VOS
85
0
0
21 Mar 2025
Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks
Bhishma Dedhia
David Bourgin
Krishna Kumar Singh
Yuheng Li
Yan Kang
Zhan Xu
N. Jha
Yixiao Liu
DiffM
VGen
120
0
0
21 Mar 2025
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer
Qingyu Shi
Jianzong Wu
Jinbin Bai
Jing Zhang
Lu Qi
Xuelong Li
Yunhai Tong
95
0
0
21 Mar 2025
HyperNVD: Accelerating Neural Video Decomposition via Hypernetworks
Maria Pilligua
Danna Xue
Javier Vázquez-Corral
87
0
0
21 Mar 2025
Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction
Edgar Sucar
Zihang Lai
Eldar Insafutdinov
Andrea Vedaldi
81
1
0
20 Mar 2025
DIPLI: Deep Image Prior Lucky Imaging for Blind Astronomical Image Restoration
Suraj Singh
Anastasia Batsheva
Oleg Y. Rogov
Ahmed Bouridane
70
0
0
20 Mar 2025
ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos
Haolin Yang
Feilong Tang
Ming Hu
Yulong Li
Junjie Guo
...
Zelin Peng
Junjun He
Junjun He
Zongyuan Ge
Imran Razzak
DiffM
VGen
301
2
0
20 Mar 2025
4D Gaussian Splatting SLAM
Yanyan Li
Youxu Fang
Zunjie Zhu
Kunyi Li
Yong Ding
Federico Tombari
3DGS
137
0
0
20 Mar 2025
Nano-3D: Metasurface-Based Neural Depth Imaging
Bingxuan Li
Jiahao Wu
Yuan Xu
Yunxiang Zhang
Zezheng Zhu
Nanfang Yu
Qi Sun
MDE
66
0
0
20 Mar 2025
DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework
Henrique Morimitsu
Xiaobin Zhu
Roberto M. Cesar Jr.
Xiangyang Ji
Xu-Cheng Yin
MDE
102
0
0
19 Mar 2025
Learn Your Scales: Towards Scale-Consistent Generative Novel View Synthesis
Fereshteh Forghani
Jason J. Yu
Tristan Aumentado-Armstrong
Konstantinos G. Derpanis
Marcus A. Brubaker
DiffM
81
0
0
19 Mar 2025
High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight
Cédric Vincent
Taehyoung Kim
Henri Meeß
79
0
0
19 Mar 2025
xMOD: Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D motion
Saad Lahlali
Sandra Kara
Hejer Ammar
Florian Chabot
Nicolas Granger
Hervé Le Borgne
Q. C. Pham
3DPC
103
0
0
19 Mar 2025
Limb-Aware Virtual Try-On Network with Progressive Clothing Warping
Shengping Zhang
Xiaoyu Han
Weigang Zhang
Xiangyuan Lan
Hongxun Yao
Qingming Huang
3DH
169
7
0
18 Mar 2025
Previous
1
2
3
4
5
6
...
29
30
31
Next