Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1704.00675
Cited By
v1
v2
v3 (latest)
The 2017 DAVIS Challenge on Video Object Segmentation
3 April 2017
Jordi Pont-Tuset
Federico Perazzi
Sergi Caelles
Pablo Arbeláez
A. Sorkine-Hornung
Luc Van Gool
VGen
VOS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The 2017 DAVIS Challenge on Video Object Segmentation"
50 / 679 papers shown
Title
ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts
Shiu-hong Kao
Yu-Wing Tai
Chi-Keung Tang
VOS
MLLM
VGen
LRM
105
0
0
01 Jul 2025
Emergent Temporal Correspondences from Video Diffusion Transformers
Jisu Nam
Soowon Son
Dahyun Chung
Jiyoung Kim
Siyoon Jin
Junhwa Hur
Seungryong Kim
VGen
23
0
0
20 Jun 2025
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects
Guohuan Xie
Syed Ariff Syed Hesham
Wenya Guo
Bing Li
Ming-Ming Cheng
Guolei Sun
Yun-Hai Liu
26
0
0
16 Jun 2025
Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors
Wen-Hsuan Chu
Lei Ke
Jianmeng Liu
Mingxiao Huo
P. Tokmakov
Katerina Fragkiadaki
3DGS
29
0
0
15 Jun 2025
StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams
Zike Wu
Qi Yan
Xuanyu Yi
Lele Wang
Renjie Liao
3DGS
21
0
0
10 Jun 2025
Consistent Video Editing as Flow-Driven Image-to-Video Generation
Ge Wang
Songlin Fan
Hangxu Liu
Quanjian Song
Hewei Wang
Jinfeng Xu
DiffM
VGen
29
0
0
09 Jun 2025
Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding
Boyu Chen
Siran Chen
Kunchang Li
Qinglin Xu
Yu Qiao
Yali Wang
VOS
25
0
0
09 Jun 2025
Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models
Sangwon Jang
Taekyung Ki
Jaehyeong Jo
Jaehong Yoon
Soo Ye Kim
Zhe Lin
Sung Ju Hwang
DiffM
VGen
25
0
0
08 Jun 2025
THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation
Mingqi Gao
Haoran Duan
Tianlu Zhang
Jungong Han
10
0
0
07 Jun 2025
FADE: Frequency-Aware Diffusion Model Factorization for Video Editing
Yixuan Zhu
Haolin Wang
Shilin Ma
Wenliang Zhao
Yansong Tang
Lei Chen
Jie Zhou
DiffM
VGen
43
0
0
06 Jun 2025
FRAME: Pre-Training Video Feature Representations via Anticipation and Memory
Sethuraman TV
Savya Khosla
Vignesh Srinivasakumar
Jiahui Huang
Seoung Wug Oh
Simon Jenni
Derek Hoiem
Joon-Young Lee
34
0
0
05 Jun 2025
SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost
Haiyang Mei
Pengyu Zhang
Mike Zheng Shou
VLM
47
0
0
02 Jun 2025
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
Yang-tian Sun
Xin Yu
Zehuan Huang
Yi-Hua Huang
Yuan-Chen Guo
Ziyi Yang
Yan-Pei Cao
Xiaojuan Qi
DiffM
VGen
MDE
46
1
0
30 May 2025
Maximum Likelihood Learning of Latent Dynamics Without Reconstruction
Samo Hromadka
Kai Biegun
Lior Fox
James Heald
M. Sahani
BDL
AI4TS
23
0
0
29 May 2025
MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
Sihan Yang
Runsen Xu
Yiman Xie
Sizhe Yang
Mo Li
...
Haodong Duan
Xiangyu Yue
Dahua Lin
Tai Wang
Jiangmiao Pang
VLM
LRM
53
1
0
29 May 2025
Sci-Fi: Symmetric Constraint for Frame Inbetweening
Liuhan Chen
Xiaodong Cun
Xiaoyu Li
Xianyi He
Shenghai Yuan
Jie Chen
Ying Shan
Li Yuan
VGen
81
0
0
27 May 2025
TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs
Juntong Wang
Jiarui Wang
Huiyu Duan
Guangtao Zhai
Xiongkuo Min
41
1
0
26 May 2025
A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking
Zixiang Zhao
Haowen Bai
Bingxin Ke
Yukun Cui
Lilun Deng
Yulun Zhang
Kai Zhang
Konrad Schindler
VGen
48
0
0
26 May 2025
Reasoning Segmentation for Images and Videos: A Survey
Yiqing Shen
Chenjia Li
Fei Xiong
Jeong-O Jeong
Tianpeng Wang
Michael Latman
Mathias Unberath
VOS
244
0
0
24 May 2025
RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers
Ahmet Berke Gokmen
Yigit Ekin
Bahri Batuhan Bilecen
Aysegül Dündar
153
0
0
19 May 2025
Efficient Neural Video Representation with Temporally Coherent Modulation
Seungjun Shin
S. Kim
Dokwan Oh
72
0
0
01 May 2025
DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining
Wei Zhuo
Zhiyue Tang
Wufeng Xue
Hao Ding
Linlin Shen
111
0
0
22 Apr 2025
Plug-and-Play Versatile Compressed Video Enhancement
Huimin Zeng
Jiacheng Li
Zhiwei Xiong
65
0
0
21 Apr 2025
MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video
Minh-Quan Viet Bui
Jongmin Park
J. P. Bello
Jaeho Moon
Jihyong Oh
Munchurl Kim
440
0
0
21 Apr 2025
Seurat: From Moving Points to Depth
Seokju Cho
Jiahui Huang
S. Kim
Joon-Young Lee
3DPC
MDE
77
1
0
20 Apr 2025
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models
Haiwen Huang
Anpei Chen
Volodymyr Havrylov
Andreas Geiger
Dan Zhang
64
2
0
18 Apr 2025
UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models
Guanlong Jiao
Biqing Huang
Kuan-Chieh Wang
Renjie Liao
DiffM
137
0
0
17 Apr 2025
Physical Reservoir Computing in Hook-Shaped Rover Wheel Spokes for Real-Time Terrain Identification
Xiao Jin
Zihan Wang
Zhenhua Yu
Changrak Choi
Kalind Carpenter
T. Nanayakkara
76
2
0
17 Apr 2025
Perception Encoder: The best visual embeddings are not at the output of the network
Daniel Bolya
Po-Yao (Bernie) Huang
Peize Sun
Jang Hyun Cho
Andrea Madotto
...
Shiyu Dong
Nikhila Ravi
Daniel Li
Piotr Dollár
Christoph Feichtenhofer
ObjD
VOS
329
9
0
17 Apr 2025
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency
Mengshi Qi
Pengfei Zhu
Xianrui Li
Xiaoyang Bi
Lu Qi
Huadong Ma
Ming-Hsuan Yang
VOS
VLM
133
0
0
16 Apr 2025
Understanding Attention Mechanism in Video Diffusion Models
Bingyan Liu
Chengyu Wang
Tongtong Su
Huan Ten
Jun Huang
K. Guo
Kui Jia
VGen
101
1
0
16 Apr 2025
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Henghui Ding
Chang Liu
Nikhila Ravi
Shuting He
Y. Wei
...
Haobo Yuan
Xuelong Li
Tao Zhang
Lu Qi
Ming-Hsuan Yang
92
1
0
15 Apr 2025
MASSeg : 2nd Technical Report for 4th PVUW MOSE Track
Xuqiang Cao
Linnan Zhao
Jiaxuan Zhao
Fang Liu
Puhua Chen
Wenping Ma
81
0
0
14 Apr 2025
FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution
Mengjiao Wang
Junpei Zhang
Xu Liu
Yuting Yang
Mengru Ma
VOS
77
0
0
13 Apr 2025
UniFlowRestore: A General Video Restoration Framework via Flow Matching and Prompt Guidance
Siyang Song
Yu Zhang
Chen Wu
Dianjie Lu
Dianjie Lu
Guijuan Zhan
Yang Weng
Zhuoran Zheng
DiffM
VGen
61
0
0
12 Apr 2025
STSeg-Complex Video Object Segmentation: The 1st Solution for 4th PVUW MOSE Challenge
Kehuan Song
Xinglin Xie
Kexin Zhang
Licheng Jiao
Lingling Li
Steve Yang
VOS
87
0
0
11 Apr 2025
Robust SAM: On the Adversarial Robustness of Vision Foundation Models
Jiahuan Long
Zhengqin Xu
Tingsong Jiang
Wen Yao
Shuai Jia
Chao Ma
Xiaoqian Chen
AAML
VLM
98
1
0
11 Apr 2025
ZS-VCOS: Zero-Shot Outperforms Supervised Video Camouflaged Object Segmentation
Wenqi Guo
Shan Du
VLM
97
0
0
10 Apr 2025
D^2USt3R: Enhancing 3D Reconstruction with 4D Pointmaps for Dynamic Scenes
Jisang Han
Honggyu An
Jaewoo Jung
Takuya Narihira
Junyoung Seo
Kazumi Fukuda
Chaehyun Kim
Sunghwan Hong
Yuki Mitsufuji
Seungryong Kim
100
2
0
08 Apr 2025
Studying Image Diffusion Features for Zero-Shot Video Object Segmentation
Thanos Delatolas
Vicky S. Kalogeiton
Dim P. Papadopoulos
DiffM
VOS
128
2
0
07 Apr 2025
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting
Yunlong Tang
Jing Bi
Chao Huang
Susan Liang
Daiki Shimada
...
Jinxi He
Liu He
Zeliang Zhang
Jiebo Luo
Chenliang Xu
107
1
0
07 Apr 2025
ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer
Jiayi Gao
Zijin Yin
Changcheng Hua
Yuxin Peng
Kongming Liang
Zhanyu Ma
Jiaxin Guo
Yang Liu
VGen
DiffM
104
2
0
03 Apr 2025
Zero-Shot 4D Lidar Panoptic Segmentation
Yushan Zhang
Aljosa Osep
Laura Leal-Taixé
Tim Meinhardt
3DPC
98
1
0
01 Apr 2025
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Bernard Ghanem
140
0
0
01 Apr 2025
FreeInv: Free Lunch for Improving DDIM Inversion
Yuxiang Bao
Huijie Liu
Xun Gao
Huan Fu
Guoliang Kang
57
0
0
29 Mar 2025
Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion
Jangho Park
Taesung Kwon
Jong Chul Ye
VGen
105
1
0
28 Mar 2025
Segment Any Motion in Videos
Nan Huang
Wenzhao Zheng
Chenfeng Xu
Kurt Keutzer
Shanghang Zhang
Angjoo Kanazawa
Qianqian Wang
VOS
100
1
0
28 Mar 2025
MVFNet: Multipurpose Video Forensics Network using Multiple Forms of Forensic Evidence
Tai D. Nguyen
Matthew C. Stamm
124
0
0
26 Mar 2025
EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models
Yufei Cai
Hu Han
Yuxiang Wei
Shiguang Shan
Xilin Chen
DiffM
VGen
95
0
0
25 Mar 2025
TransAnimate: Taming Layer Diffusion to Generate RGBA Video
Xuewei Chen
Zhimin Chen
Yiren Song
VGen
123
2
0
23 Mar 2025
1
2
3
4
...
12
13
14
Next