Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1704.00675
Cited By
v1
v2
v3 (latest)
The 2017 DAVIS Challenge on Video Object Segmentation
3 April 2017
Jordi Pont-Tuset
Federico Perazzi
Sergi Caelles
Pablo Arbeláez
A. Sorkine-Hornung
Luc Van Gool
VGen
VOS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The 2017 DAVIS Challenge on Video Object Segmentation"
50 / 679 papers shown
Title
V2P-Bench: Evaluating Video-Language Understanding with Visual Prompts for Better Human-Model Interaction
Yiming Zhao
Y. Zeng
Yukun Qi
Yi Liu
Lin Yen-Chen
Zehui Chen
Xikun Bao
Jie Zhao
Feng Zhao
VLM
116
2
0
22 Mar 2025
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer
Qingyu Shi
Jianzong Wu
Jinbin Bai
Jing Zhang
Lu Qi
Xuelong Li
Yunhai Tong
95
0
0
21 Mar 2025
Structured-Noise Masked Modeling for Video, Audio and Beyond
Aritra Bhowmik
Fida Mohammad Thoker
Carlos Hinojosa
Bernard Ghanem
Cees G. M. Snoek
VGen
108
0
0
20 Mar 2025
SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation
Chun-Han Yao
Yiming Xie
Vikram S. Voleti
Huaizu Jiang
Varun Jampani
3DGS
VGen
140
1
0
20 Mar 2025
EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation
Zihao Zhang
Haoran Chen
Haoyu Zhao
Guansong Lu
Yanwei Fu
Hang Xu
Zuxuan Wu
VGen
DiffM
173
2
0
20 Mar 2025
How to Train Your Dragon: Automatic Diffusion-Based Rigging for Characters with Diverse Topologies
Zeqi Gu
Difan Liu
Timothy Langlois
Matthew Fisher
Abe Davis
DiffM
3DH
115
0
0
19 Mar 2025
DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework
Henrique Morimitsu
Xiaobin Zhu
Roberto M. Cesar Jr.
Xiangyang Ji
Xu-Cheng Yin
MDE
100
0
0
19 Mar 2025
SAM2 for Image and Video Segmentation: A Comprehensive Survey
Zhang Jiaxing
Tang Hao
VLM
107
0
0
17 Mar 2025
GIFT: Generated Indoor video frames for Texture-less point tracking
Jianzheng Huang
Xianyu Mo
Ziling Liu
Jinyu Yang
Feng Zheng
DiffM
3DPC
3DV
VGen
99
0
0
17 Mar 2025
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Hyeonho Jeong
Suhyeon Lee
Jong Chul Ye
VGen
490
2
0
12 Mar 2025
Investigation of Frame Differences as Motion Cues for Video Object Segmentation
Sota Kawamura
Hirotada Honda
Shugo Nakamura
Takashi Sano
VOS
105
0
0
12 Mar 2025
MVGSR: Multi-View Consistency Gaussian Splatting for Robust Surface Reconstruction
Chenfeng Hou
Qi Xun Yeo
Mengqi Guo
Yongxin Su
Yanyan Li
G. Lee
3DGS
107
2
0
11 Mar 2025
Small Vision-Language Models: A Survey on Compact Architectures and Techniques
Nitesh Patnaik
Navdeep Nayak
Himani Bansal Agrawal
Moinak Chinmoy Khamaru
Gourav Bal
Saishree Smaranika Panda
Rishi Raj
Vishal Meena
Kartheek Vadlamani
VLM
97
0
0
09 Mar 2025
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
Mark YU
Wenbo Hu
Jinbo Xing
Ying Shan
VGen
152
12
0
07 Mar 2025
SMITE: Segment Me In TimE
Amirhossein Alimohammadi
Sauradip Nag
Saeid Asgari Taghanaki
Andrea Tagliasacchi
Ghassan Hamarneh
Ali Mahdavi-Amiri
VLM
VOS
531
3
0
20 Feb 2025
MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching
Yen-Siang Wu
Chi-Pin Huang
Fu-En Yang
Yu-Jie Wang
DiffM
VGen
125
1
0
18 Feb 2025
Simplifying DINO via Coding Rate Regularization
Ziyang Wu
Jingyuan Zhang
Druv Pai
Xinze Wang
Chandan Singh
Jianwei Yang
Jianfeng Gao
Yi-An Ma
548
1
0
17 Feb 2025
Object-Centric Latent Action Learning
Albina Klepach
Alexander Nikulin
Ilya Zisman
Denis Tarasov
Alexander Derevyagin
Andrei Polubarov
Nikita Lyubaykin
Vladislav Kurenkov
129
0
0
13 Feb 2025
Consistent Video Colorization via Palette Guidance
Han Wang
Yuang Zhang
Yuhong Zhang
Lingxiao Lu
Li Song
DiffM
VGen
127
0
0
31 Jan 2025
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Fu Rong
Meng Lan
Qian Zhang
Lefei Zhang
VOS
VGen
114
1
0
23 Jan 2025
Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks
Michael Schwingshackl
Fabio Francisco Oberweger
Markus Murschitz
80
1
0
20 Jan 2025
EdgeTAM: On-Device Track Anything Model
Chong Zhou
Chenchen Zhu
Yunyang Xiong
Saksham Suri
Fanyi Xiao
...
Raghuraman Krishnamoorthi
Bo Dai
Chen Change Loy
Vikas Chandra
Bilge Soran
VLM
106
1
0
13 Jan 2025
Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy
Risha Goel
Zain Shabeeb
Isabel Panicker
Vida Jamali
VLM
43
0
0
06 Jan 2025
ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking
Tingyang Zhang
Chen Wang
Zhiyang Dou
Qingzhe Gao
Jiahui Lei
Baoquan Chen
Lingjie Liu
3DV
119
0
0
06 Jan 2025
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Hao Fei
Shengqiong Wu
Hao Zhang
Tat-Seng Chua
Shuicheng Yan
185
42
0
31 Dec 2024
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM
Yuqian Yuan
Hang Zhang
Wentong Li
Zesen Cheng
Boqiang Zhang
...
Deli Zhao
Wenqiao Zhang
Yueting Zhuang
Jianke Zhu
Lidong Bing
164
10
0
31 Dec 2024
The Dynamic Duo of Collaborative Masking and Target for Advanced Masked Autoencoder Learning
Shentong Mo
100
0
0
23 Dec 2024
Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation
Luoxu Jin
Hiroshi Watanabe
DiffM
VGen
243
0
0
22 Dec 2024
First-frame Supervised Video Polyp Segmentation via Propagative and Semantic Dual-teacher Network
Qiang Hu
Mei Liu
Qiang Li
Zhiwei Wang
128
0
0
21 Dec 2024
M
3
^3
3
-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
Zixuan Chen
Jiaxin Li
Liming Tan
Yejie Guo
Junxuan Liang
Cewu Lu
Yongqian Li
VOS
124
0
0
18 Dec 2024
SAMIC: Segment Anything with In-Context Spatial Prompt Engineering
S. Nagendra
Kashif Rashid
Chaopeng Shen
Daniel Kifer
VLM
143
2
0
16 Dec 2024
Spatiotemporal Blind-Spot Network with Calibrated Flow Alignment for Self-Supervised Video Denoising
Zikang Chen
Tao Jiang
Xiaowan Hu
Wang Zhang
Huaqiu Li
Haoqian Wang
104
0
0
16 Dec 2024
Generative Inbetweening through Frame-wise Conditions-Driven Video Generation
Tianyi Zhu
Dongwei Ren
Qilong Wang
Xiaohe Wu
W. Zuo
VGen
135
3
0
16 Dec 2024
EgoPoints: Advancing Point Tracking for Egocentric Videos
Ahmad Darkhalil
Rhodri Guerrier
Adam W. Harley
Dima Damen
106
3
0
05 Dec 2024
Gen-SIS: Generative Self-augmentation Improves Self-supervised Learning
Varun Belagali
Srikar Yellapragada
Alexandros Graikos
S. Kapse
Zilinghan Li
Tarak Nandi
Ravi K. Madduri
Prateek Prasanna
Joel H. Saltz
Dimitris Samaras
DiffM
144
2
0
02 Dec 2024
VISION-XL: High Definition Video Inverse Problem Solver using Latent Image Diffusion Models
Taesung Kwon
Jong Chul Ye
173
1
0
29 Nov 2024
On Moving Object Segmentation from Monocular Video with Transformers
Christian Homeyer
Christoph Schnörr
149
3
0
28 Nov 2024
A Distractor-Aware Memory for Visual Object Tracking with SAM2
Jovana Videnovic
A. Lukežič
Matej Kristan
VLM
147
3
0
26 Nov 2024
UVCG: Leveraging Temporal Consistency for Universal Video Protection
KaiZhou Li
Jindong Gu
Xinchun Yu
Junjie Cao
Yansong Tang
Xiao-Ping Zhang
AAML
121
0
0
25 Nov 2024
Context-Aware Input Orchestration for Video Inpainting
Hoyoung Kim
Azimbek Khudoyberdiev
Seonghwan Jeong
Jihoon Ryoo
167
0
0
25 Nov 2024
Integrating Deep Metric Learning with Coreset for Active Learning in 3D Segmentation
Arvind Murari Vepa
Zukang Yang
Andrew Choi
Jungseock Joo
Fabien Scalzo
Yizhou Sun
3DPC
115
1
0
24 Nov 2024
IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos
Yunong Liu
Cristobal Eyzaguirre
Manling Li
Shubh Khanna
Juan Carlos Niebles
Vineeth Ravi
Saumitra Mishra
Weiyu Liu
Jiajun Wu
122
1
0
18 Nov 2024
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing
Chang-Shu Liu
Rui Li
Kaidong Zhang
Yunwei Lan
Dong Liu
DiffM
VGen
87
7
0
17 Nov 2024
Video Denoising in Fluorescence Guided Surgery
Trevor Seets
Andreas Velten
MedIm
AI4CE
59
0
0
14 Nov 2024
MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation
Jonas Serych
Michal Neoral
Jirí Matas
116
3
0
14 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
175
0
0
12 Nov 2024
State Chrono Representation for Enhancing Generalization in Reinforcement Learning
Jianda Chen
Wen Zheng Terence Ng
Zichen Chen
Sinno Jialin Pan
Tianwei Zhang
OffRL
66
0
0
09 Nov 2024
Moving Off-the-Grid: Scene-Grounded Video Representations
Sjoerd van Steenkiste
Daniel Zoran
Yi Yang
Yulia Rubanova
Rishabh Kabra
...
Thomas Keck
João Carreira
Alexey Dosovitskiy
Mehdi S. M. Sajjadi
Thomas Kipf
75
4
0
08 Nov 2024
LiVOS: Light Video Object Segmentation with Gated Linear Matching
Qin Liu
Jianfeng Wang
Zhiyong Yang
Linjie Li
Kevin Qinghong Lin
Marc Niethammer
Lijuan Wang
VOS
83
1
0
05 Nov 2024
DELTA: Dense Efficient Long-range 3D Tracking for any video
Tuan Duc Ngo
Peiye Zhuang
Chuang Gan
E. Kalogerakis
Sergey Tulyakov
Hsin-Ying Lee
Chaoyang Wang
195
8
0
31 Oct 2024
Previous
1
2
3
4
5
...
12
13
14
Next