ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.00675
  4. Cited By
The 2017 DAVIS Challenge on Video Object Segmentation
v1v2v3 (latest)

The 2017 DAVIS Challenge on Video Object Segmentation

3 April 2017
Jordi Pont-Tuset
Federico Perazzi
Sergi Caelles
Pablo Arbeláez
A. Sorkine-Hornung
Luc Van Gool
    VGenVOS
ArXiv (abs)PDFHTML

Papers citing "The 2017 DAVIS Challenge on Video Object Segmentation"

50 / 679 papers shown
Title
ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts
ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts
Shiu-hong Kao
Yu-Wing Tai
Chi-Keung Tang
VOSMLLMVGenLRM
105
0
0
01 Jul 2025
Emergent Temporal Correspondences from Video Diffusion Transformers
Emergent Temporal Correspondences from Video Diffusion Transformers
Jisu Nam
Soowon Son
Dahyun Chung
Jiyoung Kim
Siyoon Jin
Junhwa Hur
Seungryong Kim
VGen
23
0
0
20 Jun 2025
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects
Guohuan Xie
Syed Ariff Syed Hesham
Wenya Guo
Bing Li
Ming-Ming Cheng
Guolei Sun
Yun-Hai Liu
29
0
0
16 Jun 2025
Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors
Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors
Wen-Hsuan Chu
Lei Ke
Jianmeng Liu
Mingxiao Huo
P. Tokmakov
Katerina Fragkiadaki
3DGS
32
0
0
15 Jun 2025
StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams
StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams
Zike Wu
Qi Yan
Xuanyu Yi
Lele Wang
Renjie Liao
3DGS
26
0
0
10 Jun 2025
Consistent Video Editing as Flow-Driven Image-to-Video Generation
Consistent Video Editing as Flow-Driven Image-to-Video Generation
Ge Wang
Songlin Fan
Hangxu Liu
Quanjian Song
Hewei Wang
Jinfeng Xu
DiffMVGen
29
0
0
09 Jun 2025
Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding
Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding
Boyu Chen
Siran Chen
Kunchang Li
Qinglin Xu
Yu Qiao
Yali Wang
VOS
25
0
0
09 Jun 2025
Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models
Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models
Sangwon Jang
Taekyung Ki
Jaehyeong Jo
Jaehong Yoon
Soo Ye Kim
Zhe Lin
Sung Ju Hwang
DiffMVGen
25
0
0
08 Jun 2025
THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation
THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation
Mingqi Gao
Haoran Duan
Tianlu Zhang
Jungong Han
12
0
0
07 Jun 2025
FADE: Frequency-Aware Diffusion Model Factorization for Video Editing
FADE: Frequency-Aware Diffusion Model Factorization for Video Editing
Yixuan Zhu
Haolin Wang
Shilin Ma
Wenliang Zhao
Yansong Tang
Lei Chen
Jie Zhou
DiffMVGen
47
0
0
06 Jun 2025
FRAME: Pre-Training Video Feature Representations via Anticipation and Memory
FRAME: Pre-Training Video Feature Representations via Anticipation and Memory
Sethuraman TV
Savya Khosla
Vignesh Srinivasakumar
Jiahui Huang
Seoung Wug Oh
Simon Jenni
Derek Hoiem
Joon-Young Lee
36
0
0
05 Jun 2025
SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost
SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost
Haiyang Mei
Pengyu Zhang
Mike Zheng Shou
VLM
47
0
0
02 Jun 2025
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
Yang-tian Sun
Xin Yu
Zehuan Huang
Yi-Hua Huang
Yuan-Chen Guo
Ziyi Yang
Yan-Pei Cao
Xiaojuan Qi
DiffMVGenMDE
46
1
0
30 May 2025
Maximum Likelihood Learning of Latent Dynamics Without Reconstruction
Maximum Likelihood Learning of Latent Dynamics Without Reconstruction
Samo Hromadka
Kai Biegun
Lior Fox
James Heald
M. Sahani
BDLAI4TS
28
0
0
29 May 2025
MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
Sihan Yang
Runsen Xu
Yiman Xie
Sizhe Yang
Mo Li
...
Haodong Duan
Xiangyu Yue
Dahua Lin
Tai Wang
Jiangmiao Pang
VLMLRM
53
1
0
29 May 2025
Sci-Fi: Symmetric Constraint for Frame Inbetweening
Sci-Fi: Symmetric Constraint for Frame Inbetweening
Liuhan Chen
Xiaodong Cun
Xiaoyu Li
Xianyi He
Shenghai Yuan
Jie Chen
Ying Shan
Li Yuan
VGen
81
0
0
27 May 2025
TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs
TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs
Juntong Wang
Jiarui Wang
Huiyu Duan
Guangtao Zhai
Xiongkuo Min
41
1
0
26 May 2025
A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking
A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking
Zixiang Zhao
Haowen Bai
Bingxin Ke
Yukun Cui
Lilun Deng
Yulun Zhang
Kai Zhang
Konrad Schindler
VGen
48
0
0
26 May 2025
Reasoning Segmentation for Images and Videos: A Survey
Reasoning Segmentation for Images and Videos: A Survey
Yiqing Shen
Chenjia Li
Fei Xiong
Jeong-O Jeong
Tianpeng Wang
Michael Latman
Mathias Unberath
VOS
244
0
0
24 May 2025
RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers
RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers
Ahmet Berke Gokmen
Yigit Ekin
Bahri Batuhan Bilecen
Aysegül Dündar
155
0
0
19 May 2025
Efficient Neural Video Representation with Temporally Coherent Modulation
Efficient Neural Video Representation with Temporally Coherent Modulation
Seungjun Shin
S. Kim
Dokwan Oh
72
0
0
01 May 2025
DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining
DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining
Wei Zhuo
Zhiyue Tang
Wufeng Xue
Hao Ding
Linlin Shen
111
0
0
22 Apr 2025
Plug-and-Play Versatile Compressed Video Enhancement
Plug-and-Play Versatile Compressed Video Enhancement
Huimin Zeng
Jiacheng Li
Zhiwei Xiong
65
0
0
21 Apr 2025
MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video
MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular Video
Minh-Quan Viet Bui
Jongmin Park
J. P. Bello
Jaeho Moon
Jihyong Oh
Munchurl Kim
440
0
0
21 Apr 2025
Seurat: From Moving Points to Depth
Seurat: From Moving Points to Depth
Seokju Cho
Jiahui Huang
S. Kim
Joon-Young Lee
3DPCMDE
77
1
0
20 Apr 2025
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models
Haiwen Huang
Anpei Chen
Volodymyr Havrylov
Andreas Geiger
Dan Zhang
64
2
0
18 Apr 2025
UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models
UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models
Guanlong Jiao
Biqing Huang
Kuan-Chieh Wang
Renjie Liao
DiffM
139
0
0
17 Apr 2025
Physical Reservoir Computing in Hook-Shaped Rover Wheel Spokes for Real-Time Terrain Identification
Physical Reservoir Computing in Hook-Shaped Rover Wheel Spokes for Real-Time Terrain Identification
Xiao Jin
Zihan Wang
Zhenhua Yu
Changrak Choi
Kalind Carpenter
T. Nanayakkara
76
2
0
17 Apr 2025
Perception Encoder: The best visual embeddings are not at the output of the network
Perception Encoder: The best visual embeddings are not at the output of the network
Daniel Bolya
Po-Yao (Bernie) Huang
Peize Sun
Jang Hyun Cho
Andrea Madotto
...
Shiyu Dong
Nikhila Ravi
Daniel Li
Piotr Dollár
Christoph Feichtenhofer
ObjDVOS
329
9
0
17 Apr 2025
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency
Mengshi Qi
Pengfei Zhu
Xianrui Li
Xiaoyang Bi
Lu Qi
Huadong Ma
Ming-Hsuan Yang
VOSVLM
133
0
0
16 Apr 2025
Understanding Attention Mechanism in Video Diffusion Models
Understanding Attention Mechanism in Video Diffusion Models
Bingyan Liu
Chengyu Wang
Tongtong Su
Huan Ten
Jun Huang
K. Guo
Kui Jia
VGen
101
1
0
16 Apr 2025
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Henghui Ding
Chang Liu
Nikhila Ravi
Shuting He
Y. Wei
...
Haobo Yuan
Xuelong Li
Tao Zhang
Lu Qi
Ming-Hsuan Yang
92
1
0
15 Apr 2025
MASSeg : 2nd Technical Report for 4th PVUW MOSE Track
MASSeg : 2nd Technical Report for 4th PVUW MOSE Track
Xuqiang Cao
Linnan Zhao
Jiaxuan Zhao
Fang Liu
Puhua Chen
Wenping Ma
81
0
0
14 Apr 2025
FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution
FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution
Mengjiao Wang
Junpei Zhang
Xu Liu
Yuting Yang
Mengru Ma
VOS
77
0
0
13 Apr 2025
UniFlowRestore: A General Video Restoration Framework via Flow Matching and Prompt Guidance
UniFlowRestore: A General Video Restoration Framework via Flow Matching and Prompt Guidance
Siyang Song
Yu Zhang
Chen Wu
Dianjie Lu
Dianjie Lu
Guijuan Zhan
Yang Weng
Zhuoran Zheng
DiffMVGen
61
0
0
12 Apr 2025
STSeg-Complex Video Object Segmentation: The 1st Solution for 4th PVUW MOSE Challenge
STSeg-Complex Video Object Segmentation: The 1st Solution for 4th PVUW MOSE Challenge
Kehuan Song
Xinglin Xie
Kexin Zhang
Licheng Jiao
Lingling Li
Steve Yang
VOS
87
0
0
11 Apr 2025
Robust SAM: On the Adversarial Robustness of Vision Foundation Models
Robust SAM: On the Adversarial Robustness of Vision Foundation Models
Jiahuan Long
Zhengqin Xu
Tingsong Jiang
Wen Yao
Shuai Jia
Chao Ma
Xiaoqian Chen
AAMLVLM
98
1
0
11 Apr 2025
ZS-VCOS: Zero-Shot Outperforms Supervised Video Camouflaged Object Segmentation
ZS-VCOS: Zero-Shot Outperforms Supervised Video Camouflaged Object Segmentation
Wenqi Guo
Shan Du
VLM
97
0
0
10 Apr 2025
D^2USt3R: Enhancing 3D Reconstruction with 4D Pointmaps for Dynamic Scenes
D^2USt3R: Enhancing 3D Reconstruction with 4D Pointmaps for Dynamic Scenes
Jisang Han
Honggyu An
Jaewoo Jung
Takuya Narihira
Junyoung Seo
Kazumi Fukuda
Chaehyun Kim
Sunghwan Hong
Yuki Mitsufuji
Seungryong Kim
100
2
0
08 Apr 2025
Studying Image Diffusion Features for Zero-Shot Video Object Segmentation
Studying Image Diffusion Features for Zero-Shot Video Object Segmentation
Thanos Delatolas
Vicky S. Kalogeiton
Dim P. Papadopoulos
DiffMVOS
128
2
0
07 Apr 2025
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting
Yunlong Tang
Jing Bi
Chao Huang
Susan Liang
Daiki Shimada
...
Jinxi He
Liu He
Zeliang Zhang
Jiebo Luo
Chenliang Xu
107
1
0
07 Apr 2025
ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer
ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer
Jiayi Gao
Zijin Yin
Changcheng Hua
Yuxin Peng
Kongming Liang
Zhanyu Ma
Jiaxin Guo
Yang Liu
VGenDiffM
104
2
0
03 Apr 2025
Zero-Shot 4D Lidar Panoptic Segmentation
Zero-Shot 4D Lidar Panoptic Segmentation
Yushan Zhang
Aljosa Osep
Laura Leal-Taixé
Tim Meinhardt
3DPC
98
1
0
01 Apr 2025
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Bernard Ghanem
140
0
0
01 Apr 2025
FreeInv: Free Lunch for Improving DDIM Inversion
FreeInv: Free Lunch for Improving DDIM Inversion
Yuxiang Bao
Huijie Liu
Xun Gao
Huan Fu
Guoliang Kang
59
0
0
29 Mar 2025
Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion
Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion
Jangho Park
Taesung Kwon
Jong Chul Ye
VGen
105
1
0
28 Mar 2025
Segment Any Motion in Videos
Segment Any Motion in Videos
Nan Huang
Wenzhao Zheng
Chenfeng Xu
Kurt Keutzer
Shanghang Zhang
Angjoo Kanazawa
Qianqian Wang
VOS
100
1
0
28 Mar 2025
MVFNet: Multipurpose Video Forensics Network using Multiple Forms of Forensic Evidence
MVFNet: Multipurpose Video Forensics Network using Multiple Forms of Forensic Evidence
Tai D. Nguyen
Matthew C. Stamm
124
0
0
26 Mar 2025
EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models
EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models
Yufei Cai
Hu Han
Yuxiang Wei
Shiguang Shan
Xilin Chen
DiffMVGen
95
0
0
25 Mar 2025
TransAnimate: Taming Layer Diffusion to Generate RGBA Video
TransAnimate: Taming Layer Diffusion to Generate RGBA Video
Xuewei Chen
Zhimin Chen
Yiren Song
VGen
123
2
0
23 Mar 2025
1234...121314
Next