ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.00675
  4. Cited By
The 2017 DAVIS Challenge on Video Object Segmentation

The 2017 DAVIS Challenge on Video Object Segmentation

3 April 2017
Jordi Pont-Tuset
Federico Perazzi
Sergi Caelles
Pablo Arbeláez
A. Sorkine-Hornung
Luc Van Gool
    VGen
    VOS
ArXivPDFHTML

Papers citing "The 2017 DAVIS Challenge on Video Object Segmentation"

50 / 291 papers shown
Title
I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion
  Models
I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models
Wenqi Ouyang
Yi Dong
Lei Yang
Jianlou Si
Xingang Pan
VGen
DiffM
49
11
0
26 May 2024
Looking Backward: Streaming Video-to-Video Translation with Feature Banks
Looking Backward: Streaming Video-to-Video Translation with Feature Banks
Feng Liang
Akio Kodaira
Chenfeng Xu
Masayoshi Tomizuka
Kurt Keutzer
Diana Marculescu
DiffM
VGen
70
7
0
24 May 2024
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation
Volodymyr Fedynyak
Yaroslav Romanus
Bohdan Hlovatskyi
Bohdan Sydor
Oles Dobosevych
Igor Babin
Roman Riazantsev
VOS
50
3
0
11 May 2024
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular
  Videos
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos
Wen-Hsuan Chu
Lei Ke
Katerina Fragkiadaki
3DGS
VGen
37
29
0
03 May 2024
Zero-Shot Monocular Motion Segmentation in the Wild by Combining Deep
  Learning with Geometric Motion Model Fusion
Zero-Shot Monocular Motion Segmentation in the Wild by Combining Deep Learning with Geometric Motion Model Fusion
Yuxiang Huang
Yuhao Chen
John S. Zelek
44
1
0
02 May 2024
Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in
  the Wild
Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild
Donggyun Kim
Seongwoong Cho
Semin Kim
Chong Luo
Seunghoon Hong
VLM
50
2
0
29 Apr 2024
360VOTS: Visual Object Tracking and Segmentation in Omnidirectional
  Videos
360VOTS: Visual Object Tracking and Segmentation in Omnidirectional Videos
Yinzhe Xu
Huajian Huang
Yingshu Chen
Sai-Kit Yeung
VOS
47
1
0
22 Apr 2024
Translation-based Video-to-Video Synthesis
Translation-based Video-to-Video Synthesis
Pratim Saha
Chengcui Zhang
DiffM
31
1
0
03 Apr 2024
Towards Online Real-Time Memory-based Video Inpainting Transformers
Towards Online Real-Time Memory-based Video Inpainting Transformers
Guillaume Thiry
Hao Tang
Radu Timofte
Luc Van Gool
ViT
26
0
0
24 Mar 2024
DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot
  Video Editing
DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing
Hyeonho Jeong
Jinho Chang
Geon Yeong Park
Jong Chul Ye
DiffM
VGen
31
13
0
18 Mar 2024
Video Relationship Detection Using Mixture of Experts
Video Relationship Detection Using Mixture of Experts
A. Shaabana
Zahra Gharaee
Paul Fieguth
39
1
0
06 Mar 2024
Explicit Motion Handling and Interactive Prompting for Video Camouflaged Object Detection
Explicit Motion Handling and Interactive Prompting for Video Camouflaged Object Detection
Xin Zhang
Tao Xiao
Gepeng Ji
Xuan Wu
Keren Fu
Qijun Zhao
57
2
0
04 Mar 2024
Contextualized Diffusion Models for Text-Guided Image and Video
  Generation
Contextualized Diffusion Models for Text-Guided Image and Video Generation
Ling Yang
Zhilong Zhang
Zhaochen Yu
Jingwei Liu
Minkai Xu
Stefano Ermon
Bin Cui
49
4
0
26 Feb 2024
Self-supervised Video Object Segmentation with Distillation Learning of
  Deformable Attention
Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention
Quang-Trung Truong
Duc Thanh Nguyen
Binh-Son Hua
Sai-Kit Yeung
VOS
39
1
0
25 Jan 2024
Object-Centric Diffusion for Efficient Video Editing
Object-Centric Diffusion for Efficient Video Editing
Kumara Kahatapitiya
Adil Karjauv
Davide Abati
Fatih Porikli
Yuki M. Asano
A. Habibian
VGen
40
12
0
11 Jan 2024
Geometric-Aware Low-Light Image and Video Enhancement via Depth Guidance
Geometric-Aware Low-Light Image and Video Enhancement via Depth Guidance
Yingqi Lin
Xiaogang Xu
Yan Han
Xiaogang Xu
Zhe Liu
37
0
0
26 Dec 2023
RealCraft: Attention Control as A Tool for Zero-Shot Consistent Video Editing
RealCraft: Attention Control as A Tool for Zero-Shot Consistent Video Editing
Shutong Jin
Ruiyu Wang
Florian T. Pokorny
DiffM
VGen
89
1
0
19 Dec 2023
MaskINT: Video Editing via Interpolative Non-autoregressive Masked
  Transformers
MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Haoyu Ma
Shahin Mahdizadehaghdam
Bichen Wu
Zhipeng Fan
Yuchao Gu
Wenliang Zhao
Lior Shapira
Xiaohui Xie
DiffM
VGen
30
4
0
19 Dec 2023
DreamVideo: Composing Your Dream Videos with Customized Subject and
  Motion
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
Yujie Wei
Shiwei Zhang
Zhiwu Qing
Hangjie Yuan
Zhiheng Liu
Yu Liu
Yingya Zhang
Jingren Zhou
Hongming Shan
DiffM
VGen
24
90
0
07 Dec 2023
Memory-Efficient Optical Flow via Radius-Distribution Orthogonal Cost Volume
Memory-Efficient Optical Flow via Radius-Distribution Orthogonal Cost Volume
Gangwei Xu
Shujun Chen
Hao Jia
Miaojie Feng
Xin Yang
57
5
0
06 Dec 2023
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion
  Models
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models
Zhen Xing
Qi Dai
Zihao Zhang
Hui Zhang
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
58
17
0
30 Nov 2023
Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer
Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer
Danah Yatim
Rafail Fridman
Omer Bar-Tal
Yoni Kasten
Tali Dekel
DiffM
VGen
34
51
0
28 Nov 2023
Sketch Video Synthesis
Sketch Video Synthesis
Yudian Zheng
Xiaodong Cun
Menghan Xia
Chi-Man Pun
VGen
DiffM
28
2
0
26 Nov 2023
Is ImageNet worth 1 video? Learning strong image encoders from 1 long
  unlabelled video
Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video
Shashanka Venkataramanan
Mamshad Nayeem Rizve
João Carreira
Yuki M. Asano
Yannis Avrithis
SSL
42
18
0
12 Oct 2023
Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models
Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models
Wen-Hsuan Chu
Adam W. Harley
P. Tokmakov
Achal Dave
Leonidas J. Guibas
Katerina Fragkiadaki
VLM
40
7
0
10 Oct 2023
How Physics and Background Attributes Impact Video Transformers in
  Robotic Manipulation: A Case Study on Planar Pushing
How Physics and Background Attributes Impact Video Transformers in Robotic Manipulation: A Case Study on Planar Pushing
Shutong Jin
Ruiyu Wang
Muhammad Zahid
Florian T. Pokorny
38
1
0
03 Oct 2023
Segmenting the motion components of a video: A long-term unsupervised
  model
Segmenting the motion components of a video: A long-term unsupervised model
E. Meunier
P. Bouthemy
27
0
0
02 Oct 2023
Doduo: Learning Dense Visual Correspondence from Unsupervised
  Semantic-Aware Flow
Doduo: Learning Dense Visual Correspondence from Unsupervised Semantic-Aware Flow
Zhenyu Jiang
Hanwen Jiang
Yuke Zhu
VOS
36
4
0
26 Sep 2023
Tiled Multiplane Images for Practical 3D Photography
Tiled Multiplane Images for Practical 3D Photography
Numair Khan
Douglas Lanman
Lei Xiao
23
7
0
25 Sep 2023
Masked Momentum Contrastive Learning for Zero-shot Semantic
  Understanding
Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding
Jiantao Wu
Shentong Mo
Muhammad Awais
Sara Atito
Zhenhua Feng
J. Kittler
VLM
36
4
0
22 Aug 2023
MeViS: A Large-scale Benchmark for Video Segmentation with Motion
  Expressions
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Chen Change Loy
VOS
49
101
0
16 Aug 2023
Recurrent Self-Supervised Video Denoising with Denser Receptive Field
Recurrent Self-Supervised Video Denoising with Denser Receptive Field
Zichun Wang
Yulun Zhang
Debing Zhang
Ying Fu
47
8
0
07 Aug 2023
Stochastic positional embeddings improve masked image modeling
Stochastic positional embeddings improve masked image modeling
Amir Bar
Florian Bordes
Assaf Shocher
Mahmoud Assran
Pascal Vincent
Nicolas Ballas
Trevor Darrell
Amir Globerson
Yann LeCun
36
3
0
31 Jul 2023
VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by
  Using Diffusion Model with ControlNet
VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet
Zhihao Hu
Dong Xu
DiffM
VGen
43
65
0
26 Jul 2023
Tracking Anything in High Quality
Tracking Anything in High Quality
Jiawen Zhu
Zhe Chen
Zeqi Hao
Shijie Chang
Lu Zhang
...
Bin Luo
Ju He
Jinpeng Lan
Hanyuan Chen
Chenyang Li
VOS
21
7
0
26 Jul 2023
MC-JEPA: A Joint-Embedding Predictive Architecture for Self-Supervised
  Learning of Motion and Content Features
MC-JEPA: A Joint-Embedding Predictive Architecture for Self-Supervised Learning of Motion and Content Features
Adrien Bardes
Jean Ponce
Yann LeCun
MDE
44
25
0
24 Jul 2023
TokenFlow: Consistent Diffusion Features for Consistent Video Editing
TokenFlow: Consistent Diffusion Features for Consistent Video Editing
Michal Geyer
Omer Bar-Tal
Shai Bagon
Tali Dekel
VGen
DiffM
25
251
0
19 Jul 2023
Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Hierarchical Spatiotemporal Transformers for Video Object Segmentation
Jun-Sang Yoo
H. Lee
Seung‐Won Jung
VOS
39
1
0
17 Jul 2023
CoTracker: It is Better to Track Together
CoTracker: It is Better to Track Together
Nikita Karaev
Ignacio Rocco
Benjamin Graham
Natalia Neverova
Andrea Vedaldi
Christian Rupprecht
VOT
ViT
56
246
0
14 Jul 2023
Test-Time Training on Video Streams
Test-Time Training on Video Streams
Renhao Wang
Yu Sun
Yossi Gandelsman
Xinlei Chen
Alexei A. Efros
Alexei A. Efros
Xiaolong Wang
TTA
ViT
3DGS
47
16
0
11 Jul 2023
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single
  Object Tracking
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single Object Tracking
Yuanyou Xu
Jiahao Li
Zongxin Yang
Yi Yang
Yueting Zhuang
27
1
0
05 Jul 2023
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised
  Video Object Segmentation
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation
Jiahao Li
Yuanyou Xu
Zongxin Yang
Yi Yang
Yueting Zhuang
VOS
51
0
0
05 Jul 2023
Collaborative Score Distillation for Consistent Visual Synthesis
Collaborative Score Distillation for Consistent Visual Synthesis
Subin Kim
Kyungmin Lee
June Suk Choi
Jongheon Jeong
Kihyuk Sohn
Jinwoo Shin
DiffM
34
21
0
04 Jul 2023
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring
  Video Object Segmentation
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation
Yonglin Li
Jing Zhang
Xiao Teng
Long Lan
VOS
VLM
28
18
0
03 Jul 2023
Bidirectional Correlation-Driven Inter-Frame Interaction Transformer for
  Referring Video Object Segmentation
Bidirectional Correlation-Driven Inter-Frame Interaction Transformer for Referring Video Object Segmentation
Meng Lan
Fu Rong
Zuchao Li
Wei Yu
Lefei Zhang
VOS
38
5
0
02 Jul 2023
Unsupervised Coordinate-Based Video Denoising
Unsupervised Coordinate-Based Video Denoising
M. Aiyetigbo
Dineshchandar Ravichandran
Reda Chalhoub
Peter Kalivas
Nianyi Li
DiffM
27
0
0
01 Jul 2023
Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
30
13
0
15 Jun 2023
Object-Centric Learning for Real-World Videos by Predicting Temporal
  Feature Similarities
Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities
Andrii Zadaianchuk
Maximilian Seitzer
Georg Martius
OCL
23
39
0
07 Jun 2023
Towards Consistent Video Editing with Text-to-Image Diffusion Models
Towards Consistent Video Editing with Text-to-Image Diffusion Models
Zicheng Zhang
Bonan li
Xuecheng Nie
Congying Han
Tiande Guo
Luoqi Liu
DiffM
26
24
0
27 May 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video
  Object Segmentation
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
27
30
0
25 May 2023
Previous
123456
Next