ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2207.07115
  4. Cited By
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin
  Memory Model

XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

14 July 2022
Ho Kei Cheng
A. Schwing
    VLM
    VOS
ArXivPDFHTML

Papers citing "XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model"

50 / 67 papers shown
Title
MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection
MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection
Q. Yang
Yuan Yao
Miaomiao Cui
Liefeng Bo
VLM
61
0
0
30 Apr 2025
TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation
TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation
Ling You
Wenxuan Huang
Xinni Xie
Xiangyi Wei
Bangyan Li
Shaohui Lin
Yang Li
Changbo Wang
VGen
157
0
0
24 Apr 2025
RGB-D Video Object Segmentation via Enhanced Multi-store Feature Memory
RGB-D Video Object Segmentation via Enhanced Multi-store Feature Memory
Boyue Xu
Ruichao Hou
Tongwei Ren
Gangshan Wu
VOS
36
1
0
23 Apr 2025
Saliency-Motion Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation
Saliency-Motion Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation
Xiangyu Zheng
Wanyun Li
Songcheng He
Jianping Fan
Xiaoqiang Li
We Zhang
VOS
35
0
0
08 Apr 2025
ReferDINO-Plus: 2nd Solution for 4th PVUW MeViS Challenge at CVPR 2025
ReferDINO-Plus: 2nd Solution for 4th PVUW MeViS Challenge at CVPR 2025
Tianming Liang
Haichao Jiang
Wei-Shi Zheng
Jian-Fang Hu
44
0
0
30 Mar 2025
WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes
WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes
L. Yang
Kaixin Zhu
Juanxi Tian
Bohan Zeng
Matthieu Lin
Hongjuan Pei
Wentao Zhang
Shuicheng Yan
VGen
75
0
0
17 Mar 2025
2HandedAfforder: Learning Precise Actionable Bimanual Affordances from Human Videos
Marvin Heidinger
Snehal Jauhri
V. Prasad
Georgia Chalvatzaki
68
0
0
12 Mar 2025
StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition
StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition
Xin Ding
Hao Wu
Yuqing Yang
Shiqi Jiang
Donglin Bai
Zhibo Chen
Ting Cao
145
0
0
08 Mar 2025
SMITE: Segment Me In TimE
SMITE: Segment Me In TimE
Amirhossein Alimohammadi
Sauradip Nag
Saeid Asgari Taghanaki
Andrea Tagliasacchi
Ghassan Hamarneh
Ali Mahdavi-Amiri
VLM
VOS
137
2
0
20 Feb 2025
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Fu Rong
Meng Lan
Q. Zhang
L. Zhang
VOS
VGen
73
1
0
23 Jan 2025
6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting
6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting
Yufeng Jin
V. Prasad
Snehal Jauhri
Mathias Franzius
Georgia Chalvatzaki
3DGS
92
0
0
02 Dec 2024
VideoOrion: Tokenizing Object Dynamics in Videos
VideoOrion: Tokenizing Object Dynamics in Videos
Yicheng Feng
Yijiang Li
Wanpeng Zhang
Sipeng Zheng
Zongqing Lu
Sipeng Zheng
Zongqing Lu
109
1
0
25 Nov 2024
QuadWBG: Generalizable Quadrupedal Whole-Body Grasping
QuadWBG: Generalizable Quadrupedal Whole-Body Grasping
Jilong Wang
Javokhirbek Rajabov
Chaoyi Xu
Yiming Zheng
He Wang
43
1
0
11 Nov 2024
Human-inspired Perspectives: A Survey on AI Long-term Memory
Human-inspired Perspectives: A Survey on AI Long-term Memory
Zihong He
Weizhe Lin
Hao Zheng
Fan Zhang
Matt Jones
Laurence Aitchison
X. Xu
Miao Liu
Per Ola Kristensson
Junxiao Shen
77
2
0
01 Nov 2024
BYOCL: Build Your Own Consistent Latent with Hierarchical Representative Latent Clustering
BYOCL: Build Your Own Consistent Latent with Hierarchical Representative Latent Clustering
Jiayue Dai
Yunya Wang
Yihan Fang
Yuetong Chen
Butian Xiong
VLM
29
0
0
19 Oct 2024
VideoSAM: Open-World Video Segmentation
VideoSAM: Open-World Video Segmentation
Pinxue Guo
Zixu Zhao
Jianxiong Gao
Chongruo Wu
Tong He
Zheng Zhang
Tianjun Xiao
Wenqiang Zhang
VOS
28
0
0
11 Oct 2024
LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS
LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS
Xinyu Liu
Jing Zhang
Kexin Zhang
Xu Liu
Lingling Li
28
1
0
20 Aug 2024
Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning
Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning
Haofeng Liu
Erli Zhang
Junde Wu
Mingxuan Hong
Yueming Jin
MedIm
53
14
0
15 Aug 2024
ViLLa: Video Reasoning Segmentation with Large Language Model
ViLLa: Video Reasoning Segmentation with Large Language Model
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
VOS
LRM
72
2
0
18 Jul 2024
FoodMem: Near Real-time and Precise Food Video Segmentation
FoodMem: Near Real-time and Precise Food Video Segmentation
Ahmad AlMughrabi
Adrián Galán
Ricardo Marques
P. Radeva
VOS
38
1
0
16 Jul 2024
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Xin Li
Deshui Miao
Zhenyu He
Yixuan Wang
Huchuan Lu
Ming Yang
VOS
56
4
0
10 Jul 2024
PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Henghui Ding
Chang Liu
Yunchao Wei
Nikhila Ravi
Shuting He
...
Bo-Lu Zhao
Jing Liu
Feiyu Pan
Hao Fang
Xiankai Lu
56
8
0
24 Jun 2024
Zero-Shot Scene Change Detection
Zero-Shot Scene Change Detection
Kyusik Cho
Dong Yeop Kim
Euntai Kim
38
1
0
17 Jun 2024
RMem: Restricted Memory Banks Improve Video Object Segmentation
RMem: Restricted Memory Banks Improve Video Object Segmentation
Junbao Zhou
Ziqi Pang
Yu-xiong Wang
VOS
60
7
0
12 Jun 2024
1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion
  Expression guided Video Segmentation
1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Mingqi Gao
Jingnan Luo
Jinyu Yang
Jungong Han
Feng Zheng
42
2
0
11 Jun 2024
HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction
HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction
Jikai Wang
Qifan Zhang
Yu-Wei Chao
Bowen Wen
Xiaohu Guo
Yu Xiang
3DH
53
2
0
10 Jun 2024
SAM-PM: Enhancing Video Camouflaged Object Detection using
  Spatio-Temporal Attention
SAM-PM: Enhancing Video Camouflaged Object Detection using Spatio-Temporal Attention
Muhammad Nawfal Meeran
Gokul Adethya T
Bhanu Pratyush Mantha
35
3
0
09 Jun 2024
Matching Anything by Segmenting Anything
Matching Anything by Segmenting Anything
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
37
22
0
06 Jun 2024
How Much You Ate? Food Portion Estimation on Spoons
How Much You Ate? Food Portion Estimation on Spoons
Aaryam Sharma
Chris Czarnecki
Yuhao Chen
Pengcheng Xi
Linlin Xu
Alexander Wong
18
1
0
12 May 2024
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation
Volodymyr Fedynyak
Yaroslav Romanus
Bohdan Hlovatskyi
Bohdan Sydor
Oles Dobosevych
Igor Babin
Roman Riazantsev
VOS
48
3
0
11 May 2024
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular
  Videos
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos
Wen-Hsuan Chu
Lei Ke
Katerina Fragkiadaki
3DGS
VGen
25
29
0
03 May 2024
Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in
  the Wild
Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild
Donggyun Kim
Seongwoong Cho
Semin Kim
Chong Luo
Seunghoon Hong
VLM
45
2
0
29 Apr 2024
360VOTS: Visual Object Tracking and Segmentation in Omnidirectional
  Videos
360VOTS: Visual Object Tracking and Segmentation in Omnidirectional Videos
Yinzhe Xu
Huajian Huang
Yingshu Chen
Sai-Kit Yeung
VOS
42
1
0
22 Apr 2024
Koala: Key frame-conditioned long video-LLM
Koala: Key frame-conditioned long video-LLM
Reuben Tan
Ximeng Sun
Ping Hu
Jui-hsien Wang
Hanieh Deilamsalehy
Bryan A. Plummer
Bryan C. Russell
Kate Saenko
38
35
0
05 Apr 2024
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
Ming-hui Li
Shuai Li
Xindong Zhang
Lei Zhang
VOS
47
16
0
28 Feb 2024
Self-supervised Video Object Segmentation with Distillation Learning of
  Deformable Attention
Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention
Quang-Trung Truong
Duc Thanh Nguyen
Binh-Son Hua
Sai-Kit Yeung
VOS
34
1
0
25 Jan 2024
VONet: Unsupervised Video Object Learning With Parallel U-Net Attention
  and Object-wise Sequential VAE
VONet: Unsupervised Video Object Learning With Parallel U-Net Attention and Object-wise Sequential VAE
Haonan Yu
Wei Xu
ViT
36
1
0
20 Jan 2024
I'M HOI: Inertia-aware Monocular Capture of 3D Human-Object Interactions
I'M HOI: Inertia-aware Monocular Capture of 3D Human-Object Interactions
Chengfeng Zhao
Juze Zhang
Jiashen Du
Ziwei Shan
Junye Wang
Jingyi Yu
Jingya Wang
Lan Xu
21
18
0
10 Dec 2023
DragVideo: Interactive Drag-style Video Editing
DragVideo: Interactive Drag-style Video Editing
Yufan Deng
Ruida Wang
Yuhao Zhang
Yu-Wing Tai
Chi-Keung Tang
DiffM
VGen
24
20
0
03 Dec 2023
TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex
  Traffic Scenarios
TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex Traffic Scenarios
Lihao Liu
Yanqi Cheng
Zhongying Deng
Shujun Wang
Dongdong Chen
Xiaowei Hu
Pietro Lio'
Carola-Bibiane Schönlieb
Angelica Aviles-Rivero
42
1
0
30 Nov 2023
Learning Generalizable Manipulation Policies with Object-Centric 3D
  Representations
Learning Generalizable Manipulation Policies with Object-Centric 3D Representations
Yifeng Zhu
Zhenyu Jiang
Peter Stone
Yuke Zhu
3DPC
24
43
0
22 Oct 2023
Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models
Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models
Wen-Hsuan Chu
Adam W. Harley
P. Tokmakov
Achal Dave
Leonidas J. Guibas
Katerina Fragkiadaki
VLM
28
7
0
10 Oct 2023
Tracking Anything with Decoupled Video Segmentation
Tracking Anything with Decoupled Video Segmentation
Ho Kei Cheng
Seoung Wug Oh
Brian L. Price
Alexander Schwing
Joon-Young Lee
VOS
VLM
43
121
0
07 Sep 2023
Automated Conversion of Music Videos into Lyric Videos
Automated Conversion of Music Videos into Lyric Videos
Jia Ma
Anyi Rao
Li-Yi Wei
Rubaiat Habib Kazi
Hijung Valentina Shin
Maneesh Agrawala
24
5
0
28 Aug 2023
A One Stop 3D Target Reconstruction and multilevel Segmentation Method
A One Stop 3D Target Reconstruction and multilevel Segmentation Method
J. Xu
Wei-Ye Zhao
Zhiyan Tang
X. Gan
3DV
24
2
0
14 Aug 2023
Color-NeuS: Reconstructing Neural Implicit Surfaces with Color
Color-NeuS: Reconstructing Neural Implicit Surfaces with Color
Licheng Zhong
Lixin Yang
Kailin Li
Haoyu Zhen
Mei Han
Cewu Lu
3DH
28
4
0
14 Aug 2023
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language
  Models
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Cheng-Yu Hsieh
Sibei Chen
Chun-Liang Li
Yasuhisa Fujii
Alexander Ratner
Chen-Yu Lee
Ranjay Krishna
Tomas Pfister
LLMAG
SyDa
43
41
0
01 Aug 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming Yang
F. Khan
VLM
38
118
0
25 Jul 2023
Online Unsupervised Video Object Segmentation via Contrastive Motion
  Clustering
Online Unsupervised Video Object Segmentation via Contrastive Motion Clustering
Lin Xi
Weihai Chen
Xingming Wu
Zhong Liu
Zhengguo Li
VOS
26
9
0
21 Jun 2023
READMem: Robust Embedding Association for a Diverse Memory in
  Unconstrained Video Object Segmentation
READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object Segmentation
Stéphane Vujasinović
Sebastian Bullinger
S. Becker
N. Scherer-Negenborn
Michael Arens
Rainer Stiefelhagen
VOS
29
2
0
22 May 2023
12
Next