ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.08006
  4. Cited By
Video Object Segmentation with Language Referring Expressions

Video Object Segmentation with Language Referring Expressions

21 March 2018
Anna Khoreva
Anna Rohrbach
Bernt Schiele
    VOS
ArXivPDFHTML

Papers citing "Video Object Segmentation with Language Referring Expressions"

50 / 61 papers shown
Title
Reasoning Segmentation for Images and Videos: A Survey
Reasoning Segmentation for Images and Videos: A Survey
Yiqing Shen
Chenjia Li
Fei Xiong
Jeong-O Jeong
Tianpeng Wang
Michael Latman
Mathias Unberath
VOS
174
0
0
24 May 2025
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Haobo Yuan
Xianrui Li
Tao Zhang
Zilong Huang
Shilin Xu
S. Ji
Yunhai Tong
Lu Qi
Jiashi Feng
Ming-Hsuan Yang
VLM
132
19
0
07 Jan 2025
Referring Video Object Segmentation via Language-aligned Track Selection
Referring Video Object Segmentation via Language-aligned Track Selection
Seongchan Kim
Woojeong Jin
Sangbeom Lim
Heeji Yoon
Hyunwook Choi
Seungryong Kim
VOS
128
0
0
02 Dec 2024
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation
Claudia Cuttano
Gabriele Trivigno
Gabriele Rosi
Carlo Masone
Giuseppe Averta
VOS
146
2
0
26 Nov 2024
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Andong Deng
Tongjia Chen
Shoubin Yu
Taojiannan Yang
Lincoln Spencer
Yapeng Tian
Ajmal Mian
Joey Tianyi Zhou
Chen Chen
LRM
86
2
0
15 Nov 2024
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
Shehan Munasinghe
Hanan Gani
Wenqi Zhu
Jiale Cao
Eric P. Xing
Fahad Shahbaz Khan
Salman Khan
MLLM
VGen
VLM
70
6
0
07 Nov 2024
ViLLa: Video Reasoning Segmentation with Large Language Model
ViLLa: Video Reasoning Segmentation with Large Language Model
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
VOS
LRM
95
4
0
18 Jul 2024
Language Prompt for Autonomous Driving
Language Prompt for Autonomous Driving
Dongming Wu
Wencheng Han
Tiancai Wang
Yingfei Liu
Cheng-zhong Xu
Jianbing Shen
Jianbing Shen
VLM
87
82
0
08 Sep 2023
Interpretable and Globally Optimal Prediction for Textual Grounding
  using Image Concepts
Interpretable and Globally Optimal Prediction for Textual Grounding using Image Concepts
Raymond A. Yeh
Jinjun Xiong
Wen-mei W. Hwu
Minh Do
Alex Schwing
47
57
0
29 Mar 2018
MaskRNN: Instance Level Video Object Segmentation
MaskRNN: Instance Level Video Object Segmentation
Yuan-Ting Hu
Jia-Bin Huang
Alex Schwing
VOS
57
183
0
29 Mar 2018
Actor and Action Video Segmentation from a Sentence
Actor and Action Video Segmentation from a Sentence
Kirill Gavrilyuk
Amir Ghodrati
Zhenyang Li
Cees G. M. Snoek
VLM
58
149
0
20 Mar 2018
The 2018 DAVIS Challenge on Video Object Segmentation
The 2018 DAVIS Challenge on Video Object Segmentation
Sergi Caelles
Alberto Montes
Kevis-Kokitsi Maninis
Yuhua Chen
Luc Van Gool
Federico Perazzi
Jordi Pont-Tuset
VGen
VOS
107
121
0
01 Mar 2018
MAttNet: Modular Attention Network for Referring Expression
  Comprehension
MAttNet: Modular Attention Network for Referring Expression Comprehension
Licheng Yu
Zhe Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Joey Tianyi Zhou
Tamara L. Berg
ObjD
97
825
0
24 Jan 2018
Object Referring in Videos with Language and Human Gaze
Object Referring in Videos with Language and Human Gaze
A. Vasudevan
Dengxin Dai
Luc Van Gool
VOS
55
75
0
04 Jan 2018
Interactive Video Object Segmentation in the Wild
Interactive Video Object Segmentation in the Wild
Arnaud Benard
Michael Gygli
VOS
38
49
0
31 Dec 2017
Deep Extreme Cut: From Extreme Points to Object Segmentation
Deep Extreme Cut: From Extreme Points to Object Segmentation
Kevis-Kokitsi Maninis
Sergi Caelles
Jordi Pont-Tuset
Luc Van Gool
65
418
0
24 Nov 2017
SegFlow: Joint Learning for Video Object Segmentation and Optical Flow
SegFlow: Joint Learning for Video Object Segmentation and Optical Flow
Jingchun Cheng
Yi-Hsuan Tsai
Shengjin Wang
Ming-Hsuan Yang
VOS
60
416
0
20 Sep 2017
Video Object Segmentation Without Temporal Information
Video Object Segmentation Without Temporal Information
Kevis-Kokitsi Maninis
Sergi Caelles
Yuhua Chen
Jordi Pont-Tuset
Laura Leal-Taixé
Daniel Cremers
Luc Van Gool
VOS
118
342
0
18 Sep 2017
Learning to Segment Instances in Videos with Spatial Propagation Network
Learning to Segment Instances in Videos with Spatial Propagation Network
Jingchun Cheng
Sifei Liu
Yi-Hsuan Tsai
Wei-Chih Hung
Shalini De Mello
Liang Feng
Jan Kautz
Shengjin Wang
Ming-Hsuan Yang
VOS
57
22
0
14 Sep 2017
Pixel-Level Matching for Video Object Segmentation using Convolutional
  Neural Networks
Pixel-Level Matching for Video Object Segmentation using Convolutional Neural Networks
Jae Shin Yoon
François Rameau
Junsik Kim
Seokju Lee
Seunghak Shin
In So Kweon
VOS
54
158
0
17 Aug 2017
Query-guided Regression Network with Context Policy for Phrase Grounding
Query-guided Regression Network with Context Policy for Phrase Grounding
Kan Chen
Rama Kovvuri
Ram Nevatia
58
142
0
04 Aug 2017
Localizing Moments in Video with Natural Language
Localizing Moments in Video with Natural Language
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Sivic
Trevor Darrell
Bryan C. Russell
110
946
0
04 Aug 2017
Video Object Segmentation with Re-identification
Video Object Segmentation with Re-identification
Xiaoxiao Li
Yuankai Qi
Zhe Wang
Kai-xiang Chen
Ziwei Liu
Jianping Shi
Ping Luo
Xiaoou Tang
Chen Change Loy
VOS
61
64
0
01 Aug 2017
Online Adaptation of Convolutional Neural Networks for Video Object
  Segmentation
Online Adaptation of Convolutional Neural Networks for Video Object Segmentation
P. Voigtlaender
Bastian Leibe
VOS
112
396
0
28 Jun 2017
Rethinking Atrous Convolution for Semantic Image Segmentation
Rethinking Atrous Convolution for Semantic Image Segmentation
Liang-Chieh Chen
George Papandreou
Florian Schroff
Hartwig Adam
SSeg
215
8,455
0
17 Jun 2017
Learning Video Object Segmentation with Visual Memory
Learning Video Object Segmentation with Visual Memory
P. Tokmakov
Alahari Karteek
Cordelia Schmid
VOS
51
325
0
19 Apr 2017
Discriminative Bimodal Networks for Visual Localization and Detection
  with Natural Language Queries
Discriminative Bimodal Networks for Visual Localization and Detection with Natural Language Queries
Y. Zhang
Luyao Yuan
Yijie Guo
Zhiyuan He
I-An Huang
Honglak Lee
ObjD
64
57
0
12 Apr 2017
The 2017 DAVIS Challenge on Video Object Segmentation
The 2017 DAVIS Challenge on Video Object Segmentation
Jordi Pont-Tuset
Federico Perazzi
Sergi Caelles
Pablo Arbeláez
A. Sorkine-Hornung
Luc Van Gool
VGen
VOS
78
1,205
0
03 Apr 2017
Lucid Data Dreaming for Video Object Segmentation
Lucid Data Dreaming for Video Object Segmentation
Anna Khoreva
Rodrigo Benenson
Eddy Ilg
Thomas Brox
Bernt Schiele
VOS
110
38
0
28 Mar 2017
Recurrent Multimodal Interaction for Referring Image Segmentation
Recurrent Multimodal Interaction for Referring Image Segmentation
Chenxi Liu
Zhe Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Alan Yuille
EgoV
68
237
0
23 Mar 2017
Mask R-CNN
Mask R-CNN
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
344
27,129
0
20 Mar 2017
Super-Trajectory for Video Segmentation
Super-Trajectory for Video Segmentation
Wenguan Wang
Jianbing Shen
Jianwen Xie
Fatih Porikli
VOS
47
46
0
28 Feb 2017
Understanding Convolution for Semantic Segmentation
Understanding Convolution for Semantic Segmentation
Panqu Wang
Pengfei Chen
Ye Yuan
Ding Liu
Zehua Huang
Xiaodi Hou
G. Cottrell
SSeg
67
1,686
0
27 Feb 2017
FusionSeg: Learning to combine motion and appearance for fully automatic
  segmention of generic objects in videos
FusionSeg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos
Enis Berk Çoban
Bo Xiong
Michael I. Mandel
VOS
118
382
0
19 Jan 2017
Comprehension-guided referring expressions
Comprehension-guided referring expressions
Ruotian Luo
Gregory Shakhnarovich
ObjD
89
171
0
12 Jan 2017
A Joint Speaker-Listener-Reinforcer Model for Referring Expressions
A Joint Speaker-Listener-Reinforcer Model for Referring Expressions
Licheng Yu
Hao Tan
Joey Tianyi Zhou
Tamara L. Berg
ObjD
94
275
0
30 Dec 2016
Learning Motion Patterns in Videos
Learning Motion Patterns in Videos
P. Tokmakov
Alahari Karteek
Cordelia Schmid
VOS
88
267
0
21 Dec 2016
Video Propagation Networks
Video Propagation Networks
Varun Jampani
Raghudeep Gadde
Peter V. Gehler
DiffM
52
230
0
16 Dec 2016
Learning Video Object Segmentation from Static Images
Learning Video Object Segmentation from Static Images
Anna Khoreva
Federico Perazzi
Rodrigo Benenson
Bernt Schiele
A. Sorkine-Hornung
VOS
63
586
0
08 Dec 2016
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
Eddy Ilg
N. Mayer
Tonmoy Saikia
Margret Keuper
Alexey Dosovitskiy
Thomas Brox
3DPC
242
3,077
0
06 Dec 2016
Modeling Relationships in Referential Expressions with Compositional
  Modular Networks
Modeling Relationships in Referential Expressions with Compositional Modular Networks
Ronghang Hu
Marcus Rohrbach
Jacob Andreas
Trevor Darrell
Kate Saenko
73
406
0
30 Nov 2016
One-Shot Video Object Segmentation
One-Shot Video Object Segmentation
Sergi Caelles
Kevis-Kokitsi Maninis
Jordi Pont-Tuset
Laura Leal-Taixé
Daniel Cremers
Luc Van Gool
VOS
74
914
0
16 Nov 2016
Modeling Context in Referring Expressions
Modeling Context in Referring Expressions
Licheng Yu
Patrick Poirson
Shan Yang
Alexander C. Berg
Tamara L. Berg
127
1,261
0
31 Jul 2016
Click Carving: Segmenting Objects in Video with Point Clicks
Click Carving: Segmenting Objects in Video with Point Clicks
S. Jain
Kristen Grauman
51
54
0
05 Jul 2016
FOMTrace: Interactive Video Segmentation By Image Graphs and Fuzzy
  Object Models
FOMTrace: Interactive Video Segmentation By Image Graphs and Fuzzy Object Models
T. V. Spina
A. X. Falcão
VOS
33
8
0
10 Jun 2016
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets,
  Atrous Convolution, and Fully Connected CRFs
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Liang-Chieh Chen
George Papandreou
Iasonas Kokkinos
Kevin Patrick Murphy
Alan Yuille
SSeg
227
18,195
0
02 Jun 2016
Fully Convolutional Networks for Semantic Segmentation
Fully Convolutional Networks for Semantic Segmentation
Evan Shelhamer
Jonathan Long
Trevor Darrell
VOS
SSeg
632
37,806
0
20 May 2016
ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic
  Segmentation
ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation
Di Lin
Jifeng Dai
Jiaya Jia
Kaiming He
Jian Sun
SSeg
121
1,006
0
18 Apr 2016
Segmentation from Natural Language Expressions
Segmentation from Natural Language Expressions
Ronghang Hu
Marcus Rohrbach
Trevor Darrell
VLM
EgoV
69
434
0
20 Mar 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense
  Image Annotations
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
...
Yannis Kalantidis
Li Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
194
5,726
0
23 Feb 2016
12
Next