ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.09773
  4. Cited By
Local-Global Context Aware Transformer for Language-Guided Video
  Segmentation
v1v2 (latest)

Local-Global Context Aware Transformer for Language-Guided Video Segmentation

18 March 2022
Chen Liang
Wenguan Wang
Tianfei Zhou
Jiaxu Miao
Yawei Luo
Yi Yang
    VOS
ArXiv (abs)PDFHTMLGithub (48★)

Papers citing "Local-Global Context Aware Transformer for Language-Guided Video Segmentation"

46 / 96 papers shown
Title
Simple BERT Models for Relation Extraction and Semantic Role Labeling
Simple BERT Models for Relation Extraction and Semantic Role Labeling
Peng Shi
Jimmy J. Lin
VLM
76
446
0
10 Apr 2019
Cross-Modal Self-Attention Network for Referring Image Segmentation
Cross-Modal Self-Attention Network for Referring Image Segmentation
Linwei Ye
Mrigank Rochan
Zhi Liu
Yang Wang
EgoV
68
478
0
09 Apr 2019
VideoBERT: A Joint Model for Video and Language Representation Learning
VideoBERT: A Joint Model for Video and Language Representation Learning
Chen Sun
Austin Myers
Carl Vondrick
Kevin Patrick Murphy
Cordelia Schmid
VLMSSL
90
1,250
0
03 Apr 2019
Video Object Segmentation using Space-Time Memory Networks
Video Object Segmentation using Space-Time Memory Networks
Seoung Wug Oh
Joon-Young Lee
N. Xu
Seon Joo Kim
VOS
81
711
0
01 Apr 2019
RVOS: End-to-End Recurrent Network for Video Object Segmentation
RVOS: End-to-End Recurrent Network for Video Object Segmentation
Carles Ventura
Míriam Bellver
Andreu Girbau
Amaia Salvador
F. Marqués
Xavier Giró-i-Nieto
VOS
82
230
0
13 Mar 2019
Improving Referring Expression Grounding with Cross-modal
  Attention-guided Erasing
Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing
Xihui Liu
Zihao Wang
Jing Shao
Xiaogang Wang
Hongsheng Li
ObjD
90
184
0
03 Mar 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
263
3,747
0
09 Jan 2019
Long-Term Feature Banks for Detailed Video Understanding
Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
197
480
0
12 Dec 2018
Neighbourhood Watch: Referring Expression Comprehension via
  Language-guided Graph Attention Networks
Neighbourhood Watch: Referring Expression Comprehension via Language-guided Graph Attention Networks
Peng Wang
Qi Wu
Jiewei Cao
Chunhua Shen
Lianli Gao
Anton Van Den Hengel
ObjD
93
255
0
12 Dec 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
95,324
0
11 Oct 2018
VideoMatch: Matching based Video Object Segmentation
VideoMatch: Matching based Video Object Segmentation
Yuan-Ting Hu
Jia-Bin Huang
Alex Schwing
VOS
68
275
0
04 Sep 2018
Dynamic Multimodal Instance Segmentation guided by natural language
  queries
Dynamic Multimodal Instance Segmentation guided by natural language queries
Edgar Margffoy-Tuay
Juan C. Pérez
Emilio Botero
Pablo Arbelaez
82
176
0
06 Jul 2018
Linguistically-Informed Self-Attention for Semantic Role Labeling
Linguistically-Informed Self-Attention for Semantic Role Labeling
Emma Strubell
Pat Verga
D. Andor
David J. Weiss
Andrew McCallum
OffRL
91
380
0
23 Apr 2018
Video Object Segmentation with Language Referring Expressions
Video Object Segmentation with Language Referring Expressions
Anna Khoreva
Anna Rohrbach
Bernt Schiele
VOS
74
197
0
21 Mar 2018
Actor and Action Video Segmentation from a Sentence
Actor and Action Video Segmentation from a Sentence
Kirill Gavrilyuk
Amir Ghodrati
Zhenyang Li
Cees G. M. Snoek
VLM
79
151
0
20 Mar 2018
Learning Latent Super-Events to Detect Multiple Activities in Videos
Learning Latent Super-Events to Detect Multiple Activities in Videos
A. Piergiovanni
Michael S. Ryoo
76
90
0
05 Dec 2017
Pixel-Level Matching for Video Object Segmentation using Convolutional
  Neural Networks
Pixel-Level Matching for Video Object Segmentation using Convolutional Neural Networks
Jae Shin Yoon
François Rameau
Junsik Kim
Seokju Lee
Seunghak Shin
In So Kweon
VOS
83
159
0
17 Aug 2017
Online Adaptation of Convolutional Neural Networks for Video Object
  Segmentation
Online Adaptation of Convolutional Neural Networks for Video Object Segmentation
P. Voigtlaender
Bastian Leibe
VOS
131
396
0
28 Jun 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
811
132,725
0
12 Jun 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
240
8,045
0
22 May 2017
Recurrent Multimodal Interaction for Referring Image Segmentation
Recurrent Multimodal Interaction for Referring Image Segmentation
Chenxi Liu
Zhe Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Alan Yuille
EgoV
83
240
0
23 Mar 2017
Neural Map: Structured Memory for Deep Reinforcement Learning
Neural Map: Structured Memory for Deep Reinforcement Learning
Emilio Parisotto
Ruslan Salakhutdinov
75
261
0
27 Feb 2017
Comprehension-guided referring expressions
Comprehension-guided referring expressions
Ruotian Luo
Gregory Shakhnarovich
ObjD
107
171
0
12 Jan 2017
A Joint Speaker-Listener-Reinforcer Model for Referring Expressions
A Joint Speaker-Listener-Reinforcer Model for Referring Expressions
Licheng Yu
Hao Tan
Joey Tianyi Zhou
Tamara L. Berg
ObjD
94
275
0
30 Dec 2016
Video Propagation Networks
Video Propagation Networks
Varun Jampani
Raghudeep Gadde
Peter V. Gehler
DiffM
89
230
0
16 Dec 2016
Learning Video Object Segmentation from Static Images
Learning Video Object Segmentation from Static Images
Anna Khoreva
Federico Perazzi
Rodrigo Benenson
Bernt Schiele
A. Sorkine-Hornung
VOS
87
588
0
08 Dec 2016
Pyramid Scene Parsing Network
Pyramid Scene Parsing Network
Hengshuang Zhao
Jianping Shi
Xiaojuan Qi
Xiaogang Wang
Jiaya Jia
VOSSSeg
665
12,046
0
04 Dec 2016
Modeling Relationships in Referential Expressions with Compositional
  Modular Networks
Modeling Relationships in Referential Expressions with Compositional Modular Networks
Ronghang Hu
Marcus Rohrbach
Jacob Andreas
Trevor Darrell
Kate Saenko
82
406
0
30 Nov 2016
One-Shot Video Object Segmentation
One-Shot Video Object Segmentation
Sergi Caelles
Kevis-Kokitsi Maninis
Jordi Pont-Tuset
Laura Leal-Taixé
Daniel Cremers
Luc Van Gool
VOS
81
917
0
16 Nov 2016
Modeling Context in Referring Expressions
Modeling Context in Referring Expressions
Licheng Yu
Patrick Poirson
Shan Yang
Alexander C. Berg
Tamara L. Berg
133
1,279
0
31 Jul 2016
Predicting Personal Traits from Facial Images using Convolutional Neural
  Networks Augmented with Facial Landmark Information
Predicting Personal Traits from Facial Images using Convolutional Neural Networks Augmented with Facial Landmark Information
Yoad Lewenberg
Valliappa Chockalingam
Satinder Singh
Honglak Lee
CVBM
62
305
0
29 May 2016
Long-term Temporal Convolutions for Action Recognition
Long-term Temporal Convolutions for Action Recognition
Gül Varol
Ivan Laptev
Cordelia Schmid
89
912
0
15 Apr 2016
Segmentation from Natural Language Expressions
Segmentation from Natural Language Expressions
Ronghang Hu
Marcus Rohrbach
Trevor Darrell
VLMEgoV
83
438
0
20 Mar 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.3K
194,641
0
10 Dec 2015
Natural Language Object Retrieval
Natural Language Object Retrieval
Ronghang Hu
Huazhe Xu
Marcus Rohrbach
Jiashi Feng
Kate Saenko
Trevor Darrell
ObjD
108
554
0
13 Nov 2015
Generation and Comprehension of Unambiguous Object Descriptions
Generation and Comprehension of Unambiguous Object Descriptions
Junhua Mao
Jonathan Huang
Alexander Toshev
Oana-Maria Camburu
Alan Yuille
Kevin Patrick Murphy
ObjD
140
1,359
0
07 Nov 2015
Bidirectional LSTM-CRF Models for Sequence Tagging
Bidirectional LSTM-CRF Models for Sequence Tagging
Zhiheng Huang
Wenyuan Xu
Kai Yu
186
4,035
0
09 Aug 2015
Holistically-Nested Edge Detection
Holistically-Nested Edge Detection
Saining Xie
Zhuowen Tu
151
3,497
0
24 Apr 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,433
0
22 Dec 2014
Neural Turing Machines
Neural Turing Machines
Alex Graves
Greg Wayne
Ivo Danihelka
115
2,333
0
20 Oct 2014
Memory Networks
Memory Networks
Jason Weston
S. Chopra
Antoine Bordes
GNNKELM
162
1,709
0
15 Oct 2014
Deeply-Supervised Nets
Deeply-Supervised Nets
Chen-Yu Lee
Saining Xie
Patrick W. Gallagher
Zhengyou Zhang
Zhuowen Tu
354
2,243
0
18 Sep 2014
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
450
20,606
0
10 Sep 2014
Very Deep Convolutional Networks for Large-Scale Image Recognition
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAttMDE
1.7K
100,575
0
04 Sep 2014
On the Properties of Neural Machine Translation: Encoder-Decoder
  Approaches
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
Kyunghyun Cho
B. V. Merrienboer
Dzmitry Bahdanau
Yoshua Bengio
AI4CEAIMat
270
6,791
0
03 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
584
27,345
0
01 Sep 2014
Previous
12