Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.01810
Cited By
Videos as Space-Time Region Graphs
5 June 2018
Xinyu Wang
Abhinav Gupta
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Videos as Space-Time Region Graphs"
50 / 154 papers shown
Title
ActionCLIP: A New Paradigm for Video Action Recognition
Mengmeng Wang
Jiazheng Xing
Yong Liu
VLM
152
362
0
17 Sep 2021
EAN: Event Adaptive Network for Enhanced Action Recognition
Yuan Tian
Yichao Yan
Guangtao Zhai
G. Guo
Zhiyong Gao
35
41
0
22 Jul 2021
Instance-Level Relative Saliency Ranking with Graph Reasoning
Nian Liu
Long Li
Wangbo Zhao
Junwei Han
Ling Shao
30
27
0
08 Jul 2021
Graph Convolution for Re-ranking in Person Re-identification
Yuqi Zhang
Qiang Qi
Chong Liu
Weihua Chen
Fan Wang
Hao Li
Rong Jin
25
12
0
05 Jul 2021
Hierarchical Object-oriented Spatio-Temporal Reasoning for Video Question Answering
Long Hoang Dang
T. Le
Vuong Le
T. Tran
30
60
0
25 Jun 2021
Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation
Lei Ke
Xia Li
Martin Danelljan
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
VOS
21
71
0
22 Jun 2021
Towards Long-Form Video Understanding
Chaoxia Wu
Philipp Krahenbuhl
VLM
ViT
49
166
0
21 Jun 2021
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
Michael S. Ryoo
A. Piergiovanni
Anurag Arnab
Mostafa Dehghani
A. Angelova
ViT
37
127
0
21 Jun 2021
Exploring Visual Context for Weakly Supervised Person Search
Yichao Yan
Jinpeng Li
Tianran Ouyang
Jie Qin
Bingbing Ni
Xiaokang Yang
Ling Shao
34
33
0
19 Jun 2021
Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time
Shao-Wei Liu
Hanwen Jiang
Jiarui Xu
Sifei Liu
Xiaolong Wang
3DH
38
161
0
09 Jun 2021
CT-Net: Channel Tensorization Network for Video Classification
Kunchang Li
Xianhang Li
Yali Wang
Jun Wang
Yu Qiao
ViT
30
55
0
03 Jun 2021
DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning
Wenhao Wu
Yuxiang Zhao
Yanwu Xu
Xiao Tan
Dongliang He
...
Jinxing Ye
Yingying Li
Mingde Yao
Zichao Dong
Yifeng Shi
AI4TS
30
27
0
25 May 2021
Cross-Modal Progressive Comprehension for Referring Segmentation
Si Liu
Tianrui Hui
Shaofei Huang
Yunchao Wei
Bo-wen Li
Guanbin Li
EgoV
VOS
28
123
0
15 May 2021
Adaptive Focus for Efficient Video Recognition
Yulin Wang
Zhaoxi Chen
Haojun Jiang
Shiji Song
Yizeng Han
Gao Huang
45
98
0
07 May 2021
Probabilistic Ranking-Aware Ensembles for Enhanced Object Detections
Mingyuan Mao
Baochang Zhang
David Doermann
Jie Guo
Shumin Han
Yuan Feng
Xiaodi Wang
Errui Ding
14
2
0
07 May 2021
Learning Multi-Granular Hypergraphs for Video-Based Person Re-Identification
Yichao Yan
Jie Qin
Jiaxin Chen
Li Liu
Fan Zhu
Ying Tai
Ling Shao
25
130
0
30 Apr 2021
Interaction-GCN: A Graph Convolutional Network based framework for social interaction recognition in egocentric videos
Simone Felicioni
Mariella Dimiccoli
26
1
0
28 Apr 2021
Multiscale Vision Transformers
Haoqi Fan
Bo Xiong
K. Mangalam
Yanghao Li
Zhicheng Yan
Jitendra Malik
Christoph Feichtenhofer
ViT
63
1,224
0
22 Apr 2021
Spatiotemporal Deformable Scene Graphs for Complex Activity Detection
Salman Khan
Fabio Cuzzolin
3DPC
51
5
0
16 Apr 2021
Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
Weiyao Wang
Matt Feiszli
Heng Wang
Du Tran
VOS
15
123
0
10 Apr 2021
Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning
Zhenfang Chen
Jiayuan Mao
Jiajun Wu
Kwan-Yee K. Wong
J. Tenenbaum
Chuang Gan
VGen
36
92
0
30 Mar 2021
No frame left behind: Full Video Action Recognition
X. Liu
S. Pintea
F. Karimi Nejadasl
Olaf Booij
Jan van Gemert
19
40
0
29 Mar 2021
Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers
Lei Ke
Yu-Wing Tai
Chi-Keung Tang
ISeg
30
165
0
23 Mar 2021
Decoupled Spatial Temporal Graphs for Generic Visual Grounding
Qi Feng
Yunchao Wei
Mingming Cheng
Yi Yang
27
5
0
18 Mar 2021
Foundations of Population-Based SHM, Part IV: The Geometry of Spaces of Structures and their Feature Spaces
G. Tsialiamanis
Charilaos Mylonas
Eleni Chatzi
N. Dervilis
D. Wagg
Keith Worden
28
40
0
05 Mar 2021
Learning to Recognize Actions on Objects in Egocentric Video with Attention Dictionaries
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
EgoV
27
15
0
16 Feb 2021
TDN: Temporal Difference Networks for Efficient Action Recognition
Limin Wang
Zhan Tong
Bin Ji
Gangshan Wu
28
391
0
18 Dec 2020
GTA: Global Temporal Attention for Video Action Understanding
Bo He
Xitong Yang
Zuxuan Wu
Hao Chen
Ser-Nam Lim
Abhinav Shrivastava
ViT
33
27
0
15 Dec 2020
A Comprehensive Study of Deep Video Action Recognition
Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi-Li Zhang
Joseph Tighe
R. Manmatha
Mu Li
VLM
AI4TS
38
185
0
11 Dec 2020
Spatial-Temporal Alignment Network for Action Recognition and Detection
Junwei Liang
Liangliang Cao
Xuehan Xiong
Ting Yu
Alexander G. Hauptmann
3DPC
16
9
0
04 Dec 2020
VLG-Net: Video-Language Graph Matching Network for Video Grounding
Mattia Soldan
Mengmeng Xu
Sisi Qu
Jesper N. Tegnér
Guohao Li
35
69
0
19 Nov 2020
Object-aware Feature Aggregation for Video Object Detection
Qichuan Geng
Hong Zhang
Na Jiang
Xiaojuan Qi
Liangjun Zhang
Zhongjun Zhou
VOS
36
3
0
23 Oct 2020
Pose And Joint-Aware Action Recognition
Anshul B. Shah
Shlok Kumar Mishra
Ankan Bansal
Jun-Cheng Chen
Ramalingam Chellappa
Abhinav Shrivastava
39
33
0
16 Oct 2020
Effective Action Recognition with Embedded Key Point Shifts
Haozhi Cao
Yuecong Xu
Jianfei Yang
K. Mao
Jianxiong Yin
Simon See
15
7
0
26 Aug 2020
AssembleNet++: Assembling Modality Representations via Attention Connections
Michael S. Ryoo
A. Piergiovanni
Juhana Kangaspunta
A. Angelova
15
44
0
18 Aug 2020
Bipartite Graph Reasoning GANs for Person Image Generation
Hao Tang
S. Bai
Philip Torr
N. Sebe
32
58
0
10 Aug 2020
PAN: Towards Fast Action Recognition via Learning Persistence of Appearance
Can Zhang
Yuexian Zou
Guang Chen
Lei Gan
15
39
0
08 Aug 2020
Cascade Graph Neural Networks for RGB-D Salient Object Detection
Ao Luo
Xin Li
Fan Yang
Zhicheng Jiao
Hong Cheng
Siwei Lyu
19
107
0
07 Aug 2020
Boundary Content Graph Neural Network for Temporal Action Proposal Generation
Y. Bai
Yingying Wang
Yunhai Tong
Yang Yang
Qiyue Liu
Junhui Liu
27
161
0
04 Aug 2020
Approximated Bilinear Modules for Temporal Modeling
Xinqi Zhu
Chang Xu
Langwen Hui
Cewu Lu
Dacheng Tao
25
23
0
25 Jul 2020
MotionSqueeze: Neural Motion Feature Learning for Video Understanding
Heeseung Kwon
Manjin Kim
Suha Kwak
Minsu Cho
FAtt
20
128
0
20 Jul 2020
Visual Relation Grounding in Videos
Junbin Xiao
Xindi Shang
Xun Yang
Sheng Tang
Tat-Seng Chua
20
40
0
17 Jul 2020
SumGraph: Video Summarization via Recursive Graph Modeling
Jungin Park
Jiyoung Lee
Ig-Jae Kim
Kwanghoon Sohn
27
53
0
17 Jul 2020
Temporal Distinct Representation Learning for Action Recognition
Junwu Weng
Donghao Luo
Yabiao Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Xudong Jiang
Junsong Yuan
17
26
0
15 Jul 2020
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers
Shijie Geng
Peng Gao
Moitreya Chatterjee
Chiori Hori
Jonathan Le Roux
Yongfeng Zhang
Hongsheng Li
A. Cherian
27
11
0
08 Jul 2020
Comprehensive Information Integration Modeling Framework for Video Titling
Shengyu Zhang
Ziqi Tan
Jin Yu
Zhou Zhao
Kun Kuang
Tan Jiang
Jingren Zhou
Hongxia Yang
Fei Wu
31
40
0
24 Jun 2020
Exploiting Visual Semantic Reasoning for Video-Text Retrieval
Zerun Feng
Zhimin Zeng
Caili Guo
Zheng Li
22
34
0
16 Jun 2020
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization
Junting Pan
Siyu Chen
Zheng Shou
Yu Liu
Jing Shao
Hongsheng Li
3DPC
19
150
0
14 Jun 2020
GNN3DMOT: Graph Neural Network for 3D Multi-Object Tracking with Multi-Feature Learning
Xinshuo Weng
Yongxin Wang
Yunze Man
Kris Kitani
3DPC
VOT
45
215
0
12 Jun 2020
Egocentric Object Manipulation Graphs
Eadom Dessalene
Michael Maynord
Chinmaya Devaraj
Cornelia Fermuller
Yiannis Aloimonos
EgoV
27
19
0
05 Jun 2020
Previous
1
2
3
4
Next