Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.08956
Cited By
Panoramic Vision Transformer for Saliency Detection in 360° Videos
19 September 2022
Heeseung Yun
Se-Ho Lee
Gunhee Kim
ViT
MDE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Panoramic Vision Transformer for Saliency Detection in 360° Videos"
29 / 29 papers shown
Title
Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization
Jinheng Xie
Cheng Luo
Xiangping Zhu
Ziqi Jin
Weizeng Lu
Linlin Shen
WSOL
57
55
0
12 Oct 2021
Pano-AVQA: Grounded Audio-Visual Question Answering on 360
∘
^\circ
∘
Videos
Heeseung Yun
Youngjae Yu
Wonsuk Yang
Kangil Lee
Gunhee Kim
96
86
0
11 Oct 2021
Localizing Objects with Self-Supervised Transformers and no Labels
Oriane Siméoni
Gilles Puy
Huy V. Vo
Simon Roburin
Spyros Gidaris
Andrei Bursuc
P. Pérez
Renaud Marlet
Jean Ponce
ViT
239
203
0
29 Sep 2021
Improving 360 Monocular Depth Estimation via Non-local Dense Prediction Transformer and Joint Supervised and Self-supervised Learning
I. Yun
Hyuk-Jae Lee
Chae-Eun Rhee
ViT
MDE
64
28
0
22 Sep 2021
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
Enze Xie
Wenhai Wang
Zhiding Yu
Anima Anandkumar
J. Álvarez
Ping Luo
ViT
332
5,095
0
31 May 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
735
6,139
0
29 Apr 2021
VidTr: Video Transformer Without Convolutions
Yanyi Zhang
Xinyu Li
Chunhui Liu
Bing Shuai
Yi Zhu
Biagio Brattoli
Hao Chen
I. Marsic
Joseph Tighe
ViT
241
197
0
23 Apr 2021
Multiscale Vision Transformers
Haoqi Fan
Bo Xiong
K. Mangalam
Yanghao Li
Zhicheng Yan
Jitendra Malik
Christoph Feichtenhofer
ViT
135
1,265
0
22 Apr 2021
An Empirical Study of Training Self-Supervised Vision Transformers
Xinlei Chen
Saining Xie
Kaiming He
ViT
161
1,873
0
05 Apr 2021
Going deeper with Image Transformers
Hugo Touvron
Matthieu Cord
Alexandre Sablayrolles
Gabriel Synnaeve
Hervé Jégou
ViT
165
1,021
0
31 Mar 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
403
2,066
0
09 Feb 2021
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Sixiao Zheng
Jiachen Lu
Hengshuang Zhao
Xiatian Zhu
Zekun Luo
...
Yanwei Fu
Jianfeng Feng
Tao Xiang
Philip Torr
Li Zhang
ViT
194
2,912
0
31 Dec 2020
Point Transformer
Nico Engel
Vasileios Belagiannis
Klaus C. J. Dietmayer
3DPC
186
2,008
0
02 Nov 2020
Tangent Images for Mitigating Spherical Distortion
Marc Eder
Mykhailo Shvets
John Lim
Jan-Michael Frahm
69
107
0
19 Dec 2019
Vision-Language Navigation with Self-Supervised Auxiliary Reasoning Tasks
Fengda Zhu
Yi Zhu
Xiaojun Chang
Xiaodan Liang
LRM
103
243
0
18 Nov 2019
Orientation-aware Semantic Segmentation on Icosahedron Spheres
Chao Zhang
Stephan Liwicki
William A. P. Smith
R. Cipolla
71
80
0
30 Jul 2019
WoodScape: A multi-task, multi-camera fisheye dataset for autonomous driving
S. Yogamani
Ciarán Hughes
Jonathan Horgan
Ganesh Sistu
P. Varley
...
Sumanth Chennupati
Sanjaya Nayak
Saquib Mansoor
Xavier Perroton
P. Pérez
HAI
75
266
0
04 May 2019
Multi-source weak supervision for saliency detection
Yu Zeng
Yunzhi Zhuge
Huchuan Lu
Lulu Zhang
Mingyang Qian
Yizhou Yu
82
169
0
01 Apr 2019
Spherical CNNs on Unstructured Grids
Chiyu “Max”
Jingwei Huang
K. Kashinath
Lawrence Berkeley Nat’l Lab
P. Marcus
Matthias Niessner
95
183
0
07 Jan 2019
Kernel Transformer Networks for Compact Spherical Convolution
Yu-Chuan Su
Kristen Grauman
ViT
67
125
0
07 Dec 2018
SpherePHD: Applying CNNs on a Spherical PolyHeDron Representation of 360 degree Images
Yeonkun Lee
Jaeseok Jeong
J. Yun
Wonjune Cho
Kuk-Jin Yoon
85
99
0
20 Nov 2018
Cube Padding for Weakly-Supervised Saliency Prediction in 360° Videos
Hsien-Tzu Cheng
Chun-Hung Chao
Jin-Dong Dong
Hao Wen
Tyng-Luh Liu
Min Sun
70
193
0
04 Jun 2018
A Deep Ranking Model for Spatio-Temporal Highlight Detection from a 360 Video
Youngjae Yu
Sangho Lee
Joonil Na
Jaeyun Kang
Gunhee Kim
46
44
0
31 Jan 2018
Learning SO(3) Equivariant Representations with Spherical CNNs
Carlos Esteves
Christine Allen-Blanchette
A. Makadia
Kostas Daniilidis
121
515
0
17 Nov 2017
SalGAN: Visual Saliency Prediction with Generative Adversarial Networks
Junting Pan
Cristian Canton Ferrer
Kevin McGuinness
Noel E. O'Connor
Jordi Torres
E. Sayrol
Xavier Giró-i-Nieto
GAN
92
398
0
04 Jan 2017
Pano2Vid: Automatic Cinematography for Watching 360
∘
^{\circ}
∘
Videos
Yu-Chuan Su
Dinesh Jayaraman
Kristen Grauman
VGen
69
129
0
07 Dec 2016
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
435
10,541
0
21 Jul 2016
Gaussian Error Linear Units (GELUs)
Dan Hendrycks
Kevin Gimpel
174
5,049
0
27 Jun 2016
Learning Deep Features for Discriminative Localization
Bolei Zhou
A. Khosla
Àgata Lapedriza
A. Oliva
Antonio Torralba
SSL
SSeg
FAtt
253
9,342
0
14 Dec 2015
1