ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.08956
  4. Cited By
Panoramic Vision Transformer for Saliency Detection in 360° Videos

Panoramic Vision Transformer for Saliency Detection in 360° Videos

19 September 2022
Heeseung Yun
Se-Ho Lee
Gunhee Kim
    ViTMDE
ArXiv (abs)PDFHTML

Papers citing "Panoramic Vision Transformer for Saliency Detection in 360° Videos"

29 / 29 papers shown
Title
Online Refinement of Low-level Feature Based Activation Map for Weakly
  Supervised Object Localization
Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization
Jinheng Xie
Cheng Luo
Xiangping Zhu
Ziqi Jin
Weizeng Lu
Linlin Shen
WSOL
57
55
0
12 Oct 2021
Pano-AVQA: Grounded Audio-Visual Question Answering on 360$^\circ$
  Videos
Pano-AVQA: Grounded Audio-Visual Question Answering on 360∘^\circ∘ Videos
Heeseung Yun
Youngjae Yu
Wonsuk Yang
Kangil Lee
Gunhee Kim
96
86
0
11 Oct 2021
Localizing Objects with Self-Supervised Transformers and no Labels
Localizing Objects with Self-Supervised Transformers and no Labels
Oriane Siméoni
Gilles Puy
Huy V. Vo
Simon Roburin
Spyros Gidaris
Andrei Bursuc
P. Pérez
Renaud Marlet
Jean Ponce
ViT
239
203
0
29 Sep 2021
Improving 360 Monocular Depth Estimation via Non-local Dense Prediction
  Transformer and Joint Supervised and Self-supervised Learning
Improving 360 Monocular Depth Estimation via Non-local Dense Prediction Transformer and Joint Supervised and Self-supervised Learning
I. Yun
Hyuk-Jae Lee
Chae-Eun Rhee
ViTMDE
64
28
0
22 Sep 2021
SegFormer: Simple and Efficient Design for Semantic Segmentation with
  Transformers
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
Enze Xie
Wenhai Wang
Zhiding Yu
Anima Anandkumar
J. Álvarez
Ping Luo
ViT
332
5,095
0
31 May 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
735
6,139
0
29 Apr 2021
VidTr: Video Transformer Without Convolutions
VidTr: Video Transformer Without Convolutions
Yanyi Zhang
Xinyu Li
Chunhui Liu
Bing Shuai
Yi Zhu
Biagio Brattoli
Hao Chen
I. Marsic
Joseph Tighe
ViT
241
197
0
23 Apr 2021
Multiscale Vision Transformers
Multiscale Vision Transformers
Haoqi Fan
Bo Xiong
K. Mangalam
Yanghao Li
Zhicheng Yan
Jitendra Malik
Christoph Feichtenhofer
ViT
135
1,265
0
22 Apr 2021
An Empirical Study of Training Self-Supervised Vision Transformers
An Empirical Study of Training Self-Supervised Vision Transformers
Xinlei Chen
Saining Xie
Kaiming He
ViT
161
1,873
0
05 Apr 2021
Going deeper with Image Transformers
Going deeper with Image Transformers
Hugo Touvron
Matthieu Cord
Alexandre Sablayrolles
Gabriel Synnaeve
Hervé Jégou
ViT
165
1,021
0
31 Mar 2021
Is Space-Time Attention All You Need for Video Understanding?
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
403
2,066
0
09 Feb 2021
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective
  with Transformers
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Sixiao Zheng
Jiachen Lu
Hengshuang Zhao
Xiatian Zhu
Zekun Luo
...
Yanwei Fu
Jianfeng Feng
Tao Xiang
Philip Torr
Li Zhang
ViT
194
2,912
0
31 Dec 2020
Point Transformer
Point Transformer
Nico Engel
Vasileios Belagiannis
Klaus C. J. Dietmayer
3DPC
186
2,008
0
02 Nov 2020
Tangent Images for Mitigating Spherical Distortion
Tangent Images for Mitigating Spherical Distortion
Marc Eder
Mykhailo Shvets
John Lim
Jan-Michael Frahm
69
107
0
19 Dec 2019
Vision-Language Navigation with Self-Supervised Auxiliary Reasoning
  Tasks
Vision-Language Navigation with Self-Supervised Auxiliary Reasoning Tasks
Fengda Zhu
Yi Zhu
Xiaojun Chang
Xiaodan Liang
LRM
103
243
0
18 Nov 2019
Orientation-aware Semantic Segmentation on Icosahedron Spheres
Orientation-aware Semantic Segmentation on Icosahedron Spheres
Chao Zhang
Stephan Liwicki
William A. P. Smith
R. Cipolla
71
80
0
30 Jul 2019
WoodScape: A multi-task, multi-camera fisheye dataset for autonomous
  driving
WoodScape: A multi-task, multi-camera fisheye dataset for autonomous driving
S. Yogamani
Ciarán Hughes
Jonathan Horgan
Ganesh Sistu
P. Varley
...
Sumanth Chennupati
Sanjaya Nayak
Saquib Mansoor
Xavier Perroton
P. Pérez
HAI
75
266
0
04 May 2019
Multi-source weak supervision for saliency detection
Multi-source weak supervision for saliency detection
Yu Zeng
Yunzhi Zhuge
Huchuan Lu
Lulu Zhang
Mingyang Qian
Yizhou Yu
82
169
0
01 Apr 2019
Spherical CNNs on Unstructured Grids
Spherical CNNs on Unstructured Grids
Chiyu “Max”
Jingwei Huang
K. Kashinath
Lawrence Berkeley Nat’l Lab
P. Marcus
Matthias Niessner
95
183
0
07 Jan 2019
Kernel Transformer Networks for Compact Spherical Convolution
Kernel Transformer Networks for Compact Spherical Convolution
Yu-Chuan Su
Kristen Grauman
ViT
67
125
0
07 Dec 2018
SpherePHD: Applying CNNs on a Spherical PolyHeDron Representation of 360
  degree Images
SpherePHD: Applying CNNs on a Spherical PolyHeDron Representation of 360 degree Images
Yeonkun Lee
Jaeseok Jeong
J. Yun
Wonjune Cho
Kuk-Jin Yoon
85
99
0
20 Nov 2018
Cube Padding for Weakly-Supervised Saliency Prediction in 360°
  Videos
Cube Padding for Weakly-Supervised Saliency Prediction in 360° Videos
Hsien-Tzu Cheng
Chun-Hung Chao
Jin-Dong Dong
Hao Wen
Tyng-Luh Liu
Min Sun
70
193
0
04 Jun 2018
A Deep Ranking Model for Spatio-Temporal Highlight Detection from a 360
  Video
A Deep Ranking Model for Spatio-Temporal Highlight Detection from a 360 Video
Youngjae Yu
Sangho Lee
Joonil Na
Jaeyun Kang
Gunhee Kim
46
44
0
31 Jan 2018
Learning SO(3) Equivariant Representations with Spherical CNNs
Learning SO(3) Equivariant Representations with Spherical CNNs
Carlos Esteves
Christine Allen-Blanchette
A. Makadia
Kostas Daniilidis
121
515
0
17 Nov 2017
SalGAN: Visual Saliency Prediction with Generative Adversarial Networks
SalGAN: Visual Saliency Prediction with Generative Adversarial Networks
Junting Pan
Cristian Canton Ferrer
Kevin McGuinness
Noel E. O'Connor
Jordi Torres
E. Sayrol
Xavier Giró-i-Nieto
GAN
92
398
0
04 Jan 2017
Pano2Vid: Automatic Cinematography for Watching 360$^{\circ}$ Videos
Pano2Vid: Automatic Cinematography for Watching 360∘^{\circ}∘ Videos
Yu-Chuan Su
Dinesh Jayaraman
Kristen Grauman
VGen
69
129
0
07 Dec 2016
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
435
10,541
0
21 Jul 2016
Gaussian Error Linear Units (GELUs)
Gaussian Error Linear Units (GELUs)
Dan Hendrycks
Kevin Gimpel
174
5,049
0
27 Jun 2016
Learning Deep Features for Discriminative Localization
Learning Deep Features for Discriminative Localization
Bolei Zhou
A. Khosla
Àgata Lapedriza
A. Oliva
Antonio Torralba
SSLSSegFAtt
253
9,342
0
14 Dec 2015
1