ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.02025
  4. Cited By
Spatial Transformer Networks
v1v2v3 (latest)

Spatial Transformer Networks

5 June 2015
Max Jaderberg
Karen Simonyan
Andrew Zisserman
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Spatial Transformer Networks"

50 / 2,879 papers shown
Title
SALISA: Saliency-based Input Sampling for Efficient Video Object
  Detection
SALISA: Saliency-based Input Sampling for Efficient Video Object Detection
B. Bejnordi
A. Habibian
Fatih Porikli
Amir Ghodrati
81
12
0
05 Apr 2022
Vision Transformer Equipped with Neural Resizer on Facial Expression
  Recognition Task
Vision Transformer Equipped with Neural Resizer on Facial Expression Recognition Task
Hyeonbin Hwang
Soyeon Kim
Wei-Jin Park
Jiho Seo
Kyungtae Ko
Hyeon Yeo
ViT
83
9
0
05 Apr 2022
Revisiting Near/Remote Sensing with Geospatial Attention
Revisiting Near/Remote Sensing with Geospatial Attention
Scott Workman
M. U. Rafique
Hunter Blanton
Nathan Jacobs
121
17
0
04 Apr 2022
FoV-Net: Field-of-View Extrapolation Using Self-Attention and
  Uncertainty
FoV-Net: Field-of-View Extrapolation Using Self-Attention and Uncertainty
Liqian Ma
Stamatios Georgoulis
Xu Jia
Luc Van Gool
62
6
0
04 Apr 2022
Time Lens++: Event-based Frame Interpolation with Parametric Non-linear
  Flow and Multi-scale Fusion
Time Lens++: Event-based Frame Interpolation with Parametric Non-linear Flow and Multi-scale Fusion
S. Tulyakov
Alfredo Bochicchio
Daniel Gehrig
Stamatios Georgoulis
Yuan-zheng Li
Davide Scaramuzza
96
125
0
31 Mar 2022
Weakly Supervised Patch Label Inference Networks for Efficient Pavement
  Distress Detection and Recognition in the Wild
Weakly Supervised Patch Label Inference Networks for Efficient Pavement Distress Detection and Recognition in the Wild
Sheng Huang
Wenhao Tang
Guixin Huang
Luwen Huangfu
Dan Yang
121
9
0
31 Mar 2022
CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs
CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs
Jiteng Mu
Shalini De Mello
Zhiding Yu
Nuno Vasconcelos
Xinyu Wang
Jan Kautz
Sifei Liu
GAN
81
18
0
30 Mar 2022
Surface Vision Transformers: Attention-Based Modelling applied to
  Cortical Analysis
Surface Vision Transformers: Attention-Based Modelling applied to Cortical Analysis
Simon Dahan
Abdulah Fawaz
Logan Z. J. Williams
Chunhui Yang
Timothy S. Coalson
M. Glasser
A. Edwards
Daniel Rueckert
E. C. Robinson
MedImViT
81
21
0
30 Mar 2022
ReplaceBlock: An improved regularization method based on background
  information
ReplaceBlock: An improved regularization method based on background information
Zhemin Zhang
Xun Gong
Jinyi Wu
OOD
30
0
0
30 Mar 2022
Efficient Virtual View Selection for 3D Hand Pose Estimation
Efficient Virtual View Selection for 3D Hand Pose Estimation
Jian Cheng
Yanguang Wan
Dexin Zuo
Cuixia Ma
Jian Gu
Ping Tan
Hongan Wang
Xiaoming Deng
Yinda Zhang
3DH
110
25
0
29 Mar 2022
Long-term Video Frame Interpolation via Feature Propagation
Long-term Video Frame Interpolation via Feature Propagation
Dawit Mureja Argaw
In So Kweon
77
8
0
29 Mar 2022
Vision Transformers in Medical Computer Vision -- A Contemplative
  Retrospection
Vision Transformers in Medical Computer Vision -- A Contemplative Retrospection
Arshi Parvaiz
Muhammad Anwaar Khalid
Rukhsana Zafar
Huma Ameer
M. Ali
M. Fraz
MedIm
79
64
0
29 Mar 2022
Affine Medical Image Registration with Coarse-to-Fine Vision Transformer
Affine Medical Image Registration with Coarse-to-Fine Vision Transformer
Tony C. W. Mok
Albert C. S. Chung
ViTMedIm
104
64
0
29 Mar 2022
Learning Optical Flow, Depth, and Scene Flow without Real-World Labels
Learning Optical Flow, Depth, and Scene Flow without Real-World Labels
Vitor Campagnolo Guizilini
Kuan-Hui Lee
Rares Andrei Ambrus
Adrien Gaidon
SSL3DPCMDE
87
47
0
28 Mar 2022
Part-based Pseudo Label Refinement for Unsupervised Person
  Re-identification
Part-based Pseudo Label Refinement for Unsupervised Person Re-identification
Y. Cho
Woo Jae Kim
Seunghoon Hong
Sung-eui Yoon
87
172
0
28 Mar 2022
Reference-based Video Super-Resolution Using Multi-Camera Video Triplets
Reference-based Video Super-Resolution Using Multi-Camera Video Triplets
Junyong Lee
Myeonghee Lee
Sunghyun Cho
Seungyong Lee
SupR
68
27
0
28 Mar 2022
UV Volumes for Real-time Rendering of Editable Free-view Human
  Performance
UV Volumes for Real-time Rendering of Editable Free-view Human Performance
Yue Chen
Xuan Wang
Xingyu Chen
Qi Zhang
Xiaoyu Li
Yu-Xiao Guo
Jue Wang
Fei Wang
DiffM3DH
120
45
0
27 Mar 2022
Visual-based Safe Landing for UAVs in Populated Areas: Real-time
  Validation in Virtual Environments
Visual-based Safe Landing for UAVs in Populated Areas: Real-time Validation in Virtual Environments
Hector Tovanche-Picon
J. Gonzalez-Trejo
A. Flores-Abad
D. Mercado-Ravell
124
7
0
25 Mar 2022
Unsupervised Image Deraining: Optimization Model Driven Deep CNN
Unsupervised Image Deraining: Optimization Model Driven Deep CNN
Changfeng Yu
Yi Chang
Yi Li
Xile Zhao
Luxin Yan
68
30
0
25 Mar 2022
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Transformers Meet Visual Learning Understanding: A Comprehensive Review
Yuting Yang
Licheng Jiao
Xuantong Liu
Fan Liu
Shuyuan Yang
Zhixi Feng
Xu Tang
ViTMedIm
120
28
0
24 Mar 2022
Industrial Style Transfer with Large-scale Geometric Warping and Content
  Preservation
Industrial Style Transfer with Large-scale Geometric Warping and Content Preservation
Jinchao Yang
Fei Guo
Shuo Chen
Jun Yu Li
Jian Yang
95
11
0
24 Mar 2022
Unsupervised Simultaneous Learning for Camera Re-Localization and Depth
  Estimation from Video
Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video
S. Taguchi
Noriaki Hirose
SSLMDE
60
1
0
24 Mar 2022
Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image
  Translation
Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation
Yanwu Xu
Shaoan Xie
Wenhao Wu
Kun Zhang
Biwei Huang
Kayhan Batmanghelich
84
25
0
23 Mar 2022
CroMo: Cross-Modal Learning for Monocular Depth Estimation
CroMo: Cross-Modal Learning for Monocular Depth Estimation
Yannick Verdié
Jifei Song
Barnabé Mas
Benjamin Busam
Alevs Leonardis
Jingyu Sun
MDE
73
15
0
23 Mar 2022
Real-time Object Detection for Streaming Perception
Real-time Object Detection for Streaming Perception
Jinrong Yang
Songtao Liu
Zeming Li
Xiaoping Li
Jian Sun
112
51
0
23 Mar 2022
Deep Frequency Filtering for Domain Generalization
Deep Frequency Filtering for Domain Generalization
Shiqi Lin
Zhizheng Zhang
Zhipeng Huang
Yan Lu
Cuiling Lan
...
Jiang Wang
Zicheng Liu
Amey Parulkar
V. Navkal
Zhibo Chen
104
51
0
23 Mar 2022
PersFormer: 3D Lane Detection via Perspective Transformer and the
  OpenLane Benchmark
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark
Li Chen
Chonghao Sima
Yang Li
Zehan Zheng
Jiajie Xu
...
Hongyang Li
Conghui He
Jianping Shi
Yu Qiao
Junchi Yan
3DPCViT
118
191
0
21 Mar 2022
Self-Supervised Road Layout Parsing with Graph Auto-Encoding
Self-Supervised Road Layout Parsing with Graph Auto-Encoding
Chenyang Lu
Gijs Dubbelman
SSL
59
1
0
21 Mar 2022
V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision
  Transformer
V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer
Runsheng Xu
Hao Xiang
Zhengzhong Tu
Xin Xia
Ming-Hsuan Yang
Jiaqi Ma
ViT
228
384
0
20 Mar 2022
Stochastic Video Prediction with Structure and Motion
Stochastic Video Prediction with Structure and Motion
Adil Kaan Akan
Sadra Safadoust
Fatma Guney
VGen
98
10
0
20 Mar 2022
TVConv: Efficient Translation Variant Convolution for Layout-aware
  Visual Processing
TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing
Jie Chen
Tianlang He
Weipeng Zhuo
Li Ma
Sangtae Ha
Shueng-Han Gary Chan
CVBM
103
25
0
20 Mar 2022
Representation-Agnostic Shape Fields
Representation-Agnostic Shape Fields
Xiaoyang Huang
Jiancheng Yang
Yanjun Wang
Ziyu Chen
Linguo Li
Teng Li
Bingbing Ni
Wenjun Zhang
63
7
0
19 Mar 2022
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text
  Detection and Text Recognition
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition
Mingxin Huang
Yuliang Liu
Zhenghao Peng
Chongyu Liu
Dahua Lin
Shenggao Zhu
N. Yuan
Kai Ding
Lianwen Jin
ViT
83
103
0
19 Mar 2022
Fourier Document Restoration for Robust Document Dewarping and
  Recognition
Fourier Document Restoration for Robust Document Dewarping and Recognition
Chuhui Xue
Zichen Tian
Fangneng Zhan
Shijian Lu
S. Bai
135
30
0
18 Mar 2022
Semi-Supervised Learning with Mutual Distillation for Monocular Depth
  Estimation
Semi-Supervised Learning with Mutual Distillation for Monocular Depth Estimation
Jongbeom Baek
Gyeongnyeon Kim
Seung Wook Kim
FedMLMDE
126
12
0
18 Mar 2022
Surface Defect Detection and Evaluation for Marine Vessels using
  Multi-Stage Deep Learning
Surface Defect Detection and Evaluation for Marine Vessels using Multi-Stage Deep Learning
Lianshuang Yu
Kareem M. Metwaly
James Z. Wang
V. Monga
64
5
0
17 Mar 2022
Unsupervised Semantic Segmentation by Distilling Feature Correspondences
Unsupervised Semantic Segmentation by Distilling Feature Correspondences
Mark Hamilton
Zhoutong Zhang
Bharath Hariharan
Noah Snavely
William T. Freeman
59
245
0
16 Mar 2022
Interspace Pruning: Using Adaptive Filter Representations to Improve
  Training of Sparse CNNs
Interspace Pruning: Using Adaptive Filter Representations to Improve Training of Sparse CNNs
Paul Wimmer
Jens Mehnert
Alexandru Paul Condurache
CVBM
64
20
0
15 Mar 2022
A Self-Supervised, Differentiable Kalman Filter for Uncertainty-Aware
  Visual-Inertial Odometry
A Self-Supervised, Differentiable Kalman Filter for Uncertainty-Aware Visual-Inertial Odometry
Brandon Wagstaff
Emmett Wise
Jonathan Kelly
65
12
0
14 Mar 2022
RAUM-VO: Rotational Adjusted Unsupervised Monocular Visual Odometry
RAUM-VO: Rotational Adjusted Unsupervised Monocular Visual Odometry
Claudio Cimarelli
Hriday Bavle
Jose Luis Sanchez-Lopez
Holger Voos
57
6
0
14 Mar 2022
Automated Learning for Deformable Medical Image Registration by Jointly
  Optimizing Network Architectures and Objective Functions
Automated Learning for Deformable Medical Image Registration by Jointly Optimizing Network Architectures and Objective Functions
Xin-Yue Fan
Zi Li
Ziyang Li
Xiaolin Wang
Risheng Liu
Zhongxuan Luo
Hao Huang
97
12
0
14 Mar 2022
Euclidean Invariant Recognition of 2D Shapes Using Histograms of
  Magnitudes of Local Fourier-Mellin Descriptors
Euclidean Invariant Recognition of 2D Shapes Using Histograms of Magnitudes of Local Fourier-Mellin Descriptors
Xinhua Zhang
L. Williams
18
1
0
13 Mar 2022
Training Protocol Matters: Towards Accurate Scene Text Recognition via
  Training Protocol Searching
Training Protocol Matters: Towards Accurate Scene Text Recognition via Training Protocol Searching
Xiaojie Chu
Yongtao Wang
Chunhua Shen
Jingdong Chen
Wei Chu
40
1
0
13 Mar 2022
Deep learning-based conditional inpainting for restoration of
  artifact-affected 4D CT images
Deep learning-based conditional inpainting for restoration of artifact-affected 4D CT images
F. Madesta
T. Sentker
T. Gauer
R. Werner
MedIm
28
9
0
12 Mar 2022
PC-SwinMorph: Patch Representation for Unsupervised Medical Image
  Registration and Segmentation
PC-SwinMorph: Patch Representation for Unsupervised Medical Image Registration and Segmentation
Lihao Liu
Zhening Huang
Pietro Lio
Carola-Bibiane Schönlieb
Angelica I. Aviles-Rivero
78
16
0
10 Mar 2022
Transfer of Representations to Video Label Propagation: Implementation
  Factors Matter
Transfer of Representations to Video Label Propagation: Implementation Factors Matter
Daniel McKee
Zitong Zhan
Bing Shuai
Davide Modolo
Joseph Tighe
Svetlana Lazebnik
SSL
53
4
0
10 Mar 2022
SelfTune: Metrically Scaled Monocular Depth Estimation through
  Self-Supervised Learning
SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning
Jaehoon Choi
Dongki Jung
Yonghan Lee
Deok-Won Kim
Tianyi Zhou
Donghwan Lee
MDESSL
65
5
0
10 Mar 2022
An Audio-Visual Attention Based Multimodal Network for Fake Talking Face
  Videos Detection
An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection
Gang Wang
Peng Zhang
Lei Xie
Wei Huang
Yufei Zha
Yanni Zhang
CVBM
77
5
0
10 Mar 2022
LiftReg: Limited Angle 2D/3D Deformable Registration
LiftReg: Limited Angle 2D/3D Deformable Registration
Lin Tian
Yueh Z. Lee
Raúl San José Estépar
Marc Niethammer
74
8
0
10 Mar 2022
Resource-Efficient Invariant Networks: Exponential Gains by Unrolled
  Optimization
Resource-Efficient Invariant Networks: Exponential Gains by Unrolled Optimization
Sam Buchanan
Jingkai Yan
Ellie Haber
John N. Wright
65
3
0
09 Mar 2022
Previous
123...151617...565758
Next