Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.02025
Cited By
v1
v2
v3 (latest)
Spatial Transformer Networks
5 June 2015
Max Jaderberg
Karen Simonyan
Andrew Zisserman
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Spatial Transformer Networks"
50 / 2,879 papers shown
Title
SoftEnNet: Symbiotic Monocular Depth Estimation and Lumen Segmentation for Colonoscopy Endorobots
Alwyn Mathew
Ludovic Magerand
Emanuele Trucco
L. Manfredi
MedIm
52
2
0
19 Jan 2023
Learning Transformations To Reduce the Geometric Shift in Object Detection
Vidit Vidit
Martin Engilberge
Mathieu Salzmann
ObjD
88
4
0
13 Jan 2023
Elevation Estimation-Driven Building 3D Reconstruction from Single-View Remote Sensing Imagery
Yongqiang Mao
Kaiqiang Chen
Liangjin Zhao
Wei Chen
Deke Tang
Wenjie Liu
Zhirui Wang
Wenhui Diao
Xian Sun
Kun Fu
109
33
0
11 Jan 2023
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling
Keyu Tian
Yi Jiang
Qishuai Diao
Chen Lin
Liwei Wang
Zehuan Yuan
89
106
0
09 Jan 2023
Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review
Reza Azad
Amirhossein Kazerouni
Moein Heidari
Ehsan Khodapanah Aghdam
Amir Molaei
Yiwei Jia
Abin Jose
Rijo Roy
Dorit Merhof
MedIm
ViT
118
188
0
09 Jan 2023
A Novel Improved Mask RCNN for Multiple Targets Detection in the Indoor Complex Scenes
Zongmin Liu
Jirui Wang
Jie Li
Peng Liu
Kai Ren
37
2
0
07 Jan 2023
DepthP+P: Metric Accurate Monocular Depth Estimation using Planar and Parallax
Sadra Safadoust
Fatma Guney
MDE
48
0
0
05 Jan 2023
RecRecNet: Rectangling Rectified Wide-Angle Images by Thin-Plate Spline Model and DoF-based Curriculum Learning
K. Liao
Lang Nie
Chunyu Lin
Zishuo Zheng
Yao Zhao
86
12
0
04 Jan 2023
Rethinking Rotation Invariance with Point Cloud Registration
Jianhui Yu
Chaoyi Zhang
Weidong (Tom) Cai
3DPC
112
7
0
31 Dec 2022
Morphology-based non-rigid registration of coronary computed tomography and intravascular images through virtual catheter path optimization
Karim Kadry
Abhishek Karmakar
A. Schuh
K. Petersen
M. Schaap
David Marlevi
Charles Taylor
E. Edelman
F. R. Nezami
60
5
0
30 Dec 2022
Transformers in Action Recognition: A Review on Temporal Modeling
Elham Shabaninia
Hossein Nezamabadi-pour
Fatemeh Shafizadegan
ViT
71
9
0
29 Dec 2022
SynCLay: Interactive Synthesis of Histology Images from Bespoke Cellular Layouts
Srijay Deshpande
Muhammad Dawood
F. Minhas
Nasir M. Rajpoot
MedIm
56
10
0
28 Dec 2022
MVTN: Learning Multi-View Transformations for 3D Understanding
Abdullah Hamdi
Faisal AlZahrani
Silvio Giancola
Guohao Li
3DV
3DPC
139
6
0
27 Dec 2022
A Survey of Face Recognition
Xinyi Wang
Jianteng Peng
Sufang Zhang
Bihui Chen
Yi Wang
Yan-Hua Guo
CVBM
111
0
0
26 Dec 2022
Learning to Detect and Segment for Open Vocabulary Object Detection
Tao Wang
Nan Li
VLM
ObjD
83
25
0
23 Dec 2022
FunkNN: Neural Interpolation for Functional Generation
AmirEhsan Khorashadizadeh
Anadi Chaman
Valentin Debarnot
Ivan Dokmanić
74
8
0
20 Dec 2022
Towards Unsupervised Visual Reasoning: Do Off-The-Shelf Features Know How to Reason?
Monika Wysoczañska
Tom Monnier
Tomasz Trzciñski
David Picard
ReLM
OCL
73
1
0
20 Dec 2022
Enhancing Indic Handwritten Text Recognition Using Global Semantic Information
Ajoy Mondal
C. V. Jawahar
69
2
0
15 Dec 2022
BKinD-3D: Self-Supervised 3D Keypoint Discovery from Multi-View Videos
Jennifer J. Sun
Lili Karashchuk
Amil Dravid
Serim Ryou
Sonia Fereidooni
...
Bingni W. Brunton
Georgia Gkioxari
Ann Kennedy
Yisong Yue
Pietro Perona
3DPC
72
16
0
14 Dec 2022
Improving Warped Planar Object Detection Network For Automatic License Plate Recognition
Nguyen Dinh Tra
Nguyen Cong Tri
P. D. Hung
39
1
0
14 Dec 2022
CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information Fusion
Zizhang Wu
Man Wang
Weiwei Sun
Yuchen Li
Tianhao Xu
Fan Wang
Keke Huang
110
4
0
13 Dec 2022
CbwLoss: Constrained Bidirectional Weighted Loss for Self-supervised Learning of Depth and Pose
Fei Wang
Jun Cheng
Penglei Liu
SSL
80
4
0
12 Dec 2022
Detection Selection Algorithm: A Likelihood based Optimization Method to Perform Post Processing for Object Detection
An Fan
Benjamin Ticknor
Y. Amit
63
0
0
12 Dec 2022
Benchmark Dataset and Effective Inter-Frame Alignment for Real-World Video Super-Resolution
Ruohao Wang
Xiaohui Liu
Zhilu Zhang
Xiaohe Wu
Chunling Feng
Lei Zhang
W. Zuo
SupR
64
8
0
10 Dec 2022
Progressive Multi-view Human Mesh Recovery with Self-Supervision
Xuan Gong
Liangchen Song
Meng Zheng
Benjamin Planche
Terrence Chen
Junsong Yuan
David Doermann
Ziyan Wu
3DH
82
13
0
10 Dec 2022
Motion and Context-Aware Audio-Visual Conditioned Video Prediction
Yating Xu
Conghui Hu
G. Lee
VGen
116
0
0
09 Dec 2022
ERNet: Unsupervised Collective Extraction and Registration in Neuroimaging Data
Yao Su
Zhentian Qian
Lifang He
Xiangnan Kong
37
3
0
06 Dec 2022
ABN: Anti-Blur Neural Networks for Multi-Stage Deformable Image Registration
Yao Su
Xin Dai
Lifang He
Xiangnan Kong
MedIm
38
4
0
06 Dec 2022
Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields
Rohith Agaram
Shaurya Dewan
Rahul Sajnani
A. Poulenard
Madhava Krishna
Srinath Sridhar
88
6
0
05 Dec 2022
Learning to See Through with Events
Lei Yu
Xiang Zhang
Wei Liao
Wentao Yang
Guisong Xia
82
14
0
05 Dec 2022
Visual Question Answering From Another Perspective: CLEVR Mental Rotation Tests
Christopher Beckham
Martin Weiss
Florian Golemo
S. Honari
Derek Nowrouzezahrai
C. Pal
112
7
0
03 Dec 2022
Learning Disentangled Label Representations for Multi-label Classification
Jian Jia
Fei He
Naiyu Gao
Xiaotang Chen
Kaiqi Huang
95
3
0
02 Dec 2022
ObjectStitch: Generative Object Compositing
Yi-Zhe Song
Zhifei Zhang
Zhe Lin
Scott D. Cohen
Brian L. Price
Jianming Zhang
Seunggeun Kim
Daniel G. Aliaga
DiffM
122
33
0
02 Dec 2022
ViewNet: Unsupervised Viewpoint Estimation from Conditional Generation
Octave Mariotti
Oisin Mac Aodha
Hakan Bilen
94
8
0
01 Dec 2022
Part-based Face Recognition with Vision Transformers
Zhonglin Sun
Georgios Tzimiropoulos
ViT
102
17
0
30 Nov 2022
Fourier-Net: Fast Image Registration with Band-limited Deformation
Xiaogang Jia
Joseph Bartlett
Wei Chen
Siyang Song
T. Zhang
Xi Cheng
Wenqi Lu
Zhaowen Qiu
Jinming Duan
62
28
0
29 Nov 2022
Wearing the Same Outfit in Different Ways -- A Controllable Virtual Try-on Method
Kedan Li
Jeffrey Zhang
Shao-Yu Chang
David A. Forsyth
DiffM
69
0
0
29 Nov 2022
Self-Supervised Surgical Instrument 3D Reconstruction from a Single Camera Image
Ange Lou
X. Yao
Ziteng Liu
Jingjing Han
J. Noble
47
9
0
26 Nov 2022
WALDO: Future Video Synthesis using Object Layer Decomposition and Parametric Flow Prediction
G. L. Moing
Jean Ponce
Cordelia Schmid
80
6
0
25 Nov 2022
RUST: Latent Neural Scene Representations from Unposed Imagery
Mehdi S. M. Sajjadi
Aravindh Mahendran
Thomas Kipf
Etienne Pot
Daniel Duckworth
Mario Lucic
Klaus Greff
ViT
89
31
0
25 Nov 2022
AFR-Net: Attention-Driven Fingerprint Recognition Network
Steven A. Grosz
A.K. Jain
ViT
108
30
0
25 Nov 2022
Unsupervised 3D Keypoint Discovery with Multi-View Geometry
S. Honari
Chen Zhao
Mathieu Salzmann
Pascal Fua
3DH
70
1
0
23 Nov 2022
Semantic-aware One-shot Face Re-enactment with Dense Correspondence Estimation
Yunfan Liu
Qi Li
Zhen Sun
Tieniu Tan
CVBM
64
0
0
23 Nov 2022
AeDet: Azimuth-invariant Multi-view 3D Object Detection
Chengjian Feng
Zequn Jie
Yujie Zhong
Xiangxiang Chu
Lin Ma
3DPC
51
21
0
22 Nov 2022
The Monocular Depth Estimation Challenge
Jaime Spencer
Chao Qian
Chris Russell
Simon Hadfield
E. Graf
...
Fabio Tosi
Hao Wang
Youming Zhang
Yusheng Zhang
Chaoqiang Zhao
MDE
72
21
0
22 Nov 2022
RIC-CNN: Rotation-Invariant Coordinate Convolutional Neural Network
Hanlin Mo
Guoying Zhao
81
34
0
21 Nov 2022
Compositional Scene Modeling with Global Object-Centric Representations
Tonglin Chen
Bin Li
Zhimeng Shen
Xiangyang Xue
OCL
86
2
0
21 Nov 2022
Coarse-Super-Resolution-Fine Network (CoSF-Net): A Unified End-to-End Neural Network for 4D-MRI with Simultaneous Motion Estimation and Super-Resolution
Shaohua Zhi
Yinghui Wang
Haonan Xiao
T. Bai
Yunsong Tang
Bing Li
Chenyang Liu
Wen Li
Tian Li
Jing Cai
65
4
0
21 Nov 2022
Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular Depth Estimation
S. Tomar
Maitreya Suin
A. N. Rajagopalan
ViT
MDE
85
4
0
20 Nov 2022
PointResNet: Residual Network for 3D Point Cloud Segmentation and Classification
Aadesh Desai
Saagar Parikh
S. Kumari
Shanmuganathan Raman
3DPC
3DV
120
2
0
20 Nov 2022
Previous
1
2
3
...
10
11
12
...
56
57
58
Next