Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.02025
Cited By
v1
v2
v3 (latest)
Spatial Transformer Networks
5 June 2015
Max Jaderberg
Karen Simonyan
Andrew Zisserman
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Spatial Transformer Networks"
50 / 2,879 papers shown
Title
Drawing out of Distribution with Neuro-Symbolic Generative Models
Yi-Chuan Liang
J. Tenenbaum
T. Le
N. Siddharth
74
8
0
03 Jun 2022
Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives
Jun Li
Junyu Chen
Yucheng Tang
Ce Wang
Bennett A. Landman
S. K. Zhou
ViT
OOD
MedIm
175
47
0
02 Jun 2022
Leveraging Systematic Knowledge of 2D Transformations
Jiachen Kang
W. Jia
Xiangjian He
114
4
0
02 Jun 2022
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Pengyuan Lyu
Chengquan Zhang
Shanshan Liu
Meina Qiao
Yangliu Xu
Liang Wu
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
122
43
0
01 Jun 2022
Learning Instance-Specific Augmentations by Capturing Local Invariances
Ning Miao
Tom Rainforth
Emile Mathieu
Yann Dubois
Yee Whye Teh
Adam Foster
Hyunjik Kim
113
12
0
31 May 2022
Unsupervised Image Representation Learning with Deep Latent Particles
Tal Daniel
Aviv Tamar
OCL
SSL
76
12
0
31 May 2022
ViT-BEVSeg: A Hierarchical Transformer Network for Monocular Birds-Eye-View Segmentation
Pramit Dutta
Ganesh Sistu
S. Yogamani
E. López
J. McDonald
ViT
101
16
0
31 May 2022
MontageGAN: Generation and Assembly of Multiple Components by GANs
Chean Fei Shee
Seiichi Uchida
GAN
43
0
0
31 May 2022
TubeFormer-DeepLab: Video Mask Transformer
Dahun Kim
Jun Xie
Huiyu Wang
Siyuan Qiao
Qihang Yu
Hong-Seok Kim
Hartwig Adam
In So Kweon
Liang-Chieh Chen
ViT
MedIm
133
42
0
30 May 2022
Few-shot Class-incremental Learning for 3D Point Cloud Objects
T. Chowdhury
A. Cheraghian
Sameera Ramasinghe
Sahar Ahmadi
Morteza Saberi
Shafin Rahman
3DPC
74
18
0
30 May 2022
The Devil is in the Pose: Ambiguity-free 3D Rotation-invariant Learning via Pose-aware Convolution
Ronghan Chen
Yang Cong
3DH
62
19
0
30 May 2022
Efficient Federated Learning with Spike Neural Networks for Traffic Sign Recognition
Kan Xie
Zhe Zhang
Bo Li
Jiawen Kang
Dusit Niyato
Shengli Xie
Yi Wu
51
67
0
28 May 2022
Unsupervised Multi-object Segmentation Using Attention and Soft-argmax
Bruno Sauvalle
A. de La Fortelle
3DPC
120
12
0
26 May 2022
Wireless Deep Video Semantic Transmission
Sixian Wang
Jincheng Dai
Zijian Liang
K. Niu
Zhongwei Si
Chao Dong
Xiaoqi Qin
Ping Zhang
3DV
DiffM
131
153
0
26 May 2022
Structure Unbiased Adversarial Model for Medical Image Segmentation
Tianyang Zhang
Shaoming Zheng
Jun Cheng
Xi Jia
Joseph Bartlett
Xinxing Cheng
Huazhu Fu
Zhaowen Qiu
Jiang-Dong Liu
Jinming Duan
GAN
MedIm
66
0
0
25 May 2022
Real-Time Video Deblurring via Lightweight Motion Compensation
Hyeongseok Son
Junyong Lee
Sunghyun Cho
Seungyong Lee
48
6
0
25 May 2022
Unsupervised Misaligned Infrared and Visible Image Fusion via Cross-Modality Image Generation and Registration
Di Wang
Jinyuan Liu
Xin-Yue Fan
Risheng Liu
143
184
0
24 May 2022
Unsupervised Difference Learning for Noisy Rigid Image Alignment
Yu-Xuan Chen
Dagan Feng
Hongbin Shen
57
0
0
24 May 2022
Improving Shape Awareness and Interpretability in Deep Networks Using Geometric Moments
Rajhans Singh
Ankita Shukla
Pavan Turaga
93
7
0
24 May 2022
Learning Muti-expert Distribution Calibration for Long-tailed Video Classification
Yufan Hu
Junyu Gao
Changsheng Xu
54
5
0
22 May 2022
A comprehensive survey on semantic facial attribute editing using generative adversarial networks
A. Nickabadi
Maryam Saeedi Fard
Nastaran Moradzadeh Farid
Najmeh Mohammadbagheri
CVBM
GAN
EGVM
84
9
0
21 May 2022
Fine-Grained Visual Classification using Self Assessment Classifier
Tuong Khanh Long Do
Huy Tran
Erman Tjiputra
Quang-Dieu Tran
Anh Nguyen
131
14
0
21 May 2022
Diversity vs. Recognizability: Human-like generalization in one-shot generative models
Victor Boutin
Lakshya Singhal
Xavier Thomas
Thomas Serre
75
8
0
20 May 2022
The AI Mechanic: Acoustic Vehicle Characterization Neural Networks
Adam M. Terwilliger
J. Siegel
62
2
0
19 May 2022
GeoPointGAN: Synthetic Spatial Data with Local Label Differential Privacy
Teddy Cunningham
Konstantin Klemmer
Hongkai Wen
Hakan Ferhatosmanoglu
67
12
0
18 May 2022
Visual Attention-based Self-supervised Absolute Depth Estimation using Geometric Priors in Autonomous Driving
Jie Xiang
Yun Wang
Lifeng An
Haiyang Liu
Zijun Wang
Jian Liu
MDE
91
17
0
18 May 2022
GraphMapper: Efficient Visual Navigation by Scene Graph Generation
Zachary Seymour
Niluthpol Chowdhury Mithun
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
101
8
0
17 May 2022
Dimensionality Reduced Training by Pruning and Freezing Parts of a Deep Neural Network, a Survey
Paul Wimmer
Jens Mehnert
Alexandru Paul Condurache
DD
98
21
0
17 May 2022
Incorporating Prior Knowledge into Neural Networks through an Implicit Composite Kernel
Ziyang Jiang
Tongshu Zheng
Yiling Liu
David Carlson
73
4
0
15 May 2022
Building Facade Parsing R-CNN
Sijie Wang
Qiyu Kang
Rui She
Wee Peng Tay
Diego Navarro Navarro
Andreas Hartmannsgruber
24
0
0
12 May 2022
Image2Gif: Generating Continuous Realistic Animations with Warping NODEs
Jurijs Nazarovs
Z. Huang
15
0
0
09 May 2022
GenISP: Neural ISP for Low-Light Machine Cognition
Igor Morawski
Yu-An Chen
Yu-sheng Lin
Shusil Dangi
Kai He
Winston H. Hsu
VLM
56
22
0
07 May 2022
From Easy to Hard: Learning Language-guided Curriculum for Visual Question Answering on Remote Sensing Data
Zhenghang Yuan
Lichao Mou
Q. Wang
Xiao Xiang Zhu
105
67
0
06 May 2022
Revisiting Pretraining for Semi-Supervised Learning in the Low-Label Regime
Xun Xu
Jingyi Liao
Lile Cai
M. Nguyen
Kangkang Lu
Wanyue Zhang
Yasin Yazici
Chuan-Sheng Foo
133
6
0
06 May 2022
DeepPortraitDrawing: Generating Human Body Images from Freehand Sketches
X. Wu
Chen Wang
Hongbo Fu
Ariel Shamir
Song-Hai Zhang
Shimin Hu
3DH
74
8
0
04 May 2022
Cross-View Cross-Scene Multi-View Crowd Counting
Qi Zhang
Wei Lin
Antoni B. Chan
89
63
0
03 May 2022
LayoutBERT: Masked Language Layout Model for Object Insertion
Kerem Turgutlu
Sanatan Sharma
J. Kumar
VLM
DiffM
126
2
0
30 Apr 2022
C3-STISR: Scene Text Image Super-resolution with Triple Clues
Minyi Zhao
Miaosen Wang
Fan Bai
Bingjia Li
Jie Wang
Shuigeng Zhou
71
34
0
29 Apr 2022
KING: Generating Safety-Critical Driving Scenarios for Robust Imitation via Kinematics Gradients
Niklas Hanselmann
Katrin Renz
Kashyap Chitta
Apratim Bhattacharyya
Andreas Geiger
102
91
0
28 Apr 2022
Learning to Extract Building Footprints from Off-Nadir Aerial Images
Jinwang Wang
Lingxuan Meng
Weijia Li
Wen Yang
Lei Yu
Guisong Xia
96
37
0
28 Apr 2022
Symmetric Transformer-based Network for Unsupervised Image Registration
Mingrui Ma
Lei Song
Yuanbo Xu
Gui-Xian Liu
ViT
MedIm
76
37
0
28 Apr 2022
Estimating the Resize Parameter in End-to-end Learned Image Compression
Li-Heng Chen
C. Bampis
Zhi Li
Lukávs Krasula
A. Bovik
72
4
0
26 Apr 2022
Transformation Invariant Cancerous Tissue Classification Using Spatially Transformed DenseNet
Omar Mahdi
Ali Bou Nassif
MedIm
26
2
0
23 Apr 2022
Future Object Detection with Spatiotemporal Transformers
Adam Tonderski
Joakim Johnander
Christoffer Petersson
Kalle AAstrom
ViT
67
1
0
21 Apr 2022
Self-Supervised Equivariant Learning for Oriented Keypoint Detection
Jongmin Lee
Byung-soo Kim
Minsu Cho
3DPC
114
37
0
19 Apr 2022
2D Human Pose Estimation: A Survey
Haoming Chen
Runyang Feng
Sifan Wu
Hao Xu
F. Zhou
Zhenguang Liu
3DH
100
58
0
15 Apr 2022
Points to Patches: Enabling the Use of Self-Attention for 3D Shape Recognition
Axel Berg
Magnus Oskarsson
Mark O'Connor
3DPC
ViT
82
27
0
08 Apr 2022
Aesthetic Text Logo Synthesis via Content-aware Layout Inferring
Yizhi Wang
Guo Pu
Wenhan Luo
Yexin Wang
Pengfei Xiong
Hongwen Kang
Zheng Lian
DiffM
85
26
0
06 Apr 2022
MixFormer: Mixing Features across Windows and Dimensions
Qiang Chen
Qiman Wu
Jian Wang
Qinghao Hu
T. Hu
Errui Ding
Jian Cheng
Jingdong Wang
MDE
ViT
88
109
0
06 Apr 2022
OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses
Robik Shrestha
Kushal Kafle
Christopher Kanan
CML
102
13
0
05 Apr 2022
Previous
1
2
3
...
14
15
16
...
56
57
58
Next