ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.02025
  4. Cited By
Spatial Transformer Networks
v1v2v3 (latest)

Spatial Transformer Networks

5 June 2015
Max Jaderberg
Karen Simonyan
Andrew Zisserman
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Spatial Transformer Networks"

50 / 2,879 papers shown
Title
Drawing out of Distribution with Neuro-Symbolic Generative Models
Drawing out of Distribution with Neuro-Symbolic Generative Models
Yi-Chuan Liang
J. Tenenbaum
T. Le
N. Siddharth
74
8
0
03 Jun 2022
Transforming medical imaging with Transformers? A comparative review of
  key properties, current progresses, and future perspectives
Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives
Jun Li
Junyu Chen
Yucheng Tang
Ce Wang
Bennett A. Landman
S. K. Zhou
ViTOODMedIm
175
47
0
02 Jun 2022
Leveraging Systematic Knowledge of 2D Transformations
Leveraging Systematic Knowledge of 2D Transformations
Jiachen Kang
W. Jia
Xiangjian He
114
4
0
02 Jun 2022
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Pengyuan Lyu
Chengquan Zhang
Shanshan Liu
Meina Qiao
Yangliu Xu
Liang Wu
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
122
43
0
01 Jun 2022
Learning Instance-Specific Augmentations by Capturing Local Invariances
Learning Instance-Specific Augmentations by Capturing Local Invariances
Ning Miao
Tom Rainforth
Emile Mathieu
Yann Dubois
Yee Whye Teh
Adam Foster
Hyunjik Kim
113
12
0
31 May 2022
Unsupervised Image Representation Learning with Deep Latent Particles
Unsupervised Image Representation Learning with Deep Latent Particles
Tal Daniel
Aviv Tamar
OCLSSL
76
12
0
31 May 2022
ViT-BEVSeg: A Hierarchical Transformer Network for Monocular
  Birds-Eye-View Segmentation
ViT-BEVSeg: A Hierarchical Transformer Network for Monocular Birds-Eye-View Segmentation
Pramit Dutta
Ganesh Sistu
S. Yogamani
E. López
J. McDonald
ViT
101
16
0
31 May 2022
MontageGAN: Generation and Assembly of Multiple Components by GANs
MontageGAN: Generation and Assembly of Multiple Components by GANs
Chean Fei Shee
Seiichi Uchida
GAN
43
0
0
31 May 2022
TubeFormer-DeepLab: Video Mask Transformer
TubeFormer-DeepLab: Video Mask Transformer
Dahun Kim
Jun Xie
Huiyu Wang
Siyuan Qiao
Qihang Yu
Hong-Seok Kim
Hartwig Adam
In So Kweon
Liang-Chieh Chen
ViTMedIm
133
42
0
30 May 2022
Few-shot Class-incremental Learning for 3D Point Cloud Objects
Few-shot Class-incremental Learning for 3D Point Cloud Objects
T. Chowdhury
A. Cheraghian
Sameera Ramasinghe
Sahar Ahmadi
Morteza Saberi
Shafin Rahman
3DPC
74
18
0
30 May 2022
The Devil is in the Pose: Ambiguity-free 3D Rotation-invariant Learning
  via Pose-aware Convolution
The Devil is in the Pose: Ambiguity-free 3D Rotation-invariant Learning via Pose-aware Convolution
Ronghan Chen
Yang Cong
3DH
62
19
0
30 May 2022
Efficient Federated Learning with Spike Neural Networks for Traffic Sign
  Recognition
Efficient Federated Learning with Spike Neural Networks for Traffic Sign Recognition
Kan Xie
Zhe Zhang
Bo Li
Jiawen Kang
Dusit Niyato
Shengli Xie
Yi Wu
51
67
0
28 May 2022
Unsupervised Multi-object Segmentation Using Attention and Soft-argmax
Unsupervised Multi-object Segmentation Using Attention and Soft-argmax
Bruno Sauvalle
A. de La Fortelle
3DPC
120
12
0
26 May 2022
Wireless Deep Video Semantic Transmission
Wireless Deep Video Semantic Transmission
Sixian Wang
Jincheng Dai
Zijian Liang
K. Niu
Zhongwei Si
Chao Dong
Xiaoqi Qin
Ping Zhang
3DVDiffM
131
153
0
26 May 2022
Structure Unbiased Adversarial Model for Medical Image Segmentation
Structure Unbiased Adversarial Model for Medical Image Segmentation
Tianyang Zhang
Shaoming Zheng
Jun Cheng
Xi Jia
Joseph Bartlett
Xinxing Cheng
Huazhu Fu
Zhaowen Qiu
Jiang-Dong Liu
Jinming Duan
GANMedIm
66
0
0
25 May 2022
Real-Time Video Deblurring via Lightweight Motion Compensation
Real-Time Video Deblurring via Lightweight Motion Compensation
Hyeongseok Son
Junyong Lee
Sunghyun Cho
Seungyong Lee
48
6
0
25 May 2022
Unsupervised Misaligned Infrared and Visible Image Fusion via
  Cross-Modality Image Generation and Registration
Unsupervised Misaligned Infrared and Visible Image Fusion via Cross-Modality Image Generation and Registration
Di Wang
Jinyuan Liu
Xin-Yue Fan
Risheng Liu
143
184
0
24 May 2022
Unsupervised Difference Learning for Noisy Rigid Image Alignment
Unsupervised Difference Learning for Noisy Rigid Image Alignment
Yu-Xuan Chen
Dagan Feng
Hongbin Shen
57
0
0
24 May 2022
Improving Shape Awareness and Interpretability in Deep Networks Using
  Geometric Moments
Improving Shape Awareness and Interpretability in Deep Networks Using Geometric Moments
Rajhans Singh
Ankita Shukla
Pavan Turaga
93
7
0
24 May 2022
Learning Muti-expert Distribution Calibration for Long-tailed Video
  Classification
Learning Muti-expert Distribution Calibration for Long-tailed Video Classification
Yufan Hu
Junyu Gao
Changsheng Xu
54
5
0
22 May 2022
A comprehensive survey on semantic facial attribute editing using
  generative adversarial networks
A comprehensive survey on semantic facial attribute editing using generative adversarial networks
A. Nickabadi
Maryam Saeedi Fard
Nastaran Moradzadeh Farid
Najmeh Mohammadbagheri
CVBMGANEGVM
84
9
0
21 May 2022
Fine-Grained Visual Classification using Self Assessment Classifier
Fine-Grained Visual Classification using Self Assessment Classifier
Tuong Khanh Long Do
Huy Tran
Erman Tjiputra
Quang-Dieu Tran
Anh Nguyen
131
14
0
21 May 2022
Diversity vs. Recognizability: Human-like generalization in one-shot
  generative models
Diversity vs. Recognizability: Human-like generalization in one-shot generative models
Victor Boutin
Lakshya Singhal
Xavier Thomas
Thomas Serre
75
8
0
20 May 2022
The AI Mechanic: Acoustic Vehicle Characterization Neural Networks
The AI Mechanic: Acoustic Vehicle Characterization Neural Networks
Adam M. Terwilliger
J. Siegel
62
2
0
19 May 2022
GeoPointGAN: Synthetic Spatial Data with Local Label Differential
  Privacy
GeoPointGAN: Synthetic Spatial Data with Local Label Differential Privacy
Teddy Cunningham
Konstantin Klemmer
Hongkai Wen
Hakan Ferhatosmanoglu
67
12
0
18 May 2022
Visual Attention-based Self-supervised Absolute Depth Estimation using
  Geometric Priors in Autonomous Driving
Visual Attention-based Self-supervised Absolute Depth Estimation using Geometric Priors in Autonomous Driving
Jie Xiang
Yun Wang
Lifeng An
Haiyang Liu
Zijun Wang
Jian Liu
MDE
91
17
0
18 May 2022
GraphMapper: Efficient Visual Navigation by Scene Graph Generation
GraphMapper: Efficient Visual Navigation by Scene Graph Generation
Zachary Seymour
Niluthpol Chowdhury Mithun
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
101
8
0
17 May 2022
Dimensionality Reduced Training by Pruning and Freezing Parts of a Deep
  Neural Network, a Survey
Dimensionality Reduced Training by Pruning and Freezing Parts of a Deep Neural Network, a Survey
Paul Wimmer
Jens Mehnert
Alexandru Paul Condurache
DD
98
21
0
17 May 2022
Incorporating Prior Knowledge into Neural Networks through an Implicit
  Composite Kernel
Incorporating Prior Knowledge into Neural Networks through an Implicit Composite Kernel
Ziyang Jiang
Tongshu Zheng
Yiling Liu
David Carlson
73
4
0
15 May 2022
Building Facade Parsing R-CNN
Building Facade Parsing R-CNN
Sijie Wang
Qiyu Kang
Rui She
Wee Peng Tay
Diego Navarro Navarro
Andreas Hartmannsgruber
24
0
0
12 May 2022
Image2Gif: Generating Continuous Realistic Animations with Warping NODEs
Image2Gif: Generating Continuous Realistic Animations with Warping NODEs
Jurijs Nazarovs
Z. Huang
15
0
0
09 May 2022
GenISP: Neural ISP for Low-Light Machine Cognition
GenISP: Neural ISP for Low-Light Machine Cognition
Igor Morawski
Yu-An Chen
Yu-sheng Lin
Shusil Dangi
Kai He
Winston H. Hsu
VLM
56
22
0
07 May 2022
From Easy to Hard: Learning Language-guided Curriculum for Visual
  Question Answering on Remote Sensing Data
From Easy to Hard: Learning Language-guided Curriculum for Visual Question Answering on Remote Sensing Data
Zhenghang Yuan
Lichao Mou
Q. Wang
Xiao Xiang Zhu
105
67
0
06 May 2022
Revisiting Pretraining for Semi-Supervised Learning in the Low-Label
  Regime
Revisiting Pretraining for Semi-Supervised Learning in the Low-Label Regime
Xun Xu
Jingyi Liao
Lile Cai
M. Nguyen
Kangkang Lu
Wanyue Zhang
Yasin Yazici
Chuan-Sheng Foo
133
6
0
06 May 2022
DeepPortraitDrawing: Generating Human Body Images from Freehand Sketches
DeepPortraitDrawing: Generating Human Body Images from Freehand Sketches
X. Wu
Chen Wang
Hongbo Fu
Ariel Shamir
Song-Hai Zhang
Shimin Hu
3DH
74
8
0
04 May 2022
Cross-View Cross-Scene Multi-View Crowd Counting
Cross-View Cross-Scene Multi-View Crowd Counting
Qi Zhang
Wei Lin
Antoni B. Chan
89
63
0
03 May 2022
LayoutBERT: Masked Language Layout Model for Object Insertion
LayoutBERT: Masked Language Layout Model for Object Insertion
Kerem Turgutlu
Sanatan Sharma
J. Kumar
VLMDiffM
126
2
0
30 Apr 2022
C3-STISR: Scene Text Image Super-resolution with Triple Clues
C3-STISR: Scene Text Image Super-resolution with Triple Clues
Minyi Zhao
Miaosen Wang
Fan Bai
Bingjia Li
Jie Wang
Shuigeng Zhou
71
34
0
29 Apr 2022
KING: Generating Safety-Critical Driving Scenarios for Robust Imitation
  via Kinematics Gradients
KING: Generating Safety-Critical Driving Scenarios for Robust Imitation via Kinematics Gradients
Niklas Hanselmann
Katrin Renz
Kashyap Chitta
Apratim Bhattacharyya
Andreas Geiger
102
91
0
28 Apr 2022
Learning to Extract Building Footprints from Off-Nadir Aerial Images
Learning to Extract Building Footprints from Off-Nadir Aerial Images
Jinwang Wang
Lingxuan Meng
Weijia Li
Wen Yang
Lei Yu
Guisong Xia
96
37
0
28 Apr 2022
Symmetric Transformer-based Network for Unsupervised Image Registration
Symmetric Transformer-based Network for Unsupervised Image Registration
Mingrui Ma
Lei Song
Yuanbo Xu
Gui-Xian Liu
ViTMedIm
76
37
0
28 Apr 2022
Estimating the Resize Parameter in End-to-end Learned Image Compression
Estimating the Resize Parameter in End-to-end Learned Image Compression
Li-Heng Chen
C. Bampis
Zhi Li
Lukávs Krasula
A. Bovik
72
4
0
26 Apr 2022
Transformation Invariant Cancerous Tissue Classification Using Spatially
  Transformed DenseNet
Transformation Invariant Cancerous Tissue Classification Using Spatially Transformed DenseNet
Omar Mahdi
Ali Bou Nassif
MedIm
26
2
0
23 Apr 2022
Future Object Detection with Spatiotemporal Transformers
Future Object Detection with Spatiotemporal Transformers
Adam Tonderski
Joakim Johnander
Christoffer Petersson
Kalle AAstrom
ViT
67
1
0
21 Apr 2022
Self-Supervised Equivariant Learning for Oriented Keypoint Detection
Self-Supervised Equivariant Learning for Oriented Keypoint Detection
Jongmin Lee
Byung-soo Kim
Minsu Cho
3DPC
114
37
0
19 Apr 2022
2D Human Pose Estimation: A Survey
2D Human Pose Estimation: A Survey
Haoming Chen
Runyang Feng
Sifan Wu
Hao Xu
F. Zhou
Zhenguang Liu
3DH
100
58
0
15 Apr 2022
Points to Patches: Enabling the Use of Self-Attention for 3D Shape
  Recognition
Points to Patches: Enabling the Use of Self-Attention for 3D Shape Recognition
Axel Berg
Magnus Oskarsson
Mark O'Connor
3DPCViT
82
27
0
08 Apr 2022
Aesthetic Text Logo Synthesis via Content-aware Layout Inferring
Aesthetic Text Logo Synthesis via Content-aware Layout Inferring
Yizhi Wang
Guo Pu
Wenhan Luo
Yexin Wang
Pengfei Xiong
Hongwen Kang
Zheng Lian
DiffM
85
26
0
06 Apr 2022
MixFormer: Mixing Features across Windows and Dimensions
MixFormer: Mixing Features across Windows and Dimensions
Qiang Chen
Qiman Wu
Jian Wang
Qinghao Hu
T. Hu
Errui Ding
Jian Cheng
Jingdong Wang
MDEViT
88
109
0
06 Apr 2022
OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses
OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses
Robik Shrestha
Kushal Kafle
Christopher Kanan
CML
102
13
0
05 Apr 2022
Previous
123...141516...565758
Next