Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.02025
Cited By
v1
v2
v3 (latest)
Spatial Transformer Networks
5 June 2015
Max Jaderberg
Karen Simonyan
Andrew Zisserman
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Spatial Transformer Networks"
50 / 2,879 papers shown
Title
DiffPop: Plausibility-Guided Object Placement Diffusion for Image Composition
Jiacheng Liu
Hang Zhou
Shida Wei
Rui Ma
114
4
0
12 Jun 2024
Metric Convolutions: A Unifying Theory to Adaptive Convolutions
Thomas Dagès
M. Lindenbaum
A. Bruckstein
96
1
0
08 Jun 2024
MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation
Ionuţ Grigore
Călin-Adrian Popa
Mamba
MDE
122
1
0
06 Jun 2024
Evaluating Durability: Benchmark Insights into Multimodal Watermarking
Jielin Qiu
William Jongwon Han
Xuandong Zhao
Shangbang Long
Christos Faloutsos
Lei Li
119
1
0
06 Jun 2024
Degrees of Freedom Matter: Inferring Dynamics from Point Trajectories
Yan Zhang
Sergey Prokudin
Marko Mihajlovic
Qianli Ma
Siyu Tang
88
2
0
05 Jun 2024
Region-aware Grasp Framework with Normalized Grasp Space for Efficient 6-DoF Grasping
Siang Chen
Pengwei Xie
Wei Tang
Dingchang Hu
Yixiang Dai
Guijin Wang
104
0
0
03 Jun 2024
Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement
Yung-Hui Lin
Yu-Wen Chang
H. Shih
Takahiro Ogawa
53
0
0
03 Jun 2024
Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry
Takayuki Kanai
Igor Vasiljevic
Vitor Campagnolo Guizilini
Kazuhiro Shintani
MDE
SSL
73
1
0
03 Jun 2024
W-Net: A Facial Feature-Guided Face Super-Resolution Network
Hao Liu
Yang Yang
Yunxia Liu
109
2
0
02 Jun 2024
Infinite 3D Landmarks: Improving Continuous 2D Facial Landmark Detection
Prashanth Chandran
Gaspard Zoss
Paulo F. U. Gotardo
Derek Bradley
CVBM
78
1
0
30 May 2024
Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting
Qi Zhang
Yunfei Gong
Daijie Chen
Antoni B. Chan
Hui-dan Huang
87
4
0
30 May 2024
VisTA-SR: Improving the Accuracy and Resolution of Low-Cost Thermal Imaging Cameras for Agriculture
Heesup Yun
Sassoum Lo
C. Diepenbrock
Brian N Bailey
J. M. Earles
64
1
0
29 May 2024
LLaMA-Reg: Using LLaMA 2 for Unsupervised Medical Image Registration
Mingrui Ma
Yu Yang
LM&MA
74
2
0
29 May 2024
MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution
Wenzhuo Liu
Fei Zhu
Shijie Ma
Cheng-Lin Liu
79
5
0
28 May 2024
DualContrast: Unsupervised Disentangling of Content and Transformations with Implicit Parameterization
M. R. Uddin
Min Xu
159
0
0
27 May 2024
DAFFNet: A Dual Attention Feature Fusion Network for Classification of White Blood Cells
Yuzhuo Chen
Zetong Chen
Yunuo An
Chenyang Lu
Xu Qiao
62
2
0
25 May 2024
Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning
Zhenyu Wei
Yujie He
Zhanchuan Cai
MDE
89
0
0
23 May 2024
Rethink Predicting the Optical Flow with the Kinetics Perspective
Yuhao Cheng
Si-Jia Zhang
Yiqiang Yan
91
0
0
21 May 2024
GuidedRec: Guiding Ill-Posed Unsupervised Volumetric Recovery
A. Cafaro
A. Leroy
Guillaume Beldjoudi
Pauline Maury
Charlotte Robert
Eric Deutsch
Vincent Grégoire
Vincent Lepetit
Nikos Paragios
42
0
0
20 May 2024
Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video Reconstruction
Aryan Garg
Raghav Mallampali
Akshat Joshi
Shrisudhan Govindarajan
Kaushik Mitra
86
0
0
20 May 2024
GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition
Mallika Garg
Debashis Ghosh
P. M. Pradhan
SLR
ViT
85
5
0
18 May 2024
Eddeep: Fast eddy-current distortion correction for diffusion MRI with deep learning
Antoine Legouhy
Ross Callaghan
W. Stee
Philippe Peigneux
H. Azadbakht
Hui Zhang
MedIm
53
0
0
17 May 2024
MrRegNet: Multi-resolution Mask Guided Convolutional Neural Network for Medical Image Registration with Large Deformations
Ruizhe Li
Grazziela Figueredo
Dorothee Auer
Christian Wagner
Xin Chen
MedIm
46
1
0
16 May 2024
Dance Any Beat: Blending Beats with Visuals in Dance Video Generation
Xuanchen Wang
Heng Wang
Dongnan Liu
Weidong Cai
84
5
0
15 May 2024
A Comprehensive Survey on Data Augmentation
Zaitian Wang
Pengfei Wang
Kunpeng Liu
Pengyang Wang
Yanjie Fu
Chang-Tien Lu
Charu Aggarwal
Jian Pei
Yuanchun Zhou
ViT
175
28
0
15 May 2024
Boosting House Price Estimations with Multi-Head Gated Attention
A. Sellam
C. Distante
Abdelmalik Taleb-Ahmed
P. Mazzeo
47
2
0
13 May 2024
Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft
Debabrata Pal
Anvita Singh
Saumya Saumya
Shouvik Das
54
0
0
09 May 2024
Tilt your Head: Activating the Hidden Spatial-Invariance of Classifiers
Johann Schmidt
Sebastian Stober
93
1
0
06 May 2024
Latent Fingerprint Matching via Dense Minutia Descriptor
Zhiyu Pan
Yongjie Duan
Xiongjun Guan
Jianjiang Feng
Jie Zhou
62
4
0
02 May 2024
UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement
Ruiquan Ge
Zhaojie Fang
Pengxue Wei
Zhanghao Chen
Hongyang Jiang
Ahmed Elazab
Wangting Li
Xiang Wan
Shaochong Zhang
Changmiao Wang
MedIm
45
5
0
01 May 2024
Guiding Attention in End-to-End Driving Models
Diego Porres
Yi Xiao
Gabriel Villalonga
Alexandre Levy
Antonio M. López
68
0
0
30 Apr 2024
The Third Monocular Depth Estimation Challenge
Jaime Spencer
Fabio Tosi
Matteo Poggi
Ripudaman Singh Arora
Chris Russell
...
Albert Luginov
Muhammad Shahzad
Seyed Hosseini
Aleksander Trajcevski
James H. Elder
MDE
111
8
0
25 Apr 2024
MonoPCC: Photometric-invariant Cycle Constraint for Monocular Depth Estimation of Endoscopic Images
Zhiwei Wang
Ying Zhou
Shiquan He
Ting Li
Fan Huang
Qiang Ding
Xinxia Feng
Mei Liu
Qiang Li
MDE
90
2
0
25 Apr 2024
Self-Supervised Monocular Depth Estimation in the Dark: Towards Data Distribution Compensation
Haolin Yang
Chaoqiang Zhao
Lu Sheng
Yang Tang
MDE
88
2
0
22 Apr 2024
Socialized Learning: A Survey of the Paradigm Shift for Edge Intelligence in Networked Systems
Xiaofei Wang
Yunfeng Zhao
Chao Qiu
Qinghua Hu
Victor C. M. Leung
89
7
0
20 Apr 2024
Motion-adaptive Separable Collaborative Filters for Blind Motion Deblurring
Chengxu Liu
Xuan Wang
Xiangyu Xu
Ruhao Tian
Shuai Li
Xueming Qian
Ming-Hsuan Yang
107
14
0
19 Apr 2024
State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend
Fei Cui
Jiaojiao Fang
Xiaojiang Wu
Zelong Lai
Mengke Yang
Menghan Jia
Guizhong Liu
59
0
0
17 Apr 2024
Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured
Hanlin Mo
Guoying Zhao
OOD
71
0
0
17 Apr 2024
High-Resolution Detection of Earth Structural Heterogeneities from Seismic Amplitudes using Convolutional Neural Networks with Attention layers
L. Schirmer
Guilherme Gonçalves Schardong
V. Silva
Rogério Santos
Helio Lopes
55
0
0
15 Apr 2024
Differentiable and Stable Long-Range Tracking of Multiple Posterior Modes
Ali Younis
Erik B. Sudderth
61
4
0
12 Apr 2024
VMambaMorph: a Multi-Modality Deformable Image Registration Framework based on Visual State Space Model with Cross-Scan Module
Ziyang Wang
Jian-Qing Zheng
Chao Ma
Tao Guo
Mamba
77
3
0
07 Apr 2024
HiLo: Detailed and Robust 3D Clothed Human Reconstruction with High-and Low-Frequency Information of Parametric Models
Yifan Yang
Dong Liu
Shuhai Zhang
Zeshuai Deng
Zixiong Huang
Mingkui Tan
3DH
93
9
0
07 Apr 2024
MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor and Connector
Junbo Li
Keyan Chen
Gengju Tian
Lu Li
Z. Shi
74
1
0
05 Apr 2024
Unsegment Anything by Simulating Deformation
Jiahao Lu
Xingyi Yang
Xinchao Wang
104
4
0
03 Apr 2024
Entity-Centric Reinforcement Learning for Object Manipulation from Pixels
Dan Haramati
Tal Daniel
Aviv Tamar
LM&Ro
OffRL
OCL
88
14
0
01 Apr 2024
STBA: Towards Evaluating the Robustness of DNNs for Query-Limited Black-box Scenario
Renyang Liu
Kwok-Yan Lam
Wei Zhou
Sixing Wu
Jun Zhao
Dongting Hu
Mingming Gong
AAML
104
0
0
30 Mar 2024
Learned Scanpaths Aid Blind Panoramic Video Quality Assessment
Kanglong Fan
Wen Wen
Mu Li
Yifan Peng
Kede Ma
66
2
0
30 Mar 2024
Neighbor-Environment Observer: An Intelligent Agent for Immersive Working Companionship
Zhe Sun
Qixuan Liang
Meng Wang
Zhenliang Zhang
47
4
0
27 Mar 2024
On permutation-invariant neural networks
Masanari Kimura
Ryotaro Shimizu
Yuki Hirakawa
Ryosuke Goto
Yuki Saito
OOD
AAML
94
12
0
26 Mar 2024
Medical Image Registration and Its Application in Retinal Images: A Review
Qiushi Nie
Xiaoqing Zhang
Yan Hu
Mingdao Gong
Jiang-Dong Liu
96
3
0
25 Mar 2024
Previous
1
2
3
4
5
...
56
57
58
Next