ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.02025
  4. Cited By
Spatial Transformer Networks
v1v2v3 (latest)

Spatial Transformer Networks

5 June 2015
Max Jaderberg
Karen Simonyan
Andrew Zisserman
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Spatial Transformer Networks"

50 / 2,879 papers shown
Title
DiffPop: Plausibility-Guided Object Placement Diffusion for Image
  Composition
DiffPop: Plausibility-Guided Object Placement Diffusion for Image Composition
Jiacheng Liu
Hang Zhou
Shida Wei
Rui Ma
114
4
0
12 Jun 2024
Metric Convolutions: A Unifying Theory to Adaptive Convolutions
Metric Convolutions: A Unifying Theory to Adaptive Convolutions
Thomas Dagès
M. Lindenbaum
A. Bruckstein
96
1
0
08 Jun 2024
MambaDepth: Enhancing Long-range Dependency for Self-Supervised
  Fine-Structured Monocular Depth Estimation
MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation
Ionuţ Grigore
Călin-Adrian Popa
MambaMDE
122
1
0
06 Jun 2024
Evaluating Durability: Benchmark Insights into Multimodal Watermarking
Evaluating Durability: Benchmark Insights into Multimodal Watermarking
Jielin Qiu
William Jongwon Han
Xuandong Zhao
Shangbang Long
Christos Faloutsos
Lei Li
119
1
0
06 Jun 2024
Degrees of Freedom Matter: Inferring Dynamics from Point Trajectories
Degrees of Freedom Matter: Inferring Dynamics from Point Trajectories
Yan Zhang
Sergey Prokudin
Marko Mihajlovic
Qianli Ma
Siyu Tang
88
2
0
05 Jun 2024
Region-aware Grasp Framework with Normalized Grasp Space for Efficient
  6-DoF Grasping
Region-aware Grasp Framework with Normalized Grasp Space for Efficient 6-DoF Grasping
Siang Chen
Pengwei Xie
Wei Tang
Dingchang Hu
Yixiang Dai
Guijin Wang
104
0
0
03 Jun 2024
Generalized Jersey Number Recognition Using Multi-task Learning With
  Orientation-guided Weight Refinement
Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement
Yung-Hui Lin
Yu-Wen Chang
H. Shih
Takahiro Ogawa
53
0
0
03 Jun 2024
Self-Supervised Geometry-Guided Initialization for Robust Monocular
  Visual Odometry
Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry
Takayuki Kanai
Igor Vasiljevic
Vitor Campagnolo Guizilini
Kazuhiro Shintani
MDESSL
73
1
0
03 Jun 2024
W-Net: A Facial Feature-Guided Face Super-Resolution Network
W-Net: A Facial Feature-Guided Face Super-Resolution Network
Hao Liu
Yang Yang
Yunxia Liu
109
2
0
02 Jun 2024
Infinite 3D Landmarks: Improving Continuous 2D Facial Landmark Detection
Infinite 3D Landmarks: Improving Continuous 2D Facial Landmark Detection
Prashanth Chandran
Gaspard Zoss
Paulo F. U. Gotardo
Derek Bradley
CVBM
78
1
0
30 May 2024
Multi-View People Detection in Large Scenes via Supervised View-Wise
  Contribution Weighting
Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting
Qi Zhang
Yunfei Gong
Daijie Chen
Antoni B. Chan
Hui-dan Huang
87
4
0
30 May 2024
VisTA-SR: Improving the Accuracy and Resolution of Low-Cost Thermal
  Imaging Cameras for Agriculture
VisTA-SR: Improving the Accuracy and Resolution of Low-Cost Thermal Imaging Cameras for Agriculture
Heesup Yun
Sassoum Lo
C. Diepenbrock
Brian N Bailey
J. M. Earles
64
1
0
29 May 2024
LLaMA-Reg: Using LLaMA 2 for Unsupervised Medical Image Registration
LLaMA-Reg: Using LLaMA 2 for Unsupervised Medical Image Registration
Mingrui Ma
Yu Yang
LM&MA
74
2
0
29 May 2024
MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any
  Resolution
MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution
Wenzhuo Liu
Fei Zhu
Shijie Ma
Cheng-Lin Liu
79
5
0
28 May 2024
DualContrast: Unsupervised Disentangling of Content and Transformations with Implicit Parameterization
DualContrast: Unsupervised Disentangling of Content and Transformations with Implicit Parameterization
M. R. Uddin
Min Xu
159
0
0
27 May 2024
DAFFNet: A Dual Attention Feature Fusion Network for Classification of
  White Blood Cells
DAFFNet: A Dual Attention Feature Fusion Network for Classification of White Blood Cells
Yuzhuo Chen
Zetong Chen
Yunuo An
Chenyang Lu
Xu Qiao
62
2
0
25 May 2024
Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation
  Learning
Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning
Zhenyu Wei
Yujie He
Zhanchuan Cai
MDE
89
0
0
23 May 2024
Rethink Predicting the Optical Flow with the Kinetics Perspective
Rethink Predicting the Optical Flow with the Kinetics Perspective
Yuhao Cheng
Si-Jia Zhang
Yiqiang Yan
91
0
0
21 May 2024
GuidedRec: Guiding Ill-Posed Unsupervised Volumetric Recovery
GuidedRec: Guiding Ill-Posed Unsupervised Volumetric Recovery
A. Cafaro
A. Leroy
Guillaume Beldjoudi
Pauline Maury
Charlotte Robert
Eric Deutsch
Vincent Grégoire
Vincent Lepetit
Nikos Paragios
42
0
0
20 May 2024
Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field
  Video Reconstruction
Stereo-Knowledge Distillation from dpMV to Dual Pixels for Light Field Video Reconstruction
Aryan Garg
Raghav Mallampali
Akshat Joshi
Shrisudhan Govindarajan
Kaushik Mitra
86
0
0
20 May 2024
GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic
  Hand Gesture Recognition
GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition
Mallika Garg
Debashis Ghosh
P. M. Pradhan
SLRViT
85
5
0
18 May 2024
Eddeep: Fast eddy-current distortion correction for diffusion MRI with deep learning
Eddeep: Fast eddy-current distortion correction for diffusion MRI with deep learning
Antoine Legouhy
Ross Callaghan
W. Stee
Philippe Peigneux
H. Azadbakht
Hui Zhang
MedIm
53
0
0
17 May 2024
MrRegNet: Multi-resolution Mask Guided Convolutional Neural Network for
  Medical Image Registration with Large Deformations
MrRegNet: Multi-resolution Mask Guided Convolutional Neural Network for Medical Image Registration with Large Deformations
Ruizhe Li
Grazziela Figueredo
Dorothee Auer
Christian Wagner
Xin Chen
MedIm
46
1
0
16 May 2024
Dance Any Beat: Blending Beats with Visuals in Dance Video Generation
Dance Any Beat: Blending Beats with Visuals in Dance Video Generation
Xuanchen Wang
Heng Wang
Dongnan Liu
Weidong Cai
84
5
0
15 May 2024
A Comprehensive Survey on Data Augmentation
A Comprehensive Survey on Data Augmentation
Zaitian Wang
Pengfei Wang
Kunpeng Liu
Pengyang Wang
Yanjie Fu
Chang-Tien Lu
Charu Aggarwal
Jian Pei
Yuanchun Zhou
ViT
175
28
0
15 May 2024
Boosting House Price Estimations with Multi-Head Gated Attention
Boosting House Price Estimations with Multi-Head Gated Attention
A. Sellam
C. Distante
Abdelmalik Taleb-Ahmed
P. Mazzeo
47
2
0
13 May 2024
Vision-Language Modeling with Regularized Spatial Transformer Networks
  for All Weather Crosswind Landing of Aircraft
Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft
Debabrata Pal
Anvita Singh
Saumya Saumya
Shouvik Das
54
0
0
09 May 2024
Tilt your Head: Activating the Hidden Spatial-Invariance of Classifiers
Tilt your Head: Activating the Hidden Spatial-Invariance of Classifiers
Johann Schmidt
Sebastian Stober
93
1
0
06 May 2024
Latent Fingerprint Matching via Dense Minutia Descriptor
Latent Fingerprint Matching via Dense Minutia Descriptor
Zhiyu Pan
Yongjie Duan
Xiongjun Guan
Jianjiang Feng
Jie Zhou
62
4
0
02 May 2024
UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via
  Multi-scale Generation and Registration Enhancement
UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement
Ruiquan Ge
Zhaojie Fang
Pengxue Wei
Zhanghao Chen
Hongyang Jiang
Ahmed Elazab
Wangting Li
Xiang Wan
Shaochong Zhang
Changmiao Wang
MedIm
45
5
0
01 May 2024
Guiding Attention in End-to-End Driving Models
Guiding Attention in End-to-End Driving Models
Diego Porres
Yi Xiao
Gabriel Villalonga
Alexandre Levy
Antonio M. López
68
0
0
30 Apr 2024
The Third Monocular Depth Estimation Challenge
The Third Monocular Depth Estimation Challenge
Jaime Spencer
Fabio Tosi
Matteo Poggi
Ripudaman Singh Arora
Chris Russell
...
Albert Luginov
Muhammad Shahzad
Seyed Hosseini
Aleksander Trajcevski
James H. Elder
MDE
111
8
0
25 Apr 2024
MonoPCC: Photometric-invariant Cycle Constraint for Monocular Depth
  Estimation of Endoscopic Images
MonoPCC: Photometric-invariant Cycle Constraint for Monocular Depth Estimation of Endoscopic Images
Zhiwei Wang
Ying Zhou
Shiquan He
Ting Li
Fan Huang
Qiang Ding
Xinxia Feng
Mei Liu
Qiang Li
MDE
90
2
0
25 Apr 2024
Self-Supervised Monocular Depth Estimation in the Dark: Towards Data
  Distribution Compensation
Self-Supervised Monocular Depth Estimation in the Dark: Towards Data Distribution Compensation
Haolin Yang
Chaoqiang Zhao
Lu Sheng
Yang Tang
MDE
88
2
0
22 Apr 2024
Socialized Learning: A Survey of the Paradigm Shift for Edge
  Intelligence in Networked Systems
Socialized Learning: A Survey of the Paradigm Shift for Edge Intelligence in Networked Systems
Xiaofei Wang
Yunfeng Zhao
Chao Qiu
Qinghua Hu
Victor C. M. Leung
89
7
0
20 Apr 2024
Motion-adaptive Separable Collaborative Filters for Blind Motion
  Deblurring
Motion-adaptive Separable Collaborative Filters for Blind Motion Deblurring
Chengxu Liu
Xuan Wang
Xiangyu Xu
Ruhao Tian
Shuai Li
Xueming Qian
Ming-Hsuan Yang
107
14
0
19 Apr 2024
State-space Decomposition Model for Video Prediction Considering
  Long-term Motion Trend
State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend
Fei Cui
Jiaojiao Fang
Xiaojiang Wu
Zelong Lai
Mengke Yang
Menghan Jia
Guizhong Liu
59
0
0
17 Apr 2024
Achieving Rotation Invariance in Convolution Operations: Shifting from
  Data-Driven to Mechanism-Assured
Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured
Hanlin Mo
Guoying Zhao
OOD
71
0
0
17 Apr 2024
High-Resolution Detection of Earth Structural Heterogeneities from
  Seismic Amplitudes using Convolutional Neural Networks with Attention layers
High-Resolution Detection of Earth Structural Heterogeneities from Seismic Amplitudes using Convolutional Neural Networks with Attention layers
L. Schirmer
Guilherme Gonçalves Schardong
V. Silva
Rogério Santos
Helio Lopes
55
0
0
15 Apr 2024
Differentiable and Stable Long-Range Tracking of Multiple Posterior
  Modes
Differentiable and Stable Long-Range Tracking of Multiple Posterior Modes
Ali Younis
Erik B. Sudderth
61
4
0
12 Apr 2024
VMambaMorph: a Multi-Modality Deformable Image Registration Framework
  based on Visual State Space Model with Cross-Scan Module
VMambaMorph: a Multi-Modality Deformable Image Registration Framework based on Visual State Space Model with Cross-Scan Module
Ziyang Wang
Jian-Qing Zheng
Chao Ma
Tao Guo
Mamba
77
3
0
07 Apr 2024
HiLo: Detailed and Robust 3D Clothed Human Reconstruction with High-and
  Low-Frequency Information of Parametric Models
HiLo: Detailed and Robust 3D Clothed Human Reconstruction with High-and Low-Frequency Information of Parametric Models
Yifan Yang
Dong Liu
Shuhai Zhang
Zeshuai Deng
Zixiong Huang
Mingkui Tan
3DH
93
9
0
07 Apr 2024
MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor
  and Connector
MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor and Connector
Junbo Li
Keyan Chen
Gengju Tian
Lu Li
Z. Shi
74
1
0
05 Apr 2024
Unsegment Anything by Simulating Deformation
Unsegment Anything by Simulating Deformation
Jiahao Lu
Xingyi Yang
Xinchao Wang
104
4
0
03 Apr 2024
Entity-Centric Reinforcement Learning for Object Manipulation from
  Pixels
Entity-Centric Reinforcement Learning for Object Manipulation from Pixels
Dan Haramati
Tal Daniel
Aviv Tamar
LM&RoOffRLOCL
88
14
0
01 Apr 2024
STBA: Towards Evaluating the Robustness of DNNs for Query-Limited
  Black-box Scenario
STBA: Towards Evaluating the Robustness of DNNs for Query-Limited Black-box Scenario
Renyang Liu
Kwok-Yan Lam
Wei Zhou
Sixing Wu
Jun Zhao
Dongting Hu
Mingming Gong
AAML
104
0
0
30 Mar 2024
Learned Scanpaths Aid Blind Panoramic Video Quality Assessment
Learned Scanpaths Aid Blind Panoramic Video Quality Assessment
Kanglong Fan
Wen Wen
Mu Li
Yifan Peng
Kede Ma
66
2
0
30 Mar 2024
Neighbor-Environment Observer: An Intelligent Agent for Immersive
  Working Companionship
Neighbor-Environment Observer: An Intelligent Agent for Immersive Working Companionship
Zhe Sun
Qixuan Liang
Meng Wang
Zhenliang Zhang
47
4
0
27 Mar 2024
On permutation-invariant neural networks
On permutation-invariant neural networks
Masanari Kimura
Ryotaro Shimizu
Yuki Hirakawa
Ryosuke Goto
Yuki Saito
OODAAML
94
12
0
26 Mar 2024
Medical Image Registration and Its Application in Retinal Images: A
  Review
Medical Image Registration and Its Application in Retinal Images: A Review
Qiushi Nie
Xiaoqing Zhang
Yan Hu
Mingdao Gong
Jiang-Dong Liu
96
3
0
25 Mar 2024
Previous
12345...565758
Next