Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1512.00567
Cited By
v1
v2
v3 (latest)
Rethinking the Inception Architecture for Computer Vision
2 December 2015
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DV
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Rethinking the Inception Architecture for Computer Vision"
50 / 6,586 papers shown
Title
Federated Learning for Medical Image Classification: A Comprehensive Benchmark
Zhekai Zhou
Guibo Luo
Mingzhi Chen
Zhenyu Weng
Yuesheng Zhu
FedML
59
1
0
07 Apr 2025
PartStickers: Generating Parts of Objects for Rapid Prototyping
Mo Zhou
Josh Myers-Dean
Danna Gurari
102
0
0
07 Apr 2025
Generative Adversarial Networks with Limited Data: A Survey and Benchmarking
Omar de Mitri
Ruyu Wang
Marco F. Huber
102
0
0
07 Apr 2025
Your Image Generator Is Your New Private Dataset
Nicolo Resmini
Eugenio Lomurno
Cristian Sbrolli
Matteo Matteucci
122
0
0
06 Apr 2025
Loss Functions in Deep Learning: A Comprehensive Review
Omar Elharrouss
Yasir Mahmood
Yassine Bechqito
Mohamed Adel Serhani
E. Badidi
Jamal Riffi
Hamid Tairi
128
0
0
05 Apr 2025
Embedding Hidden Adversarial Capabilities in Pre-Trained Diffusion Models
Lucas Beerens
D. Higham
DiffM
WIGM
91
0
0
05 Apr 2025
Mapping at First Sense: A Lightweight Neural Network-Based Indoor Structures Prediction Method for Robot Autonomous Exploration
Haojia Gao
Haohua Que
Kunrong Li
Weihao Shan
Mingkai Liu
Rong Zhao
Lei Mu
Xinghua Yang
Qi Wei
Fei Qiao
64
0
0
05 Apr 2025
Detecting underdetermination in parameterized quantum circuits
Marie Kempkes
Jakob Spiegelberg
Evert van Nieuwenburg
Vedran Dunjko
91
0
0
04 Apr 2025
Leveraging Generalizability of Image-to-Image Translation for Enhanced Adversarial Defense
Haibo Zhang
Zhihua Yao
Kouichi Sakurai
Takeshi Saitoh
AAML
93
0
0
02 Apr 2025
A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives
Shuyu Li
Shulei Ji
Zihao Wang
Songruoyao Wu
Jiaxing Yu
Kai Zhang
MGen
VGen
295
1
0
01 Apr 2025
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Bernard Ghanem
140
0
0
01 Apr 2025
Spatiotemporal Attention Learning Framework for Event-Driven Object Recognition
Tiantian Xie
Pengpai Wang
Rosa H. M. Chan
91
0
0
01 Apr 2025
Data Cleansing for GANs
Naoyuki Terashita
Hiroki Ohashi
Satoshi Hara
AAML
215
0
0
01 Apr 2025
Bridge the Gap Between Visual and Linguistic Comprehension for Generalized Zero-shot Semantic Segmentation
Xiaoqing Guo
Wenbo Li
Yixuan Yuan
119
0
0
31 Mar 2025
LATex: Leveraging Attribute-based Text Knowledge for Aerial-Ground Person Re-Identification
Xiang Hu
Yuhao Wang
Pingping Zhang
Huchuan Lu
VLM
133
0
0
31 Mar 2025
THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models
Yujin Huang
Zhi Zhang
Qingchuan Zhao
Lizhen Qu
Chunyang Chen
68
0
0
31 Mar 2025
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
124
0
0
31 Mar 2025
CA^2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition
Jongseo Lee
Joohyun Chang
Dongho Lee
Jinwoo Choi
251
0
0
30 Mar 2025
Multi-label classification for multi-temporal, multi-spatial coral reef condition monitoring using vision foundation model with adapter learning
Xinlei Shao
Hongruixuan Chen
Fan Zhao
Kirsty Magson
Jundong Chen
Peiran Li
Jingchao Wang
Jun Sasaki
125
0
0
29 Mar 2025
T-CIL: Temperature Scaling using Adversarial Perturbation for Calibration in Class-Incremental Learning
Seong-Hyeon Hwang
Minsu Kim
Steven Euijong Whang
68
0
0
28 Mar 2025
Optimal Stepsize for Diffusion Sampling
Jianning Pei
Han Hu
Shuyang Gu
109
2
0
27 Mar 2025
Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Jaywon Koo
J. Hernandez
Moayed Haji-Ali
Ziyan Yang
Vicente Ordonez
EGVM
117
0
0
27 Mar 2025
OminiAdapt: Learning Cross-Task Invariance for Robust and Environment-Aware Robotic Manipulation
Yanjie Wang
Weiyun Yi
Xinhao Kong
Wanting Li
90
0
0
27 Mar 2025
Enabling Heterogeneous Adversarial Transferability via Feature Permutation Attacks
Tao Wu
Tie Luo
AAML
174
0
0
26 Mar 2025
Demand Estimation with Text and Image Data
Giovanni Compiani
Ilya Morozov
Stephan Seiler
DiffM
555
4
0
26 Mar 2025
TraNCE: Transformative Non-linear Concept Explainer for CNNs
Ugochukwu Ejike Akpudo
Yongsheng Gao
J. Zhou
Andrew Lewis
113
0
0
26 Mar 2025
Debiasing Kernel-Based Generative Models
Tian Qin
Wei-Min Huang
192
0
0
26 Mar 2025
Hierarchical Label Propagation: A Model-Size-Dependent Performance Booster for AudioSet Tagging
Ludovic Tuncay
Etienne Labbé
Thomas Pellegrini
VLM
87
0
0
26 Mar 2025
Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern Languages
Yangyang Meng
Jinpeng Li
Guodong Lin
Yu Pu
G. Wang
Hu Du
Zhiming Shao
Yukai Huang
Ke Li
Wei-Qiang Zhang
ObjD
148
0
0
26 Mar 2025
Wavelet-based Global-Local Interaction Network with Cross-Attention for Multi-View Diabetic Retinopathy Detection
Yihan Hu
Yuxin Lin
Chengliang Liu
Xiaoling Luo
Xiaoyan Dou
Qihao Xu
Yong-mei Xu
68
0
0
25 Mar 2025
Panorama Generation From NFoV Image Done Right
Dian Zheng
Cheng Zhang
Xiao-Ming Wu
Cao Li
Chengfei Lv
Jian-Fang Hu
Wei-Shi Zheng
DiffM
129
2
0
24 Mar 2025
k-NN as a Simple and Effective Estimator of Transferability
Moein Sorkhei
Christos Matsoukas
Johan Fredin Haslum
Emir Konuk
Kevin Smith
94
0
0
24 Mar 2025
On Symmetries in Convolutional Weights
B. Alsallakh
Timothy Wroge
Vivek Miglani
Narine Kokhlikyan
197
0
0
24 Mar 2025
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models
Tadeusz Dziarmaga
Marcin Kądziołka
Artur Kasymov
Marcin Mazur
EGVM
182
0
0
24 Mar 2025
Vehicular Road Crack Detection with Deep Learning: A New Online Benchmark for Comprehensive Evaluation of Existing Algorithms
Nachuan Ma
Zhengfei Song
Qiang Hu
Chuang-Wei Liu
Yu Han
Yanting Zhang
Rui Fan
Lihua Xie
108
0
0
23 Mar 2025
GOAL: Global-local Object Alignment Learning
Hyungyu Choi
Young Kyun Jang
Chanho Eom
VLM
408
0
0
22 Mar 2025
Classifier-guided CLIP Distillation for Unsupervised Multi-label Classification
Dongseob Kim
Hyunjung Shim
VLM
121
0
0
21 Mar 2025
Stack Transformer Based Spatial-Temporal Attention Model for Dynamic Multi-Culture Sign Language Recognition
Koki Hirooka
Abu Saleh Musa Miah
Tatsuya Murakami
Yuto Akiba
Yong Seok Hwang
Jungpil Shin
SLR
61
0
0
21 Mar 2025
MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving
Haiguang Wang
Daqi Liu
Hongwei Xie
Haisong Liu
Enhui Ma
Kaicheng Yu
Limin Wang
Bing Wang
VGen
119
2
0
20 Mar 2025
From Head to Tail: Efficient Black-box Model Inversion Attack via Long-tailed Learning
Ziang Li
Hongguang Zhang
Juan Wang
Meihui Chen
Hongxin Hu
Wenzhe Yi
Xiaoyang Xu
Mengda Yang
Chenjun Ma
144
0
0
20 Mar 2025
Structured-Noise Masked Modeling for Video, Audio and Beyond
Aritra Bhowmik
Fida Mohammad Thoker
Carlos Hinojosa
Bernard Ghanem
Cees G. M. Snoek
VGen
108
0
0
20 Mar 2025
GAIR: Improving Multimodal Geo-Foundation Model with Geo-Aligned Implicit Representations
Ziqiang Liu
Fan Zhang
Junfeng Jiao
Ni Lao
Gengchen Mai
91
2
0
20 Mar 2025
A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli
Pengyu Liu
Guohua Dong
D. Guo
Kun Li
Fengling Li
Xun Yang
Meng Wang
Xiaomin Ying
AI4CE
83
0
0
20 Mar 2025
Manifold Learning for Hyperspectral Images
Fethi Harkat
Tiphaine Deuberet
Guillaume Gey
V. Perrier
K. Polisano
72
0
0
19 Mar 2025
DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition
Caoshuo Li
Tanzhe Li
Xiaobin Hu
Donghao Luo
Taisong Jin
93
1
0
19 Mar 2025
Texture-Aware StarGAN for CT data harmonisation
Francesco Di Feola
Ludovica Pompilio
Cecilia Assolito
V. Guarrasi
Paolo Soda
MedIm
69
0
0
19 Mar 2025
DPImageBench: A Unified Benchmark for Differentially Private Image Synthesis
Chen Gong
Kecen Li
Zinan Lin
Tianhao Wang
215
5
0
18 Mar 2025
Fibonacci-Net: A Lightweight CNN model for Automatic Brain Tumor Classification
Santanu Roy
Ashvath Suresh
Archit Gupta
Shubhi Tiwari
Palak Sahu
Prashant Adhikari
Yuvraj S. Shekhawat
106
0
0
18 Mar 2025
TarPro: Targeted Protection against Malicious Image Editing
Kaixin Shen
Ruijie Quan
Jiaxu Miao
Jun Xiao
Yi Yang
111
1
0
18 Mar 2025
SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model
Xinqing Li
Ruiqi Song
Qingyu Xie
Ye Wu
Nanxin Zeng
Yunfeng Ai
VGen
SyDa
105
2
0
18 Mar 2025
Previous
1
2
3
4
5
...
130
131
132
Next