Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1512.00567
Cited By
v1
v2
v3 (latest)
Rethinking the Inception Architecture for Computer Vision
2 December 2015
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DV
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Rethinking the Inception Architecture for Computer Vision"
50 / 6,589 papers shown
Title
Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies
Xubin Wang
Weijia Jia
169
2
0
08 Jan 2025
Powerful Design of Small Vision Transformer on CIFAR10
Gent Wu
ViT
102
0
0
07 Jan 2025
WhACC: Whisker Automatic Contact Classifier with Expert Human-Level Performance
Phillip Maire
Samson G. King
Jonathan Andrew Cheung
Stefanie Walker
Samuel Andrew Hires
194
0
0
06 Jan 2025
Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks
Yang Wang
Chenghua Lin
ELM
195
0
0
05 Jan 2025
Facial Attractiveness Prediction in Live Streaming: A New Benchmark and Multi-modal Method
Haoyang Li
Xiaoyu Ren
Hongjiu Yu
Huiyu Duan
Kai Li
Ying Chen
Libo Wang
Xiongkuo Min
Guangtao Zhai
Xu Liu
CVBM
172
0
0
05 Jan 2025
Boosting Adversarial Transferability with Spatial Adversarial Alignment
Zhaoyu Chen
Haijing Guo
Kaixun Jiang
Jiyuan Fu
Xinyu Zhou
Dingkang Yang
Hao Tang
Yue Liu
Wenqiang Zhang
AAML
69
0
0
03 Jan 2025
Generalizing Trust: Weak-to-Strong Trustworthiness in Language Models
Martin Pawelczyk
Lillian Sun
Zhenting Qi
Aounon Kumar
Himabindu Lakkaraju
158
2
0
03 Jan 2025
TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions
Vriksha Srihari
R. Bhavya
Shruti Jayaraman
V. Mary Anita Rajam
DiffM
VGen
128
0
0
02 Jan 2025
Adaptive Hardness-driven Augmentation and Alignment Strategies for Multi-Source Domain Adaptations
Yang Yuxiang
Zeng Xinyi
Zeng Pinxian
Zu Chen
Yan Binyu
Zhou Jiliu
Wang Yan
124
0
0
02 Jan 2025
Two Heads Are Better Than One: Averaging along Fine-Tuning to Improve Targeted Transferability
Hui Zeng
Sanshuai Cui
Biwei Chen
Anjie Peng
AAML
121
0
0
31 Dec 2024
Attribution for Enhanced Explanation with Transferable Adversarial eXploration
Zhiyu Zhu
Jiayu Zhang
Zhibo Jin
Huaming Chen
Jianlong Zhou
Fang Chen
AAML
ViT
88
0
0
27 Dec 2024
Establishing Reality-Virtuality Interconnections in Urban Digital Twins for Superior Intelligent Road Inspection
Yikang Zhang
Chuang-Wei Liu
Jiahang Li
Yingbing Chen
Jie Cheng
Rui Fan
77
0
0
23 Dec 2024
Predicting Satisfied User and Machine Ratio for Compressed Images: A Unified Approach
Qi Zhang
Shanshe Wang
Xinfeng Zhang
Siwei Ma
Jingshan Pan
Wen Gao
75
0
0
23 Dec 2024
Unity is Strength: Unifying Convolutional and Transformeral Features for Better Person Re-Identification
Yuhao Wang
Pingping Zhang
Xuehu Liu
Zhengzheng Tu
Huchuan Lu
80
3
0
23 Dec 2024
Lightweight Design and Optimization methods for DCNNs: Progress and Futures
Hanhua Long
Wenbin Bi
Jian Sun
105
0
0
22 Dec 2024
GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space
Souhaib Attaiki
Paul Guerrero
Duygu Ceylan
Niloy J. Mitra
M. Ovsjanikov
149
0
0
21 Dec 2024
Prior2Posterior: Model Prior Correction for Long-Tailed Learning
S Divakar Bhat
Amit More
Mudit Soni
Surbhi Agrawal
123
0
0
21 Dec 2024
IMVB7t: A Multi-Modal Model for Food Preferences based on Artificially Produced Traits
Mushfiqur Rahman Abir
Md. Tanzib Hosain
Md. Abdullah-Al-Jubair
M. F. Mridha
98
4
0
21 Dec 2024
Non-Uniform Parameter-Wise Model Merging
Albert Manuel Orozco Camacho
Stefan Horoi
Guy Wolf
Eugene Belilovsky
MoMe
FedML
140
0
0
20 Dec 2024
SeagrassFinder: Deep Learning for Eelgrass Detection and Coverage Estimation in the Wild
Jannik Elsäßer
Laura Weihl
Veronika Cheplygina
Lisbeth Tangaa Nielsen
182
0
0
20 Dec 2024
Extreme Multi-label Completion for Semantic Document Labelling with Taxonomy-Aware Parallel Learning
Julien Audiffren
Christophe Broillet
Ljiljana Dolamic
Philippe Cudré-Mauroux
112
0
0
18 Dec 2024
MATCHED: Multimodal Authorship-Attribution To Combat Human Trafficking in Escort-Advertisement Data
V. Saxena
Benjamin Bashpole
Gijs van Dijck
Gerasimos Spanakis
91
0
0
18 Dec 2024
Optimized two-stage AI-based Neural Decoding for Enhanced Visual Stimulus Reconstruction from fMRI Data
Lorenzo Veronese
Andrea Moglia
Luca Mainardi
Pietro Cerveri
DiffM
128
0
0
17 Dec 2024
What is YOLOv6? A Deep Insight into the Object Detection Model
Athulya Sundaresan Geetha
3DH
VLM
ObjD
115
1
0
17 Dec 2024
PT: A Plain Transformer is Good Hospital Readmission Predictor
Zhenyi Fan
Jiaqi Li
Dongyu Luo
Yuqi Yuan
153
0
0
17 Dec 2024
CRoF: CLIP-based Robust Few-shot Learning on Noisy Labels
Shizhuo Deng
Bowen Han
Jiaqi Chen
Hao Wang
Dongyue Chen
Tong Jia
VLM
NoLa
151
0
0
17 Dec 2024
A Survey of Calibration Process for Black-Box LLMs
Liangru Xie
Hui Liu
Jingying Zeng
Xianfeng Tang
Yan Han
Chen Luo
Jing Huang
Zhen Li
Suhang Wang
Qi He
142
4
0
17 Dec 2024
Multilabel Classification for Lung Disease Detection: Integrating Deep Learning and Natural Language Processing
Maria Efimovich
Jayden Lim
Vedant Mehta
Ethan Poon
103
0
0
16 Dec 2024
MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt
Yuhao Wang
Xuehu Liu
T. Yan
Yebin Liu
Aihua Zheng
Pingping Zhang
Huchuan Lu
138
6
0
14 Dec 2024
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification
Yuhao Wang
Yebin Liu
Aihua Zheng
Pingping Zhang
152
8
0
14 Dec 2024
A Novel Ensemble-Based Deep Learning Model with Explainable AI for Accurate Kidney Disease Diagnosis
Md. Arifuzzaman
Iftekhar Ahmed
M. Chowdhury
Shadman Sakib
Mohammad Shoaib Rahman
Md. Ebrahim Hossain
Shakib Absar
83
0
0
12 Dec 2024
Analysis of Object Detection Models for Tiny Object in Satellite Imagery: A Dataset-Centric Approach
Kailas PS
Selvakumaran R
Palani Murugan
Ramesh Kumar V
Malaya Kumar Biswal M
126
1
0
12 Dec 2024
Post-Training Non-Uniform Quantization for Convolutional Neural Networks
Ahmed Luqman
Khuzemah Qazi
Imdadullah Khan
MQ
102
0
0
10 Dec 2024
ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet
Andrei-Robert Alexandrescu
Razvan-Gabriel Petec
Alexandru Manole
Laura-Silvia Diosan
DiffM
112
0
0
09 Dec 2024
Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment
Kim Sung-Bin
Arda Senocak
Hyunwoo Ha
Tae-Hyun Oh
DiffM
219
0
0
09 Dec 2024
Language Model as Visual Explainer
Xingyi Yang
Xinchao Wang
VLM
76
0
0
08 Dec 2024
LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors
Yusuf Dalva
Yuezun Li
Qing Liu
Nanxuan Zhao
Jianming Zhang
Zhe Lin
Pinar Yanardag
AI4CE
131
2
0
05 Dec 2024
MACAW: A Causal Generative Model for Medical Imaging
Vibujithan Vigneshwaran
Erik Ohara
Matthias Wilms
Nils D. Forkert
OOD
CML
MedIm
127
1
0
03 Dec 2024
Performance Comparison of Deep Learning Techniques in Naira Classification
Ismail Ismail Tijjani
Ahmad Abubakar Mustapha
Ismaíl Tijjani Idris
77
0
0
03 Dec 2024
Fire-Image-DenseNet (FIDN) for predicting wildfire burnt area using remote sensing data
Bo Pang
Sibo Cheng
Yuhan Huang
Yufang Jin
Yike Guo
I. Colin Prentice
Sandy P. Harrison
Rossella Arcucci
AI4CE
117
2
0
02 Dec 2024
Multimodal Fusion Learning with Dual Attention for Medical Imaging
Joy Dhar
Nayyar Zaidi
Maryam Haghighat
Puneet Goyal
Sudipta Roy
Azadeh Alavi
Vikas Kumar
84
3
0
02 Dec 2024
Token Cropr: Faster ViTs for Quite a Few Tasks
Benjamin Bergner
C. Lippert
Aravindh Mahendran
ViT
VLM
104
1
0
01 Dec 2024
AdaScale: Dynamic Context-aware DNN Scaling via Automated Adaptation Loop on Mobile Devices
Yuzhan Wang
Sicong Liu
Bin Guo
Boqi Zhang
Ke Ma
Yasan Ding
Hao Luo
Yao Li
Zhiwen Yu
122
3
0
01 Dec 2024
Sketch-Guided Motion Diffusion for Stylized Cinemagraph Synthesis
H. Jin
Hengyuan Chang
Xiaoxuan Xie
Zhengyang Wang
Xusheng Du
Shaojun Hu
H. Xie
DiffM
VGen
107
0
0
01 Dec 2024
Robust Testing for Deep Learning using Human Label Noise
Gordon Lim
Stefan Larson
Kevin Leach
NoLa
114
0
0
29 Nov 2024
Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation
Michele De Vita
Vasileios Belagiannis
DiffM
151
1
0
29 Nov 2024
CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections
Mohamed Fazli Mohamed Imam
Rufael Fedaku Marew
Jameel Hassan
Mustansar Fiaz
Alham Fikri Aji
Hisham Cholakkal
VLM
546
1
0
28 Nov 2024
BadScan: An Architectural Backdoor Attack on Visual State Space Models
Om Suhas Deshmukh
Sankalp Nagaonkar
A. Tripathi
Ashish Mishra
Mamba
126
0
0
26 Nov 2024
Synthesising Handwritten Music with GANs: A Comprehensive Evaluation of CycleWGAN, ProGAN, and DCGAN
Elona Shatri
Kalikidhar Palavala
George Fazekas
152
0
0
25 Nov 2024
Brain-like emergent properties in deep networks: impact of network architecture, datasets and training
Niranjan Rajesh
Georgin Jacob
SP Arun
OOD
111
0
0
25 Nov 2024
Previous
1
2
3
...
6
7
8
...
130
131
132
Next