Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1512.00567
Cited By
v1
v2
v3 (latest)
Rethinking the Inception Architecture for Computer Vision
2 December 2015
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DV
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Rethinking the Inception Architecture for Computer Vision"
50 / 6,612 papers shown
Title
ForamViT-GAN: Exploring New Paradigms in Deep Learning for Micropaleontological Image Analysis
Ivan Ferreira-Chacua
A. Koeshidayatullah
58
2
0
09 Apr 2023
SparseFormer: Sparse Visual Recognition via Limited Latent Tokens
Ziteng Gao
Zhan Tong
Limin Wang
Mike Zheng Shou
60
10
0
07 Apr 2023
AutoQNN: An End-to-End Framework for Automatically Quantizing Neural Networks
Cheng Gong
Ye Lu
Surong Dai
Deng Qian
Chenkun Du
Tao Li
MQ
57
0
0
07 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
105
43
0
07 Apr 2023
Graph Attention for Automated Audio Captioning
Feiyang Xiao
Jian Guan
Qiaoxi Zhu
Wenwu Wang
64
8
0
07 Apr 2023
PSLT: A Light-weight Vision Transformer with Ladder Self-Attention and Progressive Shift
Gaojie Wu
Weishi Zheng
Yutong Lu
Q. Tian
ViT
84
15
0
07 Apr 2023
Rethinking Evaluation Protocols of Visual Representations Learned via Self-supervised Learning
Jaehoon Lee
Doyoung Yoon
Byeongmoon Ji
Kyungyul Kim
Sangheum Hwang
SSL
75
3
0
07 Apr 2023
Diffusion Models as Masked Autoencoders
Chen Wei
K. Mangalam
Po-Yao (Bernie) Huang
Yanghao Li
Haoqi Fan
Hu Xu
Huiyu Wang
Cihang Xie
Alan Yuille
Christoph Feichtenhofer
DiffM
SyDa
100
53
0
06 Apr 2023
Face Animation with an Attribute-Guided Diffusion Model
Bo-Wen Zeng
Xuhui Liu
Sicheng Gao
Boyu Liu
Hong Li
Jianzhuang Liu
Baochang Zhang
86
33
0
06 Apr 2023
PopulAtion Parameter Averaging (PAPA)
Alexia Jolicoeur-Martineau
Emy Gervais
Kilian Fatras
Yan Zhang
Simon Lacoste-Julien
MoMe
110
21
0
06 Apr 2023
Tensor Slicing and Optimization for Multicore NPUs
R. Sousa
M. Pereira
Yongin Kwon
Taeho Kim
Namsoon Jung
Chang Soo Kim
Michael Frank
Guido Araujo
86
6
0
06 Apr 2023
Convolutional neural networks for crack detection on flexible road pavements
Hermann Tapamo
Anna Sergeevna Bosman
James Maina
E. Horak
21
1
0
06 Apr 2023
Efficient Audio Captioning Transformer with Patchout and Text Guidance
Thodoris Kouzelis
Grigoris Bastas
Athanasios Katsamanis
Alexandros Potamianos
ViT
88
6
0
06 Apr 2023
MULLER: Multilayer Laplacian Resizer for Vision
Zhengzhong Tu
P. Milanfar
Hossein Talebi
78
4
0
06 Apr 2023
UNICORN: A Unified Backdoor Trigger Inversion Framework
Zhenting Wang
Kai Mei
Juan Zhai
Shiqing Ma
LLMSV
81
47
0
05 Apr 2023
What Affects Learned Equivariance in Deep Image Recognition Models?
Robert-Jan Bruintjes
Tomasz Motyka
Jan van Gemert
114
8
0
05 Apr 2023
DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images
Bo Qian
Haoxing Chen
Xiangning Wang
Haoxuan Che
Gitaek Kwon
...
Xiaokang Yang
Yiyu Cai
Weiping Jia
Huating Li
Bin Sheng
80
5
0
05 Apr 2023
SMPConv: Self-moving Point Representations for Continuous Convolution
Sanghyeon Kim
Eunbyung Park
3DPC
73
13
0
05 Apr 2023
GINA-3D: Learning to Generate Implicit Neural Assets in the Wild
Bokui Shen
Xinchen Yan
C. Qi
Mahyar Najibi
Boyang Deng
Leonidas Guibas
Yin Zhou
Drago Anguelov
3DV
96
21
0
04 Apr 2023
Effective Theory of Transformers at Initialization
Emily Dinan
Sho Yaida
Susan Zhang
89
16
0
04 Apr 2023
Revisiting the Evaluation of Image Synthesis with GANs
Mengping Yang
Ceyuan Yang
Yichi Zhang
Qingyan Bai
Yujun Shen
Bo Dai
EGVM
74
7
0
04 Apr 2023
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation
Mayu Otani
Riku Togashi
Yu Sawai
Ryosuke Ishigami
Yuta Nakashima
Esa Rahtu
J. Heikkilä
Shiníchi Satoh
108
65
0
04 Apr 2023
Beyond Unimodal: Generalising Neural Processes for Multimodal Uncertainty Estimation
M. Jung
He Zhao
Joanna Dipnall
Lan Du
UQCV
BDL
72
8
0
04 Apr 2023
Improved Visual Fine-tuning with Natural Language Supervision
Junyan Wang
Yuanhong Xu
Juhua Hu
Ming Yan
Jitao Sang
Qi Qian
64
8
0
04 Apr 2023
A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NAS
Juan R. Terven
Diana-Margarita Córdova-Esparza
166
1,368
0
02 Apr 2023
Information Recovery-Driven Deep Incomplete Multiview Clustering Network
Chengliang Liu
Jie Wen
Zhihao Wu
Xiaoling Luo
Chao Huang
Yong-mei Xu
97
48
0
02 Apr 2023
Hierarchical Vision Transformers for Cardiac Ejection Fraction Estimation
Lhuqita Fazry
Asep Haryono
Nuzulul Khairu Nissa
Sunarno
Naufal Muhammad Hirzi
M. F. Rachmadi
W. Jatmiko
MedIm
76
17
0
31 Mar 2023
Social Honeypot for Humans: Luring People through Self-managed Instagram Pages
Sara Bardi
Mauro Conti
Luca Pajola
Pier Paolo Tricomi
58
1
0
31 Mar 2023
Generating Adversarial Samples in Mini-Batches May Be Detrimental To Adversarial Robustness
T. Redgrave
Colton R. Crum
AAML
40
0
0
30 Mar 2023
Consistent View Synthesis with Pose-Guided Diffusion Models
Hung-Yu Tseng
Qinbo Li
Changil Kim
Suhib Alsisan
Jia-Bin Huang
Johannes Kopf
DiffM
104
101
0
30 Mar 2023
Neglected Free Lunch -- Learning Image Classifiers Using Annotation Byproducts
Dongyoon Han
Junsuk Choe
Dante Chun
John Joon Young Chung
Minsuk Chang
Sangdoo Yun
Jean Y. Song
Seong Joon Oh
OOD
654
4
1
30 Mar 2023
SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger
Yuting Gao
Jinfeng Liu
Zi-Han Xu
Tong Wu
Wen Liu
Jie Yang
Keren Li
Xingen Sun
CLIP
VLM
64
47
0
30 Mar 2023
Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment
Kim Sung-Bin
Arda Senocak
H. Ha
Andrew Owens
Tae-Hyun Oh
DiffM
VGen
86
39
0
30 Mar 2023
LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation
Guangcong Zheng
Xianpan Zhou
Xuewei Li
Zhongang Qi
Ying Shan
Xi Li
DiffM
98
190
0
30 Mar 2023
Mixed Autoencoder for Self-supervised Visual Representation Learning
Kai Chen
Zhili Liu
Lanqing Hong
Hang Xu
Zhenguo Li
Dit-Yan Yeung
SSL
123
45
0
30 Mar 2023
A comparative evaluation of image-to-image translation methods for stain transfer in histopathology
I. Zingman
Sergio Frayle
Ivan Tankoyeu
Segrey Sukhanov
Fabian Heinemann
MedIm
30
13
0
29 Mar 2023
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Limin Wang
Bingkun Huang
Zhiyu Zhao
Zhan Tong
Yinan He
Yi Wang
Yali Wang
Yu Qiao
VGen
153
363
0
29 Mar 2023
Bi-directional Training for Composed Image Retrieval via Text Prompt Learning
Zheyuan Liu
Weixuan Sun
Yicong Hong
Damien Teney
Stephen Gould
126
34
0
29 Mar 2023
WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models
Konstantina Nikolaidou
George Retsinas
Vincent Christlein
Mathias Seuret
Giorgos Sfikas
Elisa Barney Smith
Hamam Mokayed
Marcus Liwicki
DiffM
74
16
0
29 Mar 2023
Implicit Diffusion Models for Continuous Super-Resolution
Sicheng Gao
Xuhui Liu
Bo-Wen Zeng
Sheng Xu
Yanjing Li
Xiaonan Luo
Jianzhuang Liu
Xiantong Zhen
Baochang Zhang
DiffM
111
232
0
29 Mar 2023
A Comprehensive and Versatile Multimodal Deep Learning Approach for Predicting Diverse Properties of Advanced Materials
Shun Muroga
Yasuaki Miki
Kenji Hata
AI4CE
29
18
0
29 Mar 2023
LMDA-Net:A lightweight multi-dimensional attention network for general EEG-based brain-computer interface paradigms and interpretability
Zhengqing Miao
Xin Zhang
Mei-rong Zhao
Dong Ming
44
6
0
29 Mar 2023
InceptionNeXt: When Inception Meets ConvNeXt
Weihao Yu
Pan Zhou
Shuicheng Yan
Xinchao Wang
189
141
0
29 Mar 2023
Dice Semimetric Losses: Optimizing the Dice Score with Soft Labels
Zifu Wang
Teodora Popordanoska
J. Bertels
Robin Lemmens
Matthew B. Blaschko
60
10
0
28 Mar 2023
UVCGAN v2: An Improved Cycle-Consistent GAN for Unpaired Image-to-Image Translation
D. Torbunov
Yi Huang
Huan-Hsin Tseng
Haiwang Yu
Jin-zhi Huang
Shinjae Yoo
Meifeng Lin
B. Viren
Yihui Ren
82
12
0
28 Mar 2023
Large-scale Training Data Search for Object Re-identification
Yue Yao
Huan Lei
Tom Gedeon
Liang Zheng
103
11
0
28 Mar 2023
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Kunchang Li
Yali Wang
Yizhuo Li
Yi Wang
Yinan He
Limin Wang
Yu Qiao
VGen
136
169
0
28 Mar 2023
Information-Theoretic GAN Compression with Variational Energy-based Model
Minsoo Kang
Hyewon Yoo
Eunhee Kang
Sehwan Ki
Hyong-Euk Lee
Bohyung Han
GAN
72
3
0
28 Mar 2023
Transferable Adversarial Attacks on Vision Transformers with Token Gradient Regularization
Jianping Zhang
Yizhan Huang
Weibin Wu
Michael R. Lyu
AAML
ViT
82
54
0
28 Mar 2023
Improving the Transferability of Adversarial Samples by Path-Augmented Method
Jianping Zhang
Jen-tse Huang
Wenxuan Wang
Yichen Li
Weibin Wu
Xiaosen Wang
Yuxin Su
Michael R. Lyu
AAML
111
52
0
28 Mar 2023
Previous
1
2
3
...
37
38
39
...
131
132
133
Next