Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1409.0575
Cited By
v1
v2
v3 (latest)
ImageNet Large Scale Visual Recognition Challenge
1 September 2014
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
Sean Ma
Zhiheng Huang
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ImageNet Large Scale Visual Recognition Challenge"
50 / 11,076 papers shown
Title
k-NN as a Simple and Effective Estimator of Transferability
Moein Sorkhei
Christos Matsoukas
Johan Fredin Haslum
Emir Konuk
Kevin Smith
102
0
0
24 Mar 2025
Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding
Xiangrui Liu
Yan Shu
Zhengyang Liang
Ao Li
Yang Tian
Bo Zhao
VGen
VLM
290
9
0
24 Mar 2025
BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors
Yu Wang
Junxian Mu
Hongzhi Huang
Qilong Wang
Pengfei Zhu
Q. Hu
253
1
0
22 Mar 2025
IceBench: A Benchmark for Deep Learning based Sea Ice Type Classification
Samira Alkaee Taleghan
A. Barrett
Walter N. Meier
F. Banaei-Kashani
VLM
105
0
0
22 Mar 2025
Region Masking to Accelerate Video Processing on Neuromorphic Hardware
Sreetama Sarkar
S. Shrestha
Yue Che
L. Campos-Macias
Gourav Datta
Peter A. Beerel
137
0
0
21 Mar 2025
Restoring Forgotten Knowledge in Non-Exemplar Class Incremental Learning through Test-Time Semantic Evolution
Haori Lu
Xusheng Cao
Linlan Huang
Enguang Wang
Fei Yang
Xialei Liu
CLL
92
0
0
21 Mar 2025
Beyond Accuracy: What Matters in Designing Well-Behaved Models?
Robin Hesse
Doğukan Bağcı
Bernt Schiele
Simone Schaub-Meyer
Stefan Roth
VLM
123
0
0
21 Mar 2025
Improving Adversarial Transferability on Vision Transformers via Forward Propagation Refinement
Yuchen Ren
Zhengyu Zhao
Chenhao Lin
Bo Yang
Zhe Liu
Jiafei Wu
Chao Shen
ViT
94
2
0
19 Mar 2025
Object-Centric Pretraining via Target Encoder Bootstrapping
Nikola Đukić
Tim Lebailly
Tinne Tuytelaars
OCL
129
0
0
19 Mar 2025
LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding
Amirhossein Kazerouni
Soroush Mehraban
Michael Brudno
Babak Taati
90
2
0
19 Mar 2025
Language-based Image Colorization: A Benchmark and Beyond
Yongqian Li
Shuai Yang
Jiaying Liu
DiffM
VLM
102
0
0
19 Mar 2025
DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition
Caoshuo Li
Tanzhe Li
Xiaobin Hu
Donghao Luo
Taisong Jin
103
1
0
19 Mar 2025
TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection
Qiang Qi
Xiao Wang
ViT
571
0
0
18 Mar 2025
Operational Change Detection for Geographical Information: Overview and Challenges
Nicolas Gonthier
89
0
0
18 Mar 2025
DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers
Mert Bulent Sariyildiz
Philippe Weinzaepfel
Thomas Lucas
Pau de Jorge
Diane Larlus
Yannis Kalantidis
112
0
0
18 Mar 2025
Dynamic Accumulated Attention Map for Interpreting Evolution of Decision-Making in Vision Transformer
Yi Liao
Yongsheng Gao
Weichuan Zhang
84
3
0
18 Mar 2025
RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment
Chao Wang
Giulio Franzese
A. Finamore
Pietro Michiardi
267
0
0
18 Mar 2025
Scale-Aware Contrastive Reverse Distillation for Unsupervised Medical Anomaly Detection
Chunlei Li
Yilei Shi
Jingliang Hu
Xiao Xiang Zhu
Lichao Mou
MedIm
126
0
0
18 Mar 2025
Panoramic Distortion-Aware Tokenization for Person Detection and Localization Using Transformers in Overhead Fisheye Images
Nobuhiko Wakai
Satoshi Sato
Yasunori Ishii
Takayoshi Yamashita
124
0
0
18 Mar 2025
FeNeC: Enhancing Continual Learning via Feature Clustering with Neighbor- or Logit-Based Classification
Kamil Książek
Hubert Jastrzębski
Bartosz Trojan
Krzysztof Pniaczek
Michał Karp
Jacek Tabor
CLL
251
0
0
18 Mar 2025
An interpretable approach to automating the assessment of biofouling in video footage
Evelyn J. Mannix
Bartholomew A. Woodham
201
0
0
17 Mar 2025
AI-Driven Rapid Identification of Bacterial and Fungal Pathogens in Blood Smears of Septic Patients
Agnieszka Sroka-Oleksiak
Adam Pardyl
Dawid Rymarczyk
Aldona Olechowska-Jarząb
Katarzyna Biegun-Drożdż
...
Tomasz Gosiewski
Miłosz Adamczyk
Henryk Telega
Bartosz Zieliñski
Monika Brzychczy-Włoch
88
0
0
17 Mar 2025
Scale Efficient Training for Large Datasets
Qing Zhou
Junyu Gao
Qi Wang
DD
126
0
0
17 Mar 2025
SAUCE: Selective Concept Unlearning in Vision-Language Models with Sparse Autoencoders
Qing Li
Jiahui Geng
Derui Zhu
Fengyu Cai
Chenyang Lyu
Fakhri Karray
MU
105
2
0
16 Mar 2025
Unifying Perplexing Behaviors in Modified BP Attributions through Alignment Perspective
Guanhua Zheng
Jitao Sang
Changsheng Xu
AAML
FAtt
123
0
0
14 Mar 2025
AugGen: Synthetic Augmentation Can Improve Discriminative Models
Parsa Rahimi
Damien Teney
S´ebastien Marcel
140
2
0
14 Mar 2025
Open-Set Plankton Recognition
Joona Kareinen
Annaliina Skyttä
T. Eerola
K. Kraft
L. Lensu
S. Suikkanen
Maiju Lehtiniemi
Heikki Kälviäinen
113
1
0
14 Mar 2025
Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages
Matteo Farina
Massimiliano Mancini
Giovanni Iacca
Elisa Ricci
VLM
102
0
0
14 Mar 2025
Similarity-Aware Token Pruning: Your VLM but Faster
Ahmadreza Jeddi
Negin Baghbanzadeh
Elham Dolatabadi
Babak Taati
3DV
VLM
124
2
0
14 Mar 2025
JPEG Compliant Compression for Both Human and Machine, A Report
Linfeng Ye
3DH
AAML
89
0
0
13 Mar 2025
ReSi: A Comprehensive Benchmark for Representational Similarity Measures
Max Klabunde
Tassilo Wald
Tobias Schumacher
Klaus H. Maier-Hein
Markus Strohmaier
Adriana Iamnitchi
AI4TS
VLM
248
6
0
13 Mar 2025
ChatGPT Encounters Morphing Attack Detection: Zero-Shot MAD with Multi-Modal Large Language Models and General Vision Models
Haoyu Zhang
Raghavendra Ramachandra
Kiran Raja
C. Busch
113
0
0
13 Mar 2025
Learning Interpretable Logic Rules from Deep Vision Models
Chuqin Geng
Yuhe Jiang
Ziyu Zhao
Haolin Ye
Zhaoyue Wang
X. Si
NAI
FAtt
VLM
116
1
0
13 Mar 2025
Poly-MgNet: Polynomial Building Blocks in Multigrid-Inspired ResNets
Antonia van Betteray
Matthias Rottmann
Karsten Kahl
189
1
0
13 Mar 2025
A New Benchmark for Few-Shot Class-Incremental Learning: Redefining the Upper Bound
Shiwon Kim
Dongjun Hwang
Sungwon Woo
Rita Singh
CLL
140
0
0
13 Mar 2025
DNA Origami Nanostructures Observed in Transmission Electron Microscopy Images can be Characterized through Convolutional Neural Networks
Xingfei Wei
Qiankun Mo
Chi Chen
Mark Bathe
Rigoberto Hernandez
59
0
0
13 Mar 2025
Evaluating Visual Explanations of Attention Maps for Transformer-based Medical Imaging
Minjae Chung
Jong Bum Won
Ganghyun Kim
Yujin Kim
Utku Ozbulak
MedIm
206
0
0
12 Mar 2025
Finding the Muses: Identifying Coresets through Loss Trajectories
M. Nagaraj
Deepak Ravikumar
Efstathia Soufleri
Kaushik Roy
88
0
0
12 Mar 2025
AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks
Jin Li
Ziqiang He
Anwei Luo
Jian-Fang Hu
Zhong Wang
Xiangui Kang
DiffM
128
0
0
12 Mar 2025
Mapping fMRI Signal and Image Stimuli in an Artificial Neural Network Latent Space: Bringing Artificial and Natural Minds Together
Cesare Maria Dalbagno
Manuel de Castro Ribeiro Jardim
Mihnea Angheluţă
194
0
0
12 Mar 2025
Medical Large Language Model Benchmarks Should Prioritize Construct Validity
Ahmed M. Alaa
Thomas Hartvigsen
Niloufar Golchini
Shiladitya Dutta
Frances Dean
Inioluwa Deborah Raji
Travis Zack
AI4MH
ELM
LM&MA
86
7
0
12 Mar 2025
2HandedAfforder: Learning Precise Actionable Bimanual Affordances from Human Videos
Marvin Heidinger
Snehal Jauhri
V. Prasad
Georgia Chalvatzaki
130
0
0
12 Mar 2025
Group-robust Machine Unlearning
Thomas De Min
Subhankar Roy
Stéphane Lathuilière
Elisa Ricci
Massimiliano Mancini
MU
OOD
170
1
0
12 Mar 2025
Seeing What's Not There: Spurious Correlation in Multimodal LLMs
Parsa Hosseini
Sumit Nawathe
Mazda Moayeri
S. Balasubramanian
Soheil Feizi
LRM
115
3
0
11 Mar 2025
A Theoretical Framework for Preventing Class Collapse in Supervised Contrastive Learning
Chungpa Lee
Jeongheon Oh
Kibok Lee
Jy-yong Sohn
SSL
170
2
0
11 Mar 2025
Birds look like cars: Adversarial analysis of intrinsically interpretable deep learning
Hubert Baniecki
P. Biecek
AAML
128
0
0
11 Mar 2025
Explaining Human Preferences via Metrics for Structured 3D Reconstruction
Jack Langerman
Denys Rozumnyi
Yuzhong Huang
Dmytro Mishkin
HAI
106
0
0
11 Mar 2025
LongProLIP: A Probabilistic Vision-Language Model with Long Context Text
Sanghyuk Chun
Sangdoo Yun
VLM
111
2
0
11 Mar 2025
MIGA: Mutual Information-Guided Attack on Denoising Models for Semantic Manipulation
Guanghao Li
Mingzhi Chen
Hao Yu
Shuting Dong
Wenhao Jiang
Ming Tang
Chun Yuan
DiffM
AAML
96
0
0
10 Mar 2025
Iterative Prompt Relocation for Distribution-Adaptive Visual Prompt Tuning
Chikai Shang
Mengke Li
Yiqun Zhang
Zhen Chen
Jinlin Wu
Fangqing Gu
Yang Lu
Yiu-ming Cheung
VLM
121
0
0
10 Mar 2025
Previous
1
2
3
...
5
6
7
...
220
221
222
Next