Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1409.0575
Cited By
v1
v2
v3 (latest)
ImageNet Large Scale Visual Recognition Challenge
1 September 2014
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
Sean Ma
Zhiheng Huang
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ImageNet Large Scale Visual Recognition Challenge"
50 / 11,105 papers shown
Title
Bayesian Comparisons Between Representations
Heiko H. Schütt
FAtt
523
0
0
13 Nov 2024
Convergence Rate Analysis of LION
Yiming Dong
Huan Li
Zhouchen Lin
96
3
0
12 Nov 2024
Feature Selection Based on Wasserstein Distance
Fuwei Li
66
0
0
11 Nov 2024
Decoding Visual Experience and Mapping Semantics through Whole-Brain Analysis Using fMRI Foundation Models
Yanchen Wang
Adam Turnbull
Tiange Xiang
Yunlong Xu
Sa Zhou
Adnan Masoud
Shekoofeh Azizi
F. Lin
Ehsan Adeli
89
1
0
11 Nov 2024
ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
Zanlin Ni
Yulin Wang
Renping Zhou
Yizeng Han
Jiayi Guo
Zhiyuan Liu
Yuan Yao
Gao Huang
107
5
0
11 Nov 2024
Model Fusion through Bayesian Optimization in Language Model Fine-Tuning
Chaeyun Jang
Hyungi Lee
Jungtaek Kim
Juho Lee
MoMe
171
4
0
11 Nov 2024
Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional Data
Alex Havrilla
Wenjing Liao
104
12
0
11 Nov 2024
AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian Awareness
Yizhuo Yang
Shenghai Yuan
Muqing Cao
Jianfei Yang
Lihua Xie
270
9
0
11 Nov 2024
Feature Fusion Transferability Aware Transformer for Unsupervised Domain Adaptation
Xiaowei Yu
Zhe Huang
Zao Zhang
ViT
76
3
0
10 Nov 2024
Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing
Kaixuan Lu
Ruiqian Zhang
Xiao Huang
Yuxing Xie
Xiaogang Ning
Hanchao Zhang
Mengke Yuan
Pan Zhang
Tao Wang
Tongkui Liao
93
2
0
09 Nov 2024
GCI-ViTAL: Gradual Confidence Improvement with Vision Transformers for Active Learning on Label Noise
Moseli Motsóehli
Kyungim Baek
94
1
0
08 Nov 2024
Analyzing The Language of Visual Tokens
David M. Chan
Rodolfo Corona
J. S. Park
Cheol Jun Cho
Yutong Bai
Trevor Darrell
45
4
0
07 Nov 2024
HourVideo: 1-Hour Video-Language Understanding
Keshigeyan Chandrasegaran
Agrim Gupta
Lea M. Hadzic
Taran Kota
Jimming He
Cristobal Eyzaguirre
Zane Durante
Pengfei Yu
Jiajun Wu
L. Fei-Fei
VLM
115
50
0
07 Nov 2024
AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Anil Kag
Huseyin Coskun
Jierun Chen
Junli Cao
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
Jian Ren
103
3
0
07 Nov 2024
DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning
G. Zhou
Hengkai Pan
Yann LeCun
Lerrel Pinto
VGen
LM&Ro
OffRL
115
32
0
07 Nov 2024
Pseudo-labeling with Keyword Refining for Few-Supervised Video Captioning
Ping Li
Tao Wang
Xinkui Zhao
Xianghua Xu
Mingli Song
81
4
0
06 Nov 2024
Local vs distributed representations: What is the right basis for interpretability?
Julien Colin
L. Goetschalckx
Thomas Fel
Victor Boutin
Jay Gopal
Thomas Serre
Nuria Oliver
HAI
96
2
0
06 Nov 2024
Deferred Poisoning: Making the Model More Vulnerable via Hessian Singularization
Yuhao He
Jinyu Tian
Xianwei Zheng
Li Dong
Yuanman Li
L. Zhang
AAML
88
0
0
06 Nov 2024
Adaptive Consensus Gradients Aggregation for Scaled Distributed Training
Yoni Choukroun
Shlomi Azoulay
P. Kisilev
98
0
0
06 Nov 2024
These Maps Are Made by Propagation: Adapting Deep Stereo Networks to Road Scenarios with Decisive Disparity Diffusion
Chuang-Wei Liu
Yikang Zhang
Qijun Chen
Ioannis Pitas
Rui Fan
3DV
74
3
0
06 Nov 2024
DiMSUM: Diffusion Mamba -- A Scalable and Unified Spatial-Frequency Method for Image Generation
Hao Phung
Quan Dao
T. Dao
Hoang Phan
Dimitris Metaxas
Anh Tran
Mamba
184
5
0
06 Nov 2024
On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models
Tariq Berrada Ifriqi
Pietro Astolfi
Melissa Hall
Reyhane Askari Hemmat
Yohann Benchetrit
...
Matthew Muckley
Karteek Alahari
Adriana Romero Soriano
Jakob Verbeek
M. Drozdzal
AI4CE
VLM
150
4
0
05 Nov 2024
Self-supervised cross-modality learning for uncertainty-aware object detection and recognition in applications which lack pre-labelled training data
Irum Mehboob
Li Sun
Alireza Astegarpanah
Rustam Stolkin
UQCV
89
0
0
05 Nov 2024
HumanVLM: Foundation for Human-Scene Vision-Language Model
Dawei Dai
Xu Long
Li Yutang
Zhang YuanHui
Shuyin Xia
VLM
MLLM
159
2
0
05 Nov 2024
INQUIRE: A Natural World Text-to-Image Retrieval Benchmark
Edward Vendrow
Omiros Pantazis
Alexander Shepard
Gabriel J. Brostow
Kate E. Jones
Oisin Mac Aodha
Sara Beery
Grant Van Horn
VLM
111
7
0
04 Nov 2024
Learning Where to Edit Vision Transformers
Yunqiao Yang
Long-Kai Huang
Shengzhuang Chen
Kede Ma
Ying Wei
KELM
96
1
0
04 Nov 2024
Expanding Sparse Tuning for Low Memory Usage
Shufan Shen
Junshu Sun
Xiangyang Ji
Qingming Huang
Shuhui Wang
118
0
0
04 Nov 2024
Learning from Convolution-based Unlearnable Datasets
Dohyun Kim
Pedro Sandoval-Segura
MU
196
1
0
04 Nov 2024
MamT
4
^4
4
: Multi-view Attention Networks for Mammography Cancer Classification
Alisher Ibragimov
Sofya Senotrusova
Arsenii Litvinov
E. Ushakov
E. Karpulevich
Yury Markin
74
0
0
03 Nov 2024
Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level Alignment
Chengting Yu
Fengzhao Zhang
Ruizhe Chen
Zuozhu Liu
Shurun Tan
Er-ping Li
Aili Wang
112
2
0
03 Nov 2024
Contrasting with Symile: Simple Model-Agnostic Representation Learning for Unlimited Modalities
A. Saporta
A. Puli
Mark Goldstein
Rajesh Ranganath
SSL
98
1
0
01 Nov 2024
Does the Definition of Difficulty Matter? Scoring Functions and their Role for Curriculum Learning
Simon Rampp
M. Milling
Andreas Triantafyllopoulos
Björn Schuller
86
1
0
01 Nov 2024
Tracking one-in-a-million: Large-scale benchmark for microbial single-cell tracking with experiment-aware robustness metrics
J. Seiffarth
L. Blöbaum
R. D. Paul
N. Friederich
A. J. Yamachui Sitcheu
R. Mikut
H. Scharr
A. Grünberger
K. Nöh
100
3
0
01 Nov 2024
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Bram Adams
Ahmed E. Hassan
VLM
149
0
0
01 Nov 2024
Certified Robustness for Deep Equilibrium Models via Serialized Random Smoothing
Weizhi Gao
Zhichao Hou
Han Xu
Xiaorui Liu
AAML
85
0
0
01 Nov 2024
Constrained Diffusion Implicit Models
V. Jayaram
Ira Kemelmacher-Shlizerman
Steven M. Seitz
John Thickstun
DiffM
109
0
0
01 Nov 2024
Inducing Semi-Structured Sparsity by Masking for Efficient Model Inference in Convolutional Networks
David A. Danhofer
77
0
0
01 Nov 2024
CLIP-RT: Learning Language-Conditioned Robotic Policies from Natural Language Supervision
Gi-Cheon Kang
Junghyun Kim
Kyuhwan Shim
Jun Ki Lee
Byoung-Tak Zhang
LM&Ro
353
2
1
01 Nov 2024
Group Crosscoders for Mechanistic Analysis of Symmetry
Liv Gorton
92
1
0
31 Oct 2024
Bayesian-guided Label Mapping for Visual Reprogramming
C. Cai
Zesheng Ye
Lei Feng
Jianzhong Qi
Feng Liu
157
5
0
31 Oct 2024
MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval
Haiwen Li
Fei Su
Zhicheng Zhao
81
0
0
31 Oct 2024
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach
Mathilde Caron
Alireza Fathi
Cordelia Schmid
Ahmet Iscen
69
2
0
31 Oct 2024
Transparent Trade-offs between Properties of Explanations
Hiwot Belay Tadesse
Alihan Hüyük
Yaniv Yacoby
Weiwei Pan
Finale Doshi-Velez
FAtt
150
0
0
31 Oct 2024
ResiDual Transformer Alignment with Spectral Decomposition
Lorenzo Basile
Valentino Maiorca
Luca Bortolussi
Emanuele Rodolà
Francesco Locatello
193
2
0
31 Oct 2024
Zero-shot Class Unlearning via Layer-wise Relevance Analysis and Neuronal Path Perturbation
Wenhan Chang
Tianqing Zhu
Ping Xiong
Yufeng Wu
Faqian Guan
Wanlei Zhou
MU
87
0
0
31 Oct 2024
ProTransformer: Robustify Transformers via Plug-and-Play Paradigm
Zhichao Hou
Weizhi Gao
Yuchen Shen
Feiyi Wang
Xiaorui Liu
VLM
86
2
0
30 Oct 2024
CausAdv: A Causal-based Framework for Detecting Adversarial Examples
Hichem Debbi
CML
AAML
84
1
0
29 Oct 2024
Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation
Jintao Tong
Yixiong Zou
Yuhua Li
Ruixuan Li
111
6
0
29 Oct 2024
Advancing Efficient Brain Tumor Multi-Class Classification -- New Insights from the Vision Mamba Model in Transfer Learning
Yinyi Lai
Anbo Cao
Yuan Gao
Jiaqi Shang
Zongyu Li
Jia Guo
Mamba
93
1
0
29 Oct 2024
Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation
Ruihao Xia
Yu Liang
Peng-Tao Jiang
Hao Zhang
Yue Liu
Yang Tang
Pan Zhou
DiffM
VLM
87
1
0
29 Oct 2024
Previous
1
2
3
...
11
12
13
...
221
222
223
Next