Title
Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead Maurizio Capra Beatrice Bussolino Alberto Marchisio Guido Masera Maurizio Martina Mohamed Bennai BDL 141 147 0 21 Dec 2020
Robust Federated Learning with Noisy Labels Seunghan Yang Hyoungseob Park Junyoung Byun Changick Kim FedML NoLa 69 80 0 03 Dec 2020
Grafit: Learning fine-grained image representations with coarse labels Hugo Touvron Alexandre Sablayrolles Matthijs Douze Matthieu Cord Hervé Jégou SSL 91 68 0 25 Nov 2020
Insights From A Large-Scale Database of Material Depictions In Paintings Hubert Lin Mitchell J. P. van Zuijlen M. Wijntjes S. Pont Kavita Bala 124 6 0 24 Nov 2020
HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms Mahmoud Afifi Marcus A. Brubaker M. S. Brown GAN 118 105 0 23 Nov 2020
One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks Kemal Oksuz Baris Can Cam Sinan Kalkan Emre Akbas 93 33 0 21 Nov 2020
Open-Vocabulary Object Detection Using Captions Alireza Zareian Kevin Dela Rosa Derek Hao Hu Shih-Fu Chang VLM ObjD 215 436 0 20 Nov 2020
Image Representations Learned With Unsupervised Pre-Training Contain Human-like Biases Ryan Steed Aylin Caliskan SSL 107 162 0 28 Oct 2020
Webly Supervised Image Classification with Metadata: Automatic Noisy Label Correction via Visual-Semantic Graph Jingkang Yang Weirong Chen Xue Jiang Xiaopeng Yan Huabin Zheng Wayne Zhang NoLa 77 13 0 12 Oct 2020
CAPTION: Correction by Analyses, POS-Tagging and Interpretation of Objects using only Nouns L. Ferreira Douglas De Rizzo Meneghetti P. Santos 26 2 0 02 Oct 2020
Asymmetric Loss For Multi-Label Classification Emanuel Ben-Baruch T. Ridnik Nadav Zamir Asaf Noy Itamar Friedman M. Protter Lihi Zelnik-Manor 114 549 0 29 Sep 2020
VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning Xiaowei Hu Xi Yin Kevin Qinghong Lin Lijuan Wang Lefei Zhang Jianfeng Gao Zicheng Liu VLM 110 57 0 28 Sep 2020
MimicDet: Bridging the Gap Between One-Stage and Two-Stage Object Detection Xin Lu Quanquan Li Buyu Li Junjie Yan ObjD 66 54 0 24 Sep 2020
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks Zhiqiang Shen Marios Savvides 92 63 0 17 Sep 2020
BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation Haisheng Su Weihao Gan Wei Wu Yu Qiao Junjie Yan 152 125 0 15 Sep 2020
Adaptive Label Smoothing Ujwal Krothapalli A. Lynn Abbott 98 10 0 14 Sep 2020
Denoising Large-Scale Image Captioning from Alt-text Data using Content Selection Models Khyathi Chandu Piyush Sharma Soravit Changpinyo Ashish V. Thapliyal Radu Soricut DiffM VLM 88 3 0 10 Sep 2020
1st Place Solution of LVIS Challenge 2020: A Good Box is not a Guarantee of a Good Mask Jingru Tan Gang Zhang Hanming Deng Changbao Wang Lewei Lu Quanquan Li Jifeng Dai 82 18 0 03 Sep 2020
A Cost-Effective Person-Following System for Assistive Unmanned Vehicles with Deep Learning at the Edge A. Boschi Francesco Salvetti Vittorio Mazzia Marcello Chiaberge 64 13 0 31 Aug 2020
Soliciting Human-in-the-Loop User Feedback for Interactive Machine Learning Reduces User Trust and Impressions of Model Accuracy Donald R. Honeycutt Mahsan Nourani Eric D. Ragan HAI 95 63 0 28 Aug 2020
DeepSOCIAL: Social Distancing Monitoring and Infection Risk Assessment in COVID-19 Pandemic Mahdi Rezaei Mohsen Azarmi 87 155 0 26 Aug 2020
Object Detection with a Unified Label Space from Multiple Datasets Xiangyu Zhao S. Schulter Gaurav Sharma Yi-Hsuan Tsai Manmohan Chandraker Ying Nian Wu ObjD 87 72 0 15 Aug 2020
Guided Collaborative Training for Pixel-wise Semi-Supervised Learning Zhanghan Ke Di Qiu Kaican Li Qiong Yan Rynson W. H. Lau 98 254 0 12 Aug 2020
BREEDS: Benchmarks for Subpopulation Shift Shibani Santurkar Dimitris Tsipras Aleksander Madry OOD 85 175 0 11 Aug 2020
Polysemy Deciphering Network for Robust Human-Object Interaction Detection Xubin Zhong Changxing Ding X. Qu Dacheng Tao 124 59 0 07 Aug 2020
Multiple instance learning on deep features for weakly supervised object detection with extreme domain shifts Nicolas Gonthier Saïd Ladjal Y. Gousseau WSOD 81 29 0 03 Aug 2020
Spatially Aware Multimodal Transformers for TextVQA Yash Kant Dhruv Batra Peter Anderson Alex Schwing Devi Parikh Jiasen Lu Harsh Agrawal 100 86 0 23 Jul 2020
Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020 Haisheng Su Jinyuan Feng Hao Shao Zhenyu Jiang Manyuan Zhang Wei Wu Yu Liu Hongsheng Li Junjie Yan 40 0 0 20 Jul 2020
Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer Yuanyi Zhong Jianfeng Wang Jian-wei Peng Lei Zhang 87 50 0 15 Jul 2020
Deep learning for scene recognition from visual data: a survey Alina Matei A. Glavan Estefanía Talavera 87 18 0 03 Jul 2020
Measuring Robustness to Natural Distribution Shifts in Image Classification Rohan Taori Achal Dave Vaishaal Shankar Nicholas Carlini Benjamin Recht Ludwig Schmidt OOD 134 549 0 01 Jul 2020
Recurrent Relational Memory Network for Unsupervised Image Captioning Dan Guo Yang Wang Peipei Song Meng Wang GAN 83 40 0 24 Jun 2020
Large image datasets: A pyrrhic win for computer vision? Vinay Uday Prabhu Abeba Birhane 127 367 0 24 Jun 2020
Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks Avi Schwarzschild Micah Goldblum Arjun Gupta John P. Dickerson Tom Goldstein AAML TDI 114 164 0 22 Jun 2020
UniT: Unified Knowledge Transfer for Any-shot Object Detection and Segmentation Siddhesh Khandelwal Raghav Goyal Leonid Sigal VLM 117 2 0 12 Jun 2020
Rethinking Pre-training and Self-training Barret Zoph Golnaz Ghiasi Nayeon Lee Huayu Chen Hanxiao Liu E. D. Cubuk Quoc V. Le SSeg 115 656 0 11 Jun 2020
Multimodal grid features and cell pointers for Scene Text Visual Question Answering Lluís Gómez Ali Furkan Biten Rubèn Pérez Tito Andrés Mafla Marçal Rusiñol Ernest Valveny Dimosthenis Karatzas 68 21 0 01 Jun 2020
Large-Scale Object Detection in the Wild from Imbalanced Multi-Labels Junran Peng Xingyuan Bu Ming Sun Zhaoxiang Zhang Tieniu Tan Junjie Yan VLM ObjD 82 60 0 18 May 2020
Cross-media Structured Common Space for Multimedia Event Extraction Manling Li Alireza Zareian Qi Zeng Spencer Whitehead Di Lu Heng Ji Shih-Fu Chang 80 103 0 05 May 2020
Monitoring COVID-19 social distancing with person detection and tracking via fine-tuned YOLO v3 and Deepsort techniques Narinder Singh Punn S. K. Sonbhadra Sonali Agarwal Gaurav Rai 117 240 0 04 May 2020
Clue: Cross-modal Coherence Modeling for Caption Generation Malihe Alikhani Piyush Sharma Shengjie Li Radu Soricut Matthew Stone 122 57 0 02 May 2020
Real-Time Apple Detection System Using Embedded Systems With Hardware Accelerators: An Edge AI Application Vittorio Mazzia Francesco Salvetti Aleem Khaliq Marcello Chiaberge 72 154 0 28 Apr 2020
Global Wheat Head Detection (GWHD) dataset: a large and diverse dataset of high resolution RGB labelled images to develop and benchmark wheat head detection methods Etienne David S. Madec Pouria Sadeghi-Tehran H. Aasen Bangyou Zheng ... A. Hund S. Chapman F. Baret I. Stavness Wei Guo 78 206 0 25 Apr 2020
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks Xiujun Li Xi Yin Chunyuan Li Pengchuan Zhang Xiaowei Hu ... Houdong Hu Li Dong Furu Wei Yejin Choi Jianfeng Gao VLM 244 1,955 0 13 Apr 2020
Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval Tobias Weyand A. Araújo Bingyi Cao Jack Sim 105 373 0 03 Apr 2020
GPS-Net: Graph Property Sensing Network for Scene Graph Generation Xin Lin Changxing Ding Jinquan Zeng Dacheng Tao 136 284 0 29 Mar 2020
Weakly Supervised 3D Human Pose and Shape Reconstruction with Normalizing Flows Andrei Zanfir Eduard Gabriel Bazavan Hongyi Xu Bill Freeman Rahul Sukthankar C. Sminchisescu 3DH 97 136 0 23 Mar 2020
CPS++: Improving Class-level 6D Pose and Shape Estimation From Monocular Images With Self-Supervised Learning Fabian Manhardt Gu Wang Benjamin Busam M. Nickel Sven Meier Luca Minciullo Xiangyang Ji Nassir Navab 62 13 0 12 Mar 2020
PANDA: A Gigapixel-level Human-centric Video Dataset Xueyan Wang Xiya Zhang Yinheng Zhu Yuchen Guo Xiaoyun Yuan ... Zerun Wang Guiguang Ding D. Brady Qionghai Dai Lu Fang VGen 100 82 0 10 Mar 2020
Optimizing JPEG Quantization for Classification Networks Zhijing Li Christopher De Sa Adrian Sampson VLM 52 12 0 05 Mar 2020