Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.00982
Cited By
v1
v2 (latest)
The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale
2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
ObjD
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"
50 / 356 papers shown
Title
Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead
Maurizio Capra
Beatrice Bussolino
Alberto Marchisio
Guido Masera
Maurizio Martina
Mohamed Bennai
BDL
141
147
0
21 Dec 2020
Robust Federated Learning with Noisy Labels
Seunghan Yang
Hyoungseob Park
Junyoung Byun
Changick Kim
FedML
NoLa
69
80
0
03 Dec 2020
Grafit: Learning fine-grained image representations with coarse labels
Hugo Touvron
Alexandre Sablayrolles
Matthijs Douze
Matthieu Cord
Hervé Jégou
SSL
91
68
0
25 Nov 2020
Insights From A Large-Scale Database of Material Depictions In Paintings
Hubert Lin
Mitchell J. P. van Zuijlen
M. Wijntjes
S. Pont
Kavita Bala
124
6
0
24 Nov 2020
HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms
Mahmoud Afifi
Marcus A. Brubaker
M. S. Brown
GAN
118
105
0
23 Nov 2020
One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks
Kemal Oksuz
Baris Can Cam
Sinan Kalkan
Emre Akbas
93
33
0
21 Nov 2020
Open-Vocabulary Object Detection Using Captions
Alireza Zareian
Kevin Dela Rosa
Derek Hao Hu
Shih-Fu Chang
VLM
ObjD
215
436
0
20 Nov 2020
Image Representations Learned With Unsupervised Pre-Training Contain Human-like Biases
Ryan Steed
Aylin Caliskan
SSL
107
162
0
28 Oct 2020
Webly Supervised Image Classification with Metadata: Automatic Noisy Label Correction via Visual-Semantic Graph
Jingkang Yang
Weirong Chen
Xue Jiang
Xiaopeng Yan
Huabin Zheng
Wayne Zhang
NoLa
77
13
0
12 Oct 2020
CAPTION: Correction by Analyses, POS-Tagging and Interpretation of Objects using only Nouns
L. Ferreira
Douglas De Rizzo Meneghetti
P. Santos
26
2
0
02 Oct 2020
Asymmetric Loss For Multi-Label Classification
Emanuel Ben-Baruch
T. Ridnik
Nadav Zamir
Asaf Noy
Itamar Friedman
M. Protter
Lihi Zelnik-Manor
114
549
0
29 Sep 2020
VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning
Xiaowei Hu
Xi Yin
Kevin Qinghong Lin
Lijuan Wang
Lefei Zhang
Jianfeng Gao
Zicheng Liu
VLM
110
57
0
28 Sep 2020
MimicDet: Bridging the Gap Between One-Stage and Two-Stage Object Detection
Xin Lu
Quanquan Li
Buyu Li
Junjie Yan
ObjD
66
54
0
24 Sep 2020
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks
Zhiqiang Shen
Marios Savvides
92
63
0
17 Sep 2020
BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation
Haisheng Su
Weihao Gan
Wei Wu
Yu Qiao
Junjie Yan
152
125
0
15 Sep 2020
Adaptive Label Smoothing
Ujwal Krothapalli
A. Lynn Abbott
98
10
0
14 Sep 2020
Denoising Large-Scale Image Captioning from Alt-text Data using Content Selection Models
Khyathi Chandu
Piyush Sharma
Soravit Changpinyo
Ashish V. Thapliyal
Radu Soricut
DiffM
VLM
88
3
0
10 Sep 2020
1st Place Solution of LVIS Challenge 2020: A Good Box is not a Guarantee of a Good Mask
Jingru Tan
Gang Zhang
Hanming Deng
Changbao Wang
Lewei Lu
Quanquan Li
Jifeng Dai
82
18
0
03 Sep 2020
A Cost-Effective Person-Following System for Assistive Unmanned Vehicles with Deep Learning at the Edge
A. Boschi
Francesco Salvetti
Vittorio Mazzia
Marcello Chiaberge
64
13
0
31 Aug 2020
Soliciting Human-in-the-Loop User Feedback for Interactive Machine Learning Reduces User Trust and Impressions of Model Accuracy
Donald R. Honeycutt
Mahsan Nourani
Eric D. Ragan
HAI
95
63
0
28 Aug 2020
DeepSOCIAL: Social Distancing Monitoring and Infection Risk Assessment in COVID-19 Pandemic
Mahdi Rezaei
Mohsen Azarmi
87
155
0
26 Aug 2020
Object Detection with a Unified Label Space from Multiple Datasets
Xiangyu Zhao
S. Schulter
Gaurav Sharma
Yi-Hsuan Tsai
Manmohan Chandraker
Ying Nian Wu
ObjD
87
72
0
15 Aug 2020
Guided Collaborative Training for Pixel-wise Semi-Supervised Learning
Zhanghan Ke
Di Qiu
Kaican Li
Qiong Yan
Rynson W. H. Lau
98
254
0
12 Aug 2020
BREEDS: Benchmarks for Subpopulation Shift
Shibani Santurkar
Dimitris Tsipras
Aleksander Madry
OOD
85
175
0
11 Aug 2020
Polysemy Deciphering Network for Robust Human-Object Interaction Detection
Xubin Zhong
Changxing Ding
X. Qu
Dacheng Tao
124
59
0
07 Aug 2020
Multiple instance learning on deep features for weakly supervised object detection with extreme domain shifts
Nicolas Gonthier
Saïd Ladjal
Y. Gousseau
WSOD
81
29
0
03 Aug 2020
Spatially Aware Multimodal Transformers for TextVQA
Yash Kant
Dhruv Batra
Peter Anderson
Alex Schwing
Devi Parikh
Jiasen Lu
Harsh Agrawal
100
86
0
23 Jul 2020
Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020
Haisheng Su
Jinyuan Feng
Hao Shao
Zhenyu Jiang
Manyuan Zhang
Wei Wu
Yu Liu
Hongsheng Li
Junjie Yan
40
0
0
20 Jul 2020
Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer
Yuanyi Zhong
Jianfeng Wang
Jian-wei Peng
Lei Zhang
87
50
0
15 Jul 2020
Deep learning for scene recognition from visual data: a survey
Alina Matei
A. Glavan
Estefanía Talavera
87
18
0
03 Jul 2020
Measuring Robustness to Natural Distribution Shifts in Image Classification
Rohan Taori
Achal Dave
Vaishaal Shankar
Nicholas Carlini
Benjamin Recht
Ludwig Schmidt
OOD
134
549
0
01 Jul 2020
Recurrent Relational Memory Network for Unsupervised Image Captioning
Dan Guo
Yang Wang
Peipei Song
Meng Wang
GAN
83
40
0
24 Jun 2020
Large image datasets: A pyrrhic win for computer vision?
Vinay Uday Prabhu
Abeba Birhane
127
367
0
24 Jun 2020
Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks
Avi Schwarzschild
Micah Goldblum
Arjun Gupta
John P. Dickerson
Tom Goldstein
AAML
TDI
114
164
0
22 Jun 2020
UniT: Unified Knowledge Transfer for Any-shot Object Detection and Segmentation
Siddhesh Khandelwal
Raghav Goyal
Leonid Sigal
VLM
117
2
0
12 Jun 2020
Rethinking Pre-training and Self-training
Barret Zoph
Golnaz Ghiasi
Nayeon Lee
Huayu Chen
Hanxiao Liu
E. D. Cubuk
Quoc V. Le
SSeg
115
656
0
11 Jun 2020
Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Lluís Gómez
Ali Furkan Biten
Rubèn Pérez Tito
Andrés Mafla
Marçal Rusiñol
Ernest Valveny
Dimosthenis Karatzas
68
21
0
01 Jun 2020
Large-Scale Object Detection in the Wild from Imbalanced Multi-Labels
Junran Peng
Xingyuan Bu
Ming Sun
Zhaoxiang Zhang
Tieniu Tan
Junjie Yan
VLM
ObjD
82
60
0
18 May 2020
Cross-media Structured Common Space for Multimedia Event Extraction
Manling Li
Alireza Zareian
Qi Zeng
Spencer Whitehead
Di Lu
Heng Ji
Shih-Fu Chang
80
103
0
05 May 2020
Monitoring COVID-19 social distancing with person detection and tracking via fine-tuned YOLO v3 and Deepsort techniques
Narinder Singh Punn
S. K. Sonbhadra
Sonali Agarwal
Gaurav Rai
117
240
0
04 May 2020
Clue: Cross-modal Coherence Modeling for Caption Generation
Malihe Alikhani
Piyush Sharma
Shengjie Li
Radu Soricut
Matthew Stone
122
57
0
02 May 2020
Real-Time Apple Detection System Using Embedded Systems With Hardware Accelerators: An Edge AI Application
Vittorio Mazzia
Francesco Salvetti
Aleem Khaliq
Marcello Chiaberge
72
154
0
28 Apr 2020
Global Wheat Head Detection (GWHD) dataset: a large and diverse dataset of high resolution RGB labelled images to develop and benchmark wheat head detection methods
Etienne David
S. Madec
Pouria Sadeghi-Tehran
H. Aasen
Bangyou Zheng
...
A. Hund
S. Chapman
F. Baret
I. Stavness
Wei Guo
78
206
0
25 Apr 2020
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Xiujun Li
Xi Yin
Chunyuan Li
Pengchuan Zhang
Xiaowei Hu
...
Houdong Hu
Li Dong
Furu Wei
Yejin Choi
Jianfeng Gao
VLM
244
1,955
0
13 Apr 2020
Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval
Tobias Weyand
A. Araújo
Bingyi Cao
Jack Sim
105
373
0
03 Apr 2020
GPS-Net: Graph Property Sensing Network for Scene Graph Generation
Xin Lin
Changxing Ding
Jinquan Zeng
Dacheng Tao
136
284
0
29 Mar 2020
Weakly Supervised 3D Human Pose and Shape Reconstruction with Normalizing Flows
Andrei Zanfir
Eduard Gabriel Bazavan
Hongyi Xu
Bill Freeman
Rahul Sukthankar
C. Sminchisescu
3DH
97
136
0
23 Mar 2020
CPS++: Improving Class-level 6D Pose and Shape Estimation From Monocular Images With Self-Supervised Learning
Fabian Manhardt
Gu Wang
Benjamin Busam
M. Nickel
Sven Meier
Luca Minciullo
Xiangyang Ji
Nassir Navab
62
13
0
12 Mar 2020
PANDA: A Gigapixel-level Human-centric Video Dataset
Xueyan Wang
Xiya Zhang
Yinheng Zhu
Yuchen Guo
Xiaoyun Yuan
...
Zerun Wang
Guiguang Ding
D. Brady
Qionghai Dai
Lu Fang
VGen
100
82
0
10 Mar 2020
Optimizing JPEG Quantization for Classification Networks
Zhijing Li
Christopher De Sa
Adrian Sampson
VLM
52
12
0
05 Mar 2020
Previous
1
2
3
4
5
6
7
8
Next