ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.00982
  4. Cited By
The Open Images Dataset V4: Unified image classification, object
  detection, and visual relationship detection at scale

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
    ObjD
    VLM
ArXivPDFHTML

Papers citing "The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"

50 / 246 papers shown
Title
Achieving Human Parity on Visual Question Answering
Achieving Human Parity on Visual Question Answering
Ming Yan
Haiyang Xu
Chenliang Li
Junfeng Tian
Bin Bi
...
Ji Zhang
Songfang Huang
Fei Huang
Luo Si
Rong Jin
32
12
0
17 Nov 2021
A Survey of Visual Transformers
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
77
332
0
11 Nov 2021
Resource-Efficient Federated Learning
Resource-Efficient Federated Learning
A. Abdelmoniem
Atal Narayan Sahu
Marco Canini
Suhaib A. Fahmy
FedML
32
55
0
01 Nov 2021
Multi-label Classification with Partial Annotations using Class-aware
  Selective Loss
Multi-label Classification with Partial Annotations using Class-aware Selective Loss
Emanuel Ben-Baruch
T. Ridnik
Itamar Friedman
Avi Ben-Cohen
Nadav Zamir
Asaf Noy
Lihi Zelnik-Manor
34
38
0
21 Oct 2021
Noisy Annotation Refinement for Object Detection
Noisy Annotation Refinement for Object Detection
Jiafeng Mao
Qing Yu
Yoko Yamakata
Kiyoharu Aizawa
NoLa
42
10
0
20 Oct 2021
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
272
1,026
0
13 Oct 2021
Aura: Privacy-preserving Augmentation to Improve Test Set Diversity in
  Speech Enhancement
Aura: Privacy-preserving Augmentation to Improve Test Set Diversity in Speech Enhancement
Xavier Gitiaux
Aditya Khant
Ebrahim Beyrami
Chandan K. A. Reddy
J. Gupchup
Ross Cutler
22
0
0
08 Oct 2021
Inferring Offensiveness In Images From Natural Language Supervision
Inferring Offensiveness In Images From Natural Language Supervision
P. Schramowski
Kristian Kersting
32
2
0
08 Oct 2021
PASS: An ImageNet replacement for self-supervised pretraining without
  humans
PASS: An ImageNet replacement for self-supervised pretraining without humans
Yuki M. Asano
Christian Rupprecht
Andrew Zisserman
Andrea Vedaldi
VLM
SSL
21
57
0
27 Sep 2021
Visual Scene Graphs for Audio Source Separation
Visual Scene Graphs for Audio Source Separation
Moitreya Chatterjee
Jonathan Le Roux
Narendra Ahuja
A. Cherian
26
36
0
24 Sep 2021
Pix2seq: A Language Modeling Framework for Object Detection
Pix2seq: A Language Modeling Framework for Object Detection
Ting-Li Chen
Saurabh Saxena
Lala Li
David J. Fleet
Geoffrey E. Hinton
MLLM
ViT
VLM
244
344
0
22 Sep 2021
Deep Joint Source-Channel Coding for Multi-Task Network
Deep Joint Source-Channel Coding for Multi-Task Network
Mengyang Wang
Zhicong Zhang
Jiahui Li
Mengyao Ma
Xiaopeng Fan
17
28
0
13 Sep 2021
Panoptic Narrative Grounding
Panoptic Narrative Grounding
Cristina González
Nicolás Ayobi
Isabela Hernández
José Hernández
Jordi Pont-Tuset
Pablo Arbeláez
82
22
0
10 Sep 2021
Learning to Generate Scene Graph from Natural Language Supervision
Learning to Generate Scene Graph from Natural Language Supervision
Yiwu Zhong
Jing Shi
Jianwei Yang
Chenliang Xu
Yin Li
SSL
42
77
0
06 Sep 2021
DVM-CAR: A large-scale automotive dataset for visual marketing research
  and applications
DVM-CAR: A large-scale automotive dataset for visual marketing research and applications
JingMin Huang
Bowei Chen
Lan Luo
Shigang Yue
I. Ounis
28
15
0
10 Aug 2021
Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Yuren Cong
Wentong Liao
H. Ackermann
Bodo Rosenhahn
M. Yang
ViT
22
122
0
26 Jul 2021
PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior
  for Joint Image-Text Modeling
PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Xiaoxue Zang
Lijuan Liu
Maria Wang
Yang Song
Hao Zhang
Jindong Chen
VLM
35
55
0
06 Jul 2021
Web-Scale Generic Object Detection at Microsoft Bing
Web-Scale Generic Object Detection at Microsoft Bing
S. Chen
Saurajit Mukherjee
Unmesh Phadke
Tingting Wang
Junwon Park
Ravi Theja Yada
ObjD
VLM
19
0
0
05 Jul 2021
CBNet: A Composite Backbone Network Architecture for Object Detection
CBNet: A Composite Backbone Network Architecture for Object Detection
Tingting Liang
Xiao Chu
Yudong Liu
Yongtao Wang
Zhi Tang
Wei Chu
Jingdong Chen
Haibin Ling
ObjD
15
161
0
01 Jul 2021
OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and
  Generation
OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation
Jing Liu
Xinxin Zhu
Fei Liu
Longteng Guo
Zijia Zhao
...
Weining Wang
Hanqing Lu
Shiyu Zhou
Jiajun Zhang
Jinqiao Wang
36
37
0
01 Jul 2021
Rail-5k: a Real-World Dataset for Rail Surface Defects Detection
Rail-5k: a Real-World Dataset for Rail Surface Defects Detection
Zihao Zhang
Shaozuo Yu
Siwei Yang
Yu Zhou
Bingchen Zhao
17
11
0
28 Jun 2021
Extreme Multi-label Learning for Semantic Matching in Product Search
Extreme Multi-label Learning for Semantic Matching in Product Search
Wei-Cheng Chang
Daniel Jiang
Hsiang-Fu Yu
C. Teo
Jiong Zhang
...
Qie Hu
Nikhil Shandilya
Vyacheslav Ievgrafov
Japinder Singh
Inderjit S. Dhillon
45
59
0
23 Jun 2021
Tracking Instances as Queries
Tracking Instances as Queries
Shusheng Yang
Yuxin Fang
Xinggang Wang
Yu Li
Ying Shan
Bin Feng
Wenyu Liu
30
10
0
22 Jun 2021
GAIA: A Transfer Learning System of Object Detection that Fits Your
  Needs
GAIA: A Transfer Learning System of Object Detection that Fits Your Needs
Xingyuan Bu
Junran Peng
Junjie Yan
Tieniu Tan
Zhaoxiang Zhang
ObjD
VLM
31
53
0
21 Jun 2021
MSN: Efficient Online Mask Selection Network for Video Instance
  Segmentation
MSN: Efficient Online Mask Selection Network for Video Instance Segmentation
Vidit Goel
Jiachen Li
Shubhika Garg
Harsh Maheshwari
Humphrey Shi
19
7
0
19 Jun 2021
Efficient Self-supervised Vision Transformers for Representation
  Learning
Efficient Self-supervised Vision Transformers for Representation Learning
Chunyuan Li
Jianwei Yang
Pengchuan Zhang
Mei Gao
Bin Xiao
Xiyang Dai
Lu Yuan
Jianfeng Gao
ViT
40
209
0
17 Jun 2021
Compositional Sketch Search
Compositional Sketch Search
Alexander Black
Tu Bui
Long Mai
Hailin Jin
John Collomosse
27
1
0
15 Jun 2021
Provably Robust Detection of Out-of-distribution Data (almost) for free
Provably Robust Detection of Out-of-distribution Data (almost) for free
Alexander Meinke
Julian Bitterwolf
Matthias Hein
OODD
33
22
0
08 Jun 2021
Rethinking Pseudo Labels for Semi-Supervised Object Detection
Rethinking Pseudo Labels for Semi-Supervised Object Detection
Hengduo Li
Zuxuan Wu
Abhinav Shrivastava
Larry S. Davis
16
78
0
01 Jun 2021
Linguistic Structures as Weak Supervision for Visual Scene Graph
  Generation
Linguistic Structures as Weak Supervision for Visual Scene Graph Generation
Keren Ye
Adriana Kovashka
29
52
0
28 May 2021
The Challenge of Variable Effort Crowdsourcing and How Visible Gold Can
  Help
The Challenge of Variable Effort Crowdsourcing and How Visible Gold Can Help
Danula Hettiachchi
M. Schaekermann
Tristan McKinney
Matthew Lease
54
19
0
20 May 2021
Waste detection in Pomerania: non-profit project for detecting waste in
  environment
Waste detection in Pomerania: non-profit project for detecting waste in environment
Sylwia Majchrowska
Agnieszka Mikołajczyk
M. Ferlin
Zuzanna Klawikowska
Marta A. Plantykow
Arkadiusz Kwasigroch
K. Majek
30
125
0
12 May 2021
Event Camera Simulator Design for Modeling Attention-based Inference
  Architectures
Event Camera Simulator Design for Modeling Attention-based Inference Architectures
Md Jubaer Hossain Pantho
Joel Mandebi Mbongue
Pankaj Bhowmik
C. Bobda
42
8
0
03 May 2021
Joint Representation Learning and Novel Category Discovery on Single-
  and Multi-modal Data
Joint Representation Learning and Novel Category Discovery on Single- and Multi-modal Data
Xu Jia
Kai Han
Yukun Zhu
Bradley Green
155
57
0
26 Apr 2021
A Survey of Modern Deep Learning based Object Detection Models
A Survey of Modern Deep Learning based Object Detection Models
Syed Sahil Abbas Zaidi
M. S. Ansari
Asra Aslam
N. Kanwal
M. Asghar
Brian Lee
VLM
ObjD
69
730
0
24 Apr 2021
ImageNet-21K Pretraining for the Masses
ImageNet-21K Pretraining for the Masses
T. Ridnik
Emanuel Ben-Baruch
Asaf Noy
Lihi Zelnik-Manor
SSeg
VLM
CLIP
187
690
0
22 Apr 2021
Rethinking Text Line Recognition Models
Rethinking Text Line Recognition Models
Daniel Hernandez Diaz
Siyang Qin
R. Ingle
Yasuhisa Fujii
Alessandro Bissacco
VLM
38
51
0
15 Apr 2021
Factors of Influence for Transfer Learning across Diverse Appearance
  Domains and Task Types
Factors of Influence for Transfer Learning across Diverse Appearance Domains and Task Types
Thomas Mensink
J. Uijlings
Alina Kuznetsova
Michael Gygli
V. Ferrari
VLM
43
81
0
24 Mar 2021
Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and
  Benchmark
Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and Benchmark
Joakim Bruslund Haurum
T. Moeslund
20
60
0
19 Mar 2021
Efficient Online ML API Selection for Multi-Label Classification Tasks
Efficient Online ML API Selection for Multi-Label Classification Tasks
Lingjiao Chen
Matei A. Zaharia
James Zou
40
17
0
18 Feb 2021
Differential Privacy and Byzantine Resilience in SGD: Do They Add Up?
Differential Privacy and Byzantine Resilience in SGD: Do They Add Up?
R. Guerraoui
Nirupam Gupta
Rafael Pinot
Sébastien Rouault
John Stephan
33
30
0
16 Feb 2021
Reviving Iterative Training with Mask Guidance for Interactive
  Segmentation
Reviving Iterative Training with Mask Guidance for Interactive Segmentation
Konstantin Sofiiuk
Ilya A. Petrov
Anton Konushin
38
214
0
12 Feb 2021
Copycat CNN: Are Random Non-Labeled Data Enough to Steal Knowledge from
  Black-box Models?
Copycat CNN: Are Random Non-Labeled Data Enough to Steal Knowledge from Black-box Models?
Jacson Rodrigues Correia-Silva
Rodrigo Berriel
C. Badue
Alberto F. de Souza
Thiago Oliveira-Santos
MLAU
15
14
0
21 Jan 2021
Hardware and Software Optimizations for Accelerating Deep Neural
  Networks: Survey of Current Trends, Challenges, and the Road Ahead
Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead
Maurizio Capra
Beatrice Bussolino
Alberto Marchisio
Guido Masera
Maurizio Martina
Muhammad Shafique
BDL
59
140
0
21 Dec 2020
Grafit: Learning fine-grained image representations with coarse labels
Grafit: Learning fine-grained image representations with coarse labels
Hugo Touvron
Alexandre Sablayrolles
Matthijs Douze
Matthieu Cord
Hervé Jégou
SSL
34
68
0
25 Nov 2020
Insights From A Large-Scale Database of Material Depictions In Paintings
Insights From A Large-Scale Database of Material Depictions In Paintings
Hubert Lin
Mitchell J. P. van Zuijlen
M. Wijntjes
S. Pont
Kavita Bala
26
6
0
24 Nov 2020
HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color
  Histograms
HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms
Mahmoud Afifi
Marcus A. Brubaker
M. S. Brown
GAN
35
105
0
23 Nov 2020
One Metric to Measure them All: Localisation Recall Precision (LRP) for
  Evaluating Visual Detection Tasks
One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks
Kemal Oksuz
Baris Can Cam
Sinan Kalkan
Emre Akbas
32
32
0
21 Nov 2020
Open-Vocabulary Object Detection Using Captions
Open-Vocabulary Object Detection Using Captions
Alireza Zareian
Kevin Dela Rosa
Derek Hao Hu
Shih-Fu Chang
VLM
ObjD
44
418
0
20 Nov 2020
Image Representations Learned With Unsupervised Pre-Training Contain
  Human-like Biases
Image Representations Learned With Unsupervised Pre-Training Contain Human-like Biases
Ryan Steed
Aylin Caliskan
SSL
30
156
0
28 Oct 2020
Previous
12345
Next