ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.00982
  4. Cited By
The Open Images Dataset V4: Unified image classification, object
  detection, and visual relationship detection at scale

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
    ObjD
    VLM
ArXivPDFHTML

Papers citing "The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"

50 / 243 papers shown
Title
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments
Luca Barsellotti
Roberto Bigazzi
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
98
1
0
20 Feb 2025
Efficient Progressive Image Compression with Variance-aware Masking
Efficient Progressive Image Compression with Variance-aware Masking
Alberto Presta
Enzo Tartaglione
Attilio Fiandrotti
Marco Grangetto
Pamela Cosman
34
0
0
15 Nov 2024
Label Convergence: Defining an Upper Performance Bound in Object Recognition through Contradictory Annotations
Label Convergence: Defining an Upper Performance Bound in Object Recognition through Contradictory Annotations
David Tschirschwitz
Volker Rodehorst
31
1
0
14 Sep 2024
BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation
BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation
Peng Hao
Xiaobing Wang
Yingying Jiang
Hanchao Jia
Xiaoshuai Hao
Shaowei Cui
Junhang Wei
Xiaoshuai Hao
57
3
0
26 Jul 2024
BIV-Priv-Seg: Locating Private Content in Images Taken by People With Visual Impairments
BIV-Priv-Seg: Locating Private Content in Images Taken by People With Visual Impairments
Yu-Yun Tseng
Tanusree Sharma
Lotus Zhang
Abigale Stangl
Leah Findlater
Yang Wang
Danna Gurari
81
0
0
25 Jul 2024
Error Detection and Constraint Recovery in Hierarchical Multi-Label Classification without Prior Knowledge
Error Detection and Constraint Recovery in Hierarchical Multi-Label Classification without Prior Knowledge
Joshua Shay Kricheli
Khoa Vo
Aniruddha Datta
Spencer Ozgur
Paulo Shakarian
40
2
0
21 Jul 2024
Learning Visual Grounding from Generative Vision and Language Model
Learning Visual Grounding from Generative Vision and Language Model
Shijie Wang
Dahun Kim
A. Taalimi
Chen Sun
Weicheng Kuo
ObjD
36
5
0
18 Jul 2024
Cross-Architecture Auxiliary Feature Space Translation for Efficient
  Few-Shot Personalized Object Detection
Cross-Architecture Auxiliary Feature Space Translation for Efficient Few-Shot Personalized Object Detection
F. Barbato
Umberto Michieli
J. Moon
Pietro Zanuttigh
Mete Ozay
42
2
0
01 Jul 2024
Controlling Rate, Distortion, and Realism: Towards a Single
  Comprehensive Neural Image Compression Model
Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model
Shoma Iwai
Tomo Miyazaki
S. Omachi
55
11
0
27 May 2024
Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models
Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models
Katherine Xu
Lingzhi Zhang
Jianbo Shi
58
12
0
23 May 2024
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion
  Models
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
Qinghe Wang
Baolu Li
Xiaomin Li
Bing Cao
Liqian Ma
Huchuan Lu
Xu Jia
DiffM
44
6
0
24 Apr 2024
Video Relationship Detection Using Mixture of Experts
Video Relationship Detection Using Mixture of Experts
A. Shaabana
Zahra Gharaee
Paul Fieguth
39
1
0
06 Mar 2024
Precise Extraction of Deep Learning Models via Side-Channel Attacks on
  Edge/Endpoint Devices
Precise Extraction of Deep Learning Models via Side-Channel Attacks on Edge/Endpoint Devices
Younghan Lee
Sohee Jun
Yungi Cho
Woorim Han
Hyungon Moon
Y. Paek
AAML
31
2
0
05 Mar 2024
Enhancing Vision-Language Pre-training with Rich Supervisions
Enhancing Vision-Language Pre-training with Rich Supervisions
Yuan Gao
Kunyu Shi
Pengkai Zhu
Edouard Belval
Oren Nuriel
Srikar Appalaraju
Shabnam Ghadar
Vijay Mahadevan
Zhuowen Tu
Stefano Soatto
VLM
CLIP
67
12
0
05 Mar 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
Chris Liu
Renrui Zhang
Longtian Qiu
Siyuan Huang
Weifeng Lin
...
Hao Shao
Pan Lu
Hongsheng Li
Yu Qiao
Peng Gao
MLLM
130
109
0
08 Feb 2024
FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy
  Labels
FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels
Jichang Li
Guanbin Li
Hui Cheng
Zicheng Liao
Yizhou Yu
FedML
35
15
0
19 Dec 2023
Enhancing Scene Graph Generation with Hierarchical Relationships and
  Commonsense Knowledge
Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge
Bowen Jiang
Zhijun Zhuang
Shreyas S. Shivakumar
Camillo J Taylor
33
7
0
21 Nov 2023
SniffyArt: The Dataset of Smelling Persons
SniffyArt: The Dataset of Smelling Persons
Mathias Zinnen
Azhar Hussian
Hang Tran
Prathmesh Madhu
Andreas Maier
Vincent Christlein
29
9
0
20 Nov 2023
Florence-2: Advancing a Unified Representation for a Variety of Vision
  Tasks
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Bin Xiao
Haiping Wu
Weijian Xu
Xiyang Dai
Houdong Hu
Yumao Lu
Michael Zeng
Ce Liu
Lu Yuan
VLM
50
143
0
10 Nov 2023
Zone Evaluation: Revealing Spatial Bias in Object Detection
Zone Evaluation: Revealing Spatial Bias in Object Detection
Zhaohui Zheng
Yuming Chen
Qibin Hou
Xiang Li
Ping Wang
Ming-Ming Cheng
ObjD
27
3
0
20 Oct 2023
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
Chengyang Zhao
Songlin Yang
Zhenfang Chen
Mingyu Ding
Chuang Gan
54
15
0
10 Oct 2023
DreamCom: Finetuning Text-guided Inpainting Model for Image Composition
DreamCom: Finetuning Text-guided Inpainting Model for Image Composition
Lingxiao Lu
Jiangtong Li
Bo Zhang
Li Niu
DiffM
28
11
0
27 Sep 2023
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary
  Instance Segmentation
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie
Wei Li
Xiangtai Li
Ziwei Liu
Yew-Soon Ong
Chen Change Loy
DiffM
VLM
72
35
0
22 Sep 2023
ReShader: View-Dependent Highlights for Single Image View-Synthesis
ReShader: View-Dependent Highlights for Single Image View-Synthesis
Avinash Paliwal
Brandon Nguyen
Andrii Tsarov
N. Kalantari
35
3
0
19 Sep 2023
Foreground Object Search by Distilling Composite Image Feature
Foreground Object Search by Distilling Composite Image Feature
Bo Zhang
Jiacheng Sui
Li Niu
30
5
0
09 Aug 2023
Distributionally Robust Classification on a Data Budget
Distributionally Robust Classification on a Data Budget
Ben Feuer
Ameya Joshi
Minh Pham
C. Hegde
OOD
37
2
0
07 Aug 2023
Improving Scene Graph Generation with Superpixel-Based Interaction
  Learning
Improving Scene Graph Generation with Superpixel-Based Interaction Learning
Jingyi Wang
Can Zhang
Jinfa Huang
Bo Ren
Zhidong Deng
25
7
0
04 Aug 2023
Digitally-Enhanced Dog Behavioral Testing: Getting Help from the Machine
Digitally-Enhanced Dog Behavioral Testing: Getting Help from the Machine
Nareed Farhat
Teddy Lazebnik
J. Monteny
C. Moons
E. Wydooghe
Dirk van der Linden
Anna Zamansky
24
4
0
26 Jul 2023
In Defense of Clip-based Video Relation Detection
In Defense of Clip-based Video Relation Detection
Meng Wei
Long Chen
Wei Ji
Xiaoyu Yue
Roger Zimmermann
44
5
0
18 Jul 2023
End-to-End Supervised Multilabel Contrastive Learning
End-to-End Supervised Multilabel Contrastive Learning
A. Sajedi
Samir Khaki
Konstantinos N. Plataniotis
Mahdi S. Hosseini
SSL
31
8
0
08 Jul 2023
Joint Adaptive Representations for Image-Language Learning
Joint Adaptive Representations for Image-Language Learning
A. Piergiovanni
A. Angelova
VLM
34
0
0
31 May 2023
Contextual Object Detection with Multimodal Large Language Models
Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjD
VLM
MLLM
41
78
0
29 May 2023
ElasticHash: Semantic Image Similarity Search by Deep Hashing with
  Elasticsearch
ElasticHash: Semantic Image Similarity Search by Deep Hashing with Elasticsearch
Nikolaus Korfhage
M. Mühling
Bernd Freisleben
24
3
0
08 May 2023
Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label
  Learning
Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label Learning
Ming-Kun Xie
Jianxiong Xiao
Hao-Zhe Liu
Gang Niu
Masashi Sugiyama
Sheng-Jun Huang
40
16
0
04 May 2023
Controllable Image Generation via Collage Representations
Controllable Image Generation via Collage Representations
Arantxa Casanova
Marlene Careil
Adriana Romero Soriano
Christopher Pal
Jakob Verbeek
M. Drozdzal
DiffM
39
7
0
26 Apr 2023
LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization
LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization
Sheng Liu
C. P. Huynh
Congmin Chen
Maxim Arap
Raffay Hamid
33
19
0
25 Apr 2023
Building Multimodal AI Chatbots
Building Multimodal AI Chatbots
Mingyu Lee
29
3
0
21 Apr 2023
ShapeClipper: Scalable 3D Shape Learning from Single-View Images via
  Geometric and CLIP-based Consistency
ShapeClipper: Scalable 3D Shape Learning from Single-View Images via Geometric and CLIP-based Consistency
Zixuan Huang
Varun Jampani
Anh Thai
Yuanzhen Li
Stefan Stojanov
James M. Rehg
3DV
27
18
0
13 Apr 2023
Locate Then Generate: Bridging Vision and Language with Bounding Box for
  Scene-Text VQA
Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Yongxin Zhu
Ziqiang Liu
Yukang Liang
Xin Li
Hao Liu
Changcun Bao
Linli Xu
24
6
0
04 Apr 2023
Egocentric Auditory Attention Localization in Conversations
Egocentric Auditory Attention Localization in Conversations
Fiona Ryan
Hao Jiang
Abhinav Shukla
James M. Rehg
V. Ithapu
EgoV
29
16
0
28 Mar 2023
Prismer: A Vision-Language Model with Multi-Task Experts
Prismer: A Vision-Language Model with Multi-Task Experts
Shikun Liu
Linxi Fan
Edward Johns
Zhiding Yu
Chaowei Xiao
Anima Anandkumar
VLM
MLLM
49
21
0
04 Mar 2023
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
Yanxin Long
Youpeng Wen
Jianhua Han
Hang Xu
Pengzhen Ren
Wei Zhang
Sheng Zhao
Xiaodan Liang
ObjD
VLM
20
31
0
04 Mar 2023
Modulating Pretrained Diffusion Models for Multimodal Image Synthesis
Modulating Pretrained Diffusion Models for Multimodal Image Synthesis
Cusuh Ham
James Hays
Jingwan Lu
Krishna Kumar Singh
Zhifei Zhang
Tobias Hinz
DiffM
21
24
0
24 Feb 2023
Multistage Spatial Context Models for Learned Image Compression
Multistage Spatial Context Models for Learned Image Compression
Fangzheng Lin
Heming Sun
Jinming Liu
J. Katto
30
14
0
18 Feb 2023
Contour-based Interactive Segmentation
Contour-based Interactive Segmentation
Danil Galeev
Polina Popenova
Anna Vorontsova
Anton Konushin
38
5
0
13 Feb 2023
KENGIC: KEyword-driven and N-Gram Graph based Image Captioning
KENGIC: KEyword-driven and N-Gram Graph based Image Captioning
Brandon Birmingham
A. Muscat
27
1
0
07 Feb 2023
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text
  Retrieval
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval
Yizhen Chen
Jie Wang
Lijian Lin
Zhongang Qi
Jin Ma
Ying Shan
VLM
33
18
0
30 Jan 2023
Cut and Learn for Unsupervised Object Detection and Instance
  Segmentation
Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Xudong Wang
Rohit Girdhar
Stella X. Yu
Ishan Misra
VLM
64
162
0
26 Jan 2023
Implicit Shape Model Trees: Recognition of 3-D Indoor Scenes and
  Prediction of Object Poses for Mobile Robots
Implicit Shape Model Trees: Recognition of 3-D Indoor Scenes and Prediction of Object Poses for Mobile Robots
Pascal Meissner
Rüdiger Dillmann
3DPC
21
0
0
25 Jan 2023
ODOR: The ICPR2022 ODeuropa Challenge on Olfactory Object Recognition
ODOR: The ICPR2022 ODeuropa Challenge on Olfactory Object Recognition
Mathias Zinnen
Prathmesh Madhu
Ronak Kosti
Peter Bell
Andreas Maier
Vincent Christlein
ObjD
44
11
0
24 Jan 2023
12345
Next