Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.00982
Cited By
The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale
2 November 2018
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
Jordi Pont-Tuset
Shahab Kamali
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
ObjD
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale"
50 / 243 papers shown
Title
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments
Luca Barsellotti
Roberto Bigazzi
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
98
1
0
20 Feb 2025
Efficient Progressive Image Compression with Variance-aware Masking
Alberto Presta
Enzo Tartaglione
Attilio Fiandrotti
Marco Grangetto
Pamela Cosman
34
0
0
15 Nov 2024
Label Convergence: Defining an Upper Performance Bound in Object Recognition through Contradictory Annotations
David Tschirschwitz
Volker Rodehorst
31
1
0
14 Sep 2024
BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation
Peng Hao
Xiaobing Wang
Yingying Jiang
Hanchao Jia
Xiaoshuai Hao
Shaowei Cui
Junhang Wei
Xiaoshuai Hao
57
3
0
26 Jul 2024
BIV-Priv-Seg: Locating Private Content in Images Taken by People With Visual Impairments
Yu-Yun Tseng
Tanusree Sharma
Lotus Zhang
Abigale Stangl
Leah Findlater
Yang Wang
Danna Gurari
81
0
0
25 Jul 2024
Error Detection and Constraint Recovery in Hierarchical Multi-Label Classification without Prior Knowledge
Joshua Shay Kricheli
Khoa Vo
Aniruddha Datta
Spencer Ozgur
Paulo Shakarian
40
2
0
21 Jul 2024
Learning Visual Grounding from Generative Vision and Language Model
Shijie Wang
Dahun Kim
A. Taalimi
Chen Sun
Weicheng Kuo
ObjD
36
5
0
18 Jul 2024
Cross-Architecture Auxiliary Feature Space Translation for Efficient Few-Shot Personalized Object Detection
F. Barbato
Umberto Michieli
J. Moon
Pietro Zanuttigh
Mete Ozay
42
2
0
01 Jul 2024
Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model
Shoma Iwai
Tomo Miyazaki
S. Omachi
55
11
0
27 May 2024
Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models
Katherine Xu
Lingzhi Zhang
Jianbo Shi
58
12
0
23 May 2024
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
Qinghe Wang
Baolu Li
Xiaomin Li
Bing Cao
Liqian Ma
Huchuan Lu
Xu Jia
DiffM
44
6
0
24 Apr 2024
Video Relationship Detection Using Mixture of Experts
A. Shaabana
Zahra Gharaee
Paul Fieguth
39
1
0
06 Mar 2024
Precise Extraction of Deep Learning Models via Side-Channel Attacks on Edge/Endpoint Devices
Younghan Lee
Sohee Jun
Yungi Cho
Woorim Han
Hyungon Moon
Y. Paek
AAML
31
2
0
05 Mar 2024
Enhancing Vision-Language Pre-training with Rich Supervisions
Yuan Gao
Kunyu Shi
Pengkai Zhu
Edouard Belval
Oren Nuriel
Srikar Appalaraju
Shabnam Ghadar
Vijay Mahadevan
Zhuowen Tu
Stefano Soatto
VLM
CLIP
67
12
0
05 Mar 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
Chris Liu
Renrui Zhang
Longtian Qiu
Siyuan Huang
Weifeng Lin
...
Hao Shao
Pan Lu
Hongsheng Li
Yu Qiao
Peng Gao
MLLM
130
109
0
08 Feb 2024
FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels
Jichang Li
Guanbin Li
Hui Cheng
Zicheng Liao
Yizhou Yu
FedML
35
15
0
19 Dec 2023
Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge
Bowen Jiang
Zhijun Zhuang
Shreyas S. Shivakumar
Camillo J Taylor
33
7
0
21 Nov 2023
SniffyArt: The Dataset of Smelling Persons
Mathias Zinnen
Azhar Hussian
Hang Tran
Prathmesh Madhu
Andreas Maier
Vincent Christlein
29
9
0
20 Nov 2023
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Bin Xiao
Haiping Wu
Weijian Xu
Xiyang Dai
Houdong Hu
Yumao Lu
Michael Zeng
Ce Liu
Lu Yuan
VLM
50
143
0
10 Nov 2023
Zone Evaluation: Revealing Spatial Bias in Object Detection
Zhaohui Zheng
Yuming Chen
Qibin Hou
Xiang Li
Ping Wang
Ming-Ming Cheng
ObjD
27
3
0
20 Oct 2023
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
Chengyang Zhao
Songlin Yang
Zhenfang Chen
Mingyu Ding
Chuang Gan
54
15
0
10 Oct 2023
DreamCom: Finetuning Text-guided Inpainting Model for Image Composition
Lingxiao Lu
Jiangtong Li
Bo Zhang
Li Niu
DiffM
28
11
0
27 Sep 2023
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie
Wei Li
Xiangtai Li
Ziwei Liu
Yew-Soon Ong
Chen Change Loy
DiffM
VLM
72
35
0
22 Sep 2023
ReShader: View-Dependent Highlights for Single Image View-Synthesis
Avinash Paliwal
Brandon Nguyen
Andrii Tsarov
N. Kalantari
35
3
0
19 Sep 2023
Foreground Object Search by Distilling Composite Image Feature
Bo Zhang
Jiacheng Sui
Li Niu
30
5
0
09 Aug 2023
Distributionally Robust Classification on a Data Budget
Ben Feuer
Ameya Joshi
Minh Pham
C. Hegde
OOD
37
2
0
07 Aug 2023
Improving Scene Graph Generation with Superpixel-Based Interaction Learning
Jingyi Wang
Can Zhang
Jinfa Huang
Bo Ren
Zhidong Deng
25
7
0
04 Aug 2023
Digitally-Enhanced Dog Behavioral Testing: Getting Help from the Machine
Nareed Farhat
Teddy Lazebnik
J. Monteny
C. Moons
E. Wydooghe
Dirk van der Linden
Anna Zamansky
24
4
0
26 Jul 2023
In Defense of Clip-based Video Relation Detection
Meng Wei
Long Chen
Wei Ji
Xiaoyu Yue
Roger Zimmermann
44
5
0
18 Jul 2023
End-to-End Supervised Multilabel Contrastive Learning
A. Sajedi
Samir Khaki
Konstantinos N. Plataniotis
Mahdi S. Hosseini
SSL
31
8
0
08 Jul 2023
Joint Adaptive Representations for Image-Language Learning
A. Piergiovanni
A. Angelova
VLM
34
0
0
31 May 2023
Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjD
VLM
MLLM
41
78
0
29 May 2023
ElasticHash: Semantic Image Similarity Search by Deep Hashing with Elasticsearch
Nikolaus Korfhage
M. Mühling
Bernd Freisleben
24
3
0
08 May 2023
Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label Learning
Ming-Kun Xie
Jianxiong Xiao
Hao-Zhe Liu
Gang Niu
Masashi Sugiyama
Sheng-Jun Huang
40
16
0
04 May 2023
Controllable Image Generation via Collage Representations
Arantxa Casanova
Marlene Careil
Adriana Romero Soriano
Christopher Pal
Jakob Verbeek
M. Drozdzal
DiffM
39
7
0
26 Apr 2023
LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization
Sheng Liu
C. P. Huynh
Congmin Chen
Maxim Arap
Raffay Hamid
33
19
0
25 Apr 2023
Building Multimodal AI Chatbots
Mingyu Lee
29
3
0
21 Apr 2023
ShapeClipper: Scalable 3D Shape Learning from Single-View Images via Geometric and CLIP-based Consistency
Zixuan Huang
Varun Jampani
Anh Thai
Yuanzhen Li
Stefan Stojanov
James M. Rehg
3DV
27
18
0
13 Apr 2023
Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Yongxin Zhu
Ziqiang Liu
Yukang Liang
Xin Li
Hao Liu
Changcun Bao
Linli Xu
24
6
0
04 Apr 2023
Egocentric Auditory Attention Localization in Conversations
Fiona Ryan
Hao Jiang
Abhinav Shukla
James M. Rehg
V. Ithapu
EgoV
29
16
0
28 Mar 2023
Prismer: A Vision-Language Model with Multi-Task Experts
Shikun Liu
Linxi Fan
Edward Johns
Zhiding Yu
Chaowei Xiao
Anima Anandkumar
VLM
MLLM
49
21
0
04 Mar 2023
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
Yanxin Long
Youpeng Wen
Jianhua Han
Hang Xu
Pengzhen Ren
Wei Zhang
Sheng Zhao
Xiaodan Liang
ObjD
VLM
20
31
0
04 Mar 2023
Modulating Pretrained Diffusion Models for Multimodal Image Synthesis
Cusuh Ham
James Hays
Jingwan Lu
Krishna Kumar Singh
Zhifei Zhang
Tobias Hinz
DiffM
21
24
0
24 Feb 2023
Multistage Spatial Context Models for Learned Image Compression
Fangzheng Lin
Heming Sun
Jinming Liu
J. Katto
30
14
0
18 Feb 2023
Contour-based Interactive Segmentation
Danil Galeev
Polina Popenova
Anna Vorontsova
Anton Konushin
38
5
0
13 Feb 2023
KENGIC: KEyword-driven and N-Gram Graph based Image Captioning
Brandon Birmingham
A. Muscat
27
1
0
07 Feb 2023
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval
Yizhen Chen
Jie Wang
Lijian Lin
Zhongang Qi
Jin Ma
Ying Shan
VLM
33
18
0
30 Jan 2023
Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Xudong Wang
Rohit Girdhar
Stella X. Yu
Ishan Misra
VLM
64
162
0
26 Jan 2023
Implicit Shape Model Trees: Recognition of 3-D Indoor Scenes and Prediction of Object Poses for Mobile Robots
Pascal Meissner
Rüdiger Dillmann
3DPC
21
0
0
25 Jan 2023
ODOR: The ICPR2022 ODeuropa Challenge on Olfactory Object Recognition
Mathias Zinnen
Prathmesh Madhu
Ronak Kosti
Peter Bell
Andreas Maier
Vincent Christlein
ObjD
44
11
0
24 Jan 2023
1
2
3
4
5
Next