Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection

Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection

Papers citing "Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection"

Title
No papers