Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.05056
Cited By
v1
v2
v3 (latest)
Open-Vocabulary Animal Keypoint Detection with Semantic-feature Matching
8 October 2023
Hao Zhang
Lumin Xu
Shenqi Lai
Wenqi Shao
Nanning Zheng
Ping Luo
Yu Qiao
Kaipeng Zhang
ObjD
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Open-Vocabulary Animal Keypoint Detection with Semantic-feature Matching"
18 / 18 papers shown
Title
CatFLW: Cat Facial Landmarks in the Wild Dataset
G.A. Martvel
Nareed Farhat
I. Shimshoni
Anna Zamansky
52
9
0
07 May 2023
ViTPose++: Vision Transformer for Generic Body Pose Estimation
Yufei Xu
Jing Zhang
Qiming Zhang
Dacheng Tao
ViT
142
45
0
07 Dec 2022
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Lewei Yao
Jianhua Han
Youpeng Wen
Xiaodan Liang
Dan Xu
Wei Zhang
Zhenguo Li
Chunjing Xu
Hang Xu
CLIP
VLM
173
160
0
20 Sep 2022
Expanding Language-Image Pretrained Models for General Video Recognition
Bolin Ni
Houwen Peng
Minghao Chen
Songyang Zhang
Gaofeng Meng
Jianlong Fu
Shiming Xiang
Haibin Ling
VLM
CLIP
ViT
106
326
0
04 Aug 2022
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
Rui Qian
Yeqing Li
Zheng Xu
Ming-Hsuan Yang
Serge Belongie
Huayu Chen
VLM
50
22
0
15 Jul 2022
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection
H. Rasheed
Muhammad Maaz
Muhammad Uzair Khattak
Salman Khan
Fahad Shahbaz Khan
ObjD
VLM
107
154
0
07 Jul 2022
Language-driven Semantic Segmentation
Boyi Li
Kilian Q. Weinberger
Serge Belongie
V. Koltun
René Ranftl
VLM
124
625
0
10 Jan 2022
Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species
Changsheng Lu
Piotr Koniusz
83
39
0
12 Dec 2021
AP-10K: A Benchmark for Animal Pose Estimation in the Wild
Hang Yu
Yufei Xu
Jing Zhang
Wei Zhao
Ziyu Guan
Dacheng Tao
76
113
0
28 Aug 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
459
3,893
0
11 Feb 2021
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
434
13,108
0
26 May 2020
Deep High-Resolution Representation Learning for Visual Recognition
Jingdong Wang
Ke Sun
Tianheng Cheng
Borui Jiang
Chaorui Deng
...
Yadong Mu
Mingkui Tan
Xinggang Wang
Wenyu Liu
Bin Xiao
393
3,627
0
20 Aug 2019
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Mingxing Tan
Quoc V. Le
3DV
MedIm
153
18,179
0
28 May 2019
Simple Baselines for Human Pose Estimation and Tracking
Bin Xiao
Haiping Wu
Yichen Wei
3DH
VOT
123
1,792
0
17 Apr 2018
MobileNetV2: Inverted Residuals and Linear Bottlenecks
Mark Sandler
Andrew G. Howard
Menglong Zhu
A. Zhmoginov
Liang-Chieh Chen
204
19,333
0
13 Jan 2018
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
829
11,943
0
09 Mar 2017
RMPE: Regional Multi-person Pose Estimation
Haoshu Fang
Shuqin Xie
Yu-Wing Tai
Cewu Lu
3DH
135
1,587
0
01 Dec 2016
Stacked Hourglass Networks for Human Pose Estimation
Alejandro Newell
Kaiyu Yang
Jia Deng
3DH
119
5,037
0
22 Mar 2016
1