v1v2v3 (latest)

Enhancing Few-Shot Vision-Language Classification with Large Multimodal Model Features

28 November 2024

ArXiv (abs)PDF HTML HuggingFace (4 upvotes)

Papers citing "Enhancing Few-Shot Vision-Language Classification with Large Multimodal Model Features"

24 / 74 papers shown

Title
Finetuned Language Models Are Zero-Shot Learners Jason W. Wei Maarten Bosma Vincent Zhao Kelvin Guu Adams Wei Yu Brian Lester Nan Du Andrew M. Dai Quoc V. Le ALM UQCV 974 4,485 0 03 Sep 2021
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models Jianmo Ni Gustavo Hernández Ábrego Noah Constant Ji Ma Keith B. Hall Daniel Cer Yinfei Yang 479 682 0 19 Aug 2021
SimCSE: Simple Contrastive Learning of Sentence EmbeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 Tianyu Gao Xingcheng Yao Danqi Chen AILaw SSL 649 3,913 0 18 Apr 2021
The Power of Scale for Parameter-Efficient Prompt TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 Brian Lester Rami Al-Rfou Noah Constant VPVLM 1.2K 4,813 0 18 Apr 2021
Learning Transferable Visual Models From Natural Language SupervisionInternational Conference on Machine Learning (ICML), 2021 Alec Radford Jong Wook Kim Chris Hallacy Aditya A. Ramesh Gabriel Goh ... Amanda Askell Pamela Mishkin Jack Clark Gretchen Krueger Ilya Sutskever CLIP VLM 1.9K 39,376 0 26 Feb 2021
Training Vision Transformers for Image Retrieval Alaaeldin El-Nouby Natalia Neverova Ivan Laptev Edouard Grave ViT 209 174 0 10 Feb 2021
Prefix-Tuning: Optimizing Continuous Prompts for GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021 Xiang Lisa Li Abigail Z. Jacobs 565 5,069 0 01 Jan 2021
Contrastive Learning of Medical Visual Representations from Paired Images and TextMachine Learning in Health Care (MLHC), 2020 Yuhao Zhang Hang Jiang Yasuhide Miura Christopher D. Manning C. Langlotz MedIm 550 916 0 02 Oct 2020
Learning Object Detection from Captions via Textual Scene Attributes Achiya Jerbi Roei Herzig Jonathan Berant Gal Chechik Amir Globerson 193 21 0 30 Sep 2020
VirTex: Learning Visual Representations from Textual AnnotationsComputer Vision and Pattern Recognition (CVPR), 2020 Karan Desai Justin Johnson SSL VLM 380 460 0 11 Jun 2020
Language Models are Few-Shot LearnersNeural Information Processing Systems (NeurIPS), 2020 Tom B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan ... Christopher Berner Sam McCandlish Alec Radford Ilya Sutskever Dario Amodei BDL 1.8K 50,714 0 28 May 2020
Decision-Making with Auto-Encoding Variational BayesNeural Information Processing Systems (NeurIPS), 2020 Romain Lopez Pierre Boyeau Nir Yosef Michael I. Jordan Jeffrey Regier BDL 1.3K 19,430 0 17 Feb 2020
PyTorch: An Imperative Style, High-Performance Deep Learning LibraryNeural Information Processing Systems (NeurIPS), 2019 Adam Paszke Sam Gross Francisco Massa Adam Lerer James Bradbury ... Sasank Chilamkurthy Benoit Steiner Lu Fang Junjie Bai Soumith Chintala ODL 916 47,724 0 03 Dec 2019
Sentence-BERT: Sentence Embeddings using Siamese BERT-NetworksConference on Empirical Methods in Natural Language Processing (EMNLP), 2019 Nils Reimers Iryna Gurevych 1.7K 14,951 0 27 Aug 2019
Towards VQA Models That Can Read Amanpreet Singh Vivek Natarajan Meet Shah Yu Jiang Xinlei Chen Dhruv Batra Devi Parikh Marcus Rohrbach EgoV 505 1,632 0 18 Apr 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Jacob Devlin Ming-Wei Chang Kenton Lee Kristina Toutanova VLM SSL SSeg 2.8K 106,051 0 11 Oct 2018
Neural Discrete Representation Learning Aaron van den Oord Oriol Vinyals Koray Kavukcuoglu BDL SSL OCL 539 6,157 0 02 Nov 2017
EuroSAT: A Novel Dataset and Deep Learning Benchmark for Land Use and Land Cover Classification P. Helber B. Bischke Andreas Dengel Damian Borth 442 2,252 0 31 Aug 2017
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference Adina Williams Nikita Nangia Samuel R. Bowman 1.1K 4,774 0 18 Apr 2017
Generation and Comprehension of Unambiguous Object Descriptions Junhua Mao Jonathan Huang Alexander Toshev Oana-Maria Camburu Alan Yuille Kevin Patrick Murphy ObjD 524 1,534 0 07 Nov 2015
VQA: Visual Question Answering Aishwarya Agrawal Jiasen Lu Stanislaw Antol Margaret Mitchell C. L. Zitnick Dhruv Batra Devi Parikh CoGe 890 6,014 0 03 May 2015
FaceNet: A Unified Embedding for Face Recognition and Clustering Florian Schroff Dmitry Kalenichenko James Philbin 3DH 836 14,066 0 12 Mar 2015
Microsoft COCO: Common Objects in ContextEuropean Conference on Computer Vision (ECCV), 2014 Nayeon Lee Michael Maire Serge J. Belongie Lubomir Bourdev Ross B. Girshick James Hays Pietro Perona Deva Ramanan C. L. Zitnick Piotr Dollár ObjD 8.0K 48,609 0 01 May 2014
Distributed Representations of Words and Phrases and their CompositionalityNeural Information Processing Systems (NeurIPS), 2013 Tomas Mikolov Ilya Sutskever Kai Chen G. Corrado J. Dean NAI OCL 732 34,569 0 16 Oct 2013