Extending CLIP for Category-to-image Retrieval in E-commerce

21 December 2021

Mariya Hendriksen

Maurits J. R. Bleeker

Maarten de Rijke

Papers citing "Extending CLIP for Category-to-image Retrieval in E-commerce"

22 / 22 papers shown

Title
Multi-Modality Transformer for E-Commerce: Inferring User Purchase Intention to Bridge the Query-Product Gap Srivatsa Mallapragada Ying Xie Varsha Rani Chawan Zeyad Hailat Yuanbo Wang 66 0 0 28 Jan 2025
How Much Can CLIP Benefit Vision-and-Language Tasks? Sheng Shen Liunian Harold Li Hao Tan Joey Tianyi Zhou Anna Rohrbach Kai-Wei Chang Z. Yao Kurt Keutzer CLIP VLM MLLM 234 407 0 13 Jul 2021
Category Aware Explainable Conversational Recommendation Nikolaos Kondylidis Jie Zou Evangelos Kanoulas 11 4 0 15 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision Alec Radford Jong Wook Kim Chris Hallacy Aditya A. Ramesh Gabriel Goh ... Amanda Askell Pamela Mishkin Jack Clark Gretchen Krueger Ilya Sutskever CLIP VLM 473 28,659 0 26 Feb 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai ... Matthias Minderer G. Heigold Sylvain Gelly Jakob Uszkoreit N. Houlsby ViT 138 40,217 0 22 Oct 2020
Contrastive Learning of Medical Visual Representations from Paired Images and Text Yuhao Zhang Hang Jiang Yasuhide Miura Christopher D. Manning C. Langlotz MedIm 82 744 0 02 Oct 2020
Contrastive Learning for Weakly Supervised Phrase Grounding Tanmay Gupta Arash Vahdat Gal Chechik Xiaodong Yang Jan Kautz Derek Hoiem ObjD SSL 101 141 0 17 Jun 2020
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing Zihang Dai Guokun Lai Yiming Yang Quoc V. Le 59 231 0 05 Jun 2020
How to Grow a (Product) Tree: Personalized Category Suggestions for eCommerce Type-Ahead Jacopo Tagliabue Bingqing Yu Marie Beaulieu 20 15 0 26 May 2020
MPNet: Masked and Permuted Pre-training for Language Understanding Kaitao Song Xu Tan Tao Qin Jianfeng Lu Tie-Yan Liu 70 1,093 0 20 Apr 2020
A Simple Framework for Contrastive Learning of Visual Representations Ting-Li Chen Simon Kornblith Mohammad Norouzi Geoffrey E. Hinton SSL 124 18,523 0 13 Feb 2020
Momentum Contrast for Unsupervised Visual Representation Learning Kaiming He Haoqi Fan Yuxin Wu Saining Xie Ross B. Girshick SSL 71 11,959 0 13 Nov 2019
Improving Outfit Recommendation with Co-supervision of Fashion Generation Yujie Lin Pengjie Ren Zhumin Chen Zhaochun Ren Jun Ma Maarten de Rijke 27 49 0 24 Aug 2019
The Resale Price Prediction of Secondhand Jewelry Items Using a Multi-modal Deep Model with Iterative Co-Attention Yusuke Yamaura Nobuya Kanemaki Y. Tsuboshita 27 3 0 01 Jul 2019
Multi-Label Product Categorization Using Multi-Modal Fusion Models Pasawee Wirojwatanakul A. Wangperawong 15 14 0 30 Jun 2019
Composing Text and Image for Image Retrieval - An Empirical Odyssey Nam S. Vo Lu Jiang Chen Sun Kevin Patrick Murphy Li Li Li Fei-Fei James Hays CoGe 31 362 0 18 Dec 2018
One-Shot Item Search with Multimodal Data Jonghwa Yim Junghun Kim D. Shin 31 3 0 27 Nov 2018
Representation Learning with Contrastive Predictive Coding Aaron van den Oord Yazhe Li Oriol Vinyals DRL SSL 175 10,152 0 10 Jul 2018
DeepStyle: Multimodal Search Engine for Fashion and Interior Design Ivona Tautkute Tomasz Trzciñski Aleksander P. Skorupa Łukasz Brocki K. Marasek 37 54 0 08 Jan 2018
Layer Normalization Jimmy Lei Ba J. Kiros Geoffrey E. Hinton 173 10,412 0 21 Jul 2016
Gaussian Error Linear Units (GELUs) Dan Hendrycks Kevin Gimpel 125 4,934 0 27 Jun 2016
Natural Language Object Retrieval Ronghang Hu Huazhe Xu Marcus Rohrbach Jiashi Feng Kate Saenko Trevor Darrell ObjD 57 552 0 13 Nov 2015