Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.11294
Cited By
Extending CLIP for Category-to-image Retrieval in E-commerce
21 December 2021
Mariya Hendriksen
Maurits J. R. Bleeker
Svitlana Vakulenko
Nanne van Noord
E. Kuiper
Maarten de Rijke
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Extending CLIP for Category-to-image Retrieval in E-commerce"
22 / 22 papers shown
Title
Multi-Modality Transformer for E-Commerce: Inferring User Purchase Intention to Bridge the Query-Product Gap
Srivatsa Mallapragada
Ying Xie
Varsha Rani Chawan
Zeyad Hailat
Yuanbo Wang
66
0
0
28 Jan 2025
How Much Can CLIP Benefit Vision-and-Language Tasks?
Sheng Shen
Liunian Harold Li
Hao Tan
Joey Tianyi Zhou
Anna Rohrbach
Kai-Wei Chang
Z. Yao
Kurt Keutzer
CLIP
VLM
MLLM
234
407
0
13 Jul 2021
Category Aware Explainable Conversational Recommendation
Nikolaos Kondylidis
Jie Zou
Evangelos Kanoulas
11
4
0
15 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
473
28,659
0
26 Feb 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
138
40,217
0
22 Oct 2020
Contrastive Learning of Medical Visual Representations from Paired Images and Text
Yuhao Zhang
Hang Jiang
Yasuhide Miura
Christopher D. Manning
C. Langlotz
MedIm
82
744
0
02 Oct 2020
Contrastive Learning for Weakly Supervised Phrase Grounding
Tanmay Gupta
Arash Vahdat
Gal Chechik
Xiaodong Yang
Jan Kautz
Derek Hoiem
ObjD
SSL
101
141
0
17 Jun 2020
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing
Zihang Dai
Guokun Lai
Yiming Yang
Quoc V. Le
59
231
0
05 Jun 2020
How to Grow a (Product) Tree: Personalized Category Suggestions for eCommerce Type-Ahead
Jacopo Tagliabue
Bingqing Yu
Marie Beaulieu
20
15
0
26 May 2020
MPNet: Masked and Permuted Pre-training for Language Understanding
Kaitao Song
Xu Tan
Tao Qin
Jianfeng Lu
Tie-Yan Liu
70
1,093
0
20 Apr 2020
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
124
18,523
0
13 Feb 2020
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
71
11,959
0
13 Nov 2019
Improving Outfit Recommendation with Co-supervision of Fashion Generation
Yujie Lin
Pengjie Ren
Zhumin Chen
Zhaochun Ren
Jun Ma
Maarten de Rijke
27
49
0
24 Aug 2019
The Resale Price Prediction of Secondhand Jewelry Items Using a Multi-modal Deep Model with Iterative Co-Attention
Yusuke Yamaura
Nobuya Kanemaki
Y. Tsuboshita
27
3
0
01 Jul 2019
Multi-Label Product Categorization Using Multi-Modal Fusion Models
Pasawee Wirojwatanakul
A. Wangperawong
15
14
0
30 Jun 2019
Composing Text and Image for Image Retrieval - An Empirical Odyssey
Nam S. Vo
Lu Jiang
Chen Sun
Kevin Patrick Murphy
Li Li
Li Fei-Fei
James Hays
CoGe
31
362
0
18 Dec 2018
One-Shot Item Search with Multimodal Data
Jonghwa Yim
Junghun Kim
D. Shin
31
3
0
27 Nov 2018
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRL
SSL
175
10,152
0
10 Jul 2018
DeepStyle: Multimodal Search Engine for Fashion and Interior Design
Ivona Tautkute
Tomasz Trzciñski
Aleksander P. Skorupa
Łukasz Brocki
K. Marasek
37
54
0
08 Jan 2018
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
173
10,412
0
21 Jul 2016
Gaussian Error Linear Units (GELUs)
Dan Hendrycks
Kevin Gimpel
125
4,934
0
27 Jun 2016
Natural Language Object Retrieval
Ronghang Hu
Huazhe Xu
Marcus Rohrbach
Jiashi Feng
Kate Saenko
Trevor Darrell
ObjD
57
552
0
13 Nov 2015
1