Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.09183
Cited By
Retrieving and Highlighting Action with Spatiotemporal Reference
19 May 2020
Seito Kasai
Yuchi Ishikawa
Masaki Hayashi
Y. Aoki
Kensho Hara
Hirokatsu Kataoka
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Retrieving and Highlighting Action with Spatiotemporal Reference"
14 / 14 papers shown
Title
Adversarial Representation Learning for Text-to-Image Matching
N. Sarafianos
Xiang Xu
I. Kakadiaris
GAN
68
186
0
28 Aug 2019
Fine-Grained Action Retrieval Through Multiple Parts-of-Speech Embeddings
Michael Wray
Diane Larlus
G. Csurka
Dima Damen
77
152
0
09 Aug 2019
Use What You Have: Video Retrieval Using Representations From Collaborative Experts
Yang Liu
Samuel Albanie
Arsha Nagrani
Andrew Zisserman
61
387
0
31 Jul 2019
A Short Note on the Kinetics-700 Human Action Dataset
João Carreira
Eric Noland
Chloe Hillier
Andrew Zisserman
52
446
0
15 Jul 2019
SlowFast Networks for Video Recognition
Christoph Feichtenhofer
Haoqi Fan
Jitendra Malik
Kaiming He
146
3,244
0
10 Dec 2018
Dual Encoding for Zero-Example Video Retrieval
Jianfeng Dong
Xirong Li
Chaoxi Xu
S. Ji
Yuan He
Gang Yang
Xun Wang
93
269
0
17 Sep 2018
Learning Visually-Grounded Semantics from Contrastive Adversarial Samples
Freda Shi
Jiayuan Mao
Tete Xiao
Yuning Jiang
Jian Sun
ObjD
47
51
0
27 Jun 2018
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data
Antoine Miech
Ivan Laptev
Josef Sivic
45
234
0
07 Apr 2018
Finding beans in burgers: Deep semantic-visual embedding with localization
Martin Engilberge
Louis Chevallier
P. Pérez
Matthieu Cord
40
95
0
05 Apr 2018
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
Kensho Hara
Hirokatsu Kataoka
Y. Satoh
3DPC
112
1,926
0
27 Nov 2017
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models
Jiuxiang Gu
Jianfei Cai
Shafiq Joty
Li Niu
G. Wang
VLM
50
361
0
17 Nov 2017
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Ramprasaath R. Selvaraju
Michael Cogswell
Abhishek Das
Ramakrishna Vedantam
Devi Parikh
Dhruv Batra
FAtt
216
19,796
0
07 Oct 2016
FaceNet: A Unified Embedding for Face Recognition and Clustering
Florian Schroff
Dmitry Kalenichenko
James Philbin
3DH
281
13,079
0
12 Mar 2015
Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models
Ryan Kiros
Ruslan Salakhutdinov
R. Zemel
VLM
89
1,395
0
10 Nov 2014
1