ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.12552
  4. Cited By
Deep Learning for Video-Text Retrieval: a Review

Deep Learning for Video-Text Retrieval: a Review

24 February 2023
Cunjuan Zhu
Qi Jia
Wei Chen
Yanming Guo
Yu Liu
ArXiv (abs)PDFHTML

Papers citing "Deep Learning for Video-Text Retrieval: a Review"

14 / 64 papers shown
Title
ConvNet Architecture Search for Spatiotemporal Feature Learning
ConvNet Architecture Search for Spatiotemporal Feature Learning
Du Tran
Jamie Ray
Zheng Shou
Shih-Fu Chang
Manohar Paluri
3DPC
75
383
0
16 Aug 2017
Learnable pooling with Context Gating for video classification
Learnable pooling with Context Gating for video classification
Antoine Miech
Ivan Laptev
Josef Sivic
74
327
0
21 Jun 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
232
8,019
0
22 May 2017
Dense-Captioning Events in Videos
Dense-Captioning Events in Videos
Ranjay Krishna
Kenji Hata
F. Ren
Li Fei-Fei
Juan Carlos Niebles
136
1,244
0
02 May 2017
Modeling Relational Data with Graph Convolutional Networks
Modeling Relational Data with Graph Convolutional Networks
Michael Schlichtkrull
Thomas Kipf
Peter Bloem
Rianne van den Berg
Ivan Titov
Max Welling
GNN
191
4,821
0
17 Mar 2017
CNN Architectures for Large-Scale Audio Classification
CNN Architectures for Large-Scale Audio Classification
Shawn Hershey
Sourish Chaudhuri
D. Ellis
J. Gemmeke
A. Jansen
...
Rif A. Saurous
Bryan Seybold
M. Slaney
Ron J. Weiss
K. Wilson
123
2,500
0
29 Sep 2016
Densely Connected Convolutional Networks
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
Laurens van der Maaten
Kilian Q. Weinberger
PINN3DV
772
36,813
0
25 Aug 2016
Learning Joint Representations of Videos and Sentences with Web Image
  Search
Learning Joint Representations of Videos and Sentences with Web Image Search
Mayu Otani
Yuta Nakashima
Esa Rahtu
J. Heikkilä
N. Yokoya
49
94
0
08 Aug 2016
Convolutional Two-Stream Network Fusion for Video Action Recognition
Convolutional Two-Stream Network Fusion for Video Action Recognition
Christoph Feichtenhofer
A. Pinz
Andrew Zisserman
163
2,611
0
22 Apr 2016
A Sensitivity Analysis of (and Practitioners' Guide to) Convolutional
  Neural Networks for Sentence Classification
A Sensitivity Analysis of (and Practitioners' Guide to) Convolutional Neural Networks for Sentence Classification
Ye Zhang
Byron C. Wallace
AAML
112
1,200
0
13 Oct 2015
FaceNet: A Unified Embedding for Face Recognition and Clustering
FaceNet: A Unified Embedding for Face Recognition and Clustering
Florian Schroff
Dmitry Kalenichenko
James Philbin
3DH
370
13,145
0
12 Mar 2015
Improved Semantic Representations From Tree-Structured Long Short-Term
  Memory Networks
Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks
Kai Sheng Tai
R. Socher
Christopher D. Manning
AIMat
140
3,122
0
28 Feb 2015
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence
  Modeling
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
Junyoung Chung
Çağlar Gülçehre
Kyunghyun Cho
Yoshua Bengio
591
12,713
0
11 Dec 2014
Efficient Estimation of Word Representations in Vector Space
Efficient Estimation of Word Representations in Vector Space
Tomas Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
677
31,512
0
16 Jan 2013
Previous
12