ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.03477
  4. Cited By
Fine-Grained Action Retrieval Through Multiple Parts-of-Speech
  Embeddings

Fine-Grained Action Retrieval Through Multiple Parts-of-Speech Embeddings

IEEE International Conference on Computer Vision (ICCV), 2019
9 August 2019
Michael Wray
Diane Larlus
G. Csurka
Dima Damen
ArXiv (abs)PDFHTML

Papers citing "Fine-Grained Action Retrieval Through Multiple Parts-of-Speech Embeddings"

50 / 103 papers shown
UNIV: Unified Foundation Model for Infrared and Visible Modalities
UNIV: Unified Foundation Model for Infrared and Visible Modalities
Fangyuan Mao
Shuo Wang
Jilin Mei
Chen Min
Shun Lu
Fuyang Liu
Xiaokun Feng
Meiqi Wu
Yu Hu
160
0
0
19 Sep 2025
Repeating Words for Video-Language Retrieval with Coarse-to-Fine Objectives
Repeating Words for Video-Language Retrieval with Coarse-to-Fine Objectives
Haoyu Zhao
Jiaxi Gu
Shicong Wang
Xing Zhang
Hang Xu
Zuxuan Wu
Yu-Gang Jiang
197
0
0
20 Aug 2025
Zero-Shot Skeleton-Based Action Recognition With Prototype-Guided Feature Alignment
Zero-Shot Skeleton-Based Action Recognition With Prototype-Guided Feature AlignmentIEEE Transactions on Image Processing (IEEE TIP), 2025
Kai Zhou
Shuhai Zhang
Zeng You
Jinwu Hu
Mingkui Tan
Fei Liu
303
2
0
01 Jul 2025
EVA02-AT: Egocentric Video-Language Understanding with Spatial-Temporal Rotary Positional Embeddings and Symmetric Optimization
EVA02-AT: Egocentric Video-Language Understanding with Spatial-Temporal Rotary Positional Embeddings and Symmetric Optimization
Xiaoqi Wang
Yi Wang
Lap-Pui Chau
223
1
0
17 Jun 2025
Leveraging Auxiliary Information in Text-to-Video Retrieval: A Review
Leveraging Auxiliary Information in Text-to-Video Retrieval: A Review
A. Fragomeni
Dima Damen
Michael Wray
268
0
0
29 May 2025
Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?
Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?International Conference on Learning Representations (ICLR), 2024
Boshen Xu
Ziheng Wang
Yang Du
Zhinan Song
Sipeng Zheng
Qin Jin
VLM
210
2
0
21 Feb 2025
Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2024
Yang Chen
Jingcai Guo
Song Guo
Dacheng Tao
405
9
0
18 Nov 2024
Bridging the Skeleton-Text Modality Gap: Diffusion-Powered Modality Alignment for Zero-shot Skeleton-based Action Recognition
Jeonghyeok Do
Munchurl Kim
696
1
0
16 Nov 2024
Beyond Coarse-Grained Matching in Video-Text Retrieval
Beyond Coarse-Grained Matching in Video-Text RetrievalAsian Conference on Computer Vision (ACCV), 2024
Aozhu Chen
Hazel Doughty
Xirong Li
Cees G. M. Snoek
323
2
0
16 Oct 2024
Zero-Shot Skeleton-based Action Recognition with Dual Visual-Text Alignment
Zero-Shot Skeleton-based Action Recognition with Dual Visual-Text AlignmentPattern Recognition (Pattern Recogn.), 2024
Jidong Kuang
Hongsong Wang
Chaolei Han
Yang Zhang
Jie Gui
420
8
0
22 Sep 2024
SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language
  Retrieval
SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language Retrieval
Longtao Jiang
Min Wang
Zecheng Li
Yao Fang
Wen-gang Zhou
Houqiang Li
SLR
259
4
0
23 Jul 2024
SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by
  Disentangled Variational Autoencoders
SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders
Sheng-Wei Li
Zi-Xiang Wei
Wei-Jie Chen
Yi-Hsin Yu
Chih-Yuan Yang
Jane Yung-jen Hsu
DRL
435
18
0
18 Jul 2024
Detecting Subtle Differences between Human and Model Languages Using
  Spectrum of Relative Likelihood
Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood
Yang Xu
Yu Wang
Hao An
Zhichen Liu
Yongyuan Li
281
18
0
28 Jun 2024
Part-aware Unified Representation of Language and Skeleton for Zero-shot
  Action Recognition
Part-aware Unified Representation of Language and Skeleton for Zero-shot Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2024
Anqi Zhu
Qiuhong Ke
Mingming Gong
James Bailey
274
29
0
19 Jun 2024
Symmetric Multi-Similarity Loss for EPIC-KITCHENS-100 Multi-Instance
  Retrieval Challenge 2024
Symmetric Multi-Similarity Loss for EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2024
Xiaoqi Wang
Yi Wang
Lap-Pui Chau
270
1
0
18 Jun 2024
EchoGuide: Active Acoustic Guidance for LLM-Based Eating Event Analysis
  from Egocentric Videos
EchoGuide: Active Acoustic Guidance for LLM-Based Eating Event Analysis from Egocentric Videos
Vineet Parikh
Saif Mahmud
Devansh Agarwal
Ke Li
François Guimbretière
Cheng Zhang
297
8
0
15 Jun 2024
An Information Compensation Framework for Zero-Shot Skeleton-based
  Action Recognition
An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition
Haojun Xu
Yanlei Gao
Jie Li
Xinbo Gao
320
8
0
02 Jun 2024
SHE-Net: Syntax-Hierarchy-Enhanced Text-Video Retrieval
SHE-Net: Syntax-Hierarchy-Enhanced Text-Video Retrieval
Xuzheng Yu
Chen Jiang
Xingning Dong
Tian Gan
Ming Yang
Qingpei Guo
434
6
0
22 Apr 2024
Fine-Grained Side Information Guided Dual-Prompts for Zero-Shot Skeleton
  Action Recognition
Fine-Grained Side Information Guided Dual-Prompts for Zero-Shot Skeleton Action Recognition
Yang Chen
Jingcai Guo
Tian He
Ling Wang
412
19
0
11 Apr 2024
A SOUND APPROACH: Using Large Language Models to generate audio
  descriptions for egocentric text-audio retrieval
A SOUND APPROACH: Using Large Language Models to generate audio descriptions for egocentric text-audio retrieval
Andreea-Maria Oncescu
João F. Henriques
Andrew Zisserman
Samuel Albanie
A. Sophia Koepke
226
8
0
29 Feb 2024
Video Editing for Video Retrieval
Video Editing for Video Retrieval
Bin Zhu
Kevin Flanagan
A. Fragomeni
Michael Wray
Dima Damen
CLIP
247
1
0
04 Feb 2024
Training a Large Video Model on a Single Machine in a Day
Training a Large Video Model on a Single Machine in a Day
Yue Zhao
Philipp Krahenbuhl
VLM
309
24
0
28 Sep 2023
Video-adverb retrieval with compositional adverb-action embeddings
Video-adverb retrieval with compositional adverb-action embeddingsBritish Machine Vision Conference (BMVC), 2023
Thomas Hummel
Otniel-Bogdan Mercea
A. Sophia Koepke
Zeynep Akata
230
1
0
26 Sep 2023
Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning
Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive LearningACM Multimedia (ACM MM), 2023
Chen Jiang
Hong Liu
Xuzheng Yu
Qing Wang
Yuan Cheng
...
Zhongyi Liu
Qingpei Guo
Wei Chu
Ming-Hsuan Yang
Yuan Qi
449
19
0
20 Sep 2023
Multi-Semantic Fusion Model for Generalized Zero-Shot Skeleton-Based
  Action Recognition
Multi-Semantic Fusion Model for Generalized Zero-Shot Skeleton-Based Action RecognitionInternational Conference on Image and Graphics (ICIG), 2023
Ming-Zhe Li
Zhen Jia
Zheng Zhang
Zhanyu Ma
Liang Wang
256
14
0
18 Sep 2023
Zero-shot Skeleton-based Action Recognition via Mutual Information
  Estimation and Maximization
Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and MaximizationACM Multimedia (ACM MM), 2023
Yujie Zhou
Jingyao Wang
Anyi Rao
Ning Lin
Fuchun Sun
Yuan Liu
240
34
0
07 Aug 2023
Towards Video Anomaly Retrieval from Video Anomaly Detection: New
  Benchmarks and Model
Towards Video Anomaly Retrieval from Video Anomaly Detection: New Benchmarks and ModelIEEE Transactions on Image Processing (IEEE TIP), 2023
Peng Wu
Jing Liu
Xiangteng He
Yuxin Peng
Peng Wang
Yanning Zhang
472
55
0
24 Jul 2023
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the
  Backbone
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the BackboneIEEE International Conference on Computer Vision (ICCV), 2023
Shraman Pramanick
Yale Song
Sayan Nag
Kevin Qinghong Lin
Hardik Shah
Mike Zheng Shou
Ramalingam Chellappa
Pengchuan Zhang
VLM
427
149
0
11 Jul 2023
UniUD Submission to the EPIC-Kitchens-100 Multi-Instance Retrieval
  Challenge 2023
UniUD Submission to the EPIC-Kitchens-100 Multi-Instance Retrieval Challenge 2023
Alex Falcon
Giuseppe Serra
233
0
0
27 Jun 2023
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal
  Contrastive Training
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive TrainingIEEE Transactions on Image Processing (IEEE TIP), 2023
Chong Liu
Yuqi Zhang
Hongsong Wang
Weihua Chen
F. Wang
Yan Huang
Yixing Shen
Liang Wang
278
48
0
15 Jun 2023
An Overview of Challenges in Egocentric Text-Video Retrieval
An Overview of Challenges in Egocentric Text-Video Retrieval
Burak Satar
Huaiyu Zhu
Hanwang Zhang
J. Lim
EgoV
371
1
0
07 Jun 2023
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set
  Alignment
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set AlignmentInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Peng Jin
Hao Li
Ze-Long Cheng
Jinfa Huang
Zhennan Wang
Li-ming Yuan
Chang-rui Liu
Jie Chen
323
54
0
20 May 2023
Verbs in Action: Improving verb understanding in video-language models
Verbs in Action: Improving verb understanding in video-language modelsIEEE International Conference on Computer Vision (ICCV), 2023
Liliane Momeni
Mathilde Caron
Arsha Nagrani
Andrew Zisserman
Cordelia Schmid
547
89
0
13 Apr 2023
Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval
Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval
Jae Myung Kim
A. Sophia Koepke
Cordelia Schmid
Zeynep Akata
288
51
0
06 Apr 2023
Learning Action Changes by Measuring Verb-Adverb Textual Relationships
Learning Action Changes by Measuring Verb-Adverb Textual RelationshipsComputer Vision and Pattern Recognition (CVPR), 2023
Davide Moltisanti
Frank Keller
Hakan Bilen
Laura Sevilla-Lara
353
10
0
27 Mar 2023
Improving Video Retrieval by Adaptive Margin
Improving Video Retrieval by Adaptive MarginAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021
Feng He
Qi Wang
Zhifan Feng
Wenbin Jiang
Yajuan Lü
Yong Zhu
Xiao Tan
340
25
0
09 Mar 2023
Deep Learning for Video-Text Retrieval: a Review
Deep Learning for Video-Text Retrieval: a ReviewInternational Journal of Multimedia Information Retrieval (IJMIR), 2023
Cunjuan Zhu
Qi Jia
Wei Chen
Yanming Guo
Yu Liu
254
35
0
24 Feb 2023
Variational Cross-Graph Reasoning and Adaptive Structured Semantics
  Learning for Compositional Temporal Grounding
Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal GroundingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Juncheng Li
Siliang Tang
Linchao Zhu
Wenqiao Zhang
Yi Yang
Tat-Seng Chua
Fei Wu
Yueting Zhuang
BDL
267
25
0
22 Jan 2023
Transfer Knowledge from Natural Language to Electrocardiography: Can We
  Detect Cardiovascular Disease Through Language Models?
Transfer Knowledge from Natural Language to Electrocardiography: Can We Detect Cardiovascular Disease Through Language Models?Findings (Findings), 2023
Jielin Qiu
William Jongwon Han
Jiacheng Zhu
Mengdi Xu
Michael A. Rosenberg
Emerson Liu
Douglas Weber
Ding Zhao
264
29
0
21 Jan 2023
HierVL: Learning Hierarchical Video-Language Embeddings
HierVL: Learning Hierarchical Video-Language EmbeddingsComputer Vision and Pattern Recognition (CVPR), 2023
Kumar Ashutosh
Rohit Girdhar
Lorenzo Torresani
Kristen Grauman
VLMAI4TS
574
79
0
05 Jan 2023
Learning Video Representations from Large Language Models
Learning Video Representations from Large Language ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Yue Zhao
Ishan Misra
Philipp Krahenbuhl
Rohit Girdhar
VLMAI4TS
442
246
0
08 Dec 2022
Normalized Contrastive Learning for Text-Video Retrieval
Normalized Contrastive Learning for Text-Video RetrievalConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yookoon Park
Mahmoud Azab
Bo Xiong
Seungwhan Moon
Florian Metze
Gourab Kundu
Kirmani Ahmed
205
13
0
30 Nov 2022
Semantics-Consistent Cross-domain Summarization via Optimal Transport
  Alignment
Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment
Jielin Qiu
Jiacheng Zhu
Mengdi Xu
Franck Dernoncourt
Trung Bui
Zhaowen Wang
Yue Liu
Ding Zhao
Hailin Jin
150
12
0
10 Oct 2022
ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval
ConTra: (Con)text (Tra)nsformer for Cross-Modal Video RetrievalAsian Conference on Computer Vision (ACCV), 2022
A. Fragomeni
Michael Wray
Dima Damen
CLIPViT
177
4
0
09 Oct 2022
A Feature-space Multimodal Data Augmentation Technique for Text-video
  Retrieval
A Feature-space Multimodal Data Augmentation Technique for Text-video RetrievalACM Multimedia (ACM MM), 2022
Alex Falcon
G. Serra
Oswald Lanz
VGen
258
31
0
03 Aug 2022
Exploiting Semantic Role Contextualized Video Features for
  Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance
  Retrieval Challenge 2022
Exploiting Semantic Role Contextualized Video Features for Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Burak Satar
Erik Cambria
Hanwang Zhang
J. Lim
238
4
0
29 Jun 2022
RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video
  Retrieval
RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval
Burak Satar
Erik Cambria
Hanwang Zhang
J. Lim
202
13
0
26 Jun 2022
UniUD-FBK-UB-UniBZ Submission to the EPIC-Kitchens-100 Multi-Instance
  Retrieval Challenge 2022
UniUD-FBK-UB-UniBZ Submission to the EPIC-Kitchens-100 Multi-Instance Retrieval Challenge 2022
Alex Falcon
G. Serra
Sergio Escalera
Oswald Lanz
274
1
0
22 Jun 2022
Self-Supervised Learning for Videos: A Survey
Self-Supervised Learning for Videos: A SurveyACM Computing Surveys (ACM CSUR), 2022
Madeline Chantry Schiappa
Yogesh S Rawat
M. Shah
SSL
603
178
0
18 Jun 2022
Egocentric Video-Language Pretraining
Egocentric Video-Language PretrainingNeural Information Processing Systems (NeurIPS), 2022
Kevin Qinghong Lin
Alex Jinpeng Wang
Mattia Soldan
Michael Wray
Rui Yan
...
Hongfa Wang
Dima Damen
Guohao Li
Wei Liu
Mike Zheng Shou
VLMEgoV
317
267
0
03 Jun 2022
123
Next
Page 1 of 3