Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.10671
Cited By
SMART Frame Selection for Action Recognition
19 December 2020
Shreyank N. Gowda
Marcus Rohrbach
Laura Sevilla-Lara
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SMART Frame Selection for Action Recognition"
50 / 67 papers shown
Title
Is Temporal Prompting All We Need For Limited Labeled Action Recognition?
Shreyank N. Gowda
Boyan Gao
Xiao Gu
Xiaobo Jin
VLM
41
0
0
02 Apr 2025
Online Meta-learning for AutoML in Real-time (OnMAR)
Mia Gerber
Anna Sergeevna Bosman
J. D. Villiers
OffRL
41
0
0
27 Feb 2025
Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition
Yulin Wang
Haoji Zhang
Yang Yue
Shiji Song
Chao Deng
Junlan Feng
Gao Huang
79
3
0
15 Dec 2024
Video LLMs for Temporal Reasoning in Long Videos
Fawad Javed Fateh
Umer Ahmed
Hamza Khan
M. Zia
Quoc-Huy Tran
VLM
89
0
0
04 Dec 2024
A Survey of Recent Advances and Challenges in Deep Audio-Visual Correlation Learning
Luis Vilaca
Yi Yu
Paula Vinan
75
0
0
24 Nov 2024
Principles of Visual Tokens for Efficient Video Understanding
Xinyue Hao
Gen Li
Shreyank N. Gowda
Robert B Fisher
Jonathan Huang
Anurag Arnab
Laura Sevilla-Lara
98
0
0
20 Nov 2024
Continual Learning Improves Zero-Shot Action Recognition
Shreyank N. Gowda
Davide Moltisanti
Laura Sevilla-Lara
BDL
VLM
CLL
27
1
0
14 Oct 2024
Streamlining Forest Wildfire Surveillance: AI-Enhanced UAVs Utilizing the FLAME Aerial Video Dataset for Lightweight and Efficient Monitoring
Lemeng Zhao
Junjie Hu
Jianchao Bi
Yanbing Bai
Erick Mas
Shunichi Koshimura
27
0
0
31 Aug 2024
FE-Adapter: Adapting Image-based Emotion Classifiers to Videos
Shreyank N. Gowda
Boyan Gao
David A. Clifton
24
5
0
05 Aug 2024
CC-SAM: SAM with Cross-feature Attention and Context for Ultrasound Image Segmentation
Shreyank N. Gowda
David A. Clifton
MedIm
31
1
0
31 Jul 2024
AyE-Edge: Automated Deployment Space Search Empowering Accuracy yet Efficient Real-Time Object Detection on the Edge
Chao Wu
Yifan Gong
Liangkai Liu
Mengquan Li
Yushu Wu
Xuan Shen
Zhimin Li
Geng Yuan
Weisong Shi
Yanzhi Wang
23
1
0
25 Jul 2024
MARINE: A Computer Vision Model for Detecting Rare Predator-Prey Interactions in Animal Videos
Zsófia Katona
Seyed Sahand Mohamadi Ziabari
F. Karimi Nejadasl
30
0
0
25 Jul 2024
Masks and Manuscripts: Advancing Medical Pre-training with End-to-End Masking and Narrative Structuring
Shreyank N. Gowda
David A. Clifton
MedIm
31
1
0
23 Jul 2024
StreamTinyNet: video streaming analysis with spatial-temporal TinyML
Hazem Hesham Yousef Shalby
Massimo Pavan
Manuel Roveri
37
0
0
22 Jul 2024
Pose-guided multi-task video transformer for driver action recognition
Ricardo Pizarro
Roberto Valle
L. Bergasa
J. M. Buenaposada
Luis Baumela
ViT
37
0
0
18 Jul 2024
LongVLM: Efficient Long Video Understanding via Large Language Models
Yuetian Weng
Mingfei Han
Haoyu He
Xiaojun Chang
Bohan Zhuang
VLM
65
56
0
04 Apr 2024
Reimagining Reality: A Comprehensive Survey of Video Inpainting Techniques
Shreyank N. Gowda
Yash Thakre
Shashank Narayana Gowda
Xiaobo Jin
29
0
0
31 Jan 2024
Adversarial Augmentation Training Makes Action Recognition Models More Robust to Realistic Video Distribution Shifts
Kiyoon Kim
Shreyank N. Gowda
Panagiotis Eustratiadis
Antreas Antoniou
Robert B Fisher
42
2
0
21 Jan 2024
HaltingVT: Adaptive Token Halting Transformer for Efficient Video Recognition
Qian Wu
Ruoxuan Cui
Yuke Li
Haoqi Zhu
ViT
30
2
0
10 Jan 2024
Text-Conditioned Resampler For Long Form Video Understanding
Bruno Korbar
Yongqin Xian
A. Tonioni
Andrew Zisserman
Federico Tombari
32
12
0
19 Dec 2023
OmniVec: Learning robust representations with cross modal sharing
Siddharth Srivastava
Gaurav Sharma
SSL
27
64
0
07 Nov 2023
ConViViT -- A Deep Neural Network Combining Convolutions and Factorized Self-Attention for Human Activity Recognition
Rachid Reda Dokkar
F. Chaieb
Hassen Drira
Arezki Aberkane
ViT
22
2
0
22 Oct 2023
Watt For What: Rethinking Deep Learning's Energy-Performance Relationship
Shreyank N. Gowda
Xinyue Hao
Gen Li
Laura Sevilla-Lara
Shashank Narayana Gowda
HAI
13
10
0
10 Oct 2023
Telling Stories for Common Sense Zero-Shot Action Recognition
Shreyank N. Gowda
Carolina Scarton
LM&Ro
24
2
0
29 Sep 2023
Differentiable Resolution Compression and Alignment for Efficient Video Classification and Retrieval
Rui Deng
Qian Wu
Yuke Li
Haoran Fu
18
2
0
15 Sep 2023
From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications
Shreyank N. Gowda
Dheeraj Pandey
Shashank Narayana Gowda
44
3
0
30 Aug 2023
Online Overexposed Pixels Hallucination in Videos with Adaptive Reference Frame Selection
Yazhou Xing
Amrita Mazumdar
Anjul Patney
Chao Liu
Hongxu Yin
Qifeng Chen
Jan Kautz
I. Frosio
46
1
0
29 Aug 2023
IndGIC: Supervised Action Recognition under Low Illumination
Jing-Teng Zeng
27
1
0
29 Aug 2023
Audio-Visual Glance Network for Efficient Video Recognition
Muhammad Adi Nugroho
Sangmin Woo
Sumin Lee
Changick Kim
11
5
0
18 Aug 2023
JEDI: Joint Expert Distillation in a Semi-Supervised Multi-Dataset Student-Teacher Scenario for Video Action Recognition
L. Bicsi
B. Alexe
Radu Tudor Ionescu
Marius Leordeanu
22
2
0
09 Aug 2023
Reasoning over the Behaviour of Objects in Video-Clips for Adverb-Type Recognition
Amrithaa Seshadri
Alessandra Russo
16
0
0
09 Jul 2023
SpotEM: Efficient Video Search for Episodic Memory
Santhosh Kumar Ramakrishnan
Ziad Al-Halah
Kristen Grauman
VLM
28
9
0
28 Jun 2023
Optimizing ViViT Training: Time and Memory Reduction for Action Recognition
Shreyank N. Gowda
Anurag Arnab
Jonathan Huang
ViT
18
4
0
07 Jun 2023
Just a Glimpse: Rethinking Temporal Information for Video Continual Learning
Lama Alssum
Juan Carlos León Alcázar
Merey Ramazanova
Chen Zhao
Bernard Ghanem
CLL
24
6
0
28 May 2023
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception
Hassan Akbari
Dan Kondratyuk
Yin Cui
Rachel Hornung
Haoran Wang
Hartwig Adam
VLM
MoE
30
11
0
10 May 2023
Search-Map-Search: A Frame Selection Paradigm for Action Recognition
Mingjun Zhao
Yu
Xiaoli Wang
Lei Yang
Di Niu
21
5
0
20 Apr 2023
Synthetic Sample Selection for Generalized Zero-Shot Learning
Shreyank N. Gowda
22
16
0
06 Apr 2023
MITFAS: Mutual Information based Temporal Feature Alignment and Sampling for Aerial Video Action Recognition
Ruiqi Xian
Xijun Wang
Dinesh Manocha
21
10
0
05 Mar 2023
Video Action Recognition Collaborative Learning with Dynamics via PSO-ConvNet Transformer
N. H. Phong
B. Ribeiro
29
15
0
17 Feb 2023
Gated-ViGAT: Efficient Bottom-Up Event Recognition and Explanation Using a New Frame Selection Policy and Gating Mechanism
Nikolaos Gkalelis
Dimitrios Daskalakis
Vasileios Mezaris
18
4
0
18 Jan 2023
Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning
A. Piergiovanni
Weicheng Kuo
A. Angelova
ViT
36
54
0
06 Dec 2022
Spatio-Temporal Crop Aggregation for Video Representation Learning
Sepehr Sameni
Simon Jenni
Paolo Favaro
21
3
0
30 Nov 2022
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens
Sun-Kyoo Hwang
Jaehong Yoon
Youngwan Lee
Sung Ju Hwang
31
6
0
19 Nov 2022
Active Acquisition for Multimodal Temporal Data: A Challenging Decision-Making Task
Jannik Kossen
Cătălina Cangea
Eszter Vértes
Andrew Jaegle
Viorica Patraucean
Ira Ktena
Nenad Tomašev
Danielle Belgrave
30
8
0
09 Nov 2022
Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning
Yixuan Pei
Zhiwu Qing
Jun Cen
Xiang Wang
Shiwei Zhang
Yaxiong Wang
Mingqian Tang
Nong Sang
Xueming Qian
19
13
0
02 Nov 2022
Differentiable Frequency-based Disentanglement for Aerial Video Action Recognition
D. Kothandaraman
Ming-Shun Lin
Dinesh Manocha
25
6
0
15 Sep 2022
Diverse Video Captioning by Adaptive Spatio-temporal Attention
Zohreh Ghaderi
Leonard Salewski
Hendrik P. A. Lensch
13
8
0
19 Aug 2022
Frozen CLIP Models are Efficient Video Learners
Ziyi Lin
Shijie Geng
Renrui Zhang
Peng Gao
Gerard de Melo
Xiaogang Wang
Jifeng Dai
Yu Qiao
Hongsheng Li
CLIP
VLM
16
199
0
06 Aug 2022
NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition
Boyang Xia
Wenhao Wu
Haoran Wang
Rui Su
Dongliang He
Haosen Yang
Xiaoran Fan
Wanli Ouyang
17
21
0
21 Jul 2022
Temporal Saliency Query Network for Efficient Video Recognition
Boyang Xia
Zhihao Wang
Wenhao Wu
Haoran Wang
Jungong Han
45
15
0
21 Jul 2022
1
2
Next