ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.00232
  4. Cited By
Multi-Moments in Time: Learning and Interpreting Models for Multi-Action
  Video Understanding

Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding

1 November 2019
Mathew Monfort
Bowen Pan
K. Ramakrishnan
A. Andonian
Barry A. McNamara
A. Lascelles
Quanfu Fan
Dan Gutfreund
Rogerio Feris
A. Oliva
    VLM
ArXivPDFHTML

Papers citing "Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding"

19 / 19 papers shown
Title
Tarsier: Recipes for Training and Evaluating Large Video Description
  Models
Tarsier: Recipes for Training and Evaluating Large Video Description Models
Jiawei Wang
Liping Yuan
Yuchen Zhang
47
52
0
30 Jun 2024
VIEW: Visual Imitation Learning with Waypoints
VIEW: Visual Imitation Learning with Waypoints
Ananth Jonnavittula
Sagar Parekh
Dylan P. Losey
SSL
91
10
0
27 Apr 2024
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World
Yifei Huang
Guo Chen
Jilan Xu
Mingfang Zhang
Lijin Yang
...
Hongjie Zhang
Lu Dong
Yali Wang
Limin Wang
Yu Qiao
EgoV
68
38
0
24 Mar 2024
GTAutoAct: An Automatic Datasets Generation Framework Based on Game
  Engine Redevelopment for Action Recognition
GTAutoAct: An Automatic Datasets Generation Framework Based on Game Engine Redevelopment for Action Recognition
Xingyu Song
Zhan Li
Shi Chen
K. Demachi
35
1
0
24 Jan 2024
CPR-Coach: Recognizing Composite Error Actions based on Single-class
  Training
CPR-Coach: Recognizing Composite Error Actions based on Single-class Training
Shunli Wang
Qing Yu
Shuai Wang
Dingkang Yang
Liuzhen Su
Xiao Zhao
Haopeng Kuang
Pei Zhang
Peng Zhai
Lihua Zhang
41
3
0
21 Sep 2023
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Jiaxi Gu
Shicong Wang
Haoyu Zhao
Tianyi Lu
Xing Zhang
Zuxuan Wu
Songcen Xu
Wei Zhang
Yu-Gang Jiang
Hang Xu
DiffM
VGen
39
44
0
07 Sep 2023
Deep Neural Networks in Video Human Action Recognition: A Review
Deep Neural Networks in Video Human Action Recognition: A Review
Zihan Wang
Yang Yang
Zhi Liu
Y. Zheng
59
4
0
25 May 2023
The Devil is in the Wrongly-classified Samples: Towards Unified Open-set
  Recognition
The Devil is in the Wrongly-classified Samples: Towards Unified Open-set Recognition
Jun Cen
Di Luan
Shiwei Zhang
Yixuan Pei
Yingya Zhang
Deli Zhao
Shaojie Shen
Qifeng Chen
11
19
0
08 Feb 2023
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene
  Segmentation
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation
Jie Jiang
Zhimin Li
Jiangfeng Xiong
Rongwei Quan
Qinglin Lu
Wei Liu
36
2
0
09 Dec 2022
InternVideo: General Video Foundation Models via Generative and
  Discriminative Learning
InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Yi Wang
Kunchang Li
Yizhuo Li
Yinan He
Bingkun Huang
...
Junting Pan
Jiashuo Yu
Yali Wang
Limin Wang
Yu Qiao
VLM
VGen
57
311
0
06 Dec 2022
VLG: General Video Recognition with Web Textual Knowledge
VLG: General Video Recognition with Web Textual Knowledge
Jintao Lin
Zhaoyang Liu
Wenhai Wang
Wayne Wu
Limin Wang
39
0
0
03 Dec 2022
Human-to-Robot Imitation in the Wild
Human-to-Robot Imitation in the Wild
Shikhar Bahl
Abhi Gupta
Deepak Pathak
30
165
0
19 Jul 2022
Learned Video Compression with Residual Prediction and Loop Filter
Learned Video Compression with Residual Prediction and Loop Filter
Chao Liu
Heming Sun
J. Katto
Xiaoyang Zeng
Yibo Fan
18
5
0
19 Aug 2021
Evidential Deep Learning for Open Set Action Recognition
Evidential Deep Learning for Open Set Action Recognition
Wentao Bao
Qi Yu
Yu Kong
CML
EDL
19
135
0
21 Jul 2021
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized
  Sports Actions
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions
Yixuan Li
Lei Chen
Runyu He
Zhenzhi Wang
Gangshan Wu
Limin Wang
27
97
0
16 May 2021
A Comprehensive Study of Deep Video Action Recognition
A Comprehensive Study of Deep Video Action Recognition
Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi-Li Zhang
Joseph Tighe
R. Manmatha
Mu Li
VLM
AI4TS
38
185
0
11 Dec 2020
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of
  Broadcast Soccer Videos
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos
A. Deliège
A. Cioppa
Silvio Giancola
M. J. Seikavandi
J. Dueholm
Kamal Nasrollahi
Guohao Li
T. Moeslund
Marc Van Droogenbroeck
18
152
0
26 Nov 2020
Temporal Interlacing Network
Temporal Interlacing Network
Hao Shao
Shengju Qian
Yu Liu
29
92
0
17 Jan 2020
Semantic Understanding of Scenes through the ADE20K Dataset
Semantic Understanding of Scenes through the ADE20K Dataset
Bolei Zhou
Hang Zhao
Xavier Puig
Tete Xiao
Sanja Fidler
Adela Barriuso
Antonio Torralba
SSeg
253
1,833
0
18 Aug 2016
1