ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.02494
  4. Cited By
Progressive Video Summarization via Multimodal Self-supervised Learning

Progressive Video Summarization via Multimodal Self-supervised Learning

7 January 2022
Haopeng Li
Qiuhong Ke
Mingming Gong
Tom Drummond
    AI4TS
ArXivPDFHTML

Papers citing "Progressive Video Summarization via Multimodal Self-supervised Learning"

11 / 11 papers shown
Title
Video Summarization with Large Language Models
Video Summarization with Large Language Models
Min Jung Lee
Dayoung Gong
Minsu Cho
26
0
0
15 Apr 2025
Does SpatioTemporal information benefit Two video summarization
  benchmarks?
Does SpatioTemporal information benefit Two video summarization benchmarks?
Aashutosh Ganesh
Mirela Popa
Daan Odijk
Nava Tintarev
AI4TS
27
0
0
04 Oct 2024
MMSummary: Multimodal Summary Generation for Fetal Ultrasound Video
MMSummary: Multimodal Summary Generation for Fetal Ultrasound Video
Xiaoqing Guo
Qianhui Men
J. A. Noble
48
0
0
07 Aug 2024
SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional
  Temporal Grounding
SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding
Zixu Cheng
Yujiang Pu
Shaogang Gong
Parisa Kordjamshidi
Yu Kong
AI4TS
32
0
0
06 Jul 2024
Visual-Text Cross Alignment: Refining the Similarity Score in
  Vision-Language Models
Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
Jinhao Li
Haopeng Li
S. Erfani
Lei Feng
James Bailey
Feng Liu
VLM
34
3
0
05 Jun 2024
CSTA: CNN-based Spatiotemporal Attention for Video Summarization
CSTA: CNN-based Spatiotemporal Attention for Video Summarization
Jaewon Son
Jaehun Park
Kwangsu Kim
AI4TS
ViT
37
8
0
20 May 2024
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt
  Instruction Tuning
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning
Hang Hua
Yunlong Tang
Chenliang Xu
Jiebo Luo
VGen
65
25
0
18 Apr 2024
DailyMAE: Towards Pretraining Masked Autoencoders in One Day
DailyMAE: Towards Pretraining Masked Autoencoders in One Day
Jiantao Wu
Shentong Mo
Sara Atito
Zhenhua Feng
Josef Kittler
Muhammad Awais
35
3
0
31 Mar 2024
MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation
  of Videos
MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
Jielin Qiu
Jiacheng Zhu
William Jongwon Han
Aditesh Kumar
Karthik Mittal
...
Linjie Li
Jianfeng Wang
Ding Zhao
Bo Li
Lijuan Wang
VGen
16
5
0
07 Jun 2023
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text
  Understanding
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Hu Xu
Gargi Ghosh
Po-Yao (Bernie) Huang
Dmytro Okhonko
Armen Aghajanyan
Florian Metze
Luke Zettlemoyer
Florian Metze Luke Zettlemoyer Christoph Feichtenhofer
CLIP
VLM
259
558
0
28 Sep 2021
VATT: Transformers for Multimodal Self-Supervised Learning from Raw
  Video, Audio and Text
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Hassan Akbari
Liangzhe Yuan
Rui Qian
Wei-Hong Chuang
Shih-Fu Chang
Huayu Chen
Boqing Gong
ViT
248
577
0
22 Apr 2021
1