Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.08335
Cited By
Prompt2LVideos: Exploring Prompts for Understanding Long-Form Multimodal Videos
11 March 2025
Soumya Jahagirdar
Jayasree Saha
C. V. Jawahar
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Prompt2LVideos: Exploring Prompts for Understanding Long-Form Multimodal Videos"
21 / 21 papers shown
Title
Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews)
Shih-Han Chou
Matthew Kowal
Yasmin Niknam
Diana Moyano
Shayaan Mehdi
...
Cheng Zhang
Ian Knopke
S. Kocak
Leonid Sigal
Yalda Mohsenzadeh
137
1
0
23 Jan 2024
Grounding-Prompter: Prompting LLM with Multimodal Information for Temporal Sentence Grounding in Long Videos
Houlun Chen
Xin Wang
Hong Chen
Zihan Song
Jia Jia
Wenwu Zhu
LRM
101
10
0
28 Dec 2023
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
582
4,945
0
17 Apr 2023
Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting
Syed Talal Wasim
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
M. Shah
VLM
VPVLM
114
79
0
06 Apr 2023
Procedure-Aware Pretraining for Instructional Video Understanding
Honglu Zhou
Roberto Martín-Martín
Mubbasir Kapadia
Silvio Savarese
Juan Carlos Niebles
123
40
0
31 Mar 2023
A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT
Jules White
Quchen Fu
Sam Hays
Michael Sandborn
Carlos Olea
Henry Gilbert
Ashraf Elnashar
Jesse Spencer-Smith
Douglas C. Schmidt
LLMAG
205
1,133
0
21 Feb 2023
Unsupervised Audio-Visual Lecture Segmentation
Darshan Singh
Anchit Gupta
C. V. Jawahar
Makarand Tapaswi
VOS
75
4
0
29 Oct 2022
Revealing Single Frame Bias for Video-and-Language Learning
Jie Lei
Tamara L. Berg
Joey Tianyi Zhou
96
115
0
07 Jun 2022
Prompting Visual-Language Models for Efficient Video Understanding
Chen Ju
Tengda Han
Kunhao Zheng
Ya Zhang
Weidi Xie
VPVLM
VLM
115
384
0
08 Dec 2021
PEEK: A Large Dataset of Learner Engagement with Educational Videos
Sahan Bulathwela
Maria Perez-Ortiz
Erik Novak
Emine Yilmaz
John Shawe-Taylor
109
11
0
03 Sep 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLM
SyDa
284
4,047
0
28 Jul 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
700
4,119
0
18 Apr 2021
Movie Summarization via Sparse Graph Construction
Pinelopi Papalampidi
Frank Keller
Mirella Lapata
98
32
0
14 Dec 2020
VLEngagement: A Dataset of Scientific Video Lectures for Evaluating Population-based Engagement
Sahan Bulathwela
Maria Perez-Ortiz
Emine Yilmaz
John Shawe-Taylor
135
11
0
02 Nov 2020
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Andrew Rouditchenko
Angie Boggust
David Harwath
Brian Chen
D. Joshi
...
Rogerio Feris
Brian Kingsbury
M. Picheny
Antonio Torralba
James R. Glass
SSL
88
142
0
16 Jun 2020
Dense Passage Retrieval for Open-Domain Question Answering
Vladimir Karpukhin
Barlas Oğuz
Sewon Min
Patrick Lewis
Ledell Yu Wu
Sergey Edunov
Danqi Chen
Wen-tau Yih
RALM
244
3,811
0
10 Apr 2020
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips
Antoine Miech
Dimitri Zhukov
Jean-Baptiste Alayrac
Makarand Tapaswi
Ivan Laptev
Josef Sivic
VGen
130
1,212
0
07 Jun 2019
Towards Automatic Learning of Procedures from Web Instructional Videos
Luowei Zhou
Chenliang Xu
Jason J. Corso
EgoV
92
836
0
28 Mar 2017
Learning Language-Visual Embedding for Movie Understanding with Natural-Language
Atousa Torabi
Niket Tandon
Leonid Sigal
81
98
0
26 Sep 2016
Unsupervised Learning from Narrated Instruction Videos
Jean-Baptiste Alayrac
Piotr Bojanowski
Nishant Agrawal
Josef Sivic
Ivan Laptev
Simon Lacoste-Julien
SSL
99
289
0
30 Jun 2015
A Dataset for Movie Description
Anna Rohrbach
Marcus Rohrbach
Niket Tandon
Bernt Schiele
VGen
126
503
0
12 Jan 2015
1