Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.02327
Cited By
v1
v2 (latest)
PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance
4 November 2024
Ruyang Liu
Haoran Tang
Haibo Liu
Yixiao Ge
Ying Shan
Chen Li
Jiankun Yang
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (130★)
Papers citing
"PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"
3 / 3 papers shown
Title
One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory
Chenhao Zheng
Jieyu Zhang
Mohammadreza Salehi
Ziqi Gao
Vishnu Iyengar
Norimasa Kobori
Quan Kong
Ranjay Krishna
51
0
0
29 May 2025
Aligning Multimodal LLM with Human Preference: A Survey
Tao Yu
Yize Zhang
Chaoyou Fu
Junkang Wu
Jinda Lu
...
Qingsong Wen
Zheng Zhang
Yan Huang
Liang Wang
Tieniu Tan
443
4
0
18 Mar 2025
Valley: Video Assistant with Large Language model Enhanced abilitY
Ruipu Luo
Ziwang Zhao
Min Yang
Junwei Dong
Da Li
Pengcheng Lu
Tao Wang
Linmei Hu
Ming-Hui Qiu
MLLM
150
209
0
12 Jun 2023
1