Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.06070
Cited By
Reframe Anything: LLM Agent for Open World Video Reframing
10 March 2024
Jiawang Cao
Yongliang Wu
Weiheng Chi
Wenbo Zhu
Ziyue Su
Jay Wu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reframe Anything: LLM Agent for Open World Video Reframing"
15 / 15 papers shown
Title
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
Junyang Wang
Haiyang Xu
Jiabo Ye
Mingshi Yan
Weizhou Shen
Ji Zhang
Fei Huang
Jitao Sang
114
126
0
29 Jan 2024
Segment Anything in High Quality
Lei Ke
Mingqiao Ye
Martin Danelljan
Yifan Liu
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
VLM
104
333
0
02 Jun 2023
TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs
Yaobo Liang
Chenfei Wu
Ting Song
Wenshan Wu
Yan Xia
...
Shaoguang Mao
Yuntao Wang
Linjun Shou
Ming Gong
Nan Duan
LLMAG
CLL
73
201
0
29 Mar 2023
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline
Tiantian Geng
Teng Wang
Jinming Duan
Runmin Cong
Feng Zheng
50
34
0
22 Mar 2023
The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing
Dawit Mureja Argaw
Fabian Caba Heilbron
Joon-Young Lee
Markus Woodson
In So Kweon
VGen
76
24
0
20 Jul 2022
Texture-guided Saliency Distilling for Unsupervised Salient Object Detection
Huajun Zhou
Bo Qiao
Lingxiao Yang
Jianhuang Lai
Xiaohua Xie
56
32
0
13 Jul 2022
Flamingo: a Visual Language Model for Few-Shot Learning
Jean-Baptiste Alayrac
Jeff Donahue
Pauline Luc
Antoine Miech
Iain Barr
...
Mikolaj Binkowski
Ricardo Barreira
Oriol Vinyals
Andrew Zisserman
Karen Simonyan
MLLM
VLM
385
3,542
0
29 Apr 2022
A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection
Yukun Su
Jingliang Deng
Rui Sun
Guosheng Lin
Qingyao Wu
ViT
63
83
0
09 Mar 2022
Unsupervised Domain Adaptive Salient Object Detection Through Uncertainty-Aware Pseudo-Label Learning
Pengxiang Yan
Ziyi Wu
Meng-Shu Liu
K. Zeng
Liang Lin
Guanbin Li
50
29
0
26 Feb 2022
MSeg: A Composite Dataset for Multi-domain Semantic Segmentation
John Lambert
Zhuang Liu
Ozan Sener
James Hays
V. Koltun
VLM
73
202
0
27 Dec 2021
Scaling Open-Vocabulary Image Segmentation with Image-Level Labels
Golnaz Ghiasi
Xiuye Gu
Huayu Chen
Nayeon Lee
VLM
122
382
0
22 Dec 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
445
3,856
0
11 Feb 2021
GAZED- Gaze-guided Cinematic Editing of Wide-Angle Monocular Video Recordings
K. L. B. Moorthy
Moneish Kumar
Ramanathan Subramanian
Vineet Gandhi
53
29
0
22 Oct 2020
Enhanced-alignment Measure for Binary Foreground Map Evaluation
Deng-Ping Fan
Cheng Gong
Yang Cao
Bo Ren
Ming-Ming Cheng
Ali Borji
107
1,226
0
26 May 2018
Structure-measure: A New Way to Evaluate Foreground Maps
Deng-Ping Fan
Ming-Ming Cheng
Yun-Hai Liu
Tao Li
Ali Borji
136
1,371
0
02 Aug 2017
1