ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.16619
  4. Cited By
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
v1v2 (latest)

Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric

25 November 2024
Zhichao Zhang
Wei Sun
Xinyue Li
Yunhao Li
Qihang Ge
Jun Jia
Zicheng Zhang
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guangtao Zhai
    EGVM
ArXiv (abs)PDFHTML

Papers citing "Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric"

50 / 78 papers shown
Title
Breaking Annotation Barriers: Generalized Video Quality Assessment via Ranking-based Self-Supervision
Breaking Annotation Barriers: Generalized Video Quality Assessment via Ranking-based Self-Supervision
Linhan Cao
Wei Sun
Kaiwei Zhang
Yicong Peng
Guangtao Zhai
Xiongkuo Min
135
0
0
06 May 2025
A Survey of AI-Generated Video Evaluation
A Survey of AI-Generated Video Evaluation
Xiao Liu
Xinhao Xiang
Zizhong Li
Yongheng Wang
Zhuoheng Li
Zhuosheng Liu
Weidi Zhang
Weiqi Ye
Jiawei Zhang
EGVM
67
4
0
24 Oct 2024
LLaVA-OneVision: Easy Visual Task Transfer
LLaVA-OneVision: Easy Visual Task Transfer
Bo Li
Yuanhan Zhang
Dong Guo
Renrui Zhang
Feng Li
Hao Zhang
Kaichen Zhang
Yanwei Li
Ziwei Liu
Chunyuan Li
MLLMSyDaVLM
174
865
0
06 Aug 2024
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large
  Multimodal Models
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models
Feng Li
Renrui Zhang
Hao Zhang
Yuanhan Zhang
Bo Li
Wei Li
Zejun Ma
Chunyuan Li
MLLMVLM
152
233
0
10 Jul 2024
Large Multi-modality Model Assisted AI-Generated Image Quality
  Assessment
Large Multi-modality Model Assisted AI-Generated Image Quality Assessment
Puyi Wang
Wei Sun
Zicheng Zhang
Jun Jia
Yanwei Jiang
Zhichao Zhang
Xiongkuo Min
Guangtao Zhai
EGVM
91
13
0
27 Apr 2024
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Shenghai Yuan
Jinfa Huang
Yujun Shi
Yongqi Xu
Ruijie Zhu
Bin Lin
Xinhua Cheng
Li-xin Yuan
Jiebo Luo
VGen
180
36
0
07 Apr 2024
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Roberto Henschel
Levon Khachatryan
Daniil Hayrapetyan
Hayk Poghosyan
Vahram Tadevosyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffMVGen
241
89
0
21 Mar 2024
Sora as an AGI World Model? A Complete Survey on Text-to-Video
  Generation
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Lik-Hang Lee
Tae-Ho Kim
Choong Seon Hong
Chaoning Zhang
EGVMVGen
109
43
0
08 Mar 2024
Genie: Generative Interactive Environments
Genie: Generative Interactive Environments
Jake Bruce
Michael Dennis
Ashley D. Edwards
Jack Parker-Holder
Yuge Shi
...
Konrad Zolna
Jeff Clune
Nando de Freitas
Satinder Singh
Tim Rocktaschel
VGenVLM
164
188
0
23 Feb 2024
VideoCrafter2: Overcoming Data Limitations for High-Quality Video
  Diffusion Models
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Haoxin Chen
Yong Zhang
Xiaodong Cun
Menghan Xia
Xintao Wang
Chao-Liang Weng
Ying Shan
VGenDiffM
264
323
0
17 Jan 2024
Latte: Latent Diffusion Transformer for Video Generation
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Ziqiang Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffMVGen
291
280
0
05 Jan 2024
Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined
  Levels
Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
Haoning Wu
Zicheng Zhang
Weixia Zhang
Chaofeng Chen
Liang Liao
...
Wenxiu Sun
Qiong Yan
Xiongkuo Min
Guangtao Zhai
Weisi Lin
90
163
0
28 Dec 2023
VILA: On Pre-training for Visual Language Models
VILA: On Pre-training for Visual Language Models
Ji Lin
Hongxu Yin
Ming-Yu Liu
Yao Lu
Pavlo Molchanov
Andrew Tao
Huizi Mao
Jan Kautz
Mohammad Shoeybi
Song Han
MLLMVLM
187
430
0
12 Dec 2023
VBench: Comprehensive Benchmark Suite for Video Generative Models
VBench: Comprehensive Benchmark Suite for Video Generative Models
Ziqi Huang
Yinan He
Jiashuo Yu
Fan Zhang
Chenyang Si
...
Xinyuan Chen
Limin Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
209
451
0
29 Nov 2023
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
Yanwei Li
Chengyao Wang
Jiaya Jia
VLMMLLM
101
301
0
28 Nov 2023
Make Pixels Dance: High-Dynamic Video Generation
Make Pixels Dance: High-Dynamic Video Generation
Yan Zeng
Guoqiang Wei
Jiani Zheng
Jiaxin Zou
Yang Wei
Yuchen Zhang
Hang Li
DiffMVGen
90
102
0
18 Nov 2023
FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain
  Text-to-Video Generation
FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Yuanxin Liu
Lei Li
Shuhuai Ren
Rundong Gao
Shicheng Li
Sishuo Chen
Xu Sun
Lu Hou
VGen
69
70
0
03 Nov 2023
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Haoxin Chen
Menghan Xia
Yin-Yin He
Yong Zhang
Xiaodong Cun
...
Yaofang Liu
Qifeng Chen
Xintao Wang
Chao-Liang Weng
Ying Shan
DiffM
111
314
0
30 Oct 2023
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Yaofang Liu
Xiaodong Cun
Xuebo Liu
Xintao Wang
Yong Zhang
Haoxin Chen
Yang Liu
Tieyong Zeng
Raymond H. F. Chan
Ying Shan
VGenEGVM
128
144
0
17 Oct 2023
Analysis of Video Quality Datasets via Design of Minimalistic Video
  Quality Models
Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models
Wei Sun
Wen Wen
Xiongkuo Min
Long Lan
Guangtao Zhai
Kede Ma
91
26
0
26 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models
  without Specific Tuning
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
174
883
0
10 Jul 2023
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and
  Language Models
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models
Muhammad Maaz
H. Rasheed
Salman Khan
Fahad Shahbaz Khan
MLLM
166
662
0
08 Jun 2023
AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment
AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment
Chunyi Li
Zicheng Zhang
Haoning Wu
Wei Sun
Xiongkuo Min
Xiaohong Liu
Guangtao Zhai
Weisi Lin
EGVM
82
124
0
07 Jun 2023
Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image
  Generation
Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation
Yuval Kirstain
Adam Polyak
Uriel Singer
Shahbuland Matiana
Joe Penna
Omer Levy
EGVM
265
420
0
02 May 2023
AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation
AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation
Zerui Li
Zuo-Liang Zhu
Linghao Han
Qibin Hou
Chunle Guo
Ming-Ming Cheng
101
108
0
19 Apr 2023
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Limin Wang
Bingkun Huang
Zhiyu Zhao
Zhan Tong
Yinan He
Yi Wang
Yali Wang
Yu Qiao
VGen
160
364
0
29 Mar 2023
Blind Image Quality Assessment via Vision-Language Correspondence: A
  Multitask Learning Perspective
Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective
Weixia Zhang
Guangtao Zhai
Ying Wei
Xiaokang Yang
Kede Ma
VLM
112
186
0
27 Mar 2023
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video
  Generators
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Levon Khachatryan
A. Movsisyan
Vahram Tadevosyan
Roberto Henschel
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
VGen
119
581
0
23 Mar 2023
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
Sheng-Siang Yin
Chenfei Wu
Huan Yang
Jianfeng Wang
Xiaodong Wang
...
Gong Ming
Lijuan Wang
Zicheng Liu
Houqiang Li
Nan Duan
VGen
91
137
0
22 Mar 2023
Structure and Content-Guided Video Synthesis with Diffusion Models
Structure and Content-Guided Video Synthesis with Diffusion Models
Patrick Esser
Johnathan Chiu
Parmida Atighehchian
Jonathan Granskog
Anastasis Germanidis
DiffMVGen
200
539
0
06 Feb 2023
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for
  Text-to-Video Generation
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Jay Zhangjie Wu
Yixiao Ge
Xintao Wang
Weixian Lei
Yuchao Gu
Yufei Shi
Wynne Hsu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
VGen
192
754
0
22 Dec 2022
Reproducible scaling laws for contrastive language-image learning
Reproducible scaling laws for contrastive language-image learning
Mehdi Cherti
Romain Beaumont
Ross Wightman
Mitchell Wortsman
Gabriel Ilharco
Cade Gordon
Christoph Schuhmann
Ludwig Schmidt
J. Jitsev
VLMCLIP
150
825
0
14 Dec 2022
InternVideo: General Video Foundation Models via Generative and
  Discriminative Learning
InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Yi Wang
Kunchang Li
Yizhuo Li
Yinan He
Bingkun Huang
...
Junting Pan
Jiashuo Yu
Yali Wang
Limin Wang
Yu Qiao
VLMVGen
193
332
0
06 Dec 2022
Latent Video Diffusion Models for High-Fidelity Long Video Generation
Latent Video Diffusion Models for High-Fidelity Long Video Generation
Yin-Yin He
Tianyu Yang
Yong Zhang
Ying Shan
Qifeng Chen
DiffMVGen
114
243
0
23 Nov 2022
MagicVideo: Efficient Video Generation With Latent Diffusion Models
MagicVideo: Efficient Video Generation With Latent Diffusion Models
Daquan Zhou
Weimin Wang
Hanshu Yan
Weiwei Lv
Yizhe Zhu
Jiashi Feng
DiffMVGen
131
391
0
20 Nov 2022
Exploring Video Quality Assessment on User Generated Contents from
  Aesthetic and Technical Perspectives
Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives
Haoning Wu
Erli Zhang
Liang Liao
Chaofeng Chen
Jingwen Hou
Annan Wang
Wenxiu Sun
Qiong Yan
Weisi Lin
91
171
0
09 Nov 2022
Phenaki: Variable Length Video Generation From Open Domain Textual
  Description
Phenaki: Variable Length Video Generation From Open Domain Textual Description
Ruben Villegas
Mohammad Babaeizadeh
Pieter-Jan Kindermans
Hernan Moraldo
Han Zhang
M. Saffar
Santiago Castro
Julius Kunze
D. Erhan
DiffMVGen
173
396
0
05 Oct 2022
Imagen Video: High Definition Video Generation with Diffusion Models
Imagen Video: High Definition Video Generation with Diffusion Models
Jonathan Ho
William Chan
Chitwan Saharia
Jay Whang
Ruiqi Gao
...
Diederik P. Kingma
Ben Poole
Mohammad Norouzi
David J. Fleet
Tim Salimans
VGen
212
1,550
0
05 Oct 2022
Make-A-Video: Text-to-Video Generation without Text-Video Data
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer
Adam Polyak
Thomas Hayes
Xiaoyue Yin
Jie An
...
Oron Ashual
Oran Gafni
Devi Parikh
Sonal Gupta
Yaniv Taigman
DiffMVGen
132
1,441
0
29 Sep 2022
Exploring CLIP for Assessing the Look and Feel of Images
Exploring CLIP for Assessing the Look and Feel of Images
Jianyi Wang
Kelvin C. K. Chan
Chen Change Loy
VLM
174
586
0
25 Jul 2022
NUWA-Infinity: Autoregressive over Autoregressive Generation for
  Infinite Visual Synthesis
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
Chenfei Wu
Jian Liang
Xiaowei Hu
Zhe Gan
Jianfeng Wang
Lijuan Wang
Zicheng Liu
Yuejian Fang
Nan Duan
VGen
89
74
0
20 Jul 2022
FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment
  Sampling
FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment Sampling
Haoning Wu
Chaofeng Chen
Jingwen Hou
Liang Liao
Annan Wang
Wenxiu Sun
Qiong Yan
Weisi Lin
146
181
0
06 Jul 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via
  Transformers
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
380
633
0
29 May 2022
A Deep Learning based No-reference Quality Assessment Model for UGC
  Videos
A Deep Learning based No-reference Quality Assessment Model for UGC Videos
Wei Sun
Xiongkuo Min
Wei Lu
Guangtao Zhai
101
174
0
29 Apr 2022
FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality
  Assessment
FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment
Jinglin Xu
Yongming Rao
Xumin Yu
Guangyi Chen
Jie Zhou
Jiwen Lu
84
97
0
07 Apr 2022
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive
  Transformer
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Songwei Ge
Thomas Hayes
Harry Yang
Xiaoyue Yin
Guan Pang
David Jacobs
Jia-Bin Huang
Devi Parikh
ViT
185
223
0
07 Apr 2022
Video Diffusion Models
Video Diffusion Models
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffMVGen
427
1,650
0
07 Apr 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
Guosheng Lin
MLLMBDLVLMCLIP
591
4,444
0
28 Jan 2022
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion
Chenfei Wu
Jian Liang
Lei Ji
Fan Yang
Yuejian Fang
Daxin Jiang
Nan Duan
ViTVGen
88
296
0
24 Nov 2021
Blindly Assess Quality of In-the-Wild Videos via Quality-aware
  Pre-training and Motion Perception
Blindly Assess Quality of In-the-Wild Videos via Quality-aware Pre-training and Motion Perception
Bowen Li
Weixia Zhang
Meng Tian
Guangtao Zhai
Xianpei Wang
106
125
0
19 Aug 2021
12
Next