Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.16718
Cited By
v1
v2
v3
v4
v5 (latest)
Neuro-Symbolic Evaluation of Text-to-Video Models using Formal Verification
22 November 2024
S P Sharan
Minkyu Choi
Sahil Shah
Harsh Goel
Mohammad Omama
Sandeep Chinchali
EGVM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neuro-Symbolic Evaluation of Text-to-Video Models using Formal Verification"
35 / 35 papers shown
Title
A Challenge to Build Neuro-Symbolic Video Agents
Sahil Shah
Harsh Goel
Sai Shankar Narasimhan
Minkyu Choi
S P Sharan
Oguzhan Akcin
Sandeep Chinchali
AI4TS
67
0
0
20 May 2025
Real-Time Privacy Preservation for Robot Visual Perception
Minkyu Choi
Yunhao Yang
N. Bhatt
Kushagra Gupta
Sahil Shah
Aditya Rai
David Fridovich-Keil
Ufuk Topcu
Sandeep Chinchali
90
1
0
08 May 2025
We'll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback
Minkyu Choi
S P Sharan
Harsh Goel
Sahil Shah
Sandeep Chinchali
DiffM
VGen
143
1
0
24 Apr 2025
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design
Jiachen Li
Qian Long
Jian Zheng
Xiaofeng Gao
Robinson Piramuthu
Wenhu Chen
William Yang Wang
VGen
101
26
0
08 Oct 2024
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
Zhuoyi Yang
Jiayan Teng
Wendi Zheng
Ming Ding
Shiyu Huang
...
Weihan Wang
Yean Cheng
Xiaotao Gu
Yuxiao Dong
Jie Tang
DiffM
VGen
239
565
0
12 Aug 2024
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Xuan He
Dongfu Jiang
Ge Zhang
Max Ku
Achint Soni
...
Yaswanth Narsupalli
Rongqi Fan
Zhiheng Lyu
Yuchen Lin
Wenhu Chen
EGVM
VGen
ALM
115
56
0
21 Jun 2024
Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs
Krista Opsahl-Ong
Michael J Ryan
Josh Purtell
David Broman
Christopher Potts
Matei A. Zaharia
Omar Khattab
84
41
0
17 Jun 2024
TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation
Weixi Feng
Jiachen Li
Michael Stephen Saxon
Tsu-Jui Fu
Wenhu Chen
William Yang Wang
EGVM
VGen
61
10
0
12 Jun 2024
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback
Jiachen Li
Weixi Feng
Tsu-Jui Fu
Xinyi Wang
Sugato Basu
Wenhu Chen
William Y. Wang
VGen
82
34
0
29 May 2024
The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective
Andrew Shin
Yusuke Mori
Kunitake Kaneko
VGen
EGVM
49
2
0
13 May 2024
Sora Detector: A Unified Hallucination Detection for Large Text-to-Video Models
Zhixuan Chu
Lei Zhang
Yichen Sun
Siqiao Xue
Peng Kuang
Zhan Qin
Kui Ren
HILM
EGVM
53
14
0
07 May 2024
Reward Guided Latent Consistency Distillation
Jiachen Li
Weixi Feng
Wenhu Chen
William Y. Wang
EGVM
80
15
0
16 Mar 2024
Towards Neuro-Symbolic Video Understanding
Minkyu Choi
Harsh Goel
Mohammad Omama
Yunhao Yang
Sahil Shah
Sandeep Chinchali
70
10
0
16 Mar 2024
Towards A Better Metric for Text-to-Video Generation
Jay Zhangjie Wu
Guian Fang
Haoning Wu
Xintao Wang
Yixiao Ge
...
Rui Zhao
Weisi Lin
Wynne Hsu
Ying Shan
Mike Zheng Shou
VGen
107
37
0
15 Jan 2024
PEEKABOO: Interactive Video Generation via Masked-Diffusion
Yash Jain
Anshul Nasery
Vibhav Vineet
Harkirat Singh Behl
VGen
75
35
0
12 Dec 2023
VBench: Comprehensive Benchmark Suite for Video Generative Models
Ziqi Huang
Yinan He
Jiashuo Yu
Fan Zhang
Chenyang Si
...
Xinyuan Chen
Limin Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
195
451
0
29 Nov 2023
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Shiwei Zhang
Jiayu Wang
Yingya Zhang
Kang Zhao
Hangjie Yuan
Zhan Qin
Xiang Wang
Deli Zhao
Jingren Zhou
DiffM
VGen
116
230
0
07 Nov 2023
FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation
Yuanxin Liu
Lei Li
Shuhuai Ren
Rundong Gao
Shicheng Li
Sishuo Chen
Xu Sun
Lu Hou
VGen
47
70
0
03 Nov 2023
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Yaofang Liu
Xiaodong Cun
Xuebo Liu
Xintao Wang
Yong Zhang
Haoxin Chen
Yang Liu
Tieyong Zeng
Raymond H. F. Chan
Ying Shan
VGen
EGVM
91
144
0
17 Oct 2023
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
Omar Khattab
Arnav Singhvi
Paridhi Maheshwari
Zhiyuan Zhang
Keshav Santhanam
...
Thomas T. Joshi
Hanna Moazam
Heather Miller
Matei A. Zaharia
Christopher Potts
RALM
95
280
0
05 Oct 2023
Specification-Driven Video Search via Foundation Models and Formal Verification
Yunhao Yang
Jean-Raphael Gaglione
Sandeep Chinchali
Ufuk Topcu
88
7
0
18 Sep 2023
Measuring the Quality of Text-to-Video Model Outputs: Metrics and Dataset
Iya Chivileva
Philip Lynch
Tomás E. Ward
Alan F. Smeaton
EGVM
65
17
0
14 Sep 2023
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models
Muhammad Maaz
H. Rasheed
Salman Khan
Fahad Shahbaz Khan
MLLM
135
660
0
08 Jun 2023
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Hang Zhang
Xin Li
Lidong Bing
MLLM
178
1,061
0
05 Jun 2023
NL2TL: Transforming Natural Languages to Temporal Logics using Large Language Models
Yongchao Chen
Rujul Gandhi
Yang Zhang
Chuchu Fan
83
56
0
12 May 2023
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGS
VGen
222
1,104
0
18 Apr 2023
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
571
4,925
0
17 Apr 2023
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
1.5K
14,761
0
15 Mar 2023
Data-Efficient Learning of Natural Language to Linear Temporal Logic Translators for Robot Task Specification
Jiayi Pan
Glen Chou
Dmitry Berenson
95
39
0
09 Mar 2023
Grounding Complex Natural Language Commands for Temporal Tasks in Unseen Environments
Jason Liu
Ziyi Yang
Ifrah Idrees
Sam Liang
Benjamin Schornstein
Stefanie Tellex
Ankit Parag Shah
LM&Ro
75
42
0
22 Feb 2023
Structure and Content-Guided Video Synthesis with Diffusion Models
Patrick Esser
Johnathan Chiu
Parmida Atighehchian
Jonathan Granskog
Anastasis Germanidis
DiffM
VGen
180
538
0
06 Feb 2023
Make It Move: Controllable Image-to-Video Generation with Text Descriptions
Yaosi Hu
Chong Luo
Zhenzhong Chen
VGen
65
89
0
06 Dec 2021
PyTorchVideo: A Deep Learning Library for Video Understanding
Haoqi Fan
Tullie Murrell
Heng Wang
Kalyan Vasudev Alwala
Yanghao Li
...
Ross B. Girshick
Matt Feiszli
Aaron B. Adcock
Wan-Yen Lo
Christoph Feichtenhofer
VLM
ViT
90
53
0
18 Nov 2021
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
759
18,408
0
19 Jun 2020
Microsoft COCO Captions: Data Collection and Evaluation Server
Xinlei Chen
Hao Fang
Nayeon Lee
Ramakrishna Vedantam
Saurabh Gupta
Piotr Dollar
C. L. Zitnick
224
2,497
0
01 Apr 2015
1