Neighbor communities
0 / 0 papers shown
Top Contributors
| Name | # Papers | # Citations |
|---|---|---|
Social Events
| Date | Location | Event |
|---|---|---|
| Name | # Papers | # Citations |
|---|---|---|
| Date | Location | Event |
|---|---|---|
Assessing the quality and realism of models that generate images or videos. Aids in improving generative models for content creation and simulation.
Omni-Judge: Can Omni-LLMs Serve as Human-Aligned Judges for Text-Conditioned Audio-Video Generation? Susan Liang Chao Huang Filippos Bellos Yolo Yunlong Tang Qianxiang Shen Jing Bi Luchuan Song Zeliang Zhang Jason Corso Chenliang Xu | |||
SVBench: Evaluation of Video Generation Models on Social Reasoning Wenshuo Peng Gongxuan Wang Tianmeng Yang Chuanhao Li Xiaojie Xu Hui He Kaipeng Zhang | |||
MTAVG-Bench: A Comprehensive Benchmark for Evaluating Multi-Talker Dialogue-Centric Audio-Video Generation Yang-Hao Zhou Haitian Li Rexar Lin Heyan Huang Jinxing Zhou ...Jingyun Liao Yi-Ming Cheng Xuefeng Chen Xian-Ling Mao Yousheng Feng | |||
Unified Personalized Reward Model for Vision Generation Yibin Wang Yuhang Zang Feng Han Jiazi Bu Yujie Zhou Cheng Jin Jiaqi Wang | |||
Visual Personalization Turing Test Rameen Abdal James Burgess Sergey Tulyakov Kuan-Chieh Jackson Wang | |||
Understanding Frechet Speech Distance for Synthetic Speech Quality Evaluation June-Woo Kim Dhruv Agarwal Federica Cerina | |||
Artifact-Aware Evaluation for High-Quality Video Generation Chen Zhu Jiashu Zhu Yanxun Li Meiqi Wu Bingze Song Chubin Chen Jiahong Wu Xiangxiang Chu Yangang Wang | |||
The Script is All You Need: An Agentic Framework for Long-Horizon Dialogue-to-Cinematic Video Generation Chenyu Mu Xin He Qu Yang Wanshun Chen Jiadi Yao ...Erkun Yang Cheng Deng Zhaopeng Tu Xiaolong Li Linus | |||
ColorConceptBench: A Benchmark for Probabilistic Color-Concept Understanding in Text-to-Image Models Chenxi Ruan Yu Xiao Yihan Hou Guosheng Hu Wei Zeng | |||
Reward-Forcing: Autoregressive Video Generation with Reward Feedback Jingran Zhang Ning Li Yuanhao Ban Andrew Bai Justin Cui | |||
Iterative Refinement Improves Compositional Image Generation Shantanu Jaiswal Mihir Prabhudesai Nikash Bhardwaj Zheyang Qin Amir Zadeh Chuan Li Katerina Fragkiadaki Deepak Pathak | |||
TempViz: On the Evaluation of Temporal Knowledge in Text-to-Image Models Carolin Holtermann Nina Krebs Anne Lauscher | |||
Rethinking Video Generation Model for the Embodied World Yufan Deng Zilin Pan Hongyu Zhang Xiaojie Li Ruoqing Hu Yufei Ding Yiming Zou Yan Zeng Daquan Zhou | |||
Scientific Image Synthesis: Benchmarking, Methodologies, and Downstream Utility Honglin Lin Chonghan Qin Zheng Liu Qizhi Pei Yu Li Zhanping Zhong Xin Gao Yanfeng Wang Conghui He Lijun Wu | |||
Human detectors are surprisingly powerful reward models Kumar Ashutosh XuDong Wang Xi Yin Kristen Grauman Adam Polyak Ishan Misra Rohit Girdhar | |||
The Algorithmic Gaze: An Audit and Ethnography of the LAION-Aesthetics Predictor Model Jordan Taylor William Agnew Maarten Sap Sarah E. Fox Haiyi Zhu | |||
SlidesGen-Bench: Evaluating Slides Generation via Computational and Quantitative Metrics Yunqiao Yang Wenbo Li Houxing Ren Zimu Lu Ke Wang Zhiyuan Huang Zhuofan Zong Mingjie Zhan Hongsheng Li | |||
Motion Attribution for Video Generation Xindi Wu Despoina Paschalidou Jun Gao Antonio Torralba Laura Leal-Taixé Olga Russakovsky Sanja Fidler Jonathan Lorraine | |||
Thinking with Frames: Generative Video Distortion Evaluation via Frame Reward Model Yuan Wang Borui Liao Huijuan Huang Jinda Lu Ouxiang Li Kuien Liu Meng Wang Xiang Wang | |||
Understanding Reward Hacking in Text-to-Image Reinforcement Learning Yunqi Hong Kuei-Chun Kao Hengguang Zhou Cho-Jui Hsieh | |||
DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving Yang Zhou Hao Shao Letian Wang Zhuofan Zong Hongsheng Li Steven L. Waslander | |||
Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning Chubin Chen Sujie Hu Jiashu Zhu Meiqi Wu Jintao Chen ...Nisha Huang Chengyu Fang Jiahong Wu Xiangxiang Chu Xiu Li | |||
Direct Diffusion Score Preference Optimization via Stepwise Contrastive Policy-Pair Supervision Dohyun Kim Seungwoo Lyu Seung Wook Kim Paul Hongsuck Seo | |||
FinPercep-RM: A Fine-grained Reward Model and Co-evolutionary Curriculum for RL-based Real-world Super-Resolution Yidi Liu Zihao Fan Jie Huang Jie Xiao Dong Li Wenlong Zhang Lei Bai Xueyang Fu Zheng-Jun Zha | |||
CritiFusion: Semantic Critique and Spectral Alignment for Faithful Text-to-Image Generation ZhenQi Chen TsaiChing Ni YuanFu Yang | |||
Self-Evaluation Unlocks Any-Step Text-to-Image Generation Xin Yu Xiaojuan Qi Zhengqi Li Kai Zhang Richard Zhang Zhe Lin Eli Shechtman Tianyu Wang Yotam Nitzan | |||
DiverseGRPO: Mitigating Mode Collapse in Image Generation via Diversity-Aware GRPO Henglin Liu Huijuan Huang Jing Wang Chang Liu Xiu Li Xiangyang Ji | |||
VisionDirector: Vision-Language Guided Closed-Loop Refinement for Generative Image Synthesis Meng Chu Senqiao Yang Haoxuan Che Suiyun Zhang Xichen Zhang ...Haokun Gui Zhefan Rao Dandan Tu Rui Liu Jiaya Jia | |||
PixelArena: A benchmark for Pixel-Precision Visual Intelligence Feng Liang Sizhe Cheng Chenqi Yi Yong Wang | |||
Evaluation of Generative Models for Emotional 3D Animation Generation in VR Kiran Chhatre Renan Guarese Andrii Matviienko Christopher Peters | |||
MMGR: Multi-Modal Generative Reasoning Zefan Cai Haoyi Qiu Tianyi Ma Haozhe Zhao Gengze Zhou ...Minjia Zhang Wen Xiao Jiuxiang Gu Nanyun Peng Junjie Hu | |||
MineTheGap: Automatic Mining of Biases in Text-to-Image Models Noa Cohen Nurit Spingarn-Eliezer Inbar Huberman-Spiegelglas Tomer Michaeli | |||
MR-FlowDPO: Multi-Reward Direct Preference Optimization for Flow-Matching Text-to-Music Generation Alon Ziv Sanyuan Chen Andros Tjandra Yossi Adi Wei-Ning Hsu Bowen Shi | |||
Chain-of-Image Generation: Toward Monitorable and Controllable Image Generation Young Kyung Kim Oded Schlesinger Yuzhou Zhao J. Matias Di Martino Guillermo Sapiro | |||
AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models Arman Zarei Jiacheng Pan Matthew Gwilliam Soheil Feizi Zhenheng Yang | |||
Generating Storytelling Images with Rich Chains-of-Reasoning Xiujie Song Qi Jia Shota Watanabe Xiaoyi Pang Ruijie Chen Mengyue Wu Kenny Q. Zhu | |||
RunawayEvil: Jailbreaking the Image-to-Video Generative Models Songping Wang Rufan Qian Yueming Lyu Qinglong Liu Linzhuang Zou Jie Qin Songhua Liu Caifeng Shan | |||
| Name (-) |
|---|
| Name (-) |
|---|
| Name (-) |
|---|
| Date | Location | Event | |
|---|---|---|---|
| No social events available | |||