TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization

TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization

30 December 2024

Navonil Majumder

Bryan Catanzaro

Bryan Catanzaro

Soujanya Poria

Papers citing "TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization"

4 / 4 papers shown

Title
Sat2Sound: A Unified Framework for Zero-Shot Soundscape Mapping Subash Khanal Srikumar Sastry Aayush Dhakal Adeel Ahmad Nathan Jacobs 2 0 0 19 May 2025
MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder Bowen Zhang Congchao Guo Geng Yang Hang Yu Haozhe Zhang ... Yichen Xiao Yiying Zhou Yujie Zhang Yuan Lu Yucen He 26 0 0 12 May 2025
DRAGON: Distributional Rewards Optimize Diffusion Generative Models Yatong Bai Jonah Casebeer Somayeh Sojoudi Nicholas J. Bryan DiffM VLM 63 1 0 21 Apr 2025
Imagine to Hear: Auditory Knowledge Generation can be an Effective Assistant for Language Models Suho Yoo Hyunjong Ok Jaeho Lee AuLLM RALM 51 0 0 21 Mar 2025