declare-lab/TangoFlux

[ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching

/ 100

Established

Uses Diffusion Transformers (DiT/MMDiT) conditioned on text and duration embeddings with rectified flow matching to learn trajectories in a VAE-compressed latent space. The three-stage training pipeline incorporates CRPO (Clap-Ranked Preference Optimization), which iteratively synthesizes preference pairs and applies DPO loss to align generated audio with human preferences. Integrates with Hugging Face (model hosting and accelerate training framework), ComfyUI for node-based workflows, and provides Python API, CLI, and web interface access.

843 stars.

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

843

Forks

Language

Jupyter Notebook

License

—

Related tools

asigalov61/tegridy-tools

Symbolic Music NLP Artificial Intelligence Toolkit

jaschadub/harmonydagger

Make Music Unlearnable for Generative AI.

kyegomez/MORPHEUS-1

Implementation of "MORPHEUS-1" from Prophetic AI and "The world’s first multi-modal generative...

salu133445/musegan

An AI for Music Generation

FORARTfe/HyMPS

HyMPS will be a platform-indipendent software suite for advanced audio/video contents production.

Explore Generative AI Tools

All categories Trending Generative AI directory Insights