multimodal-art-projection/YuE
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
Generates full-length songs from lyrics using a two-stage autoregressive architecture that produces semantic tokens followed by audio codec tokens, supporting style transfer via in-context learning and voice cloning. Multi-lingual models (English, Chinese, Japanese, Korean) are available on Hugging Face with configurable session counts for verse/chorus generation. Integrates with community tools like Gradio UIs and quantized inference frameworks for resource-constrained deployment (8GB+ VRAM).
6,083 stars. No commits in the last 6 months.
Stars
6,083
Forks
716
Language
Python
License
Apache-2.0
Category
Last pushed
Jun 04, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/multimodal-art-projection/YuE"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
hugofloresgarcia/vampnet
music generation with masked transformers!
sedthh/BeatLearning
Open Source Generative AI Models for Automatic Rhythm Game Beatmap Generation (for acoustic people)
asigalov61/SuperPiano
Absolutely amazing SOTA Google Colab (Jupyter) Notebooks for creating/training SOTA Music AI...
steinbergmedia/libmusictok
C++ Library for tokenizing MIDI files, designed to be compatible with the MIDITok python library
AlekseyKorshuk/huggingartists
Lyrics generation with GPT2-based Transformer