ComfyUI Music Models - Local FLAC Songs with Vocals Real Workflow
Vladimir Chopine [GeekatPlay] via YouTube
Overview
Syllabus
High-resolution FLAC music from ComfyUI on a local machine newest models
What this video covers + where to find all resources and links
Install my latest custom nodes git clone into ComfyUI custom_nodes
Sonic Holiday repo overview + required components and models
Models we’ll test: Tango Flux, Stability/OpenAI, and a new Facebook release
Why installation can be confusing + using the included installers Windows/Linux
Where files go: checkpoints, safetensors chunks, codecs, and configs
Subscribe/like to stay updated when code or models change
Restart ComfyUI + how to find nodes by searching “sonic”
MusicGen Melody node: generate music from humming 44kHz, mono/stereo, sizes
Microphone setup + duration control + press/hold to record humming
Save output in multiple formats MP3/FLAC/WAV
Text prompt mode: pick a model and specify a style example: K-pop
Run a quick test + watch generation progress
GPU memory usage explanation models staying in VRAM + cleanup tips
Recommendation: restart ComfyUI after you pick a workflow you like
Switch to Tango Flux “Sonic DJ” for lyric-synced song generation
Style/voice/duration settings + Bark text-to-voice in the workflow
Speed demo: ~12 seconds generation + key settings steps, CFG
Best-quality surprise model: tricky install + Python glue code to assemble pieces
Choose genre/mood/vocals + structured lyrics with tags verse/chorus/intro/outro
Two-minute song generation + waveform preview and save options
VRAM check: models still loaded + why a restart helps before longer runs
Full run timing: ~2–2.5 minutes + ~22GB VRAM noted
Play the result + voice quality and overall impressions
Links in description + star the repo + closing goodbye
Taught by
Vladimir Chopine [GeekatPlay]