Create
The Create tab is where you turn an MP3 into a finished video.

Two flows
The shape of the Create tab depends on one setting:
- AI Lyric Generator off (default) — Create opens to a drop zone. Bring an MP3 and render. Described below. Walk-through: Make your first video.
- AI Lyric Generator on — Create gains two steps in front of the drop zone: Lyrics (topic → scene → four candidates) and Song (the chosen lyrics plus a Suno-ready style prompt, with a Skip to Audio button to drop the resulting MP3). Turn the feature on in Settings → Preferences. Walk-through: Make your first video with AI lyrics. Feature reference: AI Lyric Generator.
The rest of this page describes the MP3 → render portion, which is the same in both flows.
Dropping in a song
Drag an MP3 file onto the drop zone, or click Browse to pick one. The app reads:
- Title and artist from the file’s metadata.
- Lyrics from the
lyrics-engmetadata tag if present.
You can edit any of these before generating.
Song analysis
Before rendering, the app splits the song into sections (verses, choruses, bridges) by reading the lyrics. Each section (“act”) gets its own background image — so a verse might have a quieter scene and the chorus a bigger, more dramatic one.
AI prompt editor
Want to steer the look of a specific section? Click the prompt icon next to it in the Create panel. You can type a custom prompt like:
“Dark highway at night with red taillights, cinematic”
This overrides the auto-generated prompt for that one section only.

Disabling an act
Each background slot has an eye icon in the top-right corner. Click it to disable that act — the slot grays out (eye-slash), the render skips that section entirely, and the previous act’s background extends over the disabled slice of the song. Click the icon again to re-enable.
At least one act must stay enabled.
Regenerating backgrounds and existing renders
Regenerating a background on Create changes what your next render uses — it does not change anything about renders already in the Library. Each render is locked to its own set of backgrounds, so you can safely iterate on the Create workspace without touching finished videos.
Generating the video
Click Generate to kick off the render. A new Library card is created for this render attempt. Progress is shown in the Queue tab, and you can render the same song repeatedly — each attempt produces its own independent card.
What happens under the hood
- Lyrics cleanup — section headers like “Verse 1” are stripped.
- Lyric transcription — each word is aligned to a timestamp in the audio.
- Prompt generation — the local AI prompt model writes a visual description for each enabled section.
- Background fetch — each prompt becomes a 4K image via the AI image service.
- Beat detection — the app finds the song’s BPM so the visualizer pulses on-beat.
- Color matching — the dominant color of the background tints the visualizer.
- Render — the renderer stitches everything into a 1080p MP4, saved to the render’s own folder.
You don’t need to understand any of this — it all just happens.