Blog

2026-06-10

How to write AI music prompts with BPM, key, and song structure

A useful AI music prompt does more than name a mood; it gives the track a pulse, tonal center, and structure you can test.

A generated track can sound promising on its own and still fail the moment you put it under a video, vocal, or client review. The beat drifts against the edit, the chorus arrives after the logo has already appeared, or the melody sits in the wrong range. The prompt may have said uplifting, warm, or cinematic, but it never told the music how to move.

BPM means beats per minute, and in an AI music prompt it gives the track a practical pulse. Key gives the music a tonal center, though it does not guarantee that every melody will suit every singer. Song structure names the order of sections such as intro, verse, pre-chorus, chorus, bridge, and outro. Together, these details turn a vague request into something you can revise.

kaivorMusic.AI is an AI music creation tool that helps creators turn clear briefs into listenable drafts they can preview, compare, and refine. If you are writing for a voiceover, ad, lesson, game loop, or short video, begin with the intended use on the AI Music Generator page instead of starting with mood alone: https://kaivormusic.ai/ai-music-generator.

Set the job before you set the number. Is the track a music bed under speech, a full song with vocals, a loop for a game menu, a podcast opener, or a 15-second cue for an end card? Then choose a BPM range that fits the job: 70 to 85 for calm confidence, 90 to 110 for medium movement, 115 to 130 for social or dance energy, or slower if speech clarity matters most. A range is often more useful than pretending one exact tempo will solve the brief.

Use key as a direction, not a magic switch. You might write: gentle minor color without heavy sadness, warm major feel without cheesy celebration, or leave key flexible when texture matters more than harmony. For vocal work, range is more practical than theory language: comfortable mid-range female vocal, low conversational male vocal, no long high notes before the final chorus.

Next, map the structure in time. A workable cue brief might say: immediate 2-second intro, restrained verse for 12 seconds, short lift, clear hook around 25 seconds, lighter final lift, short edit-friendly ending. That map tells the model and the reviewer what success means. It also keeps you from approving a beautiful track that cannot be cut cleanly.

A reusable prompt could read: warm electronic pop bed for a 45-second explainer video, around 96 BPM, optimistic major feel, clean drums that do not crowd the voice, soft bass, light piano, airy synth layers, clear room for English voiceover, direct opening, simple original hook near the middle, short cut-friendly ending, no artist or song imitation. If you are working with a generation path that exposes BPM, key, scale, or structure controls, the Google Lyria how-to page in kaivorMusic.AI is a relevant next stop: https://kaivormusic.ai/google-lyria-ai-music-generator/how-to.

Three reusable ideas help immediately: create a constraint card with use case, length, BPM range, key direction, section map, and no-go elements; request a tempo ladder such as 92, 100, and 108 BPM instead of repeating the same prompt; keep a revision log that explains why one take worked, such as chorus hits the edit or bass leaves the narration clear. These notes turn prompting into a production process.

Common mistakes include choosing an exact BPM before checking the edit, picking a key because it sounds sophisticated, writing a full pop-song structure for a tiny cue, forgetting vocal range, and assuming AI-generated music is automatically cleared for every commercial use. Treat rights, tool terms, platform rules, and client requirements as separate checks. For sensitive releases, ask a qualified professional rather than relying on a generic prompt.

FAQ: Do I always need to specify BPM? No, but a range helps when timing matters. Is key important for background music? Sometimes less than instrumentation and space, but it helps when melody or vocals matter. Should every AI music prompt include a full structure? Not for short cues; a simple timing map may be enough. The takeaway: write the feeling, then give it a pulse, a tonal center, and a form you can actually test.