Skip to content

Jingles

Jingles are short audio clips that play over regular audio with automatic volume ducking.

When a jingle plays:

  1. The target player or zone volume is reduced (ducked)
  2. The jingle audio plays through the target
  3. When the jingle ends, the original volume fades back up

Fade-in and fade-out transitions are configurable.

Ducking lowers each source relative to its current volume, so a source that is already muted stays silent for the duration of the jingle — playing a jingle never makes a muted player audible. Sources at full volume duck to the configured duck level as usual.

Jingles can target:

  • Specific players — Duck and overlay on selected players
  • Specific zones — Duck and overlay on selected zones

Instead of supplying a pre-recorded audio file, you can create a jingle by typing text. MZAP synthesizes the speech to an audio file, and from that point on the TTS jingle behaves exactly like a file-based jingle — it plays through the same ducking pipeline, can be scheduled by a JinglePlay action, and appears in the jingle list (with a speech-bubble icon and the spoken text in its subtitle).

  1. On the Jingles tab, click TTS.
  2. In the modal, set a name, pick a provider and voice, type the text, and adjust the speaking-rate slider.
  3. Click Preview to hear the voice before saving — the preview audio is never stored on the jingle.
  4. Save. The synthesized audio is generated and the jingle is created.

Editing a TTS jingle’s text, voice, or rate re-synthesizes the audio automatically. Cosmetic edits (volume, duck level, fade) skip re-synthesis.

Text is capped at a 5000-character soft limit (about six minutes of speech) on every synthesis entry point.

ProviderTypeSetupLicense
WindowsOfflineNone — uses the voices installed on the host machineIncluded
Azure SpeechCloudYour own Azure key + regionCloud TTS feature
ElevenLabsCloudYour own ElevenLabs key (no region)Cloud TTS feature

The Windows provider works offline and free, using the voices installed via Windows Settings → Time & language → Speech. The two cloud providers — Azure Speech (~140 neural voices across 90+ languages) and ElevenLabs (premium multilingual voices) — require the Cloud TTS license feature and an API key you supply yourself. See Settings — Text-to-speech for the step-by-step key setup.

Cloud-provider voices are multilingual or grouped by language, and failed synthesis (invalid key, region typo, quota error) is reported inline in the modal — no jingle is created if synthesis fails.

  • Store announcements over background music
  • Time signals or alerts
  • Scheduled welcome messages
  • DJ drops and station IDs
  • Spoken closing-time or safety announcements via TTS