SYSTEM · ONLINE// SOUND_OFFBUILD · 2.5.β--:--:-- UTC

// SOUND_OFF

Designing for the sound-off default

Roughly 78% of TikTok ads are watched without sound. If your creative doesn't make sense muted, it doesn't make sense. The rubric calls this 'sound-off comprehension' — a percentage score, weighted into native feel, of how much of your ad's value lands without audio.

Why it's the default

TikTok is the rare social platform where users actively scroll with sound on (~22% of sessions, depending on which study). But for ads specifically, the platform's feed-and-discovery context favors silence: users in public, multitasking, ad-fatigued. The platform knows this — its captioning tools, sticker library, and on-screen text editor are first-class because they have to be.

A polished commercial-look ad with a great voiceover and no captions can still hook on Instagram Reels (different audience posture, different sound defaults). On TikTok it routinely underperforms a UGC clip with sticky captions and a flat read.

The three failure modes

  • No captions, voiceover-only. Most common failure mode. Script is great, audio is great, but the muted viewer sees a person mouthing words. They scroll before the hook lands.
  • Captions exist but lag. Auto-captions that come in 1–2 seconds late blow the hook window. The viewer needs the words landing in real-time on the first frame to read it before scroll instinct kicks in.
  • Captions exist but compete. Caption text in a small grey box at the bottom of the frame, fighting with the TikTok UI. The eye doesn't find it. Compare with sticky text in a high-contrast color, mid-frame, treated as a visual element — that reads instantly.

What strong sound-off design looks like

The test: turn the ad on, mute it, hide the audio waveform, and ask someone who's never seen it to tell you what the product is and what the offer is in 5 seconds. If they can, you're passing. If they can't, the ad has a sound-off comprehension problem regardless of how good the script is.

Concrete tactics that score well in the production-analysis section:

  • Sticky on-screen texton every frame. Hook line on frame 1. Product name visible by frame 3. CTA text on the closing frame even when it's also voiced over.
  • High-contrast caption styling. White or yellow on a black or dark-magenta background, never grey-on-busy. TikTok's native caption picker is fine; just don't leave it on the default settings.
  • Visual proof, not described proof. Don't say "before and after" in voiceover — show the before and after, side-by-side. Sound-off comprehension hates ads that describe what an audio version would have shown.
  • Treat the soundtrack as a bonus. Trending audio, voiceover, jingles all add lift on top of a sound-off-coherent ad. They don't compensate for one that doesn't read muted. The ad has to work without them.

What about voiceover, then?

Voiceover still matters — once a viewer with sound on engages, voice quality drives hold rate and CTA conversion. The point isn't "don't voice your ads." It's "design as if voiceover doesn't exist, then layer voiceover on top."

The TokBench rubric scores voiceover style separately under production analysis (creator / professional / AI synthetic / none). A creator-voiced UGC read scores higher native feel than a studio-recorded VO of the same script. Synthetic AI voices score lowest — the audience clocks them and the trust drops.

// PUT_IT_TO_WORK

Run one of your own ads through the rubric and see the scores against everything you just read.

ANALYZE_AD →