TikTok measures a watched view. Meta measures a stopped scroll.
TikTok's hook rate, in its native reporting and in the Creative Center, is computed against six-second video views, the platform treats six seconds as the threshold where a user has chosen to watch, not merely failed to swipe past. Meta's thumb-stop ratio is the share of impressions where the user stayed for three seconds, a much weaker bar, often satisfied by a single legible frame and a beat of motion.
The denominators also differ. TikTok's denominator behaves like impressions in a continuous-play feed, where the platform is auto-playing video in the user's face. Meta's denominator is built on a scroll surface where the user can never actually see the ad, they swipe past the frame before three seconds elapse. The asymmetry matters: a thumb-stop is a victory over scroll. A six-second view is a victory over the next video.
"A thumb-stop is a victory over scroll. A six-second view is a victory over the next video. Same word, different war."
The numbers do not move together.
We scored 2,000+ ads against public engagement signals across TikTok, Meta and YouTube between January 2025 and May 2026. The median scroll-stop time, the window in which most viewers either commit or abandon, varies sharply by surface.
| Surface | Median scroll-stop | Sample size | Beat the Skip penalty if late |
|---|---|---|---|
| TikTok | 1.9s | n = 700 | −22 |
| Meta Reels | 2.3s | n ≈ 410 | −18 |
| YouTube Shorts | 2.6s | n = 403 | −12 |
| YouTube In-Stream | 5.1s | n ≈ 280 | −15 |
Read the penalty column as: how much our model docks Beat the Skip when the creative places its first meaningful payload, face, stakes, or spoken claim, after the surface's median scroll-stop. TikTok punishes lateness hardest because the platform's whole grammar is built on the next video being one swipe away.
Winning the first second does not save the next five.
A consistent pattern across the sample: ads that score above 70 on hook can still collapse on retention. In one cluster of 84 TikTok beauty ads, mean hook was 71 and mean retention was 38. The first frame did its job, a face, a clean stake, a pattern-break, but seconds two through six contained no escalating reason to keep watching.
The implication for edit decisions is uncomfortable: hook and retention are not the same problem and rarely share a fix. A stronger hook will not patch a flat middle. A tighter middle will not save a creative whose first frame fails to stop the thumb.
"In 84 beauty ads: mean hook 71, mean retention 38. The first frame did its job. The next five seconds did not."
Four edit moves that actually move hook.
From the structural patterns in the highest-scoring 200 ads against Beat the Skip:
- Visual stakes in frame 1. A face mid-reaction, a thing about to fall, a hand mid-pour. Static product photography in frame 1 underperforms by roughly 30 percent in predicted hook on TikTok.
- Spoken promise before brand. Voice-over that names the user's problem within 1.2 seconds correlates with a +14 lift in predicted hook. Brand mention before promise correlates with a small negative.
- Pattern-break audio. A sharp non-musical opening sound, a snap, a beep, a voice from off-screen, triggers attention re-allocation even when the visual is mid-strength.
- Caption-first composition. Roughly 78 percent of TikTok viewing in our sample is in a context where the user can see captions but may not hear audio cleanly. Frame 1 with a legible caption hook outperforms voice-only by a meaningful margin.
It depends on whether you are editing the creative or buying the media.
If you are creative-led, pre-spend
Optimise structurally, for the things that produce thumb-stop irrespective of platform. The four moves above are platform-agnostic precisely because they encode how human attention works on vertical short-form, not how a specific ad platform happens to count it this quarter. Pre-spend, you should not be reading hook rate at all, there is no hook rate yet.
If you are media-led, post-spend
Use the platform's own metric as feedback, but read it with the right denominator in mind. A 1.5x lift in TikTok hook rate and a 1.5x lift in Meta thumb-stop ratio are not the same achievement. The TikTok lift is harder because the bar is higher and the next-video pressure is constant.
What this guide is not.
SaliencyLab predicts engagement and click-intent signals against public outcome data. The Spearman correlations we publish, ρ +0.30 to +0.32 out-of-sample, held out, are calibrated against TikTok engagement, TikTok CTR percentile, and YouTube view counts. They are not calibrated against attributed sales, ROAS, or measured brand recall, and we do not claim they are.
Heatmaps in the product are predicted attention, not measured eye-tracking. Score values are model predictions, not measurements. The right way to read a Beat the Skip score of 72 is "this creative shares structural properties with ads that historically held the first six seconds on TikTok," not "72% of viewers will watch six seconds."
"We predict engagement and click intent. We do not predict ROAS, sales, or brand recall. Anyone who says otherwise is selling you the wrong instrument."
The other five guides in this hub.
The first 2 seconds, frame by frame
Why frame 1 visual stakes change the outcome more than any other edit lever.
Where brand cues belong
Timing, position, and density rules from 2,000+ ads, by category.
How to read a SaliencyLab score
What 72 means. What 48 means. What no number means at all.
Synthetic users vs live tests
When simulation is the right call, and when it is not.
Why our benchmark is credible
500 ads × 43 attributes = 21,500 data points. The full defence.
Questions we get from creative strategists.
Are hook rate and thumb-stop the same metric?
No. TikTok's hook rate (six-second views / impressions) and Meta's thumb-stop ratio (three-second views / impressions) use different time thresholds and different denominators. A creative can win on one and lose on the other.
Which metric should I optimise against if I am editing the creative?
Optimise structurally for thumb-stop, frame 1 visual stakes, spoken promise before brand, pattern-break audio, caption-first composition. Use the platform's own metric as media-side feedback once the ad is live.
Does a high hook rate guarantee retention?
No. In our 2,000+ ad sample, plenty of ads scored hook above 70 and retention below 40. The first frame can win attention while the next five seconds lose it.
How does SaliencyLab predict hook performance?
We score Beat the Skip, a model prediction calibrated against public TikTok and YouTube engagement and click-intent signals. Held-out OOS Spearman ρ is +0.30 to +0.32. It is not measured eye-tracking, and it does not predict ROAS.