EDIT & PACING BLOG / 9 MIN READ

How do I structure a YouTube video?

There is a structure that almost every video worth watching follows: hook, promise, body, payoff, exit. Here is what each part does, how long it should run, and how to tell when yours is leaking viewers.

5parts to every video
3sto land the hook
<30sto make the promise
0–100craft score

By Thomas, founder of CutScore · Updated June 2026

STRUCTURE CHECK · my_video.mp4
A clapperboard resting on an editing workspace beside a monitor, where a YouTube video gets cut into its hook, body and ending before it is published.
CRAFT SCORE
FIXES ADVISED
hook · body · ending, mapped
Hook lands early · strong first 3s
Body sags mid-section · tighten 03:40–05:1003:40
Outro runs long · trim to ~6s09:12
The 30-second answer Structure a YouTube video in five parts, in this order: a hook in the first three seconds, a short promise within thirty seconds that tells the viewer what they will get, the body delivered in clear chapters or steps, the payoff that actually fulfils the promise, and a quick exit with one next step. Keep the intro under twenty seconds, put your strongest material early, and make the ending short. Most of the runtime lives in the body, so that is where pacing decides whether people stay. If you want to see exactly where your structure leaks viewers, that is part of what CutScore measures in one pass.
WHY STRUCTURE EVEN MATTERS

I have shipped videos with great footage and no shape, and they died quietly. The problem was never the camera. It was that the first thirty seconds gave nobody a reason to stay, the middle wandered, and the ending dribbled out over a minute of "so yeah, that was the video." Good footage in a bad structure still loses.

Structure is not a creativity tax. It is the skeleton that lets your idea actually reach someone. A viewer decides whether to keep watching in the first few seconds, and then re-decides every time the pace dips. Each of those moments is a seam, and a video with no structure is mostly seams. People do not "click away" because they are cruel. They leave because, at that exact second, nothing was pulling them forward.

The fix is boring and it works. Decide what each part of the video is for, then keep each part doing only its job. The hook hooks. The body delivers. The ending ends. Below is the structure I use, the timing for each piece, and the checks that tell you when a section is leaking.

THE STRUCTURE

The five parts of a YouTube video.

Every video worth finishing has these five sections, in this order. Skip one and you can usually feel the gap, even if you cannot name it.

PartRough timingIts one job
1 · Hookfirst 3 secondsGive one clear reason to keep watching before the viewer's thumb moves.
2 · Promisewithin ~30sTell them plainly what they get and why this video is worth their time.
3 · Bodymost of the runtimeDeliver the value in clear chapters or steps, each one earning the next.
4 · Payoffnear the endActually fulfil the promise: the result, the reveal, the answer they came for.
5 · Exita few secondsOne next step, then stop. No sixty-second goodbye, no five outros.
The order is the pointThe biggest structural mistake I see is burying the promise. Your most interesting moment is sitting at 0:45, behind a logo sting and a slow "hey guys, welcome back." Move a slice of that moment to second one, and half your retention problem disappears before you change anything else.
SEE WHERE YOURS LEAKS

Mapping a video by hand, then watching for the spots where it sags, takes ages. CutScore runs the pacing and hook checks in one pass and points at the timestamps.

Join the waitlist
PART BY PART

What each section actually has to do.

1. The hook: earn the first three seconds

Open with the most interesting thing you have, not the throat-clear. The first three seconds decide whether anyone sees the rest, so they cannot be a logo and a slow inhale. Show the result, ask the sharp question, or drop the viewer mid-action. If your best moment is at 0:40, a flash of it belongs at 0:01. Watch your own opening as if you were scrolling past a stranger's video. If you would keep going, it works. If you would not, neither will they.

2. The promise: tell them what they get

Right after the hook, say plainly what this video gives them. Not your life story. One sentence: by the end of this, you will know how to do the thing. The promise sets the contract, and the payoff later pays it. Keep it inside the first thirty seconds, and keep it short, because every second of preamble is a second someone uses to leave. A clear promise also makes the body easier to cut, since anything that does not serve it is now obviously off-topic.

3. The body: chapters that earn each other

This is where most of the runtime lives, so this is where structure does the heavy lifting. Break it into clear chapters or steps, and make sure each one creates a small reason to watch the next. The single clearest signal of body pace is average shot length: how long a shot holds before you cut. When a section drags, it is almost always one shot held three seconds too long, repeated. A clean jump cut removes the dead air without a reshoot.

An edit desk with a timeline open on screen, where the body of a video gets cut into chapters and the slow sections are tightened.
The body is where structure earns its keep: chapters, pace, and the cuts that remove dead air. Photo: Jakub Żerdzicki / Pexels.

4. The payoff: actually keep the promise

Near the end, deliver the thing you promised at the top. The result, the reveal, the finished build, the answer. This sounds obvious, and yet a surprising number of videos either forget to land it or bury it under a recap nobody asked for. If the promise was "I will show you the final cut," show the final cut, and give it a moment to breathe. A payoff that feels rushed reads as a let-down, even when the content was good. This is the part that decides whether someone leaves satisfied or vaguely cheated.

5. The exit: one next step, then stop

End fast. The most common ending mistake is a minute of "so yeah, thanks for watching, don't forget to" while the energy drains out of the room. Pick one next step, a related video, a subscribe, a single ask, say it, and cut. Filler words tend to pile up here, in that nervous wind-down, so this is a good place to tighten. A clean exit leaves people on a high. A long one teaches them that your endings are where the boring part lives.

RATHER SEE IT THAN READ IT?

Here is a real CutScore coaching report for an everyday video: the hook, the pacing across the body, and the ending, all scored with timestamps.

See a sample report
IF YOU ONLY FIX THREE THINGS

The three structure fixes that matter most.

Most of the gap between a video that holds people and one that leaks them comes from these three. Fix them before you touch anything else.

1
EDITNARRATIVE
Move your best moment to the front
If your strongest second is at 0:45, a piece of it belongs at 0:01. Cutting the slow intro and leading with the payoff, or a promise of it, does more for retention than any other single edit. The hook is the part of the structure with the highest return.
How Re-cut the opening so a reason to stay lands before second three. See the hook.
2
EDITPACING
Tighten the section that sags
Every video has one stretch in the body where the pace dies. Find it, then cut. Usually it is shots held a few seconds too long, or a tangent that does not serve the promise. Shortening your average shot length in that one region often saves the whole middle.
How Watch at normal speed and mark the first place you got bored. Start cutting there. See average shot length.
3
QUICKEXIT
Cut your ending in half
Whatever your outro is, it is probably twice as long as it should be. One next step, said once, then stop. A long goodbye drains the energy you built and trains people to skip the last thirty seconds of everything you make.
How Trim the outro to a handful of seconds. If it still feels long, trim again.
CHECKING THE SHAPE

Three ways to test your structure.

OPTION 01

Watch it cold

Leave the edit alone for a day, then watch it once without touching the keyboard. Note every second you wanted to skip. Those notes are your structure problems. The catch is the usual one: after forty passes in the edit, you have stopped seeing the video and started remembering it.

OPTION 02

Map it on paper

Write your five parts with their timestamps. Where does the hook end? When does the promise land? How long is the outro? Seeing it as a list makes a bloated intro or a missing payoff obvious. It works, but it is manual, and you have to do it honestly for every video.

OPTION 03

Let a coach map it

Hand the file or a link to CutScore. It checks whether the hook lands early, measures pace across the body, flags where it sags, and notes a long ending, with timestamps and a 0 to 100 craft score. No mapping by hand. See a sample report.

What CutScore can and cannot tell you about structure CutScore is an AI video quality coach for the craft of a video, not a growth or SEO tool. It cannot decide whether your idea is interesting, but it can measure the mechanical signals of structure: whether the hook lands in the first seconds, how your average shot length moves across the body, where the pace drags, and how clean the ending is. You get one 0 to 100 score, the timestamped evidence, and a prioritised list of fixes, before anyone else watches it. More on the method and the standards.
QUESTIONS

Frequently asked.

Five parts in order: a hook in the first three seconds, a short promise that says what the viewer gets, the body delivered in clear chapters or steps, the actual payoff that fulfils the promise, and a quick exit with one next step. The body is where most of the runtime lives, so that is where pacing matters most.
There is no fixed clock, but a rough guide works. The hook is the first three seconds, the promise lands inside the first thirty, and everything after that is the body and payoff. On a ten-minute video, the intro should be over in under twenty seconds, and your outro should be a handful of seconds, not a sixty-second goodbye.
Two places, mostly. The first is the opening few seconds, where a weak hook loses people before the video has started. The second is anywhere the body sags: a section held too long, a tangent, or a shot that overstays. Structure does not fix bad content, but it stops good content from leaking viewers at the seams.
Partly, yes. A tool cannot judge whether your idea is interesting, but it can measure the mechanical signals of structure: whether the hook lands early, how long your average shot runs, where the pace drags, and how clean your ending is. CutScore reports those with timestamps so you can see where the structure breaks.
EARLY ACCESS

Find out where your structure leaks.

CutScore maps your hook, your pacing and your ending, then tells you exactly what to tighten, with the timestamps to back it up. Join the waitlist for early access.

Join the waitlist