ON-SCREEN TEXT BLOG / 8 MIN READ

What is the best font size for video captions?

Roughly half your audience watches on mute, so your captions are the video. Here are concrete size targets for YouTube, TikTok and Reels, the safe-zone rules that keep text on screen, and the one test that beats any number.

4–5 %of frame height
~44pxcap height at 1080p
2lines max on screen
0–100craft score

By Thomas, founder of CutScore · Updated June 2026

CAPTION CHECK · subtitles.mp4
An editing desk with a timeline open on the monitor, the place where caption size, contrast and placement get checked before a video goes out.
CRAFT SCORE
FIXES ADVISED
reading the captions, not the vibe
Caption too small · ~3% of frame, raise to 4.5%00:12
Contrast good · white on dark stroke
Text in unsafe zone · clips behind UI buttons00:31
The 30-second answer The best font size for video captions is roughly 4 to 5 percent of the frame height. On a 1080p frame, that is about 44 to 54 pixels for the cap height of the text, which lands near 32 to 42 point in most editors. On a vertical 1080 by 1920 short, the same percentage means bigger numbers because the frame is taller, so short-form captions usually sit around 70 to 100 pixels. Keep it to one or two short lines, high contrast (white text with a dark stroke or backing), and inside the platform safe zone. Then do the only test that matters: read it on a phone at arm's length. If you squint, go bigger. If you would rather not eyeball it, that is the exact job CutScore does in one pass.
WHY THIS QUESTION IS ANNOYING

There is no single magic number, and that is exactly why people keep searching for one. A caption that looks perfectly readable on your 27-inch monitor can turn into mouse type on a phone, on a bus, in the sun. The size that works is not measured in points. It is measured as a share of the frame, because that is the only thing that stays constant when your video gets watched at every size from a watch to a TV.

I learned this the slow way. Early on I shipped a tutorial with captions I had set in points, judged at 100 percent zoom on a big screen, and felt very pleased with myself. Then I watched it back on my phone the next morning and could barely read my own words. The text was technically there. It was just too small to be useful, which for a muted viewer is the same as not being there at all.

So the real question is not "what point size." It is "what percentage of the frame, and is it still readable on the smallest screen anyone will use." Get those two right and the point number sorts itself out. Here are the targets.

THE SIZE TARGETS

The best font size for captions, by platform.

Think in percentage of frame height first, then convert to pixels and points for your editor. These are starting points you confirm with the arm's-length test, not laws.

Where it playsFrameCaption cap height to aim forRoughly
YouTube · horizontal1920 × 1080~44–54 px (4–5%)32–42 pt in most editors
YouTube · 4K master3840 × 2160~88–108 px (4–5%)scale up with the frame
Shorts / TikTok / Reels1080 × 1920~70–100 pxthe chunky look you already see
Square feed post1080 × 1080~48–64 pxslightly bigger than 16:9
Why cap height, not point sizePoint size includes invisible space around the letters, and it changes meaning between fonts. The cap height (the height of a capital letter in pixels) is what a viewer actually sees, so it is the honest number to measure against your frame. A bold condensed font and a light wide font can share a point size and read completely differently.
SKIP THE GUESSWORK

Measuring caption size as a share of the frame on every clip gets old fast. CutScore reads it for you and flags the exact moments where text is too small, too low-contrast, or out of the safe zone.

Join the waitlist
SIZE IS ONLY HALF OF IT

Four things that decide if captions actually read.

1. Size as a percentage, set on the smallest screen

Pick your target from the table, then judge it at the size a real viewer uses. Not your editing monitor at 50 percent zoom, where everything looks generous. The cleanest habit is to size captions to about 4 to 5 percent of frame height and then immediately preview on a phone. If your editor shows percentage of frame, use it. If it only shows points, set the size once, export a ten-second test, and read it on your phone before you commit to the whole video. This habit fixes most caption problems before they exist, and it is a big part of what we analyze.

2. Contrast, because size cannot save thin grey text

A perfectly sized caption still fails if it blends into the background. White text with a dark outline or a semi-transparent box behind it stays readable over almost anything, including a bright sky or a busy room. Thin grey captions on a light background are the classic mistake, and bumping the size does not fix them. If you want to be precise about it, run a quick text contrast check on your most cluttered shots, not your cleanest one.

The same scene viewed on a large screen and on a phone side by side, showing why caption text that reads fine on a TV can turn to mouse type on a handset.
The same caption on a TV and a phone are two different reading tests. Size for the phone. Photo: Andrea Piacquadio / Pexels.

3. Lines on screen, because nobody reads a paragraph that flashes by

Big text means fewer words fit, and that is a feature. Keep one or two short lines on screen at a time. A caption that wraps onto four lines is not readable no matter how clean the font is, because it scrolls away before the eye finishes line two. If a sentence is long, break it across two caption cards rather than shrinking the font to cram it in. Two readable lines always beat one giant line that covers the speaker's face.

4. Safe zone, so the size you chose survives the platform

You can nail the size and still lose, if the caption sits where the app paints its own buttons. On a vertical short, the bottom fifth of the frame is a war zone of like counts, share icons, and the caption text the platform adds. Park your burned-in captions above that, in the lower-middle of the frame, and check the safe zones for Shorts for the exact margins. A 90-pixel caption hidden behind a share button is a 0-pixel caption, as far as the viewer is concerned.

RATHER SEE IT THAN READ IT?

Here is a real CutScore coaching report for an everyday video: caption size, contrast and safe-zone placement scored, with timestamps and the exact fixes.

See a sample report
SHORT ON TIME

If you only fix three things about your captions.

Most of the jump from "I can barely read this" to "clean and obvious" comes from these three. Fix them first.

1
2-MIN FIXTEXT
Set captions to about 4–5% of frame height
On a 1080p frame that is roughly a 44 to 54 pixel cap height; on a vertical short, more like 70 to 100. Stop sizing by point alone and start sizing by share of the frame, because that is the number that survives every screen your video lands on.
How Set the size, export ten seconds, and read it on your phone before committing the whole edit. Or let CutScore measure it and tell you the exact shortfall.
2
QUICKCONTRAST
Give the text a dark stroke or backing
White letters with a dark outline read over a bright sky, a busy kitchen, or a white shirt without you babysitting every shot. Thin grey text on a light background is the single most common reason captions fail, and going bigger will not rescue it.
How Add a 2 to 4 pixel outline or a semi-transparent box, then check it on your busiest, brightest shot, not your cleanest.
3
PLACEMENTSAFE ZONE
Keep captions out of the interface zone
On vertical video, the bottom of the frame belongs to the platform's buttons and its own caption overlay. Lift your burned-in text into the lower-middle so a share icon never lands on top of a word. A great font size hidden behind UI is wasted.
How Preview inside the app's safe-zone guides, or read your text placement rules and keep clear of the edges.
THREE WAYS TO CHECK CAPTION SIZE

By eye, by ruler, or in one pass.

OPTION 01

By eye, on a phone

Free, and the most useful test there is. Export the video, watch it on a phone held at arm's length in a bright room, and read every caption at a glance. If you slow down or squint, the text is too small, too thin, or too low-contrast. The catch is you have to be honest with yourself.

OPTION 02

By measuring the frame

Accurate and a little tedious. Drop a guide at 5 percent of frame height and match your cap height to it, then confirm the caption sits inside the safe zone. It works, but you are doing it on every clip, in every aspect ratio, by hand. Great if you like precision. Most people drift.

OPTION 03

With a coach in one pass

Hand the file (or a link) to CutScore. It measures caption size as a share of the frame, checks contrast and safe-zone placement, and gives you a 0 to 100 score with timestamped evidence and the fixes. No guides to drop. See a sample report.

How CutScore checks your captions for you CutScore is an AI video quality coach for pre-publish QC. On-screen text is one of the families it measures: it reads caption size relative to the frame, checks contrast against the background behind each line, and flags any text that strays into the platform safe zone, with the timestamps that fail. You get one score, the evidence, and a prioritised list of fixes, before anyone else sees the video. It judges the craft of the video itself, captions included, so it sits next to a growth tool rather than competing with one. More on the method and the standards.
QUESTIONS

Frequently asked.

Size your captions to roughly 4 to 5 percent of the frame height, then test them. On a 1080p frame that is about 44 to 54 pixels for the cap height of the text, which lands near 32 to 42 point in most editors. On a vertical 1080 by 1920 short, that means bigger numbers again because the frame is taller. The honest test beats any number: hold a phone at arm's length and read the captions. If you squint, go bigger.
On a vertical 1080 by 1920 frame, aim for caption text whose cap height is around 70 to 100 pixels, which is the chunky look most short-form creators already use. Keep one or two short lines on screen at a time, never a paragraph, and park the whole caption block above the bottom interface buttons so nothing gets covered.
No. Past a point, oversized captions eat the frame, cover faces, and force long phrases to wrap onto three or four lines that flash by too fast to read. The sweet spot is big enough to read on a phone at arm's length but small enough to leave the subject visible. Two readable lines beat one giant line that hides the shot.
Watch the exported video on a phone, held where a real viewer holds it, in a bright room. If you can read every caption at a glance without leaning in, the size and contrast are working. If you slow down or squint, the text is too small, too thin, or too low-contrast against the background. CutScore checks caption size, contrast and safe-zone placement automatically and flags the timestamps that fail.
EARLY ACCESS

Stop guessing whether your captions read.

CutScore checks caption size, contrast and placement on every clip and tells you exactly what to fix, with the timestamps to back it up. Join the waitlist for early access.

Join the waitlist