The spec for TikTok
Design for vertical first. Keep it punchy, legible, and built for quick retention.
- Aspect ratio: 9:16 vertical is the standard. Render at 1080 x 1920.
- Duration cap: TikTok supports long uploads on many accounts, but short-form performs best at 7-30 seconds. Plan a 15-30 second cut for most posts, and create a 60 second variant only if the content truly needs it.
- Frame rate: 30 fps is safe and smooth. 24 fps for a cinematic look. 60 fps works for fast motion or gaming captures.
- Audio: Sound-on is the default cultural norm. Lead with audio that adds value, but ensure the story still works muted.
- Captions: Use in-app auto-captions for accessibility, and burn in high-contrast key phrases. Keep your video description short, keyword rich, and specific.
- Safe zones: Keep essential text and CTAs away from the top 150 px and bottom 250 px to avoid UI overlays. Center your lower-thirds.
- Encoding: H.264 in MP4 or MOV, 8-12 Mbps bitrate for 1080p, stereo AAC 44.1 kHz audio.
The structure that works
On TikTok, structure is a retention machine. Build beats that deliver value quickly and make every cut intentional.
15-second cut
- 0-2s - Hook: A visual jolt plus a one-line payoff preview.
- 2-6s - Context: Show the problem or high-contrast before state.
- 6-12s - Value: Demonstrate the solution in motion. One key tip or one step.
- 12-15s - CTA: A clear next action and teaser for the comments or part 2.
30-second cut
- 0-3s - Hook: Specific promise, bold claim, or pattern break.
- 3-7s - Proof: Quick credential, metric, or quick before-after.
- 7-20s - Teach or demo: 1-3 crisp steps. Each step gets its own on-screen label.
- 20-27s - Payoff: Show the result or the over-the-shoulder view of the final state.
- 27-30s - CTA: Invite a save, follow, or comment with a polarizing question.
Editing rules of thumb:
- Open with motion in the first 200 ms. If you talk straight-to-camera, start mid-sentence visually, then backfill the context.
- Cut every 1-2 seconds unless a visual action demands a longer hold.
- Add micro beats: subtle zooms, push-in on impact words, pop-up text for nouns and numbers, and a whoosh to mark transitions.
- End clean. Do not fade. Hard cut with a small audio sting to encourage rewatches.
Hooks that earn attention
Hooks are formulas that collapse curiosity and value into the first breath. Pair each with a matching visual.
Formula 1: Number + Outcome + Timeline
- Example: "3 edits that cut your TikTok bounce rate in half in 24 hours."
- Visual: Swipe through a retention graph, cut to you adding captions, then a quick overlay of the results.
Formula 2: Pattern break with a "wrong" start
- Example: "Stop using trending sounds like this... do it like this to get reach without reposts."
- Visual: Show a chaotic feed scroll, then switch to a clean audio stack.
Formula 3: Before-After-Bridge in one line
- Example: "Before: boring facecam. After: clean talking head with captions. Here's the 60 second setup."
- Visual: Side-by-side split for 0.8 seconds, then jump to your quick setup.
Formula 4: Call out the viewer
- Example: "If you post 3 times a week and still sit under 1k views, this is why."
- Visual: Bold on-screen "3x/week < 1k" in high-contrast text.
Formula 5: Counterintuitive how-to
- Example: "Record in noisy rooms for better audio. Yes, really - here's the trick."
- Visual: Show a lav mic placement and a quick noise gate toggle with on-screen labels.
Brand + voice
One viral post is a lottery ticket. A consistent brand is compound interest. Audiences return for a voice they recognize - colors, fonts, cadence, and a point of view that makes even simple tips feel like part of a system.
- Brand kit: Lock your palette, type, lower thirds, and motion treatments. Keep 2-3 reusable templates for talking head, showcase, and tutorial formats.
- Voice: Decide on your stance. Are you tactical and blunt, or explanatory and friendly. Write 3 rules you never break and 3 phrases you always use.
- Cadence: Establish a posting rhythm that matches your production capacity. It is better to be reliably weekly than to sprint and stall.
Tools help here. HyperVids lets you attach a per-project brand kit - colors, fonts, logo lockups, lower-thirds, and caption styling - so every output stays on-brand without hunting through old files. You set the tone once and the templates keep it consistent, clip after clip.
Captions + accessibility
Design your edit so it is easy to follow with audio on or off, on small screens, and in noisy environments.
- Always-on captions: Turn on in-app auto-captions, then clean spellings and proper nouns. For critical beats, also burn in 1-2 impact words.
- Contrast: Use high-contrast text. White text with a 2-3 px black stroke, or white on a semi-opaque dark box. Aim for at least 4.5:1 contrast ratio.
- Size and lines: 60-80 px font for 1080 x 1920, 2 lines max, about 28-32 characters per line. Break on phrase boundaries, not mid-word.
- Placement: Keep captions above the bottom UI. Stay clear of the right-side action icons. Test on a physical phone before publishing.
- Motion hygiene: Do not animate captions more than one subtle pop-in per beat. Prefers fade or slight scale over slides that steal attention.
- Language: If you have multilingual audiences, ship two variants - one in each language - instead of cramming both into one frame.
Accessibility improves watch time and completion rate. Your captions are a design element, not an afterthought.
A sample HyperVids prompt
Here is a practical one-liner to generate a 30 second TikTok tutorial in a vertical talking-head format with dynamic captions:
"Show a fast 3-step TikTok edit workflow to boost retention from 40 percent to 65 percent: 1 - open with a pattern-breaking hook, 2 - add high-contrast captions with 2 lines max, 3 - end with a save-and-comment CTA. Keep it 30 seconds, vertical 9:16, clean studio audio, and use brand colors for lower-thirds."
Feed that into HyperVids with your brand kit attached. The app uses your colors, fonts, logo, and caption style, drafts a tight script with beat markers, builds a /hyperframes timeline, and renders a vertical cut with motion-safe captions. It works with your existing Claude CLI subscription to keep prompts lean and outputs dependable.
Production checklist for TikTok
Set up once, reuse forever. This is the minimal kit and settings that consistently ship tight short-form:
- Camera: Your phone's back camera or a mirrorless on a vertical mount. Lock 1/60 shutter at 30 fps, auto ISO with a cap, and autofocus with face tracking.
- Audio: A wired lav mic to phone or camera. Monitor levels -12 dB peaks. Apply a light noise gate and compression in post.
- Lighting: A key light at 45 degrees, a fill at 1:3 ratio, and a small back light for separation. Set white balance manually to 5000K or match your lights.
- Background: Clean and static. Remove micro flicker sources. Add one brand color accent in frame.
- B-roll: Shoot 3-5 inserts per point. 2 second takes with handle room for transitions.
- Export: 1080 x 1920, H.264, 30 fps, high profile, AAC audio 256 kbps. Name with a short slug and date for easy tracking.
CTAs that fit TikTok
CTAs should feel native and low-friction, not hard sells.
- Save for later: "If you will try this, tap save - you will need the steps when you edit."
- Comment prompt: "Type 'caption' and I will DM the exact template I use."
- Follow loop: "Follow for the 60 second part 2 where I show the color grading."
- Micro challenge: "Post your version and tag me - I will highlight the best ones."
Common failure modes
Most flops are avoidable. Here is what kills retention and reach on TikTok:
- Soft openings: Starting with your logo sting or a long hello. You need motion and a promise in the first second.
- Payoff buried: Teasing a result you never show. Deliver a proof moment by the 7-10 second mark.
- Overstuffed edits: Too many text layers, stickers, and transitions. One idea per frame wins.
- Low contrast captions: Thin fonts on busy backgrounds are unreadable on small screens.
- Covered UI: Placing text under the description area or behind the right-side icons.
- Sloppy audio: Clipping, background hum, or mismatched levels between A-roll and B-roll.
- Trendy without context: Using a sound or meme that your audience does not recognize or that does not advance the message.
- Generic hashtags: #fyp and #viral waste characters. Use 2-3 specific tags tied to your niche and the video's keywords.
- No CTA: Ending with a fade and silence. You need a tiny next step to convert interest into engagement.
- Inconsistent brand voice: Jumping between styles every post so nothing becomes familiar or repeatable.
Post-publish optimization
Shipping is half the job. The other half is feedback loops.
- First hour watch: If the hold rate dips before 3 seconds, test a different opening shot or a faster caption pop-in.
- Retention graph: Look for sharp drops. Patch those beats with tighter cuts or clearer on-screen labels.
- Comment mining: Turn the top comment into tomorrow's hook. Your audience tells you what they want next.
- Versioning: Keep an A and B cut with different hooks. Post the weaker one as a story or recycle to a different time slot.
Why tooling matters
Repeatability beats inspiration. Templates, beat markers, and a reliable caption stack let you move fast without sacrificing quality. HyperVids helps you spin consistent vertical cuts that stay on-brand and retain viewers. If your process takes longer than the idea you are trying to share, you will stop posting.
Conclusion
Short-form on TikTok rewards clarity, speed, and consistency. Start with the right specs, build a repeatable beat structure, open with a strong hook, keep captions readable, and ship on a predictable cadence. A tight brand kit and a disciplined process turn one-off hits into a library that compounds. If you lock these fundamentals and iterate on the first second, you will see completion rates rise, comments grow, and your backlog of ideas finally make it to the feed.
FAQ
How long should a TikTok short-form video be for best performance
7-30 seconds is the sweet spot for most educational and explainer content. If you truly need more time for a clear payoff, cut a 30 second version and a 60 second version, then test both. Lead with the shortest version that still lands the result.
Should I use trending sounds or original audio
Use original audio when your voice drives the value, and layer a low-volume trending track if it fits the niche. Keep the track under your voice and choose sounds that do not fight your message. Always verify rights inside TikTok's library.
How many hashtags should I add
Two to three specific, keyword-aligned hashtags are enough. Avoid generic tags. Match the exact nouns in your video: tool name, technique, and outcome. This helps TikTok categorize your content and find the right viewers.