Thirty Shorts per day sounds like a content farm. And for most tools, it would be. But the strategy is not to publish thirty Shorts on one channel -- it is to extract the best moments from your long-form recordings and distribute them as standalone pieces across your publishing schedule. A 30-minute coding session contains 3-5 genuinely interesting moments. Ten recordings give you 30-50 potential Shorts.
How Shorts Generators Identify Highlights
The technical challenge is not creating the Short -- cropping to 9:16 and adding captions is trivial with FFmpeg. The challenge is identifying which 30-60 second segments are worth extracting. This is where tools diverge dramatically in quality.
Engagement-based detection analyzes the content for moments that would hold attention: a surprising result, a clean solution to a hard problem, a visible "aha" moment, or a dramatic error and fix. This requires understanding what is happening in the content, not just detecting visual or audio changes.
Activity-based detection looks for spikes in visual activity -- rapid typing, lots of screen changes, cursor movement. This is easier to implement but produces worse highlights because activity does not equal interest. The most engaging moment in a coding tutorial might be the 10 seconds of stillness where the developer reads an error message and then says "oh, I see the problem."
Audio-based detection identifies segments with clear speech, natural emphasis, and complete thoughts. This works well for talking-head content but misses screen-recording moments where the visual tells the story.
The Extraction Pipeline
For developer content, VidNo's Shorts extraction works in four stages:
# Stage 1: Identify candidate segments
# Analyzes OCR, git diffs, and narration for "highlight" patterns:
# - Bug discovery and fix
# - Function completion
# - Test passing after failure
# - Significant refactor
# Stage 2: Score and rank candidates
# Each candidate scored on:
# - Narrative completeness (does it tell a mini-story?)
# - Visual clarity (is the code readable at 9:16?)
# - Duration fit (30-60 seconds)
# Stage 3: Crop and caption
# - Smart crop to 9:16 focusing on active code region
# - Word-by-word captions synced to narration
# - Title overlay at top
# Stage 4: Generate metadata
# - Short title (under 100 chars)
# - Hashtags
# - Description linking to full video
Caption Styles That Perform
YouTube Shorts live or die on captions. Most viewers watch without audio, and even those with audio on process captions faster than speech. The caption style matters:
- Word-by-word highlight -- Each word lights up as it is spoken. High engagement but can feel frenetic for technical content.
- Phrase-by-phrase -- 3-5 words appear together, replaced by the next phrase. Easier to read for complex explanations.
- Keyword emphasis -- Full sentences with technical terms highlighted in a different color. Works well for developer content where specific terms carry meaning.
Volume Strategy
Publishing 30 Shorts per day is extreme. Most channels see diminishing returns after 3-5 Shorts per day. But having 30 in your queue means you can schedule them across a week, A/B test different moments from the same recording, and maintain consistent publishing without daily effort.
The math works like this: if you record 5 coding sessions per week (about 2.5 hours of raw footage), and each session yields 3-5 quality Short candidates, you have 15-25 Shorts per week. Schedule 3 per day and you have content for a full week from one batch of recordings.
Shorts drive channel discovery. Long-form drives watch time and revenue. The two formats feed each other when the Shorts link back to the full tutorials. A well-executed Shorts strategy can double your channel's subscriber growth rate without adding production time, because the Shorts are extracted automatically from content you already created.