Turn 45-90 minute podcast episodes into 5-15 vertical TikTok clips with audiogram backgrounds, burned captions, and hook-first edits. Workflow plus Kompozy automation.
Podcasts are audio-first. TikTok is visual-first with a muted-autoplay default. The bridge between them is the audiogram — a vertical visual frame with the speaker's face, a waveform, and burned captions. Done well, a podcast-to-TikTok clip outperforms most native TikTok content for one reason: the source material is denser than what TikTok-native creators produce.
The practical work is in moment-mining and visual scaffolding. A 60-minute podcast has roughly 8-12 standalone moments worth clipping. Each becomes a 30-90 second TikTok with the same audio bed plus a visual treatment that survives muted autoplay. The visual treatment is where most podcast accounts fail — they ship audio-with-captions and get crushed by audiograms that include a head-and-shoulders video frame.
Podcasters making this move are usually solo or co-hosted interview/commentary shows with no native video. TikTok becomes a discovery surface that drives podcast subscriptions on Apple/Spotify. The conversion ratio matters: one 60-minute episode yields 8-12 TikToks, each a separate top-of-funnel impression.
Podcast audio is MP3 or WAV at 44.1 or 48kHz stereo. Episode length varies (15-180 minutes); the standard repurpose source is 45-90 minutes. The show notes are usually available and serve as a transcript scaffold; for accurate timestamps, run a Whisper pass on the WAV file.
TikTok caps at 10 minutes (60 minutes for select accounts), 287 MB, 9:16 native. Captions native and editable. Audio-only uploads are accepted but under-rank against video — every podcast clip needs a visual frame to compete. The TikTok in-app library does not have podcast-specific templates; build a custom audiogram template per show.
| Issue | Fix |
|---|---|
| Audio-only upload (no visual frame) gets crushed in For-You | Always build an audiogram visual frame, even if the source has no video — face photo + waveform + captions is the minimum. |
| Captions misread niche vocabulary or guest names | Run Whisper at the highest model size and manually fix proper nouns before burning. |
| Show notes used as captions (too long, descriptive not punchy) | Rewrite per clip — show notes are SEO, TikTok captions are hooks. |
| Bio link points to one podcast platform, losing the other audience | Use a smart link that routes iOS to Apple and others to Spotify. |
| Audio levels inconsistent across clips (podcast LUFS varies by episode) | Apply -16 LUFS normalization to every export to match TikTok's in-app loudness. |
| Posting 8-12 clips in 3 days saturates follower feed | Stagger across 3-4 weeks; one per day maximum. |
Following the workflow above by hand: trimming, reframing, captioning, writing copy, publishing.
Paste the source URL or upload the file. Kompozy handles transcript, scoring, reframe, captions, copy, and publish.
A 60-minute episode typically yields 8-12 standalone clips. Interview formats yield more; solo monologue episodes yield fewer.
Yes. Audio-only TikTok uploads under-rank against video. Build a static-photo + waveform + captions visual frame as the minimum.
30-90 seconds. Sub-30 second clips lack the meat to land an idea; over-90 second clips lose completion rate.
TikTok bio supports one URL. Use a smart-link service that routes iOS users to Apple Podcasts and Android/web to Spotify.
No. Podcast clips that are well-edited (good audiogram, sharp captions, punchy hook) perform competitively with native TikTok. The platform does not deprioritize the category.
Yes — Kompozy ingests the podcast file, runs Whisper, scores moments, renders 8-12 audiograms with face+waveform+caption frames, drafts captions per clip, and queues across 3-4 weeks. Typical run: 5 minutes per 60-minute episode.
Browse all repurposing pairs · See Kompozy pricing · Start your trial →