// REPURPOSE PODCAST (AUDIO FILE) → YOUTUBE

Podcast to YouTube: Turn Audio Episodes Into Watchable Video

Convert podcast audio into YouTube-ready video with waveform, transcript captions, chapter markers, and thumbnail strategy. Workflow and gotchas.

Last verified · 2026-05-21 · by Moe Ameen

Podcasters who don't upload to YouTube are leaving 30-50% of their addressable audience on the table. YouTube has become the second-largest podcast platform globally, ahead of Apple Podcasts in some markets — but it punishes audio-only uploads. A static-image-with-waveform video is the minimum acceptable format; everything else (chapters, captions, search-optimized titles, thumbnails) is what actually moves discovery.

The technical conversion is the easy part. A 60-minute MP3 + a 1920x1080 still image + auto-captions = a publishable YouTube video. The harder part is the editorial layer: chapter markers, search-optimized titles, and thumbnails that compete with native YouTube content for click-through. Workflow handles both.

Platform specs

// Source
Podcast (audio file)
Categoryaudio
Aspect ratios
Max length4 h
Typical30 min1 h
Max file size2 GB
Captionssrt-upload
Caption chars4,000
AudioMP3 or WAV, 44.1/48kHz stereo
Post freq1-2/week
Generic audio source — covers Spotify/Apple/Buzzsprout episode files.
// Destination
YouTube
Categoryvideo-long
Aspect ratios16:9, 4:3
Max length12 h
Typical8 min20 min
Max file size256 GB
Captionssrt-upload
Caption chars5,000
AudioAAC-LC, 48kHz stereo, 384kbps
Post freq1-3/week
Verified: max length 12h / 256GB per support.google.com/youtube/answer/71673 (2026-05-21).

Why repurpose Podcast (audio file) to YouTube

Podcasters with even a modest YouTube presence get the highest compounding return from this pair. YouTube's search index runs on every episode for years; podcast app discovery is mostly first-week. The cross-post extracts long-tail traffic the original audio never sees.

About the source: Podcast (audio file)

Podcast source is typically MP3 or WAV, 30-90 minutes, stereo. Most podcasters have show notes and a transcript (or can generate one cheaply). Both feed YouTube's description and SRT side-car.

About the destination: YouTube

YouTube long-form, 16:9, up to 12 hours. SRT side-car captions accepted. Chapters supported via timestamped lines in the description. Thumbnails are the single biggest CTR lever.

The workflow

  1. Generate or pull the episode transcript. Use the same transcript that powers your show notes. If you don't have one, run the MP3 through any STT tool — output as SRT for YouTube.
  2. Build a 1920x1080 still or waveform video. Minimum: a static guest-photo or show-art image at 1920x1080 with the audio. Better: a waveform animation. Best: 4-6 b-roll cuts that match section transitions.
  3. Write chapter markers in the description. Format: "00:00 Intro / 04:32 Why we built X / 12:10 The pivot moment". Five-plus chapters with the first at 00:00 unlocks YouTube's chapter chip in the player.
  4. Upload SRT as side-car captions. Don't rely on YouTube auto-caption for an hour-long podcast — accuracy drops on guest names and jargon. Side-car SRT from the show transcript is night-and-day better.
  5. Write a search-optimized title. Podcast episode titles are conversational. YouTube titles need to match search intent. Rewrite "Ep 47 — Sarah Smith on growth" as "How Sarah Smith Grew Her SaaS to $10M ARR — Podcast 47".
  6. Design a thumbnail that competes with native YouTube. Guest face + 3-5 word callout text. The default "podcast cover art" thumbnail loses 60-80% CTR to a custom thumbnail.
  7. Publish on the same day as the audio drop. Same-day cross-posting captures launch-day momentum on YouTube's freshness signals.

Platform-pair gotchas

IssueFix
YouTube auto-captions mangle guest namesUpload SRT side-car from your show transcript.
Default podcast cover art as thumbnail = 60% CTR lossDesign a custom thumbnail with guest face + callout text.
No chapter markers = no chapter chips, lower watch-timeAdd 5+ timestamped chapters in description, first at 00:00.
Podcast episode title under-performs YouTube searchRewrite for search intent; lead with the value, not the episode number.
90-minute static-image video tanks engagementAdd waveform animation or section b-roll cuts at minimum.
Audio levels normalized for podcast apps clip on YouTubeRe-master to -14 LUFS for YouTube vs -16 LUFS for podcast apps.

Manual vs Kompozy

// Manual workflow
90 min / conversion

Following the workflow above by hand: trimming, reframing, captioning, writing copy, publishing.

// With Kompozy
5 min / conversion

Paste the source URL or upload the file. Kompozy handles transcript, scoring, reframe, captions, copy, and publish.

Frequently asked questions

Do I need video to upload a podcast to YouTube?

Yes — YouTube rejects pure audio. A static image + waveform is the minimum format.

Should I also pull short clips for Shorts and TikTok?

Yes. Each podcast episode typically yields 8-15 standalone moments suitable for Shorts/Reels/TikTok. Cross-post separately.

What's the right audio level for YouTube vs podcast apps?

-14 LUFS for YouTube, -16 LUFS for podcast apps. Re-master if you're uploading the same file to both.

Are chapter markers worth the effort?

Yes — they unlock YouTube's in-player chapter chips, which measurably lift watch-time and session retention.

How does Kompozy handle this?

Upload audio, Kompozy generates waveform video, pulls transcript as SRT, writes chapters from transcript breakpoints, drafts a search-optimized title, designs a thumbnail. ~5 minutes per episode.

Browse all repurposing pairs · See Kompozy pricing · Start your trial →