guides

Interview Podcast Editing Workflow: From Raw Recording to Published Episode

PodRewind Team
7 min read
audio waveform editing interface on computer screen
Photo via Unsplash

TL;DR: Efficient interview editing follows a predictable workflow: organize files, process audio, make structural edits, add music and intros, then export. Most interview podcasts need light editing—removing obvious mistakes and long pauses while preserving conversational authenticity. Batch processing and templates save hours per episode.


Table of Contents


The Interview Editing Philosophy

Interview editing differs fundamentally from music or narrative podcast editing.

Here's the thing: interviews are conversations. Over-editing makes them feel artificial. The goal is removing distractions while preserving the natural rhythm of human interaction.

What Editing Should Do

  • Remove technical problems (connection drops, background noise, audio artifacts)
  • Cut obvious mistakes that break listener focus
  • Tighten pacing without eliminating natural conversation flow
  • Balance audio levels between participants
  • Add standard elements (intro, outro, music)

What Editing Should Not Do

  • Remove all verbal pauses and "ums" (some are natural)
  • Create artificial smoothness (real conversation has texture)
  • Restructure conversation order (except in extreme cases)
  • Make guests sound different than they actually sounded
  • Spend hours perfecting a single episode

Time Targets

For a typical 45-60 minute interview:

Light edit (recommended): 1-2 hours Medium edit: 2-3 hours Heavy edit: 4+ hours

If you're consistently spending more than 2x the episode length on editing, you're likely over-editing.


Organizing Your Raw Files

Consistent organization prevents mistakes and speeds up every step.

File Naming Convention

Create a systematic naming structure:

[Date]_[Guest]_[Track].extension

Examples:
2026-01-30_Jane_Smith_host.wav
2026-01-30_Jane_Smith_guest.wav
2026-01-30_Jane_Smith_combined.mp4

Folder Structure

/Podcast Production/
  /Episodes/
    /EP001_Jane_Smith_2026-01-30/
      /Raw/
        host.wav
        guest.wav
        backup.wav
      /Working/
        edit_v1.sesx (or your DAW project file)
      /Final/
        EP001_Jane_Smith_Final.mp3
        EP001_Jane_Smith_Final.wav (archival)
      /Assets/
        cover_art.jpg
        show_notes.txt
  /Templates/
    podcast_template.sesx
    intro.wav
    outro.wav
  /Music/
    theme_music.wav
    transition_sounds/

Track Preparation

Before editing content:

  1. Import all tracks into your DAW
  2. Sync tracks if they weren't recorded together (use a clap or verbal sync point)
  3. Verify tracks are complete (check beginning and end)
  4. Back up raw files before any processing

Audio Processing Fundamentals

Apply these processing steps to each track before content editing.

Step 1: Noise Reduction

Address consistent background noise:

What to remove:

  • Room tone/hum
  • Computer fan noise
  • Air conditioning rumble
  • Electrical interference

Tools:

  • Adobe Audition: Noise Reduction effect with noise print
  • Audacity: Noise Reduction with noise profile
  • iZotope RX: Industry standard but expensive

Approach: Capture a sample of "silence" (room tone without speaking), then remove that profile from the full track. Be conservative—aggressive noise reduction creates artifacts.

Step 2: EQ (Equalization)

Shape the frequency balance of each voice:

Common adjustments:

  • High-pass filter at 80-100Hz (removes low rumble)
  • Slight reduction around 200-400Hz (reduces muddiness)
  • Slight boost around 3-5kHz (adds clarity/presence)

Preset starting points: Most DAWs include "voice" or "podcast" EQ presets. Start there and adjust to taste.

Warning: Don't overdo EQ. If it sounds "processed," back off. Natural is better than heavily shaped.

Step 3: Compression

Even out volume variations:

What compression does: Reduces the difference between loud and quiet parts, making speech more consistently audible.

Typical settings:

  • Ratio: 2:1 to 4:1
  • Threshold: Catch peaks but not normal speech
  • Attack: 10-30ms (fast enough to catch plosives)
  • Release: 100-200ms (natural decay)

Podcast-specific approach: Light compression works for conversation. Heavily compressed audio sounds squashed and unnatural.

Step 4: Normalization

Bring tracks to consistent loudness:

Target levels:

  • LUFS: -16 to -19 LUFS for stereo podcasts (iTunes prefers -16)
  • Peak: Keep below -1dB to prevent clipping

Process: Most DAWs offer loudness normalization. Apply to the final mix, not individual tracks.

Processing Order

Apply effects in this sequence:

  1. Noise reduction
  2. EQ
  3. Compression
  4. Normalization (on final mix)

This order prevents each step from interfering with others.


Content Editing: What to Cut

The art of interview editing is knowing what to remove.

Always Cut

Technical problems:

  • Connection drops and glitches
  • Audio artifacts
  • Long silences from technical issues
  • Cross-talk that's incomprehensible

Obvious mistakes:

  • "Let me start that again..."
  • Coughing, sneezing (if distracting)
  • Phone ringing, doorbell interruptions
  • Off-topic tangents that went nowhere

Filler that adds nothing:

  • Long pauses (longer than 3-4 seconds)
  • Repeated words at sentence starts
  • False starts when someone restarts a thought

Maybe Cut (Use Judgment)

"Um" and "uh": Some verbal fillers are natural and shouldn't be removed. Cut only when they're excessive or distracting.

Pauses: Natural thinking pauses add authenticity. Cut only if they feel awkward or break flow.

Tangents: Interesting tangents are part of conversation's charm. Cut only if completely off-topic and uninteresting.

Guest stumbles: Don't make guests sound artificially polished. Cut clear errors; leave authentic speech patterns.

Rarely Cut

Complete thought changes: If a guest changed their mind mid-answer, that's often interesting.

Host questions: Cutting questions to shorten episodes removes context. If time is the issue, cut the answer, not the question.

Disagreements or awkwardness: Real conversation includes these. Unless truly uncomfortable, they add texture.


Assembling the Final Episode

Once tracks are processed and content is edited, assemble the complete episode.

Episode Structure Assembly

Standard structure:

1. [Cold open - optional] (0:00-0:30)
2. [Intro music] (0:00-0:15)
3. [Host introduction] (0:15-1:00)
4. [Guest introduction] (1:00-2:00)
5. [Main interview] (2:00-45:00)
6. [Closing] (45:00-47:00)
7. [Outro music + CTA] (47:00-48:00)

Adding Music and Intros

Theme music placement:

  • Under opening intro (ducked low enough to hear speaking)
  • Brief sting at transitions (if using)
  • Under closing/outro

Volume levels: Music should be clearly audible when alone, then duck significantly (-12dB to -18dB) under speech.

Fading:

  • Fade in music over 0.5-2 seconds
  • Fade out music over 1-3 seconds
  • Crossfade between music and speech

Pre-Recorded Elements

Create once, use forever:

  • Show intro (who you are, what the show is about)
  • Sponsor reads (update as needed)
  • Call-to-action (subscribe, review, visit website)
  • Outro (thanks for listening, see you next week)

Save these as separate audio files and drop into each episode at the appropriate spots.


Efficiency Techniques

Editing faster means more consistent publishing and less burnout.

Create Templates

Build a DAW project template with:

  • Track layout (host, guest, music, sfx)
  • Standard processing chains already applied
  • Intro and outro already placed
  • Markers for standard insertion points

Starting each edit from this template saves 15-30 minutes.

Batch Process

When editing multiple episodes:

  • Process all raw files first (noise reduction, EQ, compression)
  • Make content cuts across all episodes
  • Assemble all episodes
  • Export all at once

Staying in one mode (processing, then cutting, then assembling) is faster than doing complete episodes one at a time.

Keyboard Shortcuts

Learn your DAW's shortcuts for:

  • Ripple delete (remove selection and close gap)
  • Split at playhead
  • Zoom in/out on waveform
  • Jump to marker
  • Add marker

The difference between mouse-clicking and keyboard shortcuts compounds over hours of editing.

Edit While Listening at 1.5x

Many editors listen at increased playback speed:

  • Faster scan through content
  • Pauses and issues stand out more
  • Return to normal speed for precise cuts

This requires practice but can reduce editing time by 20-30%.

Use Transcripts

Editing with a transcript alongside audio:

  • Scan content visually instead of listening
  • Search for specific words or phrases
  • Mark cuts in text, then locate in audio
  • Dramatically faster than audio-only editing

Transcripts transform editing from a listening task to a reading task—which is much faster.


Quality Control Checklist

Before exporting, verify:

Audio Quality

  • No distortion or clipping
  • Consistent volume between speakers
  • Background noise acceptable
  • Music levels appropriate
  • No audio glitches

Content

  • Introduction is clear and engaging
  • No awkward cuts or jumps
  • Flow feels natural
  • Closing is complete with CTA
  • Episode length matches target

Technical

  • Correct export format (usually 128kbps MP3)
  • ID3 tags filled in (title, artist, artwork)
  • File name follows convention
  • Backup saved

Final Listen

Listen to:

  • First 2 minutes (first impression matters most)
  • Middle 2 minutes (spot-check main content)
  • Last 2 minutes (ending should be clean)

If these samples sound good, the full episode likely does too.


FAQ

How long should editing an interview podcast take?

For a 60-minute interview, target 1-2 hours for a light edit. This includes audio processing, content trimming, and adding intro/outro. If editing consistently takes longer, you're either over-editing or need to improve recording quality so less cleanup is required.

Should I remove all the "ums" and "ahs" from interviews?

No. Natural speech includes verbal fillers. Remove only when they're distracting—like multiple ums in a row or a very long pause filled with ums. Over-removing makes conversation sound robotic and makes cuts more obvious.

What software should I use for podcast editing?

For beginners: Audacity (free) or GarageBand (free on Mac). For serious podcasters: Adobe Audition ($23/month), Hindenburg Journalist ($95/year), or Descript (starts free, paid tiers for more features). Descript is particularly useful because it lets you edit audio by editing text.

Should I edit out content that makes my guest look bad?

Context matters. Genuine stumbles and self-corrections are human. Statements that could be defamatory, factually incorrect in harmful ways, or that the guest explicitly asked to remove should be cut. When uncertain, ask the guest before publishing.

How do I balance audio between host and guest when one is much louder?

Process each track separately. Apply compression to even out each person's volume individually, then adjust track levels so they match each other. Normalization should happen on the final mix, not individual tracks.



Editing Gets Easier

Every interview you edit teaches you something. You develop instincts for what to cut, ear for what sounds wrong, and speed in using your tools.

The biggest efficiency gain comes from better recording. Clean source audio requires less editing. Invest in your recording setup and process before investing more time in editing skills.

When editing interview podcasts, having a transcript makes the process dramatically faster. Instead of scrubbing through audio to find a specific moment, you can search and scroll. Automatic transcription turns hours of editing into minutes of reading.

Try PodRewind free and edit interviews faster with searchable transcripts.

editing
workflow
production
post-production

Ready to Get Started?

Search your podcast transcripts, chat with your archive, and turn episodes into content. Start for free today.

Try PodRewind free