Six steps · iPhone & iPad · iOS 26.4

From a paste of text
to a finished M4A.

The whole workflow lives on four screens — Script, Voices, Generate, Production — plus a Mix tab at the end. Here's what happens on each, in the order you'll do them. Nothing here requires audio engineering knowledge.

Step 1

Paste your script

On the Script Import screen, paste any dialogue script. Expand the format guide if you want a quick reference. Tap `Review Script` and the parser runs.

Recognized formats: `HOST: line`, `[Host] line`, `(Host) line`, standalone name lines
Audio tags: `[SFX: prompt (Ns)]`, `[Music: prompt (Ns)]`
Built-in sample script for first-time users

Step 2

Review speakers

Turns needing review are highlighted with confidence indicators. Tap one to confirm or reassign. Batch-assign unreviewed turns or merge duplicate speakers from the toolbar.

Green / orange / red confidence per attribution
Suggestions for near-matches (`Did you mean 'HOST'?`)
Color badges per speaker for at-a-glance scanning

Step 3

Assign voices

Open the voice library, search and preview, then map a voice to each speaker. Expand per-speaker settings to dial in stability, similarity, style, and speed. Quick Preview streams a sample of your cast before you commit.

Search and category filters across the ElevenLabs catalog
Inline preview on every voice row
`Find Similar` from any voice's detail view
Optional pronunciation rules per project

Step 4

Generate audio

A progress ring shows percentage, current turn, and estimated time remaining. Cancel any time without losing what's already generated. If something fails, `Resume` picks up from where it stopped.

Speech, SFX, and music all generate in one run
Dialogue mode batches multi-speaker turns for natural flow
Automatic quality checks flag corrupt or silent files

Step 5

Edit in post-production

Play each turn individually. Swipe to regenerate the ones that need work, compare variants side by side, adjust per-turn pauses, and exclude any segments that shouldn't ship.

Single-turn regeneration without touching the rest
Variant picker with `Use` and `Keep Only Active`
Per-turn pause sliders, 0 to 2 seconds
Edit dialogue text and regenerate inline

Step 6

Mix & export

All active, non-excluded turns mix into a single M4A. Listen to the full episode with the waveform scrubber, then share via the iOS share sheet — Files, AirDrop, email, anywhere audio goes.

M4A / AAC output, ready for any podcast host
Waveform scrubber and 15-second skip controls
iOS share sheet — no proprietary export pipeline

If something fails

A dropped connection costs you a tap, not the run.

If generation is interrupted or cancelled, the app detects which turns already have audio and offers a one-tap resume that skips completed turns. Your project stays intact.

If you change your mind

Re-parsing is destructive. The app says so first.

Editing the raw script and re-parsing replaces existing speaker assignments and deletes any generated audio. The app shows you a confirmation before it does. Everything else — variants, pauses, exclusions — is reversible.

Try the workflow first

Demo mode runs every screen, end to end, with no API key.

When no ElevenLabs key is configured, flexCast uses a mock TTS service that returns silent WAV audio with realistic durations. You can paste a script, parse it, assign voices, generate, edit, mix, and export — exactly like you would in the real app. The only thing missing is the voices themselves.

Ask about the beta

Step you can't skip

Demo mode lets you walk the workflow end to end before deciding the app is worth an ElevenLabs account.
Same persistence

Projects, speakers, and pause settings save in SwiftData the same way they would with real audio.
Same export path

A demo mix exports as a real M4A — with silent audio. The pipeline you'll use is the pipeline you tested.

Want the rest of the answers?

The FAQ covers script formats, voice limits, SFX and music duration, the Keychain story, and what flexCast does and doesn't collect.

Read the FAQ → See every feature →

From a paste of text to a finished M4A.

A dropped connection costs you a tap, not the run.

Re-parsing is destructive. The app says so first.

Demo mode runs every screen, end to end, with no API key.

Want the rest of the answers?

From a paste of text
to a finished M4A.