Uploading Audio

There are three ways to upload your session audio to GM Assistant, depending on how you recorded. Pick whichever matches your recording.

Limits:Files are limited to 4 GB and sessions are limited to 10 hours long.

Single File

One audio file containing the whole session and all speakers. E.g. a phone in the middle of the table, a single mic capturing everyone, a Discord call recorded as one combined stream.

Open New Session in your campaign
Leave the audio toggle on Single Audio File
Pick your file and continue

Multi-Part Audio

Sometimes a recording gets split mid-session: a Discord call drops, a recorder hits a file size limit, you restart partway through. You don't need to merge the files yourself.

Switch the audio toggle to Multiple Audio Files
Add every piece to the same track using Add Part. The parts concatenate in order.

Multi-part also works inside multi-track: each track can have its own parts.

Multi-Track and Speaker Labels

If your session audio has speakers separated across tracks, GM Assistant can label every line in your transcript with who said what and use that information to produce a noticeably more accurate analysis.

Two benefits:

Speaker labels in your transcripts. All transcript formats (plain text, SRT, VTT, JSON) will have speaker labels.
Here's a plain text example:

GM: A goblin steps out from behind the rocks, drawing a rusted shortsword.
Gandalf: I light my staff and call out to it. Stand down, creature.
Frodo: I keep my hand on the hilt of Sting, watching the treeline.
GM: The goblin hisses, glances over its shoulder, and starts backing away.
Gandalf: Don't let it run. We need to know who it answers to.

More accurate analysis. Recaps, memorable moments, and other notes all benefit because GM Assistant isn't guessing who said what during overlap or rapid exchanges. Attribution of actions and decisions to the right character is noticeably better.

When multi-track works (and when it doesn't)

The rule: Each track should hold one or more known speakers, but a given speaker should only appear on one track.

✓Works

One track per player headset on a Discord call.
A couple sharing a single mic, with both names assigned to that track.
The GM voicing every NPC on the GM track.

✗Does NOT work

Multiple mics in the same room (everyone sitting around a table, two phones recording from different angles, etc.). Every mic picks up every speaker, so there's no segregation for multi-track to attribute.

If your setup falls into the right column, a single audio file is the right choice. Pick the best recording or pre-mix them into a single track before uploading.

About background noise

Tracks don't have to be perfectly clean.

Kids in the next room, dice rolls, snacks, ambient music, a player briefly chiming in on someone else's mic.... all fine. The system uses your speaker labels as the source of truth for attribution; minor cross-talk or noise on a track won't confuse it.

To capture per-speaker audio in the first place, see the Multi-Track Recording section of the recording guide for setups that work with Craig bot, Zencastr, OBS, and per-player mics.

Docs

Uploading Audio

Single File

Multi-Part Audio

Multi-Track and Speaker Labels

When multi-track works (and when it doesn't)

About background noise