Skip to main content
If you have recordings from a source that isn’t directly integrated with CallVault — or you just have a file on your computer — you can upload it directly. CallVault will transcribe it and generate an AI summary just like any other call.

Uploading a file

  1. Open the workspace or folder where you want the call to live
  2. Click Upload in the top-right corner
  3. Select your file from your computer (or drag and drop it onto the upload area)
  4. Give the call a title if you’d like — otherwise CallVault uses the filename
  5. Click Upload
Transcription starts automatically. Depending on the file length, it usually takes 1–5 minutes.

Supported file formats

FormatExtensionNotes
MP4.mp4Most common video format — works great
M4A.m4aAudio exported from Apple devices
MP3.mp3Widely supported audio format
WAV.wavUncompressed audio — high quality
WebM.webmCommon browser-recorded format
OGG.oggOpen audio format
MOV.movQuickTime video files
MKV.mkvMatroska video container
If your file is in a less common format and upload fails, convert it to MP4 or MP3 using a free tool like Handbrake or VLC before uploading.

File size and length

  • Maximum file size: 2 GB per file
  • Maximum recording length: There’s no hard cap on duration, but very long recordings (3+ hours) may take longer to transcribe
For multi-hour recordings like all-day workshops, consider splitting the file into segments for faster processing and easier navigation.

Transcription time

Transcription typically runs faster than real-time:
Recording lengthApproximate transcription time
Under 30 minutes1–2 minutes
30–60 minutes2–4 minutes
1–2 hours4–8 minutes
2+ hours8–15 minutes
You’ll receive a notification when transcription is complete and the call is ready to review.

Bulk upload

You can upload multiple files at once by selecting more than one file in the upload dialog. Each file becomes a separate call record and is queued for transcription individually.

After upload

Once transcription completes, the call record will have:
  • A full text transcript with speaker labels
  • An AI-generated summary with action items and key topics
  • All the same features as calls imported from Fathom or Zoom
Speaker labels from uploaded files are inferred by CallVault’s transcription engine and may be less accurate than integrations where participant data is available. You can manually correct speaker names in the transcript view.