Twitter transcription — Spaces, video posts, voice notes to text

Twitter transcription.Spaces, videos, voice notes to text.

Drop the MP3 from a recorded Twitter Space — or a video, or a DM voice note. Get speaker labels, timestamps, and an SRT in 99 languages. No X Premium needed.

Drop your audio or video

MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously

Paste a link, we’ll fetch the audio

YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more

Record straight from your browser

No card required~90s per 60-min fileSRT · VTT · DOCX · TXTFiles auto-delete in 24h

Space recording in. Labeled transcript out.

X exports a Space recording as a single mixed MP3 — every speaker on one channel. We use acoustic diarization tuned for 6-12 rotating mic holders, the usual Spaces shape.

X Space recording (MP3)REC 5 speakers · 1:14:22

auto-detected en-US44.1 kHz mono · 96 kbps

~90s

Transcript · streaming92% accuracy

Welcome back everyone — we've got about 600 listeners now. Jess, you wanted to jump in on the Solana point?

Yeah, so the throughput numbers from last week are misleading without context on the validator set.

Can I push back on that? Because the mainnet beta data tells a different story.

Go ahead, Mike — keep it tight, we've got two more speakers in the queue.

92% on Spaces MP3SRT · DOCX · TXT · JSON

This is what loads when the job finishes.

Same layout as the real dashboard — Summary, full Transcript, Speakers tab, Exports. Key points and action items extracted automatically. Auto-tags on every job.

app.transcription.solutions / interview-202.mp3Export

Summary 5Transcript 1,420Speakers 2Exports

interview-202.mp347:08128 kbps CBR2 speakersen-US auto-detected

Founders need post-call content, not just transcripts. Tools force them to stitch 5 apps together.

Sample preview from a founder interview about post-call workflow. Real transcripts look exactly like this — same tabs, same summary block, same key-points / action-items split, same auto-tag chips.

Key points

Gap exists between raw recordings and shippable content — tools stop at transcript.

Show notes, social clips, blog drafts all expected by call's end, not next-day.

Current tooling fragmented across 5 apps — no single pipeline.

Conversion-rate signal flipped a buyer-segment assumption at week 3.

40% of original hypothesis survived — the shape held, mechanics rebuilt.

Action items

Speaker 1Investigate single-pipeline approach to replace 5-app stitch.

Speaker 2Mock how show-notes draft could flow from the transcript.

Speaker 2Pull conversion-rate by segment, Monday EOD.

Speaker 1Map the 5-app stitch & list which steps actually need a human.

Auto-taggedfounder interviewpost-call contenttooling fragmentationsingle pipeline

Try it on your own file — it's free

Option 01

X live captions

Real-time captions inside the Spaces UI. Nothing to download, nothing to search.

RequiresLive attendance

Speaker labelsNo

LanguagesEN + a few others

ExportNone — captions only

Post-Space accessLost when Space ends

CostFree with X account

Best forListeners who need accessibility in the moment and don't care about a transcript after.

Option 02

Transcription.Solutions

Drop the Space MP3 or paste the Space URL. Speaker labels, SRT, summary — every plan.

RequiresMP3 download or Space URL

Speaker labelsAcoustic, 2-12 speakers

Languages99, auto-detected

ExportSRT · DOCX · TXT · JSON

AI summaryKey points + topic tags

Cost · per min$0.03

Best forHosts repurposing Spaces into blog posts, podcasts, or YouTube videos with burned-in captions.

Option 03

Otter / Fireflies

Calendar bots designed for Zoom. To capture a Space you have to route audio into a fake meeting.

RequiresAudio loopback rig

Speaker labelsOften collapses to one

LanguagesEN-tuned, others degrade

ExportTXT, DOCX (paid)

AI summaryPaid tier

Cost$17/user/mo

Best forPeople already paying for Otter who want a rough live capture and don't mind setup friction.

Pricing and feature flags accurate as of May 2026. X Spaces caption rollout still varies by region and account type.

8 things people ask about Twitter transcription.

01Can you transcribe a Space that's still live?+

Not in real time. We work from the recording. Wait for the Space to end, download the MP3 from your X dashboard (Spaces → Recorded → Download audio), then drop the file. Most Spaces are available for 30 days after.

02What about a Space that wasn't recorded?+

If the host didn't toggle recording on, X has no file and neither do we. Some third-party tools capture Spaces externally — if you have that MP3 or MP4, we'll take it.

03Can you pull from a Space URL directly?+

Yes, if the Space is still public on X and recording was enabled. Paste the URL on the job form. If X has expired or unlisted it, you'll need the downloaded MP3 instead.

04Do you handle X video posts and Vine-style clips too?+

Yes. Drop the MP4 or paste the post URL. Short clips under 30 seconds are charged at our 1-minute minimum. Longer videos transcribe at the standard $0.03/min.

05What about voice DMs?+

Voice notes from X DMs work — export the audio file from the conversation and drop it. They're usually 30-60 seconds and one speaker, so accuracy is high (94%+) and cost is the per-minute minimum.

06How do speaker labels work when 10 people are on mic?+

We assign generic labels (Speaker 1, Speaker 2…) acoustically. After the transcript loads, you rename them once — usually a 2-3 minute pass against the Space's guest list. Renames apply throughout the file.

07Does the AI summary catch crypto / Web3 terminology?+

Mostly yes — protocol names, L1/L2, common tickers ($BTC, $ETH, $SOL) and slang (gm, wagmi) are in our vocabulary. For obscure projects or new launches, add them to Custom vocabulary before processing.

08Can I get burned-in captions for repurposing a Space as a YouTube video?+

We return SRT or VTT, which you import into your editor (Descript, Premiere, CapCut, DaVinci). We don't render burned-in MP4 ourselves — the SRT is the bridge to whatever video tool you already use.

Twitter transcription.Spaces, videos, voice notes to text.

Drop your audio or video

Paste a link, we’ll fetch the audio

Record straight from your browser

Space recording in. Labeled transcript out.

This is what loads when the job finishes.

Founders need post-call content, not just transcripts. Tools force them to stitch 5 apps together.

X's own captions. Otter. Or us.

X live captions

Transcription.Solutions

Otter / Fireflies

Four things generic transcribers miss on Spaces.

What goes wrong

What to flip here

Recommended job settings for X Spaces

92% on clean Spaces. Lower when Bluetooth shows up.

8 things people ask about Twitter transcription.

Drop your Space MP3. See what comes out.