Siri / Google live dictation
Press and hold the keyboard mic icon. Words appear as you speak. No history, no editing buffer, no file upload.
Upload an iPhone voice memo, an Android recording, a WhatsApp voice note — or record live in the browser. Get your voice transcribed to searchable text in minutes.
MP3 · WAV · M4A · MP4 · MOV · MKV · OGG · OPUS · FLAC · WEBM — up to 100 MB anonymously
YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ more
↓ From voice note to text
Drop an iPhone voice memo, a WhatsApp voice note, a Telegram voice message, or hit record live in your browser. Mobile codecs compress voice aggressively — we handle the format quirks server-side with ffmpeg.
Hey, quick voice note — wanted to capture this before I forget. The thing about voice memos is everyone's recording from a different platform.
iPhone uses M4A. WhatsApp uses Opus. Telegram uses Ogg. Android does AMR or M4A depending on the keyboard app. Same voice, four different codecs.
Don't worry about the format — drop it in, transcribe runs the same way. The accuracy ceiling is the mobile bitrate, not the
↓ This loads after you drop a voice memo
iPhone M4A, WhatsApp Opus, Telegram Ogg, Android M4A — all four land in the same dashboard with the same Summary / Transcript / Speakers / Exports tabs. Mobile recording quirks (compression, low bitrate, narrowband) handled at the audio-extraction layer.
Sample preview from a WhatsApp voice note about cross-platform voice-memo workflows. Mirrors what loads in your account: summary, transcript, format-aware export, voice-specific accuracy notes.
Three ways to convert voice to text · honest comparison
Three real ways to get text from a voice recording in 2026. Live dictation gets you through one sentence at a time. AI tools batch a 20-minute voice memo in 30 seconds. Manual typing is what everyone falls back to when the other two fail.
Press and hold the keyboard mic icon. Words appear as you speak. No history, no editing buffer, no file upload.
Drop a voice memo, paste a recording link, or record live. ~30× realtime. Punctuation, paragraph breaks, AI summary, mobile codecs handled.
Listen, pause, type. Slowest. Highest accuracy if the audio is hard or the speaker code-switches frequently.
Siri/Google figures from public iOS / Android speech API benchmarks. Manual typing speed from US/UK transcriber productivity surveys.
Mobile voice formats · what's actually in your file
Four formats cover ~95% of voice memos. All four are accepted directly — we extract the audio with ffmpeg server-side, you never convert manually. Accuracy ceiling varies by codec / bitrate, not by app.
| Format | Extension | Bitrate | Notes |
|---|---|---|---|
| iPhone Voice Memos | .m4a · AAC | 64 kbps | Apple's default. Decent quality for monologue voice notes. Drag-drop from the macOS Voice Memos app works directly. Accuracy ceiling ~95% on clean recording. |
| WhatsApp voice notes | .opus · Ogg container | 24 kbps | Aggressive compression for messaging-grade audio. Lossy by design — accuracy ceiling ~92% even on clean voice. Forward to email or download via WhatsApp web, then upload. |
| Telegram voice messages | .oga / .ogg · Opus | 32 kbps | Slightly higher bitrate than WhatsApp. Accuracy ceiling ~93%. Save the message file from Telegram desktop, drop directly into the upload card. |
| Android voice recorder | .m4a / .amr | 32–128 kbps | Varies by manufacturer — Samsung defaults to M4A 128 kbps, older Android uses AMR 12 kbps. Pixel Recorder app exports clean M4A and reaches 95%+ accuracy. |
Accuracy · real-world numbers
Modern transcription reaches 95%+ word accuracy on clear English at 128 kbps and above, comparable to a human transcriber on the same recording. The audio coming in sets the ceiling — cleaner source, cleaner transcript. The breakdown below covers the recordings we actually see in production.
USB or studio microphone in a treated room. Single speaker at conversational distance. The headline number.
Podcast masters, interview recordings, well-mic'd meetings. The sweet spot for most professional work.
Field-recorded interviews, podcast episodes at 64–128 kbps, multi-speaker recordings. Usable for editorial without a review pass.
Ceiling mic, omnidirectional capture, mild reverb, multiple speakers at distance. Plan a rename pass on the speaker chips.
Common questions
30 free minutes per month, no card required. Drop a voice memo, paste a URL, or record live — see the result on your own audio first.
Start free