Open a browser tab, drop a file or hit record, and watch text appear. No app, no extension, no Chrome-only restriction. Works on iPhone, iPad, Android, Mac, Windows, Linux. Free for first 60 minutes a month.
Free per month. No credit card. No "15-day trial then auto-charge". Sixty minutes is enough to evaluate the result on real audio of yours, or to handle the long tail of one-off transcripts.
Speech-to-text online means converting recorded speech to text directly in your web browser, with no software install. Transcription.Solutions does this three ways: upload a saved audio or video file, paste a URL from YouTube / TikTok / Instagram / 1,500 other sites, or click record and capture from your microphone right in the page. The recording is uploaded as it finishes, transcribed on our servers, and the text comes back within minutes — typically faster than realtime. The source audio is permanently deleted from our infrastructure within 24 hours.
Three ways to start. Pick the one that matches what you have right now — a file, a link, or a microphone.
Drag any audio or video file into the page. Or paste a public link — YouTube, TikTok, Instagram, Vimeo, Twitter, Facebook, podcast feeds. We handle the upload, audio extraction, and transcription server-side. You don't need to convert formats.
Click the record button. The browser asks for mic permission once. Speak, then click stop. The recording uploads automatically and transcription begins. Useful for quick voice memos, dictation, and meeting notes when you don't have a recording app.
The transcript appears in the browser. Edit speaker turns inline, copy paragraphs, run an AI summary, or export to TXT, SRT, VTT, or DOCX. Everything stays in your account; the source audio is wiped within 24 hours.
Captures from your microphone using the standard MediaRecorder API. Works on iOS Safari, Android Chrome, desktop everywhere. No app to install, no extension to permission-creep.
Multi-file drop zone. Resumable uploads on large files (the connection drops; you don't lose progress). Free tier accepts files up to 100 MB; Pro 500 MB; Business 2 GB.
YouTube, TikTok, Instagram Reels, Vimeo, Twitter, Facebook, podcast feeds — paste any public URL. We resolve and extract the audio server-side.
On Pro and Business plans we separate two or more voices automatically. Manual rename per speaker. Free tier has the transcript without diarization.
Long files show partial results as chunks complete. You start reading before the file is finished — no blank loading screen.
If you'd rather not have a UI between you and the transcription engine: REST endpoints for upload, URL, recorder. JWT auth. Same backend, no UI overhead.
The browser flow is good, but for batch jobs and automation you want the API. Three lines of bash to upload a file, three more to paste a URL. JWT auth, webhook callbacks for completion.
# Upload a file → get a job ID
curl -X POST https://api.transcription.solutions/api/v1/jobs/upload \
-H "Authorization: Bearer $TS_API_KEY" \
-F "file=@interview.mp3" \
-F "diarize=true"
# Or paste a URL
curl -X POST https://api.transcription.solutions/api/v1/jobs \
-H "Authorization: Bearer $TS_API_KEY" \
-H "Content-Type: application/json" \
-d '{"source_url": "https://youtu.be/...", "diarize": true}'
# Webhook fires on completion with the transcript URL.
# Or poll: GET /api/v1/jobs/{id} until status = "done".Per-key rate limits apply. JWT auth. Webhook signatures use HMAC-SHA256. Available on every plan including Free for evaluation. Same pricing as the dashboard — no API surcharge.
Full endpoint reference →Auto-detection picks the language; force a specific one in advanced settings if a multilingual file confuses detection. The 8 below are tier-1 — production-grade quality on real-world audio. We support 99 total.
Browser-recorded audio is typically 16 kHz mono via the laptop or phone mic. That's good enough for 95%+ on a quiet room. Bus stops and coffee shops are harder.
MacBook or iPhone mic, indoor, no music in the background. The default condition for a voice memo, dictation, or one-person podcast take.
Phone mic outdoors, conversation while walking, 1–2 metres from the speaker. Most words right; punctuation occasionally drifts on long pauses.
On some browsers, audio recording pauses when you switch tabs. Leave the tab in the foreground for long recordings. Or use a dedicated recorder app and upload the file — same accuracy, no tab risk.
iOS Low Power Mode and Android battery savers can cut recording short. Disable them for long sessions, or plug in.
If you deny mic permission, the recorder won't work — no fallback. Check site permissions in your browser settings, or use the file-upload path instead.
Hardcore privacy extensions occasionally break our auth flow. If recording works but transcripts don't save, try a private window or a different browser.
“I had a Whisper.cpp install on my laptop, three Python venvs, and a habit of "setting it up properly" instead of using it. Closing all of that and pasting a URL into a browser tab sounded like a downgrade. Two weeks in, I haven't opened the laptop install once.”
60 free minutes per month. No card, no app install, no extension. Drop a file, paste a link, or click record.
Start free