Start free

Transcribe
voice recordingsaudio and videoYouTube videosaudio filesvideo filesMP4 videosZoom meetingsMicrosoft TeamsGoogle MeetinterviewspodcastslecturesTikTok videosWhatsApp voicevoice memosMP3 filesphone callssermons
into text. In seconds

Speech-to-text & AI transcription software for audio and video. Convert MP3, MP4, or voice to text with speaker labels and AI summary, usually faster than realtime.

Drop your audio or video

MP3 · MP4 · WAV · M4A · MOV · up to 10 hours per file

Paste a link, we'll fetch the audio

YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify · 50+ méi

Direkt aus dem Browser ophuelen

Aschreiwen dauert 30 Sekonnen — d'Opnam mécht direkt duerno op, am Dashboard.

Free 30 min/moNo card100+ 100+ languagesSprieche-Labels (Pro+)Files auto-delete in 24h

Free tier: 30 minutes per month, up to 30 min per file. No card required.

100+
Sproochen automatesch erkannt
Auto-detect with manual override.
95%+
Genauegkeet bei propperem Audio
Most major languages, one or two speakers.
10h
Max. Dateilängt um Business
10 St. um Pro · 30 Min. um Free.
~30×
Méi séier wéi an Echtzäit
Eng 60-Min-Datei ass typesch an 2–3 Min zréck.
This is the dashboard

Click around. It's the real thing

Tabs funktionéieren. To-dos kann een ëmschalten. Genee dat lued an dengem Kont nodeems e Job fäerdeg ass — selwecht Layout, selwecht Controlen.

app.transcription.solutions / jobs / interview-ari-2026-04-26

Summary

auto-snapshot · saved
TL;DR

Grënner brauche Content no engem Call, net just Transkriptiounen. Tools forcéiere se, 5 Apps zesummen ze flécken.

318words2speakers · 58 / 425topics

Key points 3

  • 01Gap exists between raw recordings and shippable content
  • 02Show notes, social clips, blog drafts — expected by call's end
  • 03Aktuell Tools opgesplécktert iwwer 5+ Appen

Action items 2

  • Investigate single-pipeline approach to replace 5-app stitch
  • Skizz, wéi e Show-Note-Draft aus dëser Transkriptioun ausgesäit
Topicsfounder workflowContent nom Calltooling fragmentationshow noteseng eenzeg Pipeline

Transkriptioun mat Spriecher

4 Zeilen · 2 Speaker · 30s Clip
00:12Spriecher ASo what I keep hearing from founders is this gap between raw recordings and content you can actually ship.
00:27Speaker BGenau. Kee wëll nach eng Transkriptioun — si wëllen eng Show Note, e Clip, e Blog-Draft, bis den Uruff eriwwer ass.
00:41Spriecher ARight, and the tooling right now forces you to stitch five apps together to get there.
00:54Speaker BOne pipeline, one place. That's the bet.

Speaker analysis

Stereo-Kanalsplit · Diarisatioun op Mono
Spriecher A
58% airtime
2
Turns
14s
Talk time
…this gap between raw recordings and content you can actually ship.
Speaker B
42% airtime
2
Turns
10s
Talk time
One pipeline, one place. That's the bet.

Export formats

All Plang, all Format · 7 outputs · no watermarks · TXT · SRT · MD · JSON · VTT · DOCX · PDF
TXT

Plain text

Propperen Textdump · all Pläng

SRT

SubRip-Ënnertitel

Timestamped subtitle · all plans

MD

Markdown

Speaker headers + summary · all plans

JSON

Structured JSON

Ëffentlecht Schema · fir API-Workflows · all Pläng

VTT

WebVTT subtitle

HTML5-Videoplayer-Format · all Pläng

DOCX

Word document

Speaker-Header + Zäitstempel · all Pläng

PDF

Gebrandet PDF

Print-ready · summary & speakers · all plans

DEMO · OUNI TOUN
0:18 / 1:00
Sample output · 30 seconds of a podcast clip

One file. Aacht Saachen zréck

Fuer mat der Maus oder tipp op all Output fir ze gesinn, wéi en richteg ausgesäit. Selwechte 30-Sekonnen-Podcast-Clip an der Mëtt, aacht Artefakte doraus.

Transkriptioun

Punctuated · timestamped

00:12 Spriecher A
Wat ech vu Founder ëmmer erëm héieren, ass dës Lück …
AI-Resumé

TL;DR · key points

Founder brauchen Post-Call Inhalt, not just transcripts. Tools force them to stitch 5 apps together.
Speakers

Diarization · Pro+

Stereo-Kanalsplit fir Gespréicher mat zwee. Mono-Diarisatioun fir alles anescht.
100+ languages

Automatesch erkennen

Research-grade ASR. Force a specific language if auto-detect picks the wrong one.
interview-ari-2026-04-26.mp3
30-Sekonne-Clip · 2 Speaker
100+ Sproochen · auto-erkennt · 95%+ Genauegkeet
Transcript · 30s window
00:12
ASo what I keep hearing from founders is this gap.
00:14
ADen Uruff ass eriwwer, déi richteg Aarbecht fänkt un.
00:18
BRight — post-call eats the day.
00:21
ATools assume the transcript is the deliverable.
00:24
AEt ass den Input.
00:27
BSo you stitch five apps together by hand.
AI-Resumé
TL;DR: Founder brauchen Post-Call Inhalt, not raw transcripts. Today's tools force a 5-app workflow.
Key points
  • Transcript is the input, not the deliverable
  • To-dos schloen de Rohtext
  • One pipeline beats stitched-together SaaS
Diarizatioun · 2 Speaker erkannt
Spriecher A
Speaker B
0:000:150:30
Stereo-Kanal-Split · 62 % / 38 % Riedaarbecht
Language detection
English (en-US)99.2%
Aner Kandidaten
en-GB English (UK)0.6%
en-AU Englesch (AU)0.2%
Beim Upload erkannt · jidder Zäit iwwerschreiwen · 100+ Sproochen
Exports · 7 formats · no watermarks
TXT interview-ari-2026-04-26.txt34 KB
SRT interview-ari-2026-04-26.srt52 KB
VTT interview-ari-2026-04-26.vtt51 KB
MD interview-ari-2026-04-26.md38 KB
JSON interview-ari-2026-04-26.json71 KB
DOCX interview-ari-2026-04-26.docx91 KB
PDF interview-ari-2026-04-26.pdf146 KB
URL-Import · 1500+ Säite ënnerstëtzt
youtube.com/watch?v=Hk8L4mD2pXv
Fetch metadata0.3s
Audio eroflueden4.2 MB
Extract speechstereo · 44 kHz
An der ASR-Schlaang
REC00:42 / 60:00
Safari um iPhone · Chrome um Desktop
Auto-stops at 60 min — upload longer files
Live-Job-Status
Eroplueden0:08
Audio-Extrakt0:02
ASR · AssemblyAI U-247%
Diarizationan der Schlaang
AI-Resuméan der Schlaang
Export-Renderingan der Schlaang
Status pushed step-by-step · no refresh needed
Exports

7 formats · no watermarks

TXTSRTMDJSONVTTDOCXPDF
URL-Import

YouTube · TikTok · Instagram

Paste any video link. We download once, transcribe, and discard the source.
Browser record

Mic in iPhone Safari · Chrome

Hit record, talk, hit stop. No app install. Up to 60 min per recording.
Echtzäit-Fortschrëtt

WebSocket-Job-Status

Live status from upload → ASR → diarization → done. No polling, no waiting blind.
Wien dat benotzt

Transcription software built for the people who actually do the work

Three patterns we see weekly. The pipeline doesn't change — what you ship after it does.

01Podcaster

Episode show notes shipped

Eng laang Interview gëtt zu enger 5-Zeilen-Zesummefaassung, véier Kapitelen, enger Transkriptioun mat Sprieche-Labelen an enger SRT fir kuerz Clipsen — ee Job, all Output, deen s du wierklech benotz.

7 FormaterTXT · SRT · MD · JSON
VTT · DOCX · PDF
02Fuerscher

Long-form interviews, cited by timestamp

Dräi-Stonnen-Zoom-Opzeechnunge mat zwou Stëmmen, Enn-zu-Enn. Speaker-Diarizatioun um Pro. Zitéier no Zäitstempel aus dem DOCX-Export. Schluss mam „wou hu si dat gesot …"-Scrubben.

95%+ASR-Genauegkeet
op proppere Toun
03Small teams

Recordings action items assignees

No auto-join, no calendar permissions, no "agent in your meeting." Drop the recording, share the transcript. Action items extracted, named, ready for triage.

2,500Minutten pro Mount
um Business-Plang
Inputs we accept

Drop a file, paste a link,
or call our API

Six ways in, working today. Each pill is a real ingest path that ships in production right now.

YouTubeTikTokInstagramDirect media URLPublic REST APIWebhooksYouTubeTikTokInstagramDirect media URLPublic REST APIWebhooksYouTubeTikTokInstagramDirect media URLPublic REST APIWebhooksYouTubeTikTokInstagramDirect media URLPublic REST APIWebhooks
Pricing

Pläng déi
actually fit

All plans include diarization-quality ASR. Higher tiers unlock larger files, queue priority, and AI summary.

MéintlechAnnual −50%
Free
$0forever
Keng Kaart · keng Trial-Aaflafzäit

For trying out, occasional one-offs, short clips.

  • 30 minutes per month
  • Bis zu 30 Min pro Datei
  • All 7 Export-Formater · keng Wasserzeechen
  • Low-priority queue
Start free →
E-Mail-Verifikatioun erfuerderlech
Most popular
Pro
$19$19/ Mount
Cancel anytime · $0.04 / min overage

For people running interviews, podcasts, or repeated long-form work.

  • 600 Minutten pro Mount
  • Bis zu 10 Stonnen pro Datei
  • Sprieche-Labels + AI-Zesummefaassung
  • To-dos + Theme-Tags
  • “Make readable” paragraph polish
  • Translation · webhook delivery
  • Standard queue priority
Pro wielen →
Iwwerschoss $0.04 / Min · zu all Moment kënnegbar
Business
$49$49/ Mount
Zu all Moment kënnegbar · $0.02 / Min Iwwerschoss

Fir Teams, Agencen an Ops-Crewen, déi mat Volumen schaffen.

  • 2 500 Minutten pro Mount
  • Up to 10 hours per file
  • Alles aus Pro · 50 Iwwersetzungen / Mount
  • Prioritéits-Queue
  • Public REST API · per-key rate-limit tier
  • Prioritäre E-Mail-Support
Choose Business →
Overage $0.02 / min · cancel anytime

Joresofschloss spuert 50% · Réckerstattungspolitik · No card required for Free

Same audio · two outputs

Free gëtt der d'Wierder.
Pro ships deliverables.

Same audio, same model. The difference is everything we do after the transcription finishes.

Free output

So what I keep hearing from founders is this gap between raw recordings and the content they can actually ship. Exactly, nobody wants another transcript, they want a show note, a clip, a blog draft, by the time the call ends. Right, and the tooling right now forces you to stitch five apps together to get there. One pipeline, one place. That's the bet. We've been seeing this pattern for months — the audio comes in clean, but the workflow downstream is held together with screenshots and copy-paste between Notion and Otter and Zapier and wat soss grad an engem aneren Tab op ass, wann den Uruff op ass an den Deadline an zwanzeg Minutten leeft …

Plain transcriptKeng Speaker-LabelsKeng ZesummefaassungAll 7 formats

Duerno: iergendwou paaschten, strukturéieren, d'Zesummefaassung selwer schreiwen, To-dos mat der Hand erauszéien.

Pro output
TL;DR

Founder brauche keng Transkriptiounen — si brauche Post-Processing. Eng Pipeline schléit fënnef Appen zesummenzeflécken.

00:12 Spriecher ASo what I keep hearing from founders is this gap between raw recordings and content you can actually ship.
00:27 Speaker BGenau. Kee wëll nach eng Transkriptioun — si wëllen eng Show Note, e Clip, e Blog-Draft, bis den Uruff eriwwer ass.
00:41 Spriecher ARight, and the tooling right now forces you to stitch five apps together to get there.
00:54 Speaker BOne pipeline, one place. That's the bet.
Action items · 2
  1. Try a unified pipeline — audio in, notes & exports out, one job.
  2. Replace the Otter + Notion + Zapier stack before the next call.
TL;DR · 1 ZeilSpeaker · diariséiertAction items · 2“Make readable” polish

Duerno: copy TL;DR into Slack, attach the DOCX to email, ship the clip. Done before the call notes get cold.

— Selwecht Audio · Selwecht Modell · Den Ënnerscheed läit am Post-Processing —

An der Praxis

Wat eis Benotzer kritt de Mond net zou about

Unprompted reviews from signed-in users. We don't run review-incentive campaigns. Hover to pause.

MR
Maya Reyes
@mayarcuts · podcaster

Podcaster opens 5 tabs to ship one episode. Eng Datei eran — Show Notes, Transkriptioun, Clip-fäerdeg SRT eraus. Méi net.

Apr 181 job in
DA
Dr. Diego Alarcón
@diegoalarcon · researcher

14 long-form interviews through diarization. DER 0.95 op propperem Audio ass real. DOCX-Exporter ginn direkt an den Paper-Draft.

Apr 22DER 0.95
SO
Sora Okafor
@sorawrites · Schrëftstellerin

26 voice memos. 3 TikTok URLs. Newsletter draft outline in 11 minutes. Try beating that with Otter — I'll wait.

Apr 1911 min
MR
Maya Reyes
@mayarcuts · podcaster

Podcaster opens 5 tabs to ship one episode. Eng Datei eran — Show Notes, Transkriptioun, Clip-fäerdeg SRT eraus. Méi net.

Apr 181 job in
DA
Dr. Diego Alarcón
@diegoalarcon · researcher

14 long-form interviews through diarization. DER 0.95 op propperem Audio ass real. DOCX-Exporter ginn direkt an den Paper-Draft.

Apr 22DER 0.95
SO
Sora Okafor
@sorawrites · Schrëftstellerin

26 voice memos. 3 TikTok URLs. Newsletter draft outline in 11 minutes. Try beating that with Otter — I'll wait.

Apr 1911 min
JV
Jules Verstappen
@julesverops · Ops

Webhook + action-items extraction killed our weekly-recap-doc thing. Whole loop is 2 Minutten elo.

Apr 232 min loop
RK
Rohan Kapoor
@rohan_legal · counsel

Opnamen vun Deposéierungen → Transkriptioun mat Spriecher → PDF mat Zitatien. Mir hu fréier dat no baussen ginn. Elo ass et one upload.

24. Abr1 upload
EM
Elena Marchetti
@elenamarch · sales

Italian sales calls → English summaries. My team finally reads them. Tiny detail, huge impact.

Apr 27IT → EN
JV
Jules Verstappen
@julesverops · Ops

Webhook + action-items extraction killed our weekly-recap-doc thing. Whole loop is 2 Minutten elo.

Apr 232 min loop
RK
Rohan Kapoor
@rohan_legal · counsel

Opnamen vun Deposéierungen → Transkriptioun mat Spriecher → PDF mat Zitatien. Mir hu fréier dat no baussen ginn. Elo ass et one upload.

24. Abr1 upload
EM
Elena Marchetti
@elenamarch · sales

Italian sales calls → English summaries. My team finally reads them. Tiny detail, huge impact.

Apr 27IT → EN
TN
Tomi Nakamura
@tominaka · translator

Japanesch automatesch erkennen leeft einfach. Den Serif-Italic op dëser Säit ass awer e komplett anert Designverbriechen, dat ech respektéieren.

Apr 21auto-detect
PL
Priya Lakshmi
@priyalbuilds · founder

REST API + per-key rate-limit = our internal voice-memo pipeline. Took 30 minutes fir alles unzeschléissen. $19/Mount fir d'ganzt Team.

Apr 2519 $/Mount
FA
Fatima Al-Rashid
@fatima_writes · journalist

24h auto-delete is the feature I wousst net, datt ech dat wollt bis ech d'Privatsphär-Säit vu jidwer Konkurrent gekuckt hunn.

26. Abr.24h Läsche
TN
Tomi Nakamura
@tominaka · translator

Japanesch automatesch erkennen leeft einfach. Den Serif-Italic op dëser Säit ass awer e komplett anert Designverbriechen, dat ech respektéieren.

Apr 21auto-detect
PL
Priya Lakshmi
@priyalbuilds · founder

REST API + per-key rate-limit = our internal voice-memo pipeline. Took 30 minutes fir alles unzeschléissen. $19/Mount fir d'ganzt Team.

Apr 2519 $/Mount
FA
Fatima Al-Rashid
@fatima_writes · journalist

24h auto-delete is the feature I wousst net, datt ech dat wollt bis ech d'Privatsphär-Säit vu jidwer Konkurrent gekuckt hunn.

26. Abr.24h Läsche
FAQ

Questions people actually ask

Wéi genee ass d'Transkriptioun?+

Bei klorem Toun mat een oder zwee Spriecher kënnt d'Genauegkeet op 95%+ an de meeschte gréisseren Sproochen. D'Qualitéit fält bei Hannergrondsgeräischer, staarken Akzenter oder iwwerlappendem Schwätzen.

Wéi eng Sproochen?+

100+ languages with auto-detect. You can also force a specific language if auto-detect picks the wrong one. UI is English-only — multi-language interface is on the planned list.

How long do you keep my files?+

Source media (the audio/video you uploaded) is deleted from our infrastructure within 24 hours after transcription completes. The transcript and summary stay in your account until you delete them — or 30 days after you delete your account. Our speech-to-text providers (AssemblyAI primary, OpenAI fallback) process audio under their own retention policies — see /privacy fir déi voll Lëscht vun den Subprocessoren.

Do you train models on my recordings?+

No. Our upstream ASR provider has training opt-out by default for paid endpoints — we use those. We add nothing on top: no own models trained on your transcripts, no shadow analytics.

What happens if a job fails?+

Your minutes are not deducted. Most failures (private URL, file too long, codec we don't support) come with a clear error message and retry guidance.

Kann ech kënnegen?+

Yes — anytime in the Stripe customer portal. You keep your plan through the paid period, then drop to Free at the next renewal date.

Wéi ass d'Réckerstattungspolitik?+

Full refund within 7 days if you've used less than 10% of your plan minutes. After that, pro-rated refunds for the unused portion. Email support@transcription.solutions.

Hutt der eng API?+

Yes — REST API is live, webhooks too. API key auth is on the next-up list. Rate limits per plan tier. Docs at /docs/api once you have an account.

Sécherheet & Privatsphär

The boring stuff, handled

Keen SOC 2-Sticker. Wa mir e Control nach net ausgeliwwert hunn, setze mir keen Badge drop.

100%
Automatesch Läschen
of source files within 24 hours, every time
0
Tracker · Reklammen · Weiderverkaf
Däin Audio gëtt ni benotzt fir Modeller ze trainéieren
1×
Klick fir ze läschen
Account + all data wiped within 30 days

Source files erased in 24h

Audio and video you upload disappear within 24 hours of the job finishing. Hard contract, not a setting.

No training on your data

Eisen ASR-Provider huet d'Training-Opt-out par défaut — mir benotzen genee déi Endpunkten. Mir setzen näischt uewendrop.

AES-256 + TLS 1.3

Encryption at rest and in transit, since day one. HSTS enforced.

GDPR-aligned

EU access / deletion / portability rights honored. DPA on request.

One-click deletion

Astellungen → Kont läschen. All Donnéeë bannent 30 Deeg geläscht. Kee Support-Ticket néideg.

Lëscht vun de Subprocessoren

Full vendor list with purpose at /privacy. No surprise vendors.

— READY WHEN YOU ARE

Drop a file.
Get a transcript
ier däi Kaffi kal gëtt

30 free minutes a month, up to 30 min per file. No credit card, no card-after-trial, no asterisks. Cancel any plan anytime in one click.

Free / month30 min
Languages100+
Export formats7
MP3MP4WAVM4AMOVMKVWEBMYOUTUBETIKTOKINSTAGRAMBROWSER RECORD