Start free

Transcribe
voice recordingsaudio and videoYouTube videosaudio filesvideo filesMP4 videosZoom meetingsMicrosoft TeamsGoogle MeetinterviewspodcastslecturesTikTok videosWhatsApp voicevoice memosMP3 filesphone callssermons
转成文字。 In seconds

Speech-to-text & AI transcription software for audio and video. Convert MP3, MP4, or voice to text with speaker labels and AI summary, usually faster than realtime.

Drop your audio or video

MP3 · MP4 · WAV · M4A · MOV · 单文件最长 10 小时

粘贴链接,我们来抓取音频

YouTube · TikTok · Vimeo · Twitter · SoundCloud · Spotify ·还有 50 多个

直接在浏览器里录音

注册只要 30 秒——之后直接在控制台里开始录音。

Free 30 min/mo无需信用卡100+ 100+ languages说话人标注(Pro 及以上)文件 24 小时后自动删除

免费版:每月 30 分钟,单文件最长 30 分钟。无需信用卡。

100+
Languages auto-detected
Auto-detect with manual override.
95%+
Accuracy on clean audio
Most major languages, one or two speakers.
10h
Max file length on Business
Pro 每月 10 小时 · Free 每月 30 分钟。
~30×
比实时还快
60 分钟的文件通常 2–3 分钟就能拿到结果。
This is the dashboard

Click around. 这才是真东西

标签页可切换,待办事项可展开。任务完成后在你账号里看到的就是这个界面——同样的布局,同样的控件。

app.transcription.solutions / jobs / interview-ari-2026-04-26

Summary

auto-snapshot · saved
TL;DR

创始人要的是会后产出物,不只是文字稿。市面工具逼着他们把 5 个应用拼起来用。

318words2speakers · 58 / 425话题

Key points 3

  • 01原始录音和可发布内容之间存在断层
  • 02节目笔记、社媒剪辑、博客初稿——通话结束前就要拿到
  • 03Current tooling fragmented across 5+ apps

待办事项 2

  • Investigate single-pipeline approach to replace 5-app stitch
  • Mock how show-note draft would look from this transcript
Topics创始人工作流会后产出物tooling fragmentation节目笔记single pipeline

带说话人标注的转录稿

4 行 · 2 speakers · 30s clip
00:12说话人 A我从创始人那里反复听到一个问题:原始录音和真正能发出去的内容之间,有一道鸿沟。
00:27Speaker BExactly. Nobody wants another transcript — they want a show note, a clip, a blog draft, by the time the call ends.
00:41说话人 ARight, and the tooling right now forces you to stitch five apps together to get there.
00:54Speaker B一条流水线,一个地方搞定。这就是我们押的方向。

说话人分析

立体声分轨 · 单声道说话人识别
说话人 A
占 58% 发言时长
2
Turns
14s
Talk time
…this gap between raw recordings and content you can actually ship.
Speaker B
42% airtime
2
Turns
10s
Talk time
一条流水线,一个地方搞定。这就是我们押的方向。

Export formats

Every plan, every format · 7 种输出 · 无水印 · TXT · SRT · MD · JSON · VTT · DOCX · PDF
TXT

Plain text

纯文本导出 · 全部套餐支持

SRT

SubRip subtitle

Timestamped subtitle · all plans

MD

Markdown

Speaker headers + summary · all plans

JSON

Structured JSON

公开 schema · 适配 API 工作流 · 全部套餐支持

VTT

WebVTT 字幕

HTML5 video player format · all plans

DOCX

Word document

Speaker headers + timestamps · all plans

PDF

Branded PDF

可直接打印 · 含摘要和说话人 · 全部套餐

演示 · 已静音
0:18 / 1:00
Sample output · 30 seconds of a podcast clip

One file. 拿到 8 样东西

Hover or tap any output to see what it actually looks like. Same 30-second podcast clip in the center, eight artifacts derived from it.

Transcript

Punctuated · timestamped

00:12 说话人 A
So what I keep hearing from founders is this gap…
AI 摘要

TL;DR · key points

Founders need post-call content, not just transcripts. Tools force them to stitch 5 apps together.
Speakers

说话人识别 · Pro+

双人通话用立体声分轨。其他场景用单声道说话人识别。
100+ languages

自动识别

Research-grade ASR. Force a specific language if auto-detect picks the wrong one.
interview-ari-2026-04-26.mp3
30-second clip · 2 speakers
100+ 种语言 · 自动识别 · 95%+ 准确率
转录稿 · 30 秒窗口
00:12
ASo what I keep hearing from founders is this gap.
00:14
AThe call ends, the actual work begins.
00:18
BRight — post-call eats the day.
00:21
ATools assume the transcript is the deliverable.
00:24
AIt's the input.
00:27
BSo you stitch five apps together by hand.
AI 摘要
TL;DR: Founders need post-call content,而不是原始转录稿。今天的工具逼你走 5 个 App 的工作流。
Key points
  • 转录稿是原料,不是成品
  • 待办事项比纯文本更有用
  • 一条流水线,胜过一堆拼起来的 SaaS
Diarization · 2 speakers detected
说话人 A
Speaker B
0:000:150:30
Stereo channel split · 62% / 38% turn share
Language detection
English (en-US)99.2%
Other candidates
en-GB English (UK)0.6%
en-AU English (AU)0.2%
Detected on upload · override anytime · 100+ languages
导出 · 7 种格式 · 无水印
TXT interview-ari-2026-04-26.txt34 KB
SRT interview-ari-2026-04-26.srt52 KB
VTT interview-ari-2026-04-26.vtt51 KB
MD interview-ari-2026-04-26.md38 KB
JSON interview-ari-2026-04-26.json71 KB
DOCX interview-ari-2026-04-26.docx91 KB
PDF interview-ari-2026-04-26.pdf146 KB
链接导入 · 支持 1500+ 站点
youtube.com/watch?v=Hk8L4mD2pXv
抓取元数据0.3s
Download audio4.2 MB
Extract speechstereo · 44 kHz
进入 ASR 队列
REC00:42 / 60:00
iPhone 上的 Safari · 桌面端 Chrome
Auto-stops at 60 min — upload longer files
Live job status
上传0:08
Audio extract0:02
ASR · AssemblyAI U-247%
Diarization排队中
AI 摘要排队中
导出渲染排队中
状态逐步推送 · 无需刷新
导出

7 formats · no watermarks

TXTSRTMDJSONVTTDOCXPDF
URL ingest

YouTube · TikTok · Instagram

Paste any video link. We download once, transcribe, and discard the source.
浏览器录制

Mic in iPhone Safari · Chrome

点录制、说话、点停止。无需安装。单次录制最长 60 分钟。
实时进度

WebSocket job status

Live status from upload → ASR → diarization → done. No polling, no waiting blind.
谁在用

Transcription software built for the people who 把活儿干完

Three patterns we see weekly. The pipeline doesn't change — what you ship after it does.

01Podcasters

Episode 节目笔记 shipped

一段长访谈,一次任务,拿到 5 行摘要、四个章节、带说话人标注的转录稿,还有用来剪短视频的 SRT——全是你真正会用到的产出。

7 种格式TXT · SRT · MD · JSON
VTT · DOCX · PDF
02Researchers

长篇访谈, 按时间戳引用

Three-hour Zoom recordings with two voices, end-to-end. Speaker diarization on Pro. Cite by timestamp from the DOCX export. No more "where did they say that…" scrubbing.

95%+ASR 准确率
音质清晰时
03Small teams

Recordings action items assignees

No auto-join, no calendar permissions, no "agent in your meeting." Drop the recording, share the transcript. Action items extracted, named, ready for triage.

2,500Minutes per month
on Business plan
支持的输入方式

Drop a file, paste a link,
或调用我们的 API

六种入口,今天就能用。每个标签都是一条已经在生产环境跑着的真实接入路径。

YouTubeTikTokInstagramDirect media URLPublic REST APIWebhooksYouTubeTikTokInstagramDirect media URLPublic REST APIWebhooksYouTubeTikTokInstagramDirect media URLPublic REST APIWebhooksYouTubeTikTokInstagramDirect media URLPublic REST APIWebhooks
Pricing

Plans that
刚好合适

All plans include diarization-quality ASR. Higher tiers unlock larger files, queue priority, and AI summary.

MonthlyAnnual −50%
Free
$0forever
No card · no trial expiry

For trying out, occasional one-offs, short clips.

  • 30 minutes per month
  • 单文件最长 30 分钟
  • 7 种导出格式 · 无水印
  • Low-priority queue
Start free →
Email verification required
最受欢迎
Pro
$19$19/ 月
Cancel anytime · $0.04 / min overage

For people running interviews, podcasts, or repeated long-form work.

  • 600 minutes per month
  • Up to 10 hours per file
  • 说话人标注 + AI 摘要
  • 待办事项 + 主题标签
  • 「读起来更顺」段落润色
  • Translation · webhook delivery
  • Standard queue priority
选择 Pro →
超量 $0.04/分钟 · 随时取消
Business
$49$49/ 月
随时取消 · 超量 $0.02/分钟

为有量的团队、代理机构和运营准备。

  • 2,500 minutes per month
  • Up to 10 hours per file
  • Pro 所有功能 · 每月 50 次翻译
  • High-priority queue
  • Public REST API · per-key rate-limit tier
  • Priority email support
选择 Business →
Overage $0.02 / min · cancel anytime

按年付费可省 50% · Refund policy · No card required for Free

同一段音频 · 两种产出

Free 给你文字。
Pro 直接交付成品。

Same audio, same model. The difference is everything we do after the transcription finishes.

Free output

我从创始人那里反复听到一个问题:原始录音和真正能发出去的内容之间,有一道鸿沟。没错,没人想要又一份转录稿,他们要的是节目笔记、剪辑片段、博客初稿——通话一结束就能拿到。对,现在的工具链逼你把五个 App 拼起来才能做到这些。一条流水线,一个地方搞定,这就是我们押的方向。这种情况我们已经看了好几个月——音频进来干干净净,但后面的工作流全靠在 Notion、Otter、Zapier 之间截图和复制粘贴勉强撑住,还有 whatever else happens to be open in another tab when the call wraps and the deadline is in twenty minutes…

Plain transcriptNo speaker labels不含摘要All 7 formats

接下来: 得自己粘到别处、整理结构、写摘要、一条条挑出待办事项。

Pro output
TL;DR

Founders don't need transcripts — they need post-processing. One pipeline beats stitching five apps.

00:12 说话人 A我从创始人那里反复听到一个问题:原始录音和真正能发出去的内容之间,有一道鸿沟。
00:27 Speaker BExactly. Nobody wants another transcript — they want a show note, a clip, a blog draft, by the time the call ends.
00:41 说话人 ARight, and the tooling right now forces you to stitch five apps together to get there.
00:54 Speaker B一条流水线,一个地方搞定。这就是我们押的方向。
Action items · 2
  1. Try a unified pipeline — audio in, notes & exports out, one job.
  2. 下一次通话之前,就能替掉 Otter + Notion + Zapier 这一套。
一句话总结 · 1 行Speakers · diarizedAction items · 2“Make readable” polish

接下来: TL;DR 复制到 Slack,DOCX 附件发邮件,剪辑片段发出去。会议笔记还没凉就全做完了。

— Same audio · Same model · The difference is in the post-processing —

真实使用场景

用户怎么说 won't shut up about

Unprompted reviews from signed-in users. We don't run review-incentive campaigns. Hover to pause.

MR
Maya Reyes
@mayarcuts · podcaster

Podcaster opens 5 tabs to ship one episode. 一次提交 — show notes, transcript, clip-ready SRT out. That's it.

Apr 181 个任务进入
DA
Dr. Diego Alarcón
@diegoalarcon · 研究员

14 long-form interviews through diarization. DER 0.95 on clean audio is real. DOCX exports go straight into the paper draft.

4 月 22 日DER 0.95
SO
Sora Okafor
@sorawrites · writer

26 voice memos. 3 TikTok URLs. Newsletter draft outline in 11 minutes。让 Otter 试试看能不能比这更快——我等着。

Apr 1911 min
MR
Maya Reyes
@mayarcuts · podcaster

Podcaster opens 5 tabs to ship one episode. 一次提交 — show notes, transcript, clip-ready SRT out. That's it.

Apr 181 个任务进入
DA
Dr. Diego Alarcón
@diegoalarcon · 研究员

14 long-form interviews through diarization. DER 0.95 on clean audio is real. DOCX exports go straight into the paper draft.

4 月 22 日DER 0.95
SO
Sora Okafor
@sorawrites · writer

26 voice memos. 3 TikTok URLs. Newsletter draft outline in 11 minutes。让 Otter 试试看能不能比这更快——我等着。

Apr 1911 min
JV
Jules Verstappen
@julesverops · ops

Webhook 加上待办提取,直接干掉了我们每周的会议纪要文档。整个循环 2 分钟 now.

Apr 232 min loop
RK
Rohan Kapoor
@rohan_legal · 法务

庭审录音 → 区分说话人的转录稿 → 带引用的 PDF。以前都外包到海外做,现在 one upload.

4 月 24 日1 upload
EM
Elena Marchetti
@elenamarch · 销售

Italian sales calls → English summaries. My team finally reads them. Tiny detail, huge impact.

Apr 27IT → EN
JV
Jules Verstappen
@julesverops · ops

Webhook 加上待办提取,直接干掉了我们每周的会议纪要文档。整个循环 2 分钟 now.

Apr 232 min loop
RK
Rohan Kapoor
@rohan_legal · 法务

庭审录音 → 区分说话人的转录稿 → 带引用的 PDF。以前都外包到海外做,现在 one upload.

4 月 24 日1 upload
EM
Elena Marchetti
@elenamarch · 销售

Italian sales calls → English summaries. My team finally reads them. Tiny detail, huge impact.

Apr 27IT → EN
TN
Tomi Nakamura
@tominaka · translator

日语自动识别 开箱即用. The serif italic on this site is, however, an unrelated design crime I respect.

4 月 21 日auto-detect
PL
Priya Lakshmi
@priyalbuilds · founder

REST API + per-key rate-limit = our internal voice-memo pipeline. Took 30 minutes 即可接入。整个团队每月 19 美元。

Apr 25$19/mo
FA
Fatima Al-Rashid
@fatima_writes · 记者

24h auto-delete is the feature I didn't know I wanted until I checked the privacy page of every competitor.

Apr 2624h delete
TN
Tomi Nakamura
@tominaka · translator

日语自动识别 开箱即用. The serif italic on this site is, however, an unrelated design crime I respect.

4 月 21 日auto-detect
PL
Priya Lakshmi
@priyalbuilds · founder

REST API + per-key rate-limit = our internal voice-memo pipeline. Took 30 minutes 即可接入。整个团队每月 19 美元。

Apr 25$19/mo
FA
Fatima Al-Rashid
@fatima_writes · 记者

24h auto-delete is the feature I didn't know I wanted until I checked the privacy page of every competitor.

Apr 2624h delete
常见问题

Questions people 真的 ask

转录准不准?+

在音质清晰、一两个说话人的情况下,主流语言准确率可达 95% 以上。背景噪音、口音重或多人抢话会影响效果。

支持哪些语言?+

100+ languages with auto-detect. You can also force a specific language if auto-detect picks the wrong one. UI is English-only — multi-language interface is on the planned list.

我的文件你们保留多久?+

Source media (the audio/video you uploaded) is deleted from our infrastructure within 24 hours after transcription completes. The transcript and summary stay in your account until you delete them — or 30 days after you delete your account. Our speech-to-text providers (AssemblyAI primary, OpenAI fallback) process audio under their own retention policies — see /privacy 查看完整的子处理方列表。

Do you train models on my recordings?+

No. Our upstream ASR provider has training opt-out by default for paid endpoints — we use those. We add nothing on top: no own models trained on your transcripts, no shadow analytics.

What happens if a job fails?+

Your minutes are not deducted. Most failures (private URL, file too long, codec we don't support) come with a clear error message and retry guidance.

可以取消吗?+

Yes — anytime in the Stripe customer portal. You keep your plan through the paid period, then drop to Free at the next renewal date.

What's the refund policy?+

如果你已用套餐时长不到 10%,7 天内可全额退款。超过后,按未使用部分按比例退款。邮件联系 support@transcription.solutions。

有 API 吗?+

可以——REST API 已经上线,Webhook 也有。API 密钥鉴权在下个迭代里。各套餐有不同的速率上限。注册后到 /docs/api 看文档。

安全与隐私

The boring stuff, handled

没有 SOC 2 认证标。该有的控制项没上线之前,我们不挂徽章。

100%
Auto-deletion
of source files within 24 hours, every time
0
Trackers · ads · resale
你的音频不会被用来训练模型
1×
Click to delete
账号及全部数据 30 天内清除

Source files erased in 24h

Audio and video you upload disappear within 24 hours of the job finishing. Hard contract, not a setting.

No training on your data

上游 ASR 服务默认禁用训练数据采集——我们用的就是这些接口,不会在此之上额外采集。

AES-256 + TLS 1.3

Encryption at rest and in transit, since day one. HSTS enforced.

符合 GDPR

EU access / deletion / portability rights honored. DPA on request.

One-click deletion

Settings → Delete account. All data wiped within 30 days. No support ticket required.

子处理方列表

完整供应商清单和用途见 /privacy. No surprise vendors.

— READY WHEN YOU ARE

Drop a file.
Get a transcript
咖啡还没凉就好了

每月 30 分钟免费,单文件最长 30 分钟。不要信用卡,不会试用期后偷扣,没有任何隐藏条款。任何套餐随时一键取消。

免费 / 月30 min
Languages100+
Export formats7
MP3MP4WAVM4AMOVMKVWEBMYOUTUBETIKTOKINSTAGRAMBROWSER RECORD