Is my audio uploaded anywhere?

Mixed answer. The recording (WebM file) stays in your browser — never uploaded. The transcription audio IS routed through Google's recognition service in Chrome / Edge / Brave / Arc — that's how the SpeechRecognition API works in those browsers. So Google sees the audio briefly during recognition (per their docs they don't tie it to your account or use it for training, but it does leave your device for that step).

Why does the audio play back fine but the transcript misses words?

Recognition accuracy depends on speech clarity, mic quality, background noise, and whether you picked the right language. Audio recording is a simple bytes-in / bytes-out operation that doesn't care about content quality, so the file is always faithful to what hit the mic. Transcript quality is bounded by the recognition model.

Can I edit the transcript?

Yes — the output box is fully editable. Fix recognition errors, add punctuation the recognizer missed, format the text however you need before copying.

What audio format is the download?

WebM with Opus codec — the browser MediaRecorder default. Plays in Chrome, Slack, Discord, VLC, ffmpeg. Convert to MP3 / WAV with the audio-format-converter tool if your destination needs a different format.

Why does the transcript stop when I take a long pause?

The SpeechRecognition API has a built-in silence timeout (~5-10 seconds) that ends the recognition session. The audio recording continues uninterrupted — only the transcript pauses. The tool restarts recognition automatically when you start speaking again, so the transcript resumes (with a small gap where the silence was).

Can I record from a USB mic or audio interface?

Yes — the mic dropdown lists every audio input your OS recognizes. Pick yours before recording.

Audio, Video & Voice · Free tool

Voice Note Transcriber

Record a voice note and get a text transcript instantly in the same browser tab. Copy, edit, or paste anywhere—free online tool with no sign-up required.

Updated June 2026

Language

●

00:00

Found this useful?Email Buy Me a Coffee

What it does

Record audio and transcribe it at the same time, with the live transcript appearing word-by-word as you speak. When you stop, you've got both: the original audio file (downloadable WebM) and the time-stamped transcript (copyable text). Useful when you want a record of both — what was said AND what your microphone actually heard, rather than picking one and trusting it.

Common uses: quick meeting notes (record the call, have a transcript to scan afterward instead of re-watching); voice memo capture (record yourself thinking out loud, get text you can search later); interview prep / debrief (transcribe a phone screen or fact-finding call); language practice (read aloud, see what the recognizer thinks you said — exposes mispronunciations and accent issues).

Recording uses MediaRecorder for the audio (output: WebM with Opus codec). Transcription uses the browser's built-in SpeechRecognition API which on Chrome / Edge / Brave routes audio through Google's hosted recognition service. So while the audio file stays in your browser, the recognition (transcript) DOES require an internet connection and your audio is briefly transmitted to Google for processing. If full local-only recording is required (e.g. confidential meetings), use a desktop tool with Whisper running locally.

Embed this tool on your siteShow snippet

Paste this snippet into any page. Loads on-demand (lazy), no tracking scripts, and sized to most dashboards. Replace the height to fit your layout.

<iframe src="https://freetoolarena.com/embed/voice-note-transcriber" width="100%" height="720" frameborder="0" loading="lazy" title="Voice Note Transcriber" style="border:1px solid #e2e8f0;border-radius:12px;max-width:720px;"></iframe>

Embed docs →

How to use it

Pick your spoken language from the dropdown. Wrong language picks produce garbled transcripts.
Click Start. Browser asks for microphone permission the first time — grant it.
Speak naturally. Words appear word-by-word in the transcript pane; the audio waveform shows your input level.
Click Stop when you're done. The transcript box is editable — fix any recognition errors before copying.
Two outputs: Download Audio gives you a WebM file; Copy Transcript puts the text on your clipboard. Or download both as a paired set.

When to use this tool

Recording quick voice notes where you want both audio and a searchable transcript.
Capturing meeting audio + a rough transcript without a paid transcription service.
Practicing speeches and reviewing both the recording and what the recognizer thinks you said.
Language learning — read aloud, compare the transcript to your intended text.

When not to use it

Confidential / sensitive content — Chrome's transcription routes audio through Google's recognition service. For local-only transcription, use a Whisper-based desktop tool.
Multi-speaker transcripts — the API doesn't separate speakers. Use Otter, Rev, Whisper-based diarization tools for proper speaker turns.
Long recordings (>60 minutes) — browser memory is a bottleneck.
Production-grade transcription — accuracy is ~85-95% in good conditions; for legal, medical, or court-quality transcripts use a professional service.

Frequently asked questions

Is my audio uploaded anywhere?: Mixed answer. The recording (WebM file) stays in your browser — never uploaded. The transcription audio IS routed through Google's recognition service in Chrome / Edge / Brave / Arc — that's how the SpeechRecognition API works in those browsers. So Google sees the audio briefly during recognition (per their docs they don't tie it to your account or use it for training, but it does leave your device for that step).
Why does the audio play back fine but the transcript misses words?: Recognition accuracy depends on speech clarity, mic quality, background noise, and whether you picked the right language. Audio recording is a simple bytes-in / bytes-out operation that doesn't care about content quality, so the file is always faithful to what hit the mic. Transcript quality is bounded by the recognition model.
Can I edit the transcript?: Yes — the output box is fully editable. Fix recognition errors, add punctuation the recognizer missed, format the text however you need before copying.
What audio format is the download?: WebM with Opus codec — the browser MediaRecorder default. Plays in Chrome, Slack, Discord, VLC, ffmpeg. Convert to MP3 / WAV with the audio-format-converter tool if your destination needs a different format.
Why does the transcript stop when I take a long pause?: The SpeechRecognition API has a built-in silence timeout (~5-10 seconds) that ends the recognition session. The audio recording continues uninterrupted — only the transcript pauses. The tool restarts recognition automatically when you start speaking again, so the transcript resumes (with a small gap where the silence was).
Can I record from a USB mic or audio interface?: Yes — the mic dropdown lists every audio input your OS recognizes. Pick yours before recording.

Learn more

Explore more audio, video & voice tools

100% in-browserNo downloadsNo sign-upMalware-freeHow we keep this safe →