Audio to text, free

No signup · no login · nothing to install

Drop your audio here…
  • 100% free
  • No signup
  • Audio stays local
  • Open source model

Turn any audio into text in seconds: 100% free, no signup, no payment, nothing to install. Supports MP3, WAV, OGG, M4A, FLAC and WebM up to 1 hour. Everything runs in your browser with an open-source AI model, so your audio never leaves your device. After the first load it even works without internet.

How it works in 3 steps

  1. Drop your audio

    Drag the file or click to select it. Supports MP3, WAV, OGG, M4A, FLAC and WebM up to 100 MB and 1 hour. No daily limit on how many files.

  2. Pick language and quality

    Defaults to English with the fastest model. Switch the audio language (Spanish, English, Portuguese, French) or bump quality if you need maximum accuracy.

  3. Copy or download the text

    The transcript shows up as soon as it's done. Copy it to your clipboard, download it as plain .txt or as .srt subtitles with timestamps.

Why use it

  • 100% free

    No payment, no credit card, no trial that charges you later.

  • No signup

    No email, no password, no spam. Just open it and use it.

  • Truly private

    Your audio never gets uploaded to any server. The transcription runs in your browser.

  • Nothing to install

    Works in any modern browser: Chrome, Firefox, Safari, Edge.

  • Many formats

    MP3, WAV, OGG, M4A, FLAC and WebM. Up to 1 hour and 100 MB per file.

  • Many languages

    Spanish, English, Portuguese, French. The AI handles accents.

Who it's for

  • Students

    Turn recorded lectures into editable notes. Paste the text into ChatGPT, Claude or Gemini afterwards to summarize or build flashcards.

  • Journalists and researchers

    Transcribe sensitive interviews without uploading audio to an external service. Confidential data never leaves your computer.

  • Podcasters and creators

    Generate transcripts to boost SEO on your blog or .srt subtitles to upload to YouTube, TikTok or Instagram Reels.

  • Professionals and teams

    Convert meetings, voice notes and memos to text without sharing corporate audio with cloud services. Complies with strict data policies.

  • People with hearing difficulties

    Access audio and video content as accessible text instantly, without paying accessibility subscriptions.

  • Anyone with voice notes

    Convert WhatsApp voice notes or phone memos to text quickly, without installing yet another app on your phone.

Why this converter, not the others

Most audio-to-text converters Google shows you require signup, give you a few free minutes, then ask for a credit card. Not this one.

  • Others ask for email and password; here, nothing.
  • Others upload your audio to their server; here, everything stays local.
  • Others give you a 30-minute trial then charge; here, 100% free with no usage cap.
  • Others depend on their own uptime; here, it runs in your browser even if they go down.

Frequently asked questions

Is it really free?

Yes, 100%. No credit card and no "pro" tier that charges later. You get two modes: the browser mode is unlimited, and the server mode (optional) has a free daily quota. Both are completely free and don't require signup.

Do I need to sign up?

No. No email, no password, no captcha. Just open it, drop your audio and get the text.

Does my audio get uploaded somewhere?

By default, no — the transcription runs 100% in your browser and your audio never leaves your device. Only if you open "Advanced settings" and switch to server mode does your audio pass through a free external provider (Cloudflare, Hugging Face) for faster transcription. Your choice.

What's the difference between the two modes?

Browser mode (the default) runs on your computer with an AI model downloaded once. It's private and unlimited, but speed depends on your hardware. Server mode uploads your audio to a free external provider — it's faster and more accurate for long audio, but has a daily quota per device. Pick which one you want under "Advanced settings".

What audio formats does it support?

MP3, WAV, OGG, M4A, FLAC and WebM. File size up to 100 MB and length up to 1 hour.

Does it work with non-English audio?

Yes. It recognizes Spanish, Portuguese and French too. For other languages, pick the audio language manually from the dropdown.

How long does it take?

In browser mode, it depends on your computer. A modern laptop transcribes 5 minutes of audio in under a minute; a 1-hour file can take 15–30 minutes depending on your hardware. The first run is slower because it downloads the AI model (~40 MB) once and caches it. In server mode it's ~5 seconds for a 10-minute clip.

Do I need an internet connection?

For browser mode, only the first time to download the model. After that it works offline. Server mode does need a connection every time.