Trusted by creators at startups, agencies, and Fortune 500 teams.
Free forever — no signup, no watermark, no length cap.
Why this is the fastest MP4 to MP3 converter on the web
Audio extraction is where browser ffmpeg.wasm tools fail hardest — a 1 GB video has to load entirely into the tab's memory before encoding starts, and hour-long recordings crash the WebAssembly ceiling. We rebuilt the pipeline: presigned upload to the nearest Backblaze B2 POP, native ffmpeg on our server (decoding just the audio track), output delivered through Bunny CDN. An hour-long lecture extracts in 10–20 seconds.
When you actually need MP3, not a video
Three common reasons people convert MP4 to MP3: building a podcast feed from a recorded interview, feeding a transcript through Whisper / Otter / AssemblyAI (smaller files cost less), and listening to a talk on a phone with the screen off (iOS will not reliably background-play .mp4 audio).
What quality you should expect
We export VBR MP3 at the LAME quality-2 preset (~190 kbps), the same range Spotify and Apple Music stream at. Perceptually indistinguishable from the source AAC audio that lives inside a typical MP4. Bigger files: use a desktop install.
Going to transcribe it?
If MP3 is just a pit stop on the way to a transcript, you may not need this tool at all — most transcription services accept MP4 directly. Whisper, Otter, AssemblyAI, and Rev all take video. The only reason to convert first is to cut upload time and cost, which can be worth it for hour-long files.
Sister tools
For Apple M4A sources: M4A to MP3 converter. For Loom recordings: Loom to MP3 converter. For general audio routing: Audio converter and Video to audio converter. Already have an MP3 you want to shrink? Compress audio.
Why this is the fastest MP4 to MP3 converter on the web
Audio extraction is the kind of job where browser ffmpeg.wasm tools hurt most: a 1 GB video has to load entirely into the browser tab memory before encoding even starts, and on hour-long recordings the WebAssembly memory ceiling routinely crashes the page.
We rebuilt the pipeline. Your video uploads via a presigned URL straight to the nearest Backblaze B2 storage POP — fast everywhere. Native ffmpeg on our server decodes just the audio track (the heavy video stream is never decoded), re-encodes to VBR MP3 at LAME quality-2, and delivers the result through Bunny CDN. An hour-long lecture extracts in 10–20 seconds end-to-end.
When you actually need MP3, not a video
Three common cases drive most of the searches that land on this page. First: you recorded an interview or a meeting and you only want the audio for a podcast feed. Second: you are running a transcript through a service that wants audio (and the file is smaller without video, so it costs less). Third: you want to listen to a recorded talk on a phone with the screen off, which iOS will not do reliably from an .mp4.
What quality you should expect
Native ffmpeg exports VBR MP3 at the LAME quality-2 preset, which lands around 190 kbps. That is the same range Spotify and Apple Music stream at, and indistinguishable from the source for speech, music, and screen-recording audio in any blind test. If your input audio was already low-quality (say, a Zoom recording), the output will not be magically better — but it will not be worse either.