audio-joiner
An audio joiner concatenates multiple audio files into a single output track, optionally with crossfades between segments, useful for podcast assembly, mixtape creation, audio-book chapter merging, and combining recorded segments into a single deliverable. The ZTools Audio Joiner runs entirely in the browser using Web Audio API + FFmpeg.wasm, supports MP3 / WAV / OGG / M4A inputs, drag-to-reorder, optional 0.5-3 second crossfades on segment boundaries, and exports a single combined file in your chosen format.
Use casesβ
Podcast episode assemblyβ
Combine intro music + interview + outro music into a single MP3 episode file. Crossfades smooth the transitions; final file ready for upload.
Audiobook chapter mergingβ
Multiple recorded chapters stitched into a single long file for distribution. Or split a long audiobook into per-CD-length segments β companion to the audio-cutter tool.
Music mixtapeβ
Combine tracks for a workout playlist, road trip, or party. Crossfades produce a continuous-mix feel without DJ software.
Voice memo consolidationβ
Several short voice memos on the same topic combined into a single sequential file for sharing or archiving.
How it worksβ
- Upload audio files β Drop multiple files; they load in order. Drag to reorder before joining.
- Set crossfade (optional) β 0 sec = hard cut; 0.5-1 sec = gentle blend; 2-3 sec = DJ-style mix. Default depends on use case (podcast: 0.3 sec; music: 1-2 sec).
- Pick output format β MP3 (universal), WAV (lossless), OGG / M4A. Bitrate setting for lossy formats.
- Encode β Tool decodes each file, applies crossfades, concatenates, re-encodes the result. Progress shown.
- Download β Single combined file saves locally. Original files untouched.
Examplesβ
Input: Intro 10s + interview 30 min + outro 15s, 0.5 sec crossfades
Output: Single 30:25 MP3 with smooth transitions.
Input: 12 song tracks, 1.5 sec crossfades
Output: Continuous mixtape ~50 minutes, ~70 MB at 192 kbps.
Input: 4 voice memos, hard cut (0 crossfade)
Output: Sequential file with audible breaks at boundaries β appropriate for distinct memos.
Frequently asked questionsβ
Will the formats be normalised?
Optional. The tool can normalise loudness across segments (so the intro is not louder than the body), or leave levels as-is. Normalisation matters for music; less for voice.
What if input formats differ?
Mixed inputs (MP3 + WAV + OGG) are decoded to PCM internally, then re-encoded to the chosen output. Lossy β lossy involves generation loss; minimise by using high source bitrates.
Can I trim before joining?
Trim each file separately with the audio-cutter tool, then join. Or join first and trim the result. Either workflow works.
How big a file can I produce?
Browser memory limits matter. Up to ~2 hours of joined output is realistic on modern laptops. For multi-hour audiobooks, desktop tools (Audacity) handle better.
Are the files uploaded?
No β entirely client-side. All processing in browser memory; nothing transmitted.
How do crossfades work?
The tail of segment N fades out while the head of segment N+1 fades in over the configured duration. Both play simultaneously in the overlap region.
Tipsβ
- Use consistent bitrate / sample rate across input files when possible β mismatches cause re-encoding artifacts.
- For podcast, set 0.3-0.5 sec crossfades; for music, 1.5-3 sec depending on tempo.
- Listen to the boundaries before publishing β abrupt cuts and bad crossfades are immediately audible.
- Normalise volume across segments for podcast / audiobook; preserve dynamics for music.
- For long files, split the export into smaller chunks and rejoin in a dedicated DAW if quality matters.
Try it nowβ
The full audio-joiner runs in your browser at https://ztools.zaions.com/audio-joiner β no signup, no upload, no data leaves your device.
Last updated: 2026-05-05 Β· Author: Ahsan Mahmood Β· Edit this page on GitHub