Updated March 2026 · 12 min read · ACX Submission Guide

ACX Technical Requirements for Audiobooks: The Complete 2026 Guide

Getting rejected by ACX after weeks of production work is one of the most frustrating experiences an audiobook creator can face. The ACX quality check is thorough, automated, and unforgiving — a noise floor that's 2 dB too loud or a peak level that clips at -2.9 dB will get your submission bounced, no matter how good the narration sounds to human ears.

This guide covers every technical specification ACX requires as of 2026, explains what each measurement actually means, walks through the most common rejection reasons, and shows you how to verify your files before submission. If you're using TomeVox to produce your audiobook, we'll also show you exactly which specs are handled automatically so you know what you never need to worry about.

The Core ACX Technical Specifications

ACX's technical requirements have been stable for several years, with minor updates to loudness targets in 2024. Here is the complete specification table you must meet for every audio file you submit:

Specification Required Value Notes
Sample Rate 44,100 Hz (44.1 kHz) Do not submit at 48 kHz — ACX will reject or downconvert with artifacts
Bit Depth 16-bit minimum 24-bit is acceptable for WAV masters; export to 16-bit MP3 for final delivery
Bit Rate 192 kbps minimum (MP3) Constant bit rate (CBR) preferred over VBR for ACX compatibility
Channels Mono or stereo Mono is preferred for speech; stereo is accepted
Peak Level -3 dBFS or lower No sample should exceed -3 dBFS at any point in the file
RMS Loudness -23 dBRMS to -18 dBRMS Measured across the entire program (not including room tone)
Noise Floor -60 dBRMS or lower Measured during a silent passage (room tone), not during speech
Room Tone (Opening) 0.5–1.0 seconds Each file must begin with room tone before any speech begins
Room Tone (Closing) 1.0–5.0 seconds Each file must end with room tone after the final spoken word
File Format MP3 (CBR, 192 kbps+) WAV accepted for upload; ACX converts internally

Understanding Each Specification

Sample Rate: 44.1 kHz Is Non-Negotiable

The sample rate determines how many audio samples are captured per second. CD-quality audio uses 44,100 samples per second. Video and broadcast audio commonly uses 48,000 Hz. If you record at 48 kHz and export at 48 kHz, ACX's automated system will flag the file. Always set your DAW project to 44.1 kHz before you begin recording — changing it after the fact introduces a sample rate conversion that can subtly degrade audio quality.

Noise Floor: -60 dBRMS Is the Hardest Spec to Hit

The noise floor is a measurement of how quiet your "silence" actually is. In practice, all rooms have ambient noise: HVAC hum, electrical interference from computers, traffic, building creaks. ACX measures the RMS level of a quiet passage in your audio and compares it against the -60 dBRMS threshold.

Most home recording environments without acoustic treatment fall between -50 and -55 dBRMS — which will fail ACX review. Meeting -60 dBRMS typically requires either a properly treated recording space, noise reduction software (iZotope RX is the industry standard), or both. This is consistently the number one reason ACX rejects human-narrated audiobooks.

Peak Level and RMS Loudness: Two Different Measurements

These are often confused. Peak level measures the loudest single sample in your audio — the instantaneous ceiling. ACX requires no sample exceed -3 dBFS. A peak limiter set to -3 dB handles this automatically.

RMS loudness (sometimes expressed as LUFS — Loudness Units relative to Full Scale) measures the average perceived loudness over time. ACX requires your speech audio to average between -23 and -18 dBRMS. Too quiet and the listener has to crank their volume; too loud and it clips against ACX's ceiling. Most professional narrators target -20 dBRMS as a comfortable midpoint.

Room Tone: Opening and Closing Buffers

Room tone is a brief period of ambient silence at the start and end of each audio file. It serves two purposes: it gives ACX's automated QC system a clean sample to measure your noise floor against, and it provides a natural buffer so chapter transitions don't cut abruptly. Opening room tone should be 0.5 to 1 second. Closing room tone should be 1 to 5 seconds — enough to feel like a natural pause, not so long it feels like the track died.

Chapter File Structure

ACX requires your audiobook to be submitted as individual audio files — one per chapter (or one per major section). This is different from a single continuous audio file. The structure must include:

File naming should be systematic. ACX does not enforce a specific naming convention, but clean naming prevents upload order errors. A reliable convention is:

00_opening-credits.mp3   01_chapter-01.mp3   02_chapter-02.mp3   99_closing-credits.mp3

Each chapter file should contain only one chapter. Do not concatenate multiple chapters into one file. ACX's system needs individual files to create proper chapter navigation in the Audible app.

The Most Common Rejection Reasons

ACX publishes a list of common QC failure reasons, and the same issues come up repeatedly in audiobook production communities. Here are the top offenders:

How to Measure Your Audio Before Submission

Before uploading a single file to ACX, run a self-QC pass on every chapter. Here is what to check and how to check it:

Measuring noise floor: In most DAWs (Audacity, Adobe Audition, Logic, Reaper), select a 3–5 second silent passage and run a statistics analysis. You're looking for the RMS level of that silent section. It must read -60 dBRMS or lower. In iZotope RX, the Meter module displays noise floor in real time.

Measuring peak levels: Use your DAW's built-in peak meter, or export to a loudness measurement tool like the free Youlean Loudness Meter. Look at the True Peak (TP) reading. It must be -3 dBTP or lower.

Measuring RMS loudness: Youlean Loudness Meter, the free LUFS Meter VST, or Adobe Audition's built-in Match Loudness panel will give you an integrated LUFS reading. Target -20 LUFS as a practical midpoint within ACX's -23 to -18 range.

Quick Self-QC Checklist

Before uploading: noise floor is -60 dBRMS or lower, peak is -3 dBFS or lower, integrated loudness is between -23 and -18 dBRMS, sample rate is 44.1 kHz, bit rate is 192 kbps CBR, each file has opening and closing room tone, and all files are individually chaptered.

Step-by-Step ACX Submission Walkthrough

  1. Create or claim your title on ACX Log in at acx.com, click "Add Your Title," and either link your Amazon KDP title or enter your book's metadata manually. You'll need the title, subtitle, author name, and ISBN if you have one. If you're self-publishing, you can create an ACX title without an ISBN.
  2. Choose distribution and royalty type ACX offers exclusive distribution (Audible + Amazon + Apple Books) or non-exclusive. Exclusive pays 40% royalty; non-exclusive pays 25%. If you plan to distribute to Spotify Audiobooks, Kobo, or other platforms, choose non-exclusive now — you cannot change this after going live.
  3. Prepare and verify all audio files Run your full self-QC pass on every file. Check noise floor, peak, RMS loudness, sample rate, bit rate, and room tone buffers. Fix any file that fails before uploading. One failed chapter will get the entire submission rejected.
  4. Upload files in chapter order In your ACX dashboard, navigate to your title and click "Upload Your Audiobook." Upload files one at a time in chapter order — Opening Credits first, then Chapter 1, Chapter 2, and so on through Closing Credits. ACX processes each file as it uploads.
  5. Preview each chapter after upload ACX provides an in-browser player for each uploaded file. Listen to the beginning and end of each chapter to confirm room tone is present, the audio loads cleanly, and there are no obvious artifacts from the upload process.
  6. Submit for ACX Quality Check Once all files are uploaded and you're satisfied with the previews, click "Submit for Review." ACX's automated QC system runs first (usually within 24–48 hours), followed by a human review. The full process takes 7–14 business days.
  7. Respond to QC feedback if rejected If your submission is rejected, ACX will email you with a specific reason code. Address only the flagged issues, re-export the affected files, and resubmit. You do not need to re-upload all files — only the chapters that failed.

How TomeVox Handles ACX Compliance Automatically

When you convert a book with TomeVox, the audio output is engineered to meet ACX specifications without any manual adjustment on your part. Here is what is handled for you:

ACX Requirement TomeVox Handling
Sample rate 44.1 kHz All audio rendered at 44,100 Hz by default
Bit rate 192 kbps CBR MP3 export at 192 kbps Constant Bit Rate
Peak level -3 dBFS max True peak limiting applied at -3 dBTP on master
RMS loudness -23 to -18 dBRMS Integrated loudness normalized to -20 LUFS target
Noise floor -60 dBRMS min AI-generated audio has no room noise; floor is effectively -90+ dBRMS
Opening room tone 0.5–1s 0.75s of silence prepended to each chapter file
Closing room tone 1–5s 2s of silence appended to each chapter file
Per-chapter file structure EPUB/PDF chapter detection produces one audio file per detected chapter
M4B with chapter markers M4B container with embedded chapter metadata for Audible/Apple Books

The one thing TomeVox does not generate for you is the Opening Credits and Closing Credits audio files — those require your specific copyright statement, author name pronunciation, and narrator credit. You can record those yourself (they're typically 30–60 seconds each) and add them to the upload. Every other technical requirement is met by the TomeVox output.

The hardest part of ACX for human narrators — the noise floor — is simply not a problem for AI-generated audio. There is no room, no microphone, no HVAC system. The technical floor is clean by default.

A Note on ACX's AI Narration Policy

Important: ACX (Audiobook Creation Exchange) currently requires human narration for submitted titles. External AI-generated narration — including audio from third-party tools — is not accepted by ACX regardless of disclosure. The technical specifications in this guide apply to human-narrated audio submitted through the standard ACX workflow.

Amazon does operate a separate AI narration programme called KDP Virtual Voice, which uses Amazon's own AI voices to narrate books enrolled in KDP. This is invite-only as of early 2026 and does not accept author-supplied audio files. It is a different system from ACX.

If you are producing an audiobook with AI narration and want Audible distribution, monitor ACX's policy page directly for any updates. In the meantime, platforms that openly accept AI-narrated audio include Spotify for Authors, Apple Books (via the Digital Narration programme), Google Play Books, Kobo Writing Life, and INaudio (which distributes to 30+ platforms).

Skip the technical setup entirely

TomeVox converts your EPUB, PDF, DOCX, or TXT file into an ACX-ready audiobook — correct sample rate, loudness, noise floor, chapter files, and M4B output included. Preview your first chapter free.

Try TomeVox Free