ACX Technical Requirements for Audiobooks: The Complete 2026 Guide
Getting rejected by ACX after weeks of production work is one of the most frustrating experiences an audiobook creator can face. The ACX quality check is thorough, automated, and unforgiving — a noise floor that's 2 dB too loud or a peak level that clips at -2.9 dB will get your submission bounced, no matter how good the narration sounds to human ears.
This guide covers every technical specification ACX requires as of 2026, explains what each measurement actually means, walks through the most common rejection reasons, and shows you how to verify your files before submission. If you're using TomeVox to produce your audiobook, we'll also show you exactly which specs are handled automatically so you know what you never need to worry about.
The Core ACX Technical Specifications
ACX's technical requirements have been stable for several years, with minor updates to loudness targets in 2024. Here is the complete specification table you must meet for every audio file you submit:
| Specification | Required Value | Notes |
|---|---|---|
| Sample Rate | 44,100 Hz (44.1 kHz) | Do not submit at 48 kHz — ACX will reject or downconvert with artifacts |
| Bit Depth | 16-bit minimum | 24-bit is acceptable for WAV masters; export to 16-bit MP3 for final delivery |
| Bit Rate | 192 kbps minimum (MP3) | Constant bit rate (CBR) preferred over VBR for ACX compatibility |
| Channels | Mono or stereo | Mono is preferred for speech; stereo is accepted |
| Peak Level | -3 dBFS or lower | No sample should exceed -3 dBFS at any point in the file |
| RMS Loudness | -23 dBRMS to -18 dBRMS | Measured across the entire program (not including room tone) |
| Noise Floor | -60 dBRMS or lower | Measured during a silent passage (room tone), not during speech |
| Room Tone (Opening) | 0.5–1.0 seconds | Each file must begin with room tone before any speech begins |
| Room Tone (Closing) | 1.0–5.0 seconds | Each file must end with room tone after the final spoken word |
| File Format | MP3 (CBR, 192 kbps+) | WAV accepted for upload; ACX converts internally |
Understanding Each Specification
Sample Rate: 44.1 kHz Is Non-Negotiable
The sample rate determines how many audio samples are captured per second. CD-quality audio uses 44,100 samples per second. Video and broadcast audio commonly uses 48,000 Hz. If you record at 48 kHz and export at 48 kHz, ACX's automated system will flag the file. Always set your DAW project to 44.1 kHz before you begin recording — changing it after the fact introduces a sample rate conversion that can subtly degrade audio quality.
Noise Floor: -60 dBRMS Is the Hardest Spec to Hit
The noise floor is a measurement of how quiet your "silence" actually is. In practice, all rooms have ambient noise: HVAC hum, electrical interference from computers, traffic, building creaks. ACX measures the RMS level of a quiet passage in your audio and compares it against the -60 dBRMS threshold.
Most home recording environments without acoustic treatment fall between -50 and -55 dBRMS — which will fail ACX review. Meeting -60 dBRMS typically requires either a properly treated recording space, noise reduction software (iZotope RX is the industry standard), or both. This is consistently the number one reason ACX rejects human-narrated audiobooks.
Peak Level and RMS Loudness: Two Different Measurements
These are often confused. Peak level measures the loudest single sample in your audio — the instantaneous ceiling. ACX requires no sample exceed -3 dBFS. A peak limiter set to -3 dB handles this automatically.
RMS loudness (sometimes expressed as LUFS — Loudness Units relative to Full Scale) measures the average perceived loudness over time. ACX requires your speech audio to average between -23 and -18 dBRMS. Too quiet and the listener has to crank their volume; too loud and it clips against ACX's ceiling. Most professional narrators target -20 dBRMS as a comfortable midpoint.
Room Tone: Opening and Closing Buffers
Room tone is a brief period of ambient silence at the start and end of each audio file. It serves two purposes: it gives ACX's automated QC system a clean sample to measure your noise floor against, and it provides a natural buffer so chapter transitions don't cut abruptly. Opening room tone should be 0.5 to 1 second. Closing room tone should be 1 to 5 seconds — enough to feel like a natural pause, not so long it feels like the track died.
Chapter File Structure
ACX requires your audiobook to be submitted as individual audio files — one per chapter (or one per major section). This is different from a single continuous audio file. The structure must include:
- Opening Credits file (your name, title, copyright statement, publisher)
- One file per chapter, numbered sequentially
- Closing Credits file (narrator credit, publisher info)
- Retail Audio Sample (first 5 minutes of the book, this is auto-generated by ACX from your uploaded files)
File naming should be systematic. ACX does not enforce a specific naming convention, but clean naming prevents upload order errors. A reliable convention is:
00_opening-credits.mp3 01_chapter-01.mp3 02_chapter-02.mp3 99_closing-credits.mp3
Each chapter file should contain only one chapter. Do not concatenate multiple chapters into one file. ACX's system needs individual files to create proper chapter navigation in the Audible app.
The Most Common Rejection Reasons
ACX publishes a list of common QC failure reasons, and the same issues come up repeatedly in audiobook production communities. Here are the top offenders:
- Noise floor above -60 dBRMS — Home recording environments with untreated room acoustics, HVAC noise, or computer fan noise. By far the most common rejection.
- Peak levels exceeding -3 dBFS — Usually caused by loud consonants (plosives, sibilance) that weren't caught during editing, or music/effects tracks that weren't properly limited.
- RMS loudness outside the -23 to -18 dBRMS range — Too quiet is as bad as too loud. Narrators who record at conservative levels and don't normalize frequently get rejected for being too soft.
- Wrong sample rate — Submitting at 48 kHz instead of 44.1 kHz. This happens when narrators use video-oriented DAW templates.
- Missing room tone — Files that begin immediately with speech, or end immediately after the last word, without the required buffer.
- Inconsistent audio between files — Different noise floors, loudness levels, or EQ profiles between chapters, suggesting different recording sessions or environments.
- Background music or effects with no narration — Intro music, chapter-break stingers, or ambient soundscapes that are not approved by ACX guidelines. ACX allows music only under specific conditions.
- Audible edits — Clicks, pops, or abrupt cuts from sloppy editing that weren't caught in QC.
- Wrong file format or bit rate — Submitting Variable Bit Rate (VBR) MP3 instead of Constant Bit Rate (CBR), or submitting at 128 kbps instead of 192 kbps minimum.
How to Measure Your Audio Before Submission
Before uploading a single file to ACX, run a self-QC pass on every chapter. Here is what to check and how to check it:
Measuring noise floor: In most DAWs (Audacity, Adobe Audition, Logic, Reaper), select a 3–5 second silent passage and run a statistics analysis. You're looking for the RMS level of that silent section. It must read -60 dBRMS or lower. In iZotope RX, the Meter module displays noise floor in real time.
Measuring peak levels: Use your DAW's built-in peak meter, or export to a loudness measurement tool like the free Youlean Loudness Meter. Look at the True Peak (TP) reading. It must be -3 dBTP or lower.
Measuring RMS loudness: Youlean Loudness Meter, the free LUFS Meter VST, or Adobe Audition's built-in Match Loudness panel will give you an integrated LUFS reading. Target -20 LUFS as a practical midpoint within ACX's -23 to -18 range.
Quick Self-QC Checklist
Before uploading: noise floor is -60 dBRMS or lower, peak is -3 dBFS or lower, integrated loudness is between -23 and -18 dBRMS, sample rate is 44.1 kHz, bit rate is 192 kbps CBR, each file has opening and closing room tone, and all files are individually chaptered.
Step-by-Step ACX Submission Walkthrough
-
Create or claim your title on ACX Log in at acx.com, click "Add Your Title," and either link your Amazon KDP title or enter your book's metadata manually. You'll need the title, subtitle, author name, and ISBN if you have one. If you're self-publishing, you can create an ACX title without an ISBN.
-
Choose distribution and royalty type ACX offers exclusive distribution (Audible + Amazon + Apple Books) or non-exclusive. Exclusive pays 40% royalty; non-exclusive pays 25%. If you plan to distribute to Spotify Audiobooks, Kobo, or other platforms, choose non-exclusive now — you cannot change this after going live.
-
Prepare and verify all audio files Run your full self-QC pass on every file. Check noise floor, peak, RMS loudness, sample rate, bit rate, and room tone buffers. Fix any file that fails before uploading. One failed chapter will get the entire submission rejected.
-
Upload files in chapter order In your ACX dashboard, navigate to your title and click "Upload Your Audiobook." Upload files one at a time in chapter order — Opening Credits first, then Chapter 1, Chapter 2, and so on through Closing Credits. ACX processes each file as it uploads.
-
Preview each chapter after upload ACX provides an in-browser player for each uploaded file. Listen to the beginning and end of each chapter to confirm room tone is present, the audio loads cleanly, and there are no obvious artifacts from the upload process.
-
Submit for ACX Quality Check Once all files are uploaded and you're satisfied with the previews, click "Submit for Review." ACX's automated QC system runs first (usually within 24–48 hours), followed by a human review. The full process takes 7–14 business days.
-
Respond to QC feedback if rejected If your submission is rejected, ACX will email you with a specific reason code. Address only the flagged issues, re-export the affected files, and resubmit. You do not need to re-upload all files — only the chapters that failed.
How TomeVox Handles ACX Compliance Automatically
When you convert a book with TomeVox, the audio output is engineered to meet ACX specifications without any manual adjustment on your part. Here is what is handled for you:
| ACX Requirement | TomeVox Handling |
|---|---|
| Sample rate 44.1 kHz | All audio rendered at 44,100 Hz by default |
| Bit rate 192 kbps CBR | MP3 export at 192 kbps Constant Bit Rate |
| Peak level -3 dBFS max | True peak limiting applied at -3 dBTP on master |
| RMS loudness -23 to -18 dBRMS | Integrated loudness normalized to -20 LUFS target |
| Noise floor -60 dBRMS min | AI-generated audio has no room noise; floor is effectively -90+ dBRMS |
| Opening room tone 0.5–1s | 0.75s of silence prepended to each chapter file |
| Closing room tone 1–5s | 2s of silence appended to each chapter file |
| Per-chapter file structure | EPUB/PDF chapter detection produces one audio file per detected chapter |
| M4B with chapter markers | M4B container with embedded chapter metadata for Audible/Apple Books |
The one thing TomeVox does not generate for you is the Opening Credits and Closing Credits audio files — those require your specific copyright statement, author name pronunciation, and narrator credit. You can record those yourself (they're typically 30–60 seconds each) and add them to the upload. Every other technical requirement is met by the TomeVox output.
The hardest part of ACX for human narrators — the noise floor — is simply not a problem for AI-generated audio. There is no room, no microphone, no HVAC system. The technical floor is clean by default.
A Note on ACX's AI Narration Policy
Important: ACX (Audiobook Creation Exchange) currently requires human narration for submitted titles. External AI-generated narration — including audio from third-party tools — is not accepted by ACX regardless of disclosure. The technical specifications in this guide apply to human-narrated audio submitted through the standard ACX workflow.
Amazon does operate a separate AI narration programme called KDP Virtual Voice, which uses Amazon's own AI voices to narrate books enrolled in KDP. This is invite-only as of early 2026 and does not accept author-supplied audio files. It is a different system from ACX.
If you are producing an audiobook with AI narration and want Audible distribution, monitor ACX's policy page directly for any updates. In the meantime, platforms that openly accept AI-narrated audio include Spotify for Authors, Apple Books (via the Digital Narration programme), Google Play Books, Kobo Writing Life, and INaudio (which distributes to 30+ platforms).
Skip the technical setup entirely
TomeVox converts your EPUB, PDF, DOCX, or TXT file into an ACX-ready audiobook — correct sample rate, loudness, noise floor, chapter files, and M4B output included. Preview your first chapter free.
Try TomeVox Free