Audio Quality
We run every file through a battery of independent analyzers - each tuned to a different failure mode - and combine them into a single quality score. Every issue you see is timestamped and rated by how audible it actually is.
What we look for
Not one fuzzy "quality score". We run a battery of specialised checks in parallel, each tuned to a different failure mode, then roll them into a single 0-100 score per file.
Noise floor
How clean the recording is underneath the speech.
Pops & clicks
Sharp transient artifacts that crack through the voice.
Spectral glitches
Frequency discontinuities and model-induced weirdness.
Garbled speech
Mumbled phonemes that don't form real words.
Static & hiss
Sustained background noise that shouldn't be there.
Silence gaps
Unexpected pauses, skipped paragraphs, trailing dead air.
Clipping
Distortion from audio peaks that hit the rails.
End cutoff
Files that terminate mid-word instead of tapering naturally.
Every issue is rated by audibility
We don't just flag "there's an issue" - we tell you whether a listener would actually notice it. Any severe issue, or any critical silence gap, auto-fails the file.
Tuned for what a listener actually notices
A tiny click doesn't ruin a file; a three-second dropout in the middle of a sentence does. Our scoring reflects that - weighted toward the checks that correlate with real listening experience.
Severe issues override
A single severe artifact - garbled speech, sustained static, an abrupt end cutoff - auto-fails the file. An otherwise clean track with one bad moment is still a bad track.
Bandwidth sanity check
Some providers silently downsample under load, shipping audio that sounds thin or tinny. We compare the actual frequency content against what you should be getting and flag the files that fall short.
What you get back per file
Quality score
A 0-100 weighted score from the SNR, spectral, cleanliness, and bandwidth checks.
Timestamped artifact log
Every detected issue with its start time, duration, type, and audibility label.
Issue breakdown
Counts by audibility: severe / distracting / noticeable / subtle - so you can triage fast.
Regeneration flag
A single boolean per file: should you regenerate this one? Derived from score plus severe-issue rules.
What people are saying
"I've been paying for a creator account the last two months, and while I do manage to get satisfying results, it seems to require an ever increasing amount of re-rolls to get acceptable quality."
"Audio quality sounds like I'm standing far away from the mic or something. It was so amazing at first."
"Total requested audio was 4:31, but from 1:21-2:26 and 3:02-3:36 there was only silence. Also huge volume level changes and style shifts."
"Some of my texts get synthesized no problem into one neat file. Yet other books encounter problems. I get a bunch of 5-minute chunks and there seem to be a random amount of chunks missing."