#Is there any mention or guide about using ebook2audiobook to preprocess an epub's text for TTS inpu
1 messages · Page 1 of 1 (latest)
Short answer: No.
I canât find any official ElevenLabs documentation or helpâcenter article that mentions or recommends using the thirdâparty âebook2audiobookâ tool for preprocessing EPUB text.
Official guidance is to import EPUBs directly into ElevenLabs:
- Studio supports EPUB (and PDF, DOCX, TXT, HTML/URL) and provides structure-aware import; use Heading 1 for chapter titles so chapters are auto-detected. (help.elevenlabs.io)
- For publishing, supported manuscript formats include EPUB, PDF, TXT, DOCX, and HTML; text may be normalized for audio compatibility. (help.elevenlabs.io)
- ElevenReader/Publishing workflows also state you can import your book as an EPUB, DOCX, or PDF. (elevenlabs.io)
Sources:
Which file formats can I import with Studio? â ElevenLabs
What formats are supported for publication? â ElevenLabs
Create your audiobook with ElevenReader Publishing
But there are no bult-in tools, or discussed tools, to convert numbers to words, abbreviations, etc ?
ElevenLabs does have builtâin and documented options for converting numbers, dates, and similar items to spoken words (text normalization), plus tools to control acronyms/abbreviations:
- Builtâin normalization in the UI: When you generate TTS on the website, normalization runs automatically to improve pronunciation of numbers, dates, etc. The bestâpractices guide also notes normalization is enabled by default across TTS models. (elevenlabs.io)
- API control: Use the apply_text_normalization parameter (âautoâ, âonâ, âoffâ) on TTS endpoints (including Convert and streaming/timestamps). This spells out numbers/dates when enabled and can add latency. Model restrictions apply for some v2.5 models (Flash/Turbo). (elevenlabs.io)
- Changelog and Help Center details: Normalization via apply_text_normalization was added in Oct 2024; for v2.5 models it may be limited (e.g., only âautoâ/âoffâ unless on Enterprise). The help article also explains defaults and Studio settings. (elevenlabs.io)
- Project/Studio workflows: Project creation (Studio/Audio Native) also exposes apply_text_normalization and adds an âapply_englishâ mode. (elevenlabs.io)
- Languageâspecific normalization: apply_language_text_normalization (currently Japanese only) can be enabled, with higher latency tradeâoffs. (elevenlabs.io)
- Abbreviations/acronyms and custom pronunciations: Use Pronunciation Dictionaries (.PLS/TXT) with alias or phoneme entries (e.g., map âUNâ â âUnited Nationsâ) across Studio, Dubbing Studio, and the Speech Synthesis API. (elevenlabs.io)
If you need deterministic expansion (e.g., always expand units/URLs), the bestâpractices page also recommends preprocessing (e.g., regex/LLM prompts) alongside the builtâin normalization. (elevenlabs.io)
Sources:
Text to Speech (product guide) | ElevenLabs Documentation
Create speech | ElevenLabs Documentation
October 20, 2024 | ElevenLabs Documentation
Create Studio Project | ElevenLabs Documentation
Create speech | ElevenLabs Documentation
Best practices | ElevenLabs Documentation
Best practices | ElevenLabs Documentation
Built-in normalization is a disaster for FR-ch numbers (Swiss French)