#Dubbing studio: What is the .aff file, and how can I use it with the audio I've exported from my pr
1 messages · Page 1 of 1 (latest)
Itâs .AAF (not .aff). In Dubbing Studio, AAF is a âtimeline dataâ export: it contains the projectâs clip/track timing and structure, not audio. (help.elevenlabs.io)
How to use it with your exported audio:
- In your Dubbing Studio project, click Export and generate your audio (WAV/MP3). If you need stems, you can export a .zip of audio tracks or a .zip of audio clips. (elevenlabs.io)
- In the same Export menu, also export the AAF (timeline data). The AAF goes with those audio files for downstream editing/assembly based on the projectâs timing. Make sure you select the correct language when exporting. (elevenlabs.io)
- Dubbing exports also include a perâspeaker WAV option (downloaded as a zip) if you prefer separated speaker audio. (help.elevenlabs.io)
If you meant a different â.affâ file, let me knowâthere isnât an ElevenLabs export by that extension in the Dubbing docs. (help.elevenlabs.io)
Sources:
Which file formats are supported by Dubbing? â ElevenLabs
Dubbing Studio | ElevenLabs Documentation
Dubbing Studio | ElevenLabs Documentation
Which file formats are supported by Dubbing? â ElevenLabs
Which file formats are supported by Dubbing? â ElevenLabs
Dubbing studio: difference between models (i.e., multilingual, turbo, etc)
Hereâs how the ElevenLabs speech models differ for dubbing work. The choice mainly affects quality/emotion, latency, language coverage, max input length, and cost.
-
Multilingual v2 (model_id: eleven_multilingual_v2)
- Highest fidelity and emotional nuance; most stable on long-form. Better number/date normalization outâofâtheâbox. 29 languages. 10k character/request. Costs 1 credit/character. Best for productionâquality dubs. (elevenlabs.io)
-
Turbo v2.5 (model_id: eleven_turbo_v2_5)
- Lowâlatency (~250â300 ms) with higher quality than Flash; good balance of speed and fidelity. 32 languages (adds Vietnamese, Hungarian, Norwegian). 40k characters/request. Costs 0.5 credit/character. Good for faster turnarounds when you still want quality. (elevenlabs.io)
-
Flash v2.5 (model_id: eleven_flash_v2_5)
- Ultraâlow latency (~75 ms) and lowest cost (0.5 credit/character). 32 languages (same set as Turbo v2.5). 40k characters/request. Slightly less emotional depth; ideal for quick iterations or realâtime agents. Note: number/date normalization is weaker by default; Multilingual v2 handles this better (Enterprise can enable normalization for v2.5). (elevenlabs.io)
-
Turbo v2 (model_id: eleven_turbo_v2)
- Older lowâlatency Englishâonly model; 30k characters/request. Generally superseded by v2.5 for multilingual dubbing. (help.elevenlabs.io)
Practical guidance for Dubbing Studio
- Use Multilingual v2 when dub quality and emotional consistency matter most (longâform, premium exports). (elevenlabs.io)
- Use Turbo v2.5 when you want faster generation with strong quality across 32 languages. (elevenlabs.io)
- Use Flash v2.5 for the fastest, lowestâcost drafts or interactive review cycles; consider text normalization if your script has lots of numbers/dates. (elevenlabs.io)
Other notes
- Dubbing supports multiâlanguage output; your perâminute dub cost in the UI is separate from TTS costs. If you add extra languages inside a Dubbing Studio project, translation is billed at 1 credit/character plus the standard TTS model rate (so model choice affects the TTS portion). (elevenlabs.io)
- Default if you donât specify a model via API is Multilingual v2; you can list model IDs with /v1/models. (help.elevenlabs.io)
- Full model comparison (languages, limits, latency) is summarized in the Models docs. (elevenlabs.io)
If you share your target languages and priorities
Dubbing studio: The default settings for a character's voice model is "recommended (default)". How does it "recommend" a model?
Short answer: ElevenLabs doesnât document any heuristic behind âRecommended (default)â for a characterâs voice model in Dubbing Studio. The docs only state that Studio/Dubbing projects default to the Multilingual v2 TTS model unless you change the projectâs model in Project Settings.
What is documented
- Default project model: âThe default model is Multilingual v2 for most new projects⊠You can also choose newer models, including v3, in Project Settings.â (elevenlabs.io)
- Platform-wide default when no model is specified is also Multilingual v2. (help.elevenlabs.io)
- Dubbing Studio lets you pick voices/voice-clone modes per speaker track, but the docs donât describe a per-character ârecommendationâ algorithm for model choice. (elevenlabs.io)
- If you want to choose a model explicitly, use the model selection guidance (e.g., Multilingual v2 for highest quality content creation; Flash v2.5 for low latency; Turbo v2.5 for a balance). (elevenlabs.io)
Practical guidance
- If you need determinism, set the model explicitly in Project Settings (or per-character if available in your build) instead of relying on âRecommended (default).â (elevenlabs.io)
If youâd like, I can point you to the exact place in your project settings to switch the model.
Sources:
Studio overview | ElevenLabs Documentation
Why does my voice sound different in the API compared to the website? â ElevenLabs
Dubbing Studio | ElevenLabs Documentation
Models | ElevenLabs Documentation
Studio overview | ElevenLabs Documentation
Dubbing studio: How does the use of single and double quotes influence audio generation? And when should punctuation be used?
example) in the sentence <that's what he "said">, should the period come before or after the double quote?
Short answer
- Quotes: ElevenLabs documentation says you can use quotation marks to add emphasis, which can influence prosody. It does not document any difference between single (â â) and double (â â) quotes. (help.elevenlabs.io)
- Punctuation: Punctuation materially affects delivery. Ellipses add pauses/weight, standard punctuation guides natural rhythm, and capitalization increases emphasis. You can also use punctuation to signal interruptions or trailing thoughts. (elevenlabs.io)
- Dubbing Studio: Any edits you make (including punctuation and quotes) in Speaker Cards affect regenerated audio for that clip. (elevenlabs.io)
About your example
- The docs donât specify whether a period should go inside or outside the closing quote. ElevenLabsâ guidance is to use proper, natural formatting for your target language; punctuation is then used to shape delivery. If youâre generating via API and your text is short/ambiguous, set language_code so the normalizer applies the right language rules. (elevenlabs.io)
Bottom line
- Use quotes (single or double) when you want emphasis.
- Use punctuation deliberately to control pauses and emphasis.
- Follow the punctuation style of your target language; ElevenLabs does not prescribe period placement relative to quotation marks. (elevenlabs.io)
Sources:
How to produce emotions? â ElevenLabs
Prompting Eleven v3 (alpha) | ElevenLabs Documentation
Dubbing Studio | ElevenLabs Documentation
Text to Speech (product guide) | ElevenLabs Documentation
Prompting Eleven v3 (alpha) | ElevenLabs Documentation
Dubbing studio: can I use multiple exclamation marks or question marks to emphasize the sentence?