#I am creating a professional voice clone and there is a option to add lables. What does this do and

1 messages · Page 1 of 1 (latest)

pale capeBOT
#
AI-generated Answer for intelligenceassist_48174

Labels in the Professional Voice Cloning process are primarily for organizational purposes. They help you categorize and manage your voice clones within the Voice Library or the My Voices section. These labels do not affect the outcome or quality of the voice cloning itself.

You can use labels to describe attributes like the voice's intended use, language, accent, or any other details that make it easier to identify and organize your voices later. For example, you might add labels like "English," "Narration," or "Podcast."

Feel free to choose labels that make sense for your specific needs!

grim wadi
#

what do stability and similarity do for a proffesional voice clone?

pale capeBOT
#
AI-generated Answer for intelligenceassist_48174

The Stability and Similarity settings for a Professional Voice Clone control how the AI generates speech based on the cloned voice:

  1. Stability: This setting determines how consistent the voice output is. A higher stability value makes the voice sound more uniform and predictable, while a lower value allows for more variation in tone, pitch, and delivery. Lower stability can make the voice sound more dynamic but may also introduce inconsistencies.

  2. Similarity: This setting controls how closely the generated voice matches the original cloned voice. A higher similarity value ensures the output stays very close to the original voice's tone and characteristics. Lowering this value allows for more flexibility in the voice's style but may make it sound less like the original.

For optimal results with the multilingual v2 model, it is recommended to set 90% similarity, 50% stability, and 0% style. However, you can experiment with these settings to achieve the desired output.

grim wadi
#

How do these models differ from eachother?

Eleven Multilingual v2

Eleven Flash v2.5

Eleven Turbo v2.5

Eleven Turbo v2

Eleven Flash v2

Eleven Multilingual v2 (Speech to Speech)

pale capeBOT
#
AI-generated Answer for intelligenceassist_48174

Here’s a breakdown of the differences between the models you mentioned:

Eleven Multilingual v2

  • Purpose: Best for high-quality content creation like voiceovers, audiobooks, and post-production.
  • Languages: Supports 29 languages.
  • Strengths: High stability, exceptional accuracy, and lifelike, emotionally rich speech.
  • Latency: Higher latency compared to other models, making it less suitable for real-time applications.
  • Use Case: Ideal for projects where quality and emotional depth are the priority.

Eleven Flash v2.5

  • Purpose: Ultra-low-latency model optimized for real-time applications.
  • Languages: Supports 32 languages.
  • Strengths: Generates speech in under 75ms, making it the fastest model available.
  • Trade-offs: Slightly lower quality and emotional depth compared to Turbo models.
  • Use Case: Best for developers needing rapid speech generation across multiple languages.

Eleven Turbo v2.5

  • Purpose: Low-latency model designed for real-time conversational AI.
  • Languages: Supports 32 languages.
  • Strengths: Balances speed and quality, generating audio in around 300ms.
  • Trade-offs: Slightly less stylistic range and stability compared to Multilingual v2.
  • Use Case: Suitable for real-time applications requiring natural speech across multiple languages.

Eleven Turbo v2

  • Purpose: Low-latency model focused exclusively on English.
  • Languages: English-only.
  • Strengths: Similar performance to Turbo v2.5 but optimized for English, with low latency.
  • Use Case: Ideal for speed-critical, English-only applications.

Eleven Flash v2

  • Purpose: Ultra-low-latency English-only model.
  • Languages: English-only.
  • Strengths: Generates speech in under 75ms, making it the fastest English model.
  • Trade-offs: Slightly lower quality and emotional depth compared to Turbo models.
  • Use Case: Best for developers needing rapid English speech generation.

Eleven Multilingual v2 (Speech to Speech)

  • Purpose: Designed for speech-to-speech applications.
  • Languages: Multilingual support.
  • Strengths: Focuses on converting speech input into high-quality speech output in multiple languages.
  • Use Case: Ideal for scenarios requiring speech-to-speech transformations, such as dubbing or translation.

Each model is tailored for specific use cases, balancing factors like quality, latency, and language support. For content creation, Multilingual v2 is the best choice, while Flash and Turbo models are optimized for real-time applications.

grim wadi
#

why can't I share my professional voice clone using the voice model sharing option, when i click in the toggle nothing happens

pale capeBOT
#
AI-generated Answer for intelligenceassist_48174

If you're unable to share your Professional Voice Clone using the voice model sharing option, there could be several reasons for this issue:

  1. Compliance with Local Laws: Sharing may be restricted in certain regions due to local laws. For example, sharing is disabled in states like Texas, Washington, and Illinois. If you're located in one of these areas, this could be the reason.

  2. Voice Clone Type: Only Professional Voice Clones can be shared. Instant Voice Clones or voices created using the Voice Design tool are not eligible for sharing. Ensure that the voice you're trying to share is a Professional Voice Clone.

  3. Subscription Plan: Voice sharing options and metrics are only available if you're on the Creator plan or above. If you're on a lower-tier plan, you may need to upgrade your subscription.

  4. Verification Status: Your Professional Voice Clone must be verified before it can be shared. If the verification process is incomplete or failed, the sharing option will not work.

  5. Technical Issues: If none of the above applies, there might be a technical issue. In this case, try refreshing the page, clearing your browser cache, or using a different browser. If the problem persists, you may need to contact customer support for further assistance.

If you're still unsure about the cause, providing a screenshot of the issue or checking the status of your Professional Voice Clone might help diagnose the problem.

grim wadi
#

where can I find the ID of my professional voice clone, is it in the sharing link or somewhere else?