#Im using my own voice ( recorded through the wonderful ElevenLabs professional voice recorder for m
1 messages · Page 1 of 1 (latest)
To achieve nuances like whispering or changing tone in your speech using ElevenLabs, you can try using descriptive dialogue tags in your script. For example, adding phrases like "he said quietly" can guide the AI to produce the desired effect. Experimenting with different phrases might help you achieve the specific tone you want.
Currently, there isn't a specific ElevenLabs-endorsed GPT model for scriptwriting with tonal guidance. However, you can use external tools to draft your script and then refine it with descriptive tags for the desired nuances before using it in ElevenLabs' text-to-speech.
If you have further questions or need assistance with specific features, feel free to ask!
Where do i find descriptive tags
ElevenLabs does not currently offer a GPT-like tool specifically endorsed scriptwriting with tonal and nuanced adjustments. However, you can achieve variations like whispering or changing tone by adjusting the settings in the Voice Lab or using the advanced controls in the Voiceover Studio. These tools allow you to fine-tune the delivery style of your voice, including pitch, pacing, and emotional tone.
For descriptive tags, ElevenLabs does not provide a specific feature for tagging tonalities or nuances directly. You may need to manually describe the desired tone in your script or experiment with the settings to achieve the effect you want.
If you're looking for scriptwriting assistance, ElevenLabs does not currently offer a built-in tool for generating scripts with tonal instructions. You might need to use external tools for scriptwriting and then import the text into ElevenLabs for voice generation. If you encounter issues with third-party tools, it’s best to reach out to their support teams directly.
Let us know if you need further clarification!
I see in the ElevenLabs posted scripts on the website they use these tonality prompts.. [whispering] with the text highlighted in red..is it necessary to highlight the tonality prompts in red for the text to speech to recognize it ?
To achieve different nuances like whispering or changing tone in your speech using ElevenLabs, you can use descriptive dialogue tags. These tags, such as "[whispering]" or "[shouting]", help guide the AI to produce the desired vocal effect. Highlighting these prompts in red is not necessary for the text-to-speech system to recognize them. The key is to include clear and descriptive tags within your text to guide the AI's delivery.
When I tried it in the free platform on ElevenLabs it actually read the word [whispering] instead of changing its tone..is that because it was the free version?
The free plan of ElevenLabs does not support commercial use, but it does not limit features like tone changes or emotional prompts. If the platform is reading the word "[whispering]" instead of adjusting the tone, it may be due to how the prompt is formatted or interpreted by the model, rather than the plan type.
Currently, ElevenLabs does not require text to be highlighted in red for tonal prompts to work. Instead, achieving specific tones or emotions often depends on crafting sentences effectively. You can refer to the 'Prompting Guide for Emotions' for tips on how to write text that triggers desired effects. Additionally, exploring the Voice Library may help you find voices designed for whispering or other nuanced tones.
If you're looking for more advanced control over emotions or tonal changes, ElevenLabs is actively working on expanding its models to provide better user control, though no specific timeline is available for these updates.
I want to use my own voice recorded in the 22 dollar a month professional cloning feature
To use your own voice with nuanced tones like whispering or changes in inflection, you can achieve this by carefully crafting your text input. Adding descriptive prompts like "[whispering]" or using punctuation (e.g., quotation marks or ellipses) can help guide the delivery. However, the platform does not require highlighting prompts in red for recognition—it's the text itself that influences the tone.
If the system reads the word "[whispering]" instead of applying the tone, it may be due to how the text is formatted or interpreted, not because of the free version. Ensure that your voice clone is optimized for expressiveness, and try experimenting with different phrasing or punctuation to achieve the desired effect.
For advanced control over emotions and style, you might explore prompts or use samples that reflect the speaking style you want during the voice cloning process. Monotone samples may result in less dynamic output, so ensure your recordings are expressive and high-quality.
The Instant Voice Cloning feature is available on paid plans starting from the $22/month Starter plan. You can upload a 1-2 minute recording of your voice to create a clone. For more details, visit the My Voice section.
I want the professional voice cloning feature not the instant..how much is that ?
To access the Professional Voice Cloning feature, you need the Creator plan, which costs $22 per month. For the first month, this plan is offered at a discounted rate of $11. This plan also provides 100,000 characters per month, equivalent to approximately 120 minutes of audio. You can find more details about the subscription plans on the pricing page.
How do I ensure my voice clone is " optimized for expressiveness" as you suggested?
To ensure your voice clone is optimized for expressiveness, focus on the following steps:
-
High-Quality Audio Recording: Use a good microphone to record your voice with minimal background noise. Ensure the audio is clear and captures the nuances of your speech.
-
Varied Speech Samples: Provide recordings that showcase a range of tones, emotions, and styles. For example, include samples of whispering, excitement, calm narration, or other expressive elements you want the clone to replicate.
-
Recording Length: Aim for 1-2 minutes of audio. This length is sufficient for the AI to capture the characteristics of your voice while maintaining high quality.
-
Material Selection: Choose scripts or text that reflect the tone and style you want for your voice clone. Reading from books, articles, or custom scripts can help achieve this.
-
Paid Plan Access: Ensure you are subscribed to at least the Starter plan or higher, as Instant Voice Cloning is only available for paid plans. For professional voice cloning, you may need to explore Enterprise options for advanced features.
By following these steps, your voice clone will be better equipped to handle expressive and nuanced speech.
I thought for professional voice cloning you had me read something lengthy..like several hours long..I thought the great Gatsby was your training example?
To optimize your Professional Voice Clone for expressiveness, here are some key recommendations:
-
Audio Quality: Use high-quality audio recordings with no background noise, music, or excessive reverb. Ensure the voice is clear, consistent, and professional-sounding.
-
Duration: While the minimum requirement is 30 minutes of audio, providing around 3 hours of recordings will yield the best results. This allows the AI to capture more nuances and expressiveness in your voice.
-
Content: Select recordings that reflect a variety of tones, emotions, and speaking styles. The dialogue should align with your preferred speaking style and pace.
-
Consistency: Use recordings from a single speaker with consistent tone and microphone quality. Avoid mixing clips of varying quality.
-
Preparation: If needed, use noise removal tools to enhance audio clarity before uploading.
For Professional Voice Cloning, the $22/month plan includes this feature. You can explore more details about subscriptions on the Subscription Page.
Do i have to record the 3 hours all at once or can I do it in pieces and then send it in together to be professionally cloned?
Do you understand what I mean?
You can record your audio in smaller pieces and combine them before submitting for professional voice cloning. It's important that the total duration of the recordings meets the recommended length—ideally closer to 3 hours for optimal results. Ensure all recordings are consistent in quality, featuring a single speaker, free of background noise, music, excessive pauses, and reverb. Combining shorter recordings into longer ones can help maintain quality control and improve the effectiveness of the voice clone.
Is ElevenLabs professional voice cloning ( the 22.00 creator plan ) effective for creating voice over shorts like TikTok or Instagram short form content?