#Wendigos Voice Cloning

1186 messages Β· Page 2 of 2 (latest)

winged tapir
#

Yeah I can definitely do that! No ETA on when I'll get around to it tho

dim mulch
# winged tapir Yeah I can definitely do that! No ETA on when I'll get around to it tho

Hello I saw a video on this
https://www.youtube.com/watch?v=dQ841Pd6YvQ
It seems to a free alternative to eleven labs i think
I have no idea if its as good, or actually free
Just something to look into πŸ™‚

Here's the Qwen3-TTS Demo app I showed in the video: https://huggingface.co/spaces/Qwen/Qwen3-TTS

It's only a matter of time until there's an open UI for it that beats Eleven Labsβ€”all for free.

Read about the time someone cloned my voice for a video training series, unauthorized: https://www.jeffgeerling.com/blog/2024/elecrow-responded-apolo...

β–Ά Play video
winged tapir
# dim mulch Hello I saw a video on this https://www.youtube.com/watch?v=dQ841Pd6YvQ It seems...

Yes I am aware of Qwen3-TTS and am working on adding support for it in VoiceBox! The challenge is the lack of streaming input/output support with current server implementations https://github.com/vllm-project/vllm-omni/issues/938

GitHub

Motivation Qwen3-TTS was initially supported in PR #895 with offline inference. Online serving was added in PR #968 (merged Jan 27). To make Qwen3-TTS production ready, we need to complete the rema...

#

Once a fully fleshed-out server is released with streaming support it will definitely be added

dim mulch
winged tapir
#

Yep! You'll need a decent GPU and will need to set up your own TTS server locally, but once thats done it'll be free

rough fable
#

This looks very interesting. I figure it's possible to make it work in any language..?

winged tapir
rough fable
#

Nice. I'm considering this for a modpack I'm preparing for a big local YT channel/streamer group. Reading the readme I'm not entirely sure but it sounds like everything can be prepared in advance since all players in the lobby share the same API keys for the services via config, correct?

winged tapir
rough fable
# winged tapir Yes that's correct! If you want to use the realtime features I recommend getting...

hi, I'm currently trying to set this up. I want to use realtime responses and I'm attempting to use ChatGPT for chat (gpt-4.1-mini) and Elevenlabs for STT and TTS. I've set the API keys in all three locations, adjusted the language code for elevenlabs (2 char), predefined a voice and entered its voice id. When I'm in-game mimics won't talk at all. Is it because I'm attempting this in a solo lobby?

#

also is the chat model fine or are others much better?

#

can you elaborate on that config sync part, you mentioned? is it only syncing the api keys?
because the voice id needs to be different for each player, right?

winged tapir
winged tapir
rough fable
winged tapir
winged tapir
#

Interesting, seems like I'm getting the same issue on my end. I wonder if v73 changed something

#

I'll be looking into it this evening!

rough fable
#

ok, I'll spare you my log then.

winged tapir
#

The STT and Chat are working normally but for some reason the TTS isnt being played

rough fable
#

what I'm also seeing, but this might be some incompatibility, is masked are spawning directly inside of my player character. Disabling only wendigos and nothing else resolves this.

#

I also checked if it's lostenemyfix mod that places enemies on a nearby navmesh when they would otherwise spam errors due to not finding a valid one but that wasn't it.

winged tapir
#

Ah that may be a debugging function I was using to teleport masked to me, I may have missed commenting it out somewhere

rough fable
#

Just for clarification: any of the chat service providers requires setting up api billing, correct?
I wanted to prepare gemini to try it, made a google cloud account where it said try for 90 days / 500$ but it won't let me set up a billing profile for that account for some reason. Or rather it won't let me connect the billing profile I created to the account. I'll just give it some time ig

winged tapir
#

ok so I resubscribed to elevenlabs and I can hear the masked now. Do you have an active subscription?

rough fable
#

Yea, I bought the 5$ tier today

winged tapir
winged tapir
rough fable
winged tapir
rough fable
#

ig that one is probably with a gemini api key that may not be working

winged tapir
#

Were you speaking during that session? I dont see any STT detections. Is VoiceMeeter Output (VB-Audio VoiceMeeter VAIO) the correct input device? @rough fable

rough fable
#

That is the correct one yea. I also have some mod that displays the voice activation icon for the normal game voice stuff

#

let me make sure I'm speaking. I'm restarting continuously so maybe I didn't say anything there.

winged tapir
#

In the console after you speak you should see [Wendigos STT]: RECOGNIZED:

#

It looks like the STT is initializing properly so I suspect it cant hear you for some reason if that isnt coming up

rough fable
#

this looks sus tho..?

[Wendigos Log] Clips count: 0
[Debug  :GeneralImprovements] Updated time display.
[Debug  :GeneralImprovements] Updated time display.
[Debug  :GeneralImprovements] Updated time display.
[Debug  :GeneralImprovements] Updated time display.
[Wendigos Log] Clips count: 0
[Debug  :GeneralImprovements] Updated time display.
Saving changed settings
[AI Manager] Starting speech recognition.
[Wendigos Log] Set to VoiceMeeter Output (VB-Audio VoiceMeeter VAIO)
[Info   :LethalPerformance] Saved 1 save(s)
[Debug  :GeneralImprovements] Updated time display.
Connected to ElevenLabs Scribe (NAudio).
Device 'VoiceMeeter Output (VB-Audio VoiceMeeter VAIO)' not found. Defaulting to device 0.
[Debug  :GeneralImprovements] Updated time display.
[Debug  :GeneralImprovements] Updated time display.
[Debug  :GeneralImprovements] Updated time display.
[Debug  :GeneralImprovements] Updated time display.
[Debug  :GeneralImprovements] Updated time display.
[Wendigos Log] Clips count: 0
#

Doesn't seem to be an issue with VoiceMeeter and the virtual audio stuff either. Tried selecting my usb mic directly.

#

got this again though

Connected to ElevenLabs Scribe (NAudio).
Device 'Microphone (SC440 USB Microphone)' not found. Defaulting to device 0.
winged tapir
#

Interesting, it looks like the speech sdk I'm using isn't recognizing the device identifier for some reason. Can you run the game and use the shortcut SHIFT + V + B to open the voicebox GUI and tell me what input devices are listed there? @rough fable

rough fable
#

the recording in that gui also works fwiw

winged tapir
winged tapir
#

Actually I just went ahead and published VoiceBox v0.3.5 to thunderstore! You should be able to update it in about an hour. Try updating and let me know if its fixed!

rough fable
rough fable
#

Hmm, still can't get it to work and not seeing that [RECOGNIZED] you mentioned. Does this look as expected?

[Wendigos Log] Created GUI Manager
[Wendigos Log] Chat Manager Object is: null
[Wendigos Log] Clearing chared masked dict
[Wendigos Log] STT MANAGER IS: _AIManager (UnityEngine.GameObject)
[AI Manager] Starting speech recognition.
[Wendigos Chat] Creating chat manager object. Disregard "Service config is null" errors.
[ServiceFactory] Chat service config is null. No chat service will be created.
[Wendigos Chat] Created Chat manager.
[Wendigos Log] CLIENT IDS: 0 xAthrNtCydF0CrAmyO2f

When I created the ElevenLabs API key I restricted it to TTS and STT endpoints. Is that fine? Or is the problem that it's not even capturing any clips for me -> the only reoccuring log I see during rounds: [Wendigos Log] Clips count: 0?

winged tapir
#

All that is normal and your Elevenlabs setup is fine. That message displays whenever a masked tries to play an idle clip when there arent any. Seems like the input device still isn't being detected. Do you get the same "defaulting to device 0" error?

#

Also, are you on Windows or linux?

rough fable
#

No, the device 0 error is gone. I'm on Windows

#

I just tried a less bloated profile. Will send log

winged tapir
#

Interesting, so my fix did resolve the NAudio device name issue at least.

winged tapir
rough fable
winged tapir
#

I dont see anything out of the ordinary on first glance... a few more debugging steps we can try:

  1. Try using your microphone directly instead of voicemeeter
  2. If you're up for it you can try setting up an Azure speech service (free tier) and trying that for STT. There is a guide on the mod page that walks through how to set one up!
rough fable
#

Even tried on my phone and mobile network.. I love microsoft πŸ˜‚

rough fable
#

ok finally got azure set up. worked immediately.

winged tapir
#

Ok that's super weird then! Once I'm home I'll look into what's up with the Elevenlabs STT.

If you want, you can install the mod asyncloggers which shows logs from async services like the STT backend. That might illuminate what's up with Elevenlabs

rough fable
#

I have that installed. How can I view the logs?

winged tapir
#

One more debugging step before I add a bunch of logging to a custom voicebox dll, can you try Elevenlabs STT with an unrestricted api key and the language code set as eng?

rough fable
#

sure

#

nope, doesn't recognize my speech anymore.

#

ISO 639 is a standardized nomenclature used to classify languages. Each language is assigned a two-letter (set 1) and three-letter lowercase abbreviation (sets 2–5). Part 1 of the standard, ISO 639-1, defines the two-letter codes, and Part 3 (2007), ISO 639-3, defines the three-letter codes, aiming to cover all known natural languages, largely...

#

I was using de before, not deu or ger

winged tapir
#

both should work, I've been using eng so I was double checking

rough fable
#

ok πŸ‘

winged tapir
#

Time to write up some logging haha

#

Alright here's a modified VoiceBox dll with super verbose Elevenlabs STT logging. Place it in {game folder or mod profile folder}/BepInEx/plugins/Tim_Shaw-VoiceBox/ and replace the old VoiceBoxModLib.Core.dll

rough fable
# winged tapir

hmm, I should be getting spammed with new logs ig but I don't see anything different.

#

Here

winged tapir
#

It did show the issue tho! Raw response: {"message_type":"invalid_request","error":"Invalid vad_threshold: '0,4'. Must be a number between 0.1 and 0.9"}

#

now I just have to figure out why on earth thats a comma and not a .

#

ohhhh are you european by chance? There might be some localization going on with the decimal being converted to a string!

rough fable
#

ohhh, the classic regional bullshittery haha

#

I am yes

winged tapir
#

That is crazy that that was the issue, how does that even happen lmaoooo

#

Fixing rn and will publish VoiceBox v0.3.6 soon!

rough fable
#

yup, works!

winged tapir
# rough fable yup, works!

Sweet! I'll publish the updated version shortly. Is it cool if I credit you in the changelog/readme for helping fix this issue? and if so what @ should I use to tag you?

rough fable
#

yea sure. @rough fable is fine ig

#

interesting. azure struggled a lot when I was using any English terms mid sentence or even entire English sentences but ElevenLabs just ranslates it instantly lol

winged tapir
#

Published updates for VoiceBox and Wendigos! Should be available in about an hour

haughty oriole
#

Can we get an update that adds a config to restore the mimics back to their original form? with the mask and the arms out animation?

rough fable
#

Hmm, any idea why sometimes I can't hear their voices?

[ElevenLabs STT] Raw response: {"message_type":"partial_transcript","text":"What's up?"}
[Info   :NaturalSelection]  Missing data container for (Urchin|ID: 365). Creating new data container...
[Debug  :NaturalSelection] (Urchin|ID: 365) Final size: Small
[Info   :NaturalSelection]  Missing data container for (Urchin|ID: 366). Creating new data container...
[Debug  :NaturalSelection] (Urchin|ID: 366) Final size: Small
[ElevenLabs STT] Raw response: {"message_type":"committed_transcript","text":"What's up?"}
[ElevenLabs STT] Raw response: {"message_type":"committed_transcript_with_timestamps","text":"What's up?","language_code":null,"words":[{"text":"What's","start":23.179,"end":23.439,"type":"word","speaker_id":null,"logprob":-0.127381960550944,"characters":[{"text":"W","start":23.179,"end":23.199},{"text":"h","start":23.199,"end":23.219},{"text":"a","start":23.219,"end":23.239},{"text":"t","start":23.239,"end":23.359},{"text":"'","start":23.359,"end":23.359},{"text":"s","start":23.379,"end":23.439}]},{"text":" ","start":23.439,"end":23.519,"type":"spacing","speaker_id":null,"logprob":-0.00238037109375,"characters":[{"text":" ","start":23.439,"end":23.519}]},{"text":"up?","start":23.519,"end":23.699,"type":"word","speaker_id":null,"logprob":-0.10380299886067708,"characters":[{"text":"u","start":23.519,"end":23.639},{"text":"p","start":23.639,"end":23.699},{"text":"?","start":23.699,"end":23.699}]}]}
[Wendigos STT] RECOGNIZED: What's up?
Added clip successfully.
[Wendigos Log] COUNT: 5
[Wendigos Log] Masked dist is: 232,9006
[Wendigos Log] Masked dist is: 233,5887
[Wendigos Log] Masked dist is: 65,12283
[Wendigos Log] Masked dist is: 22,0806
[Wendigos Log] Masked dist is: 8,81206
Yo : VBGbA9UvwZJjAc14FkCX
Streaming is already in progress.
 : VBGbA9UvwZJjAc14FkCX
Streaming is already in progress.
  : VBGbA9UvwZJjAc14FkCX

I think it's whenever it's printing that warning 'Streaming is already in progress'.

winged tapir
#

Each masked is assigned an exclusive user to mimic so only 1 masked will speak.

#

Or is that masked speaking only sometimes?

rough fable
#

Hmm, yes at first he was only occasionally responding. Then I tried a few more rounds and coldn't get any voice out of him despite it printing the responses in console.

#

That was in the debug profile with less masked spawning. Oh, actually I should still have that log too.

#
[23:36:11.8313239] [Info   : Unity Log] [ElevenLabs STT] Raw response: {"message_type":"committed_transcript","text":"Test, Test,"}
[23:36:11.8894070] [Info   : Unity Log] [ElevenLabs STT] Raw response: {"message_type":"committed_transcript_with_timestamps","text":"Test, Test,","language_code":null,"words":[{"text":"Test,","start":25.471,"end":25.971,"type":"word","speaker_id":null,"logprob":-0.560723876953125,"characters":[{"text":"T","start":25.471,"end":25.551},{"text":"e","start":25.551,"end":25.711},{"text":"s","start":25.711,"end":25.791},{"text":"t","start":25.791,"end":25.971},{"text":",","start":25.971,"end":25.971}]},{"text":" ","start":25.971,"end":26.251,"type":"spacing","speaker_id":null,"logprob":-0.0086669921875,"characters":[{"text":" ","start":25.971,"end":26.251}]},{"text":"Test,","start":26.251,"end":26.831,"type":"word","speaker_id":null,"logprob":-1.0394973754882812,"characters":[{"text":"T","start":26.251,"end":26.352},{"text":"e","start":26.352,"end":26.431},{"text":"s","start":26.431,"end":26.651},{"text":"t","start":26.651,"end":26.831},{"text":",","start":26.831,"end":26.831}]}]}
[23:36:11.8924078] [Info   :   Console] [Wendigos STT] RECOGNIZED: Test, Test,
[23:36:11.9001272] [Info   :   Console] Added clip successfully.
[23:36:11.9014081] [Info   :   Console] [Wendigos Log] COUNT: 2
[23:36:11.9014081] [Info   :   Console] [Wendigos Log] Masked dist is: 2,908579
[23:36:11.9014081] [Info   :   Console] [Wendigos Log] Masked dist is: 2,314281
[23:36:12.0412957] [Info   : Unity Log] Received set ship lights RPC. Lights on?: False
[23:36:12.4092602] [Info   :   Console] Hier. : xAthrNtCydF0CrAmyO2f
[23:36:12.4092602] [Warning: Unity Log] Streaming is already in progress.
[23:36:12.4092602] [Info   :   Console]  HΓΆrt mich? : xAthrNtCydF0CrAmyO2f
[23:36:12.4103907] [Warning: Unity Log] Streaming is already in progress.
[23:36:12.4103907] [Info   :   Console]   : xAthrNtCydF0CrAmyO2f
#

So here both masked were right next to me. I should have heard something I assume.

winged tapir
rough fable
winged tapir
rough fable
#

Tried it on my smaller mod profile. There was one mimic.
I was constantly talking. At the very start the mimic was playing a few random clips (original recordings), some of which abruptly cut out.

Other than that I couldn't hear any of the ai chat responses that were printed to console/log.

#

Now, what's weird to me is that the ElevenLabs TTS was working before when I was using Azure for STT, right? Let me check that again.

#

Yeah, no.. Exact same thing with Azure STT. Maybe the changes that fixed the regional serialization broke the TTS stuff in some way?

winged tapir
#

I'm not sure whats going on because on my end everything is working fine, I even did a few rounds and spawned masked and the masked that is mimicking me always responds

#

@rough fable here is an updated VoiceBox build with verbose logging on the Elevenlabs TTS side

winged tapir
rough fable
#

Ohhhh, I'm just an idiot. Debug logs made me realize I only updated one of the ElevenLabs API key in this profile. Sorry for the confusion :/

#

It's all good now. Thanks again!

winged tapir
#

Great to hear its working!

winged tapir
rough fable
#

When I quit the game to main menu and load back in, in the new round I can't hear mask voices and get this:

#
[Wendigos STT] RECOGNIZED: Ich geh schon mal rein.
Added clip successfully.
[Wendigos Log] COUNT: 1
[Wendigos Log] Masked dist is: 6,949234
Noise heard relative loudness: 0,05536064
Noise heard relative loudness: 0,04317421
System.AggregateException: One or more errors occurred. (A task was canceled.) ---> System.Threading.Tasks.TaskCanceledException: A task was canceled.
   --- End of inner exception stack trace ---
  at System.Threading.Tasks.Task.ThrowIfExceptional (System.Boolean includeTaskCanceledExceptions) [0x00011] in <1071a2cb0cb3433aae80a793c277a048>:IL_0011 
  at System.Threading.Tasks.Task.Wait (System.Int32 millisecondsTimeout, System.Threading.CancellationToken cancellationToken) [0x00043] in <1071a2cb0cb3433aae80a793c277a048>:IL_0043 
  at System.Threading.Tasks.Task.Wait () [0x00000] in <1071a2cb0cb3433aae80a793c277a048>:IL_0000 
  at TimShaw.VoiceBox.Core.ElevenLabsTTSServiceManager.InitWebsocket (System.Net.WebSockets.ClientWebSocket webSocket, TimShaw.VoiceBox.Components.StreamingAudioDecoder audioDecoder, System.Threading.CancellationToken token) [0x000a9] in <489e94a5e069496b9e62857fcfcf408c>:IL_00A9 
  at TimShaw.VoiceBox.Components.AudioStreamer.InitStreaming (TimShaw.VoiceBox.Core.ITextToSpeechService service, System.Threading.CancellationToken token) [0x00096] in <489e94a5e069496b9e62857fcfcf408c>:IL_0096 
  at TimShaw.VoiceBox.Components.TTSManager.RequestAudioAndStream (System.String promptChunk, System.Boolean isFinalSegment, TimShaw.VoiceBox.Components.AudioStreamer audioStreamer) [0x0000f] in <489e94a5e069496b9e62857fcfcf408c>:IL_000F 
  at Wendigos.ElevenLabs.StreamAudioChunk (System.String promptChunk, System.String voiceID, System.Boolean isFinalSegment, TimShaw.VoiceBox.Components.AudioStreamer audioStreamer) [0x0002e] in <3166c4302a5c4344ba6304c4122e45e3>:IL_002E 
---> (Inner Exception #0) System.Threading.Tasks.TaskCanceledException: A task was canceled.<---
haughty oriole
winged tapir
cold slate
#

when playing recorded clips, when / how likely is the mod to try to pick an "appropriate" response?

#

like if a masked hears you address it, will it always try to respond appropriately, or could it say something unrelated?
and how much context does it have? just your question, or its past responses to you?

cold slate
#

also

#

I know local voice cloning was abandoned, but any chance we could use a local llm for smart clip selection?

winged tapir
winged tapir
winged tapir
#

Also, I'm working on adding Qwen3-TTS support for local voice cloning but I haven't found a suitable local server that supports streaming input/output

cold slate
cold slate
cold slate
cold slate
#

and great for my privacy-minded friends lol

rough fable
#

Been goofing around with this and had some interesting conversations already lol
the German TTS is a little robotic sometimes. Is there a noticeable difference between voices created on the lowest ElevenLabs tier vs the creator tier? Do you know, Tim?

#

Also tried some prompt engineering to improve the responses but somehow it always ends up with the AI mocking me and calling me names xD

#

Now, if I could request one feature, well.. more of an enhancement, that'd be a config setting that maps voice IDs to steam IDs and syncs them automatically to clients. Like steamID1:voiceID1, steamID2:voiceID2. I figure since you already have the sync with clients it might be an easy addition and very nice QoL unless I'm missing something..?

rough fable
#

I guess this works fine with Zombies or other mods that spawn masked in non-native ways..?

rough fable
#

Can you tell me what API endpoints are required in ElevenLabs for experimental?

haughty oriole
#

@winged tapir does it work with v80?

haughty oriole
#

No it broke

zinc bramble
#

Hi, whenever I try loading into a moon, I get this in the BepInEx window

#

and i cant load in

#

tbh, this is way too much hastle since i'm looking at the voicebox tutorial.

winged tapir
winged tapir
winged tapir
# zinc bramble

I cant determine much from just this log message. Can you post the entire log?

haughty oriole
#

I deleted it, sorry.

#

Maybe it fixed itself with the new April 1st update

winged tapir
#

Seems to be working on my end for the moment at least

winged tapir
haughty oriole
winged tapir
#

looks like they uploaded a fix recently, should hopefully be working once the update is published on Thunderstore

haughty oriole
#

@winged tapir Here is the log to diagnose the issue with v81

rough fable
#

Hmm, may not be related at all but I've had the game hard crash on me twice over the last week or so and both times it was immediately after an ElevenLabs error like this

ElevenLabs Error [resource_exhausted]
[ElevenLabs STT] WebSocket closed by server. Status: NormalClosure, Description: resource_exhausted
[Wendigos STT] Speech to Text service cancelled: Reason=Error Error: ElevenLabs Error [resource_exhausted]
Attempting to reconnect... [1/3]
[AI Manager] Starting speech recognition.
[Info   :BellMonster] Playing bellStep sound (Chasing)
Connected to ElevenLabs Scribe (NAudio).
Crash!!!
#
========== OUTPUTTING STACK TRACE ==================

0x00007FFA59134A16 (ntdll) RtlWaitOnAddress
0x00007FFA590FFCB4 (ntdll) RtlEnterCriticalSection
0x00007FFA590FFAE2 (ntdll) RtlEnterCriticalSection
0x00007FFA1A2CFA57 (winmmbase) mmTaskCreate
0x00007FFA1A2D17C9 (winmmbase) waveInPrepareHeader
0x000001DEF7BCC258 (Mono JIT Code) (wrapper managed-to-native) NAudio.Wave.WaveInterop:waveInPrepareHeader (intptr,NAudio.Wave.WaveHeader,int)
0x000001DEF7BCCB7B (Mono JIT Code) NAudio.Wave.WaveInBuffer:Reuse ()
0x000001DEF7BCC8D3 (Mono JIT Code) NAudio.Wave.WaveInEvent:DoRecording ()
0x000001DEF7BCC703 (Mono JIT Code) NAudio.Wave.WaveInEvent:RecordThread ()
0x000001DEF7BCC68B (Mono JIT Code) NAudio.Wave.WaveInEvent:<StartRecording>b__29_0 (object)
0x000001DEF7BCC629 (Mono JIT Code) System.Threading.QueueUserWorkItemCallback:WaitCallback_Context (object)
0x000001DB9AADBEEE (Mono JIT Code) System.Threading.ExecutionContext:RunInternal (System.Threading.ExecutionContext,System.Threading.ContextCallback,object,bool)
0x000001DB9AADB96B (Mono JIT Code) System.Threading.ExecutionContext:Run (System.Threading.ExecutionContext,System.Threading.ContextCallback,object,bool)
0x000001DD35FA3893 (Mono JIT Code) System.Threading.QueueUserWorkItemCallback:System.Threading.IThreadPoolWorkItem.ExecuteWorkItem ()

...
#

if you have any use for the full dmp or log lmk

rough fable
#

happened again

Update used by player client rpc has been called for interact trigger: DriverSeatTrigger
Triggering animated object trigger bool: setting to False
StopIgnition is disabled! Netcode for GameObjects does not support disabled NetworkBehaviours! The InteractTrigger component was skipped during ownership assignment!
StopIgnition is disabled! Netcode for GameObjects does not support disabled NetworkBehaviours! The InteractTrigger component was skipped during ownership assignment!
[Info   :GeneralImprovements] Applied 2 queued monitor changes.
[Debug  :GeneralImprovements] Updated time display.
[Debug  :GeneralImprovements] Updated time display.
Starting ignition!!!
ElevenLabs Error [resource_exhausted]
[ElevenLabs STT] WebSocket closed by server. Status: NormalClosure, Description: resource_exhausted
[Wendigos STT] Speech to Text service cancelled: Reason=Error Error: ElevenLabs Error [resource_exhausted]
Attempting to reconnect... [2/3]
[AI Manager] Starting speech recognition.
[Debug  :GeneralImprovements] Updated time display.
Connected to ElevenLabs Scribe (NAudio).
Crash!!!
#

same stack trace

#

I have a feeling it always happens when this resource exhausted error occurs at the same time some audio source in-game is trying to play sound. this time it was the exact moment I attempted to start the cruiser. last time it was the exact moment the bellmonster attempted to play its aggro sound.

hot hinge
#

This mod seems like a really cool concept. Is it taxing for the host’s pc?

winged tapir
winged tapir
rough fable
winged tapir
haughty oriole
graceful stone
#

Hey! Really cool concept for a mod, but I asm having issues getting it to work on my end. I know it is quite experimental, but I paid for an eleven labs api key and now I am invested.

I have my error below. I believe I am passing the api keys correctly through the cofing and everything. I am trying to use the voice cloning through eleven labs, and it does actually have a cloned version of my voice (super cool, very spooky, but it seems within he logs whenever it trys to pull from that voice, it fails and defaults to playing random voice clips it pulls (that works).

Parameter name: key
  at System.Collections.Generic.Dictionary`2[TKey,TValue].FindEntry (TKey key) [0x00008] in <1071a2cb0cb3433aae80a793c277a048>:IL_0008
  at System.Collections.Generic.Dictionary`2[TKey,TValue].get_Item (TKey key) [0x00000] in <1071a2cb0cb3433aae80a793c277a048>:IL_0000
  at Wendigos.STT.SendToChatAndStreamAudioResponse (MaskedPlayerEnemy closest_masked, System.String playerName, System.String player_speech) [0x0008a] in <dfeb82a886cc482f86134eef5ca43b65>:IL_008A
  at Wendigos.STT+<>c__DisplayClass20_0.<InitCallbacks>b__2 () [0x00161] in <dfeb82a886cc482f86134eef5ca43b65>:IL_0161```

Also the STT azure connection occasionally fails and crashes my game, but that doesnt happen too often. I am also down to test just anything to get this working.

Is this an error with the mod? Let me know if you need more logs!
rough fable
#

I'm curious, are you using auto cloning? I haven't seen that error you send on my end and it sounds like that's the only part between our two setups that could be different.

leaden lark
#

fixed the issue but can't tell you how. Just did a verify files and reimported modpack

leaden lark
#

Different issue though - If I say "Can you hear me?" to a masked it repeats "Can you hear me?" like six times and replies at the same time "Yeah I can hear you". I tried turning max voice clips to the minimum (1) but it didn't make a difference; wondering if I fucked up somewhere. Example here, I want 'oh' and it got really hung up on that but still replied to other stuff. https://streamable.com/n1j26p

graceful stone
#

i will try fix later today i suppose

rough fable
#

to clarify: I'm not using auto cloning
but yeah, i don't even know if it's related at all. just a guess

graceful stone
#

its not really crashing, just the program not working and defaulting to spamming audio clips

wraith tartan
#

I don't know why, but I just can't get it to work.
Tried (a paid) Elevenlabs API for STT first, but that seems broken. Then switched to a free Azure subscription, I see it sent a few clips and then just hit me with a:
[Connection was closed by the remote host. Error code: 1007. Error details: Quota exceeded. Cid: SessionId: e10e36ee2a8f49fe85f013575773ddbc
Altough when I check Azure, the quota doesn't seem exceeded and it tells me I only made 5 requests in the last 24hrs.

Does anyone else have an idea as to whats happening?
Appreciate it!

EDIT: I used a paid azure subscription and that worked, seemed like some quota thing still caused it, or I did something wrong, anyway its fixed, sounds great!

wraith tartan
#

~~I had it working for a while, but ran into request issues. But for some reason I now keep getting: ~~

[Warning: Unity Log] Audio source failed to initialize audio spatializer. An audio spatializer is specified in the audio project settings, but the associated plugin was not found or initialized properly. Please make sure that the selected spatializer is compatible with the target.

~~Im fairly certain it has to do with the Wendigos plugin, as it starts spamming my logs whenever I talk to a nearby masked.~~Any clues to what I might've changed accidentally?

Okay the top part resolved itself, but didn't seem to cause it.

So the problem seems to be in the usage of API keys.
It's weird to me since it prints these messages in the log:

[Info   :   Console] [Wendigos STT]: Creating AI manager object for STT service. Disregard "Service config is null" errors.
[Warning: Unity Log] [ServiceFactory] Chat service config is null. No chat service will be created.
[Warning: Unity Log] [ServiceFactory] STT service config is null. No STT service will be created.
[Warning: Unity Log] [ServiceFactory] TTS service config is null. No TTS service will be created.
[Warning: Unity Log] Chat service API key not found.
[Warning: Unity Log] TTS service API key not found.
[Info   :   Console] [Wendigos STT] STT Service Created.

Altough, these API keys work (ChatGPT for Chat and Elevenlabs for TTS and STT).
In the logs later it is confirmed that at least the chat service works (by giving me a reply to run through TTS), altough it prints it in stream mode (don't think this is an issue, since it gave a reply to my question one single time, a few days ago with that same setup.

[Info   :   Console] [Wendigos STT] RECOGNIZED: Okay, so what's going on now?
[Info   :   Console] Added clip successfully.
[Info   :   Console] [Wendigos Log] COUNT: 1
[Info   :   Console] [Wendigos Log] Masked dist is: 6.560073
[Warning: Unity Log] PlayOneShot was called with a null AudioClip.
[Debug  :  Imperium] [PROFILE] Objects refresh time : 11
[Debug  :  Imperium] [PROFILE] Total objects refresh time : 11
[Info   :   Console]  : cxG7uDG1BbuP6ObKuREX
[Info   :   Console] Not : cxG7uDG1BbuP6ObKuREX
[Info   :   Console]  much : cxG7uDG1BbuP6ObKuREX
[Info   :   Console] , : cxG7uDG1BbuP6ObKuREX
[Info   :   Console]  just : cxG7uDG1BbuP6ObKuREX
[Info   :   Console]  trying : cxG7uDG1BbuP6ObKuREX
[Info   :   Console]  to : cxG7uDG1BbuP6ObKuREX
[Info   :   Console]  survive : cxG7uDG1BbuP6ObKuREX
[Info   :   Console] ! : cxG7uDG1BbuP6ObKuREX
[Info   :   Console]  What : cxG7uDG1BbuP6ObKuREX
[Info   :   Console]  about : cxG7uDG1BbuP6ObKuREX
[Info   :   Console]  you : cxG7uDG1BbuP6ObKuREX
[Info   :   Console] ? : cxG7uDG1BbuP6ObKuREX
[Info   :   Console]  : cxG7uDG1BbuP6ObKuREX
[Info   :   Console]  : cxG7uDG1BbuP6ObKuREX
[Info   :   Console]   : cxG7uDG1BbuP6ObKuREX
[Debug  :  Imperium] [PROFILE] Objects refresh time : 11
[Debug  :  Imperium] [PROFILE] Total objects refresh time : 11
[Debug  :  Imperium] [PROFILE] Objects refresh time : 11
[Debug  :  Imperium] [PROFILE] Total objects refresh time : 11

Elevenlabs usage/history only shows status codes 200 en 1000 (which are okay according to their docs)...
Any clue as to what's going on? And what I can do to make it work (again) πŸ˜„
Sorry for spamming the discord. πŸ™‚

haughty oriole
#

@winged tapir

#

019dca4d-8652-1fa6-a61d-b4bbf5066274