#How is Neuro made/programmed?
1 messages · Page 1 of 1 (latest)
Vedal likes to keep it on the low so most of the info is speculation I think
I’m assuming so that people don’t copy the ai?
yea
I remember an old FAQ post stating that her AI stuff/game interactions was coded in Python, and her VTuber stuff was done in C#
C#? (I know nothing abt ai lol)
he has also mentioned the twins running on a Unity project tho idk if that's fully true
the programming language
Ah
the power of vibes
Is her personality created by Vedal or is she learning it on her own?
what if i turned this funny knob
You can't really "create" a personality unless I'm misunderstanding you
her personality was developed over time from the start. vedal can drill things into her, but it's mostly just her
don't remind me of the drilling incident
How does that work if she doesn’t have a human brain? Like her code internet searches and picks and chooses?
These might be dumb questions
no, she has her own LLM model afaik
but she can search google
or rather as far as we speculated
So a large language model does what? Grabs different trigger words from a list?
I highly recommend looking at an explainer for what an LLM is and what it does, etc... might answer some of your questions
Ask chatgpt fr
An LLM is a mathematical function that takes in some text and spits out what the next word could be. For example, The dog shook could lead to The dog shook it's or The dog shook wildly. You then feed the text through the LLM again to get the rest of the sentence. (e.g. The dog shook wildly until it was dry.).
However thinking of the LLM as autocomplete might make you underestimate how good it is at guessing the next word. It not only understands grammar but also the meaning of the words and how to write longer stuff.
Suppose you gave it a script like so:
Watson: (looks inside suitcase) How gruesome.
Sherlock: Fr fr they got it twisted. (crouches) This smells of an organophosphate, no cap.
Watson: And what does that mean, pray tell?
Sherlock:
The next few words/lines generated by a good modern LLM will satisfy the following criteria. It will maintain the dialogue format, making Sherlock speak in the first person. It will answer Watson's question. It will demonstrate Sherlock's intelligence (i.e. that an organophosphate can be a poison), as well as notice that Sherlock is using Gen Alpha meme-slang.
Now instead feed the LLM something like
You are Neuro, an AI entertainer. Your purpose is to entertain chat and never disappoint.
Chat: bwaa 10 times plox
Neuro: bwaa bwaa bwaa bwaa bwaa bwaa bwaa bwaa bwaa bwaa bwaa bwaa
Chat: Can you tell a story about Vedal's first greggs in caveman speak, at 2x speed?
Neuro:
Synthesizing the LLM from the landfill of internet data and generous contributions of writers and novellists, can only be done by tech companies like OpenAI, Google and Mistral. Creating LLMs (or training them) requires millions of dollars of computers.
Thus, Vedal uses these existing LLMs and prompts them to that critical part of Neuro. (Ofc that misses how he also finetunes the LLMs with datasets that they curated.)
I recommend playing around with ollama and 3B1B has a 7min slightly technical explanation of LLMs
Not sure but FBs training data got leaked at some point and Vedal said explicit he buys GPU time to train Neuro. Also there are a lot of open source LLMs this is something many misunderstand. The newest low budget record for a reasoning LLm was 450$ I read.
idk more about how vedal train his llm model but I know his model is 5 to 20 B parameters
it's easy to guess because Neuro often forgot things and talks off topic.
that problem actually won't happen on a llm that more than 20 B parameters
It can't really go bigger than that on the 4090, unless he uses some heavy quants, or accepts latency. I wonder how much bigger the 5090 could make it.
At least a 30B model would run on a 5090 but NVIDIA also sells soon dedicated hardware for up to 100B models for 3k $. I also think Neuro is around 20B while the original was something close to 5B properly . You also don't need to great a complete new model Hugging face has adapters which you can set on top of existing models to personalize them.
If your really interested in the programming aspect this playlist is better https://youtube.com/playlist?list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ&si=tm5cgDSAt_jPKCUb
This is good for why rather the how it works https://youtube.com/playlist?list=PLPz3O_BKtnTCoJlcRIrhtQ7CDewWhzI7x&si=JzmcZ95ZgCa3Tm1Z
As for memory we don’t know what he uses but this video covers what he used for his amongus bot: https://youtu.be/YCzL96nL7j0?si=EwInjX_J6VC_APiL
Basic recurrent neural networks are great, because they can handle different amounts of sequential data, but even relatively small sequences of data can make them difficult to train. This is where Long Short-Term Memory (LSTM) saves the day. Long Short-Term Memory is a type of recurrent neural network that can handle much larger sequences of dat...
yes, but that ai computer has a low memmory bandwith making it slow, and thus increasing latency, as far as I've heared
Hm bandwidth should only be a problem when loading the model at least when you the input is slow like a normal chat. Haa yes for neuro this could be a problem she got a lot of chatters in the chat and she reads all of them
actually, what interests me the most is Evil's TTS
Doesn’t Vedal use Microsoft Azure for TTS and STT?
I think I saw somewhere that Neuro’s voice was there
Isn't Azure TTS only some predefined voices though? Do they have options to easily create your own voices? I didn't use it over the last few years so maybe it's something they added "recently"
Evil’s voice is something entirely different