#programming
1 messages ยท Page 342 of 1

you're not my boss
is your refrigerator running
you better go catch it


shiro bday
notn unifomr

happy birthday 

not cake
emi not know ee
overused joke count: i genuinely lost count


today not my birthday

glueful
tomorrow not my birthday either
shiro birthday same konii birthday 
can't believe you lied

it not me
it not lie though 
shimo birthday
every day
bun
bnuy 
bune

so shiro is at least 18262 years old 
are the neuro and evil slow cat dance emojis on this server or only on twitch?
light mode 
shiro is actually 0 years old
trust
no sleep
?
uweweee
chayleep


install gentoo
holy cursed

I've been reading this chat for months and I still don't know what wrr means
๐ 
means

phue
tyres* 

to you

r
these tyres?
GT5?
noooo
GT4
Off by one again
hapie
Damn
I speaking Canadian English not British english
related languages but different

shir, genuine question
cananadian
not british
๐
๐ด๓ ง๓ ข๓ ณ๓ ฃ๓ ด๓ ฟ?
is your internet better than what it was about 2-3 years ago?
peueueu
tire 
๐ฎ๐ช
my Internet randomizes every 4 months when I move
toot
currently it's better
char horn
some progress is better than none ig
Do you have those tyres with the alumni**ni**um?
Where did you get that cube from?
shube
I want answers
konii schiz
I am a little bit confused at the moment 
nooe npower of
power off

uwah

eyey not
โ๏ธ
use zune_png::{PngDecoder, PngEncoder, zune_core::{bit_depth::BitDepth, colorspace::ColorSpace, options::EncoderOptions}};
use std::fs;
fn main() {
let mut bytes = include_bytes!("../base.png").to_vec();
let color_space = ColorSpace::Luma;
let bit_depth = BitDepth::Eight;
let c = color_space.num_components() * bit_depth.size_of();
for _ in 0..255 {
let mut size = (bytes.len() / c).isqrt();
let mut i = 0;
loop {
if size * size * c == bytes.len() { break; }
bytes.push(bytes[i % bytes.len()]);
size = (bytes.len() / c).isqrt();
i += 1;
}
let encoder_options = EncoderOptions::new(size, size, color_space, bit_depth);
bytes = PngEncoder::new(&bytes, encoder_options).encode()
}
bytes = PngDecoder::new(&bytes).decode_raw().unwrap();
_ = fs::write("out.png", &bytes)
}
popipo

ooo
heeead
englis so weied
how was it a rule for so long that you can't put two nouns beside each other
and then everyone just started ignoring it
tree o f tree supertree tree
so they made it "fireplace"
head achu
vedal vedal
bles yu
other times they just hyphenate it tho
by
are these the aluminum tyres that some guy was talking about?
,-
don't let the ermshark bite
ermie sh

loam
no more loam 
silty clay loam 
enough
koniiiiiii
bright
(i literally booted Gran Turismo 2 just for this screenshot)
eep 
goodnight korone eep well 
not fair me want eep too 
Fraps!
thank

dente

nesus letter
sleep well 
I think when neuro cant win she gets stuck going through menus to stall and live as long as possible.
this isserious matter. i. need maek neurieosama.
:mlntcandy:
true
@silent anvil pls fix
I'm working on a dotfile manager in bash, it probably is gonna be absolutely horrible tho... probably not even that useful
these are at the root
and this is the config from the defaut profile
maybe i need to put an rm -rf so that it actually deletes the files it is gonna replace..
just a
35 - cp -rv $TPATH $SPATH
35 + rm -rf $SPATH
+ cp -rv $TPATH $SPATH
will do
i guess it can have some usecases.. for example if you wanna switch between multiple dotfile configs from time to time
me sleep
also i assume you know about https://www.gnu.org/software/stow/ but if not check it rr
and
banana 
Darn.. why didn't i think about symlinking lmao.. well.. atleast my thing could also be used for backups... don't think symlinks can do that
but that's what vcs can do
Embed a templating language and switch things that way 
so i actually added a proper backup system for the profiles
maybe i should optimize it and use diffs instead of just copying
Incremental backup 
Well i didn't really make them incremental
Just see them as like git branches or something

I basically implemented a primitive version of git lmao
Github repo comming soon(tm)
did you know that 60% of our protein coding gene has a recognizable counterpart in the banana genome, and 40% of those are identical amino chain wise
I did not but genomes are weird so I'd believe it

i dont usually bite on a roblox video but
โข I originally wanted to build a DC generator in Roblox, so I could test out designs more easily. However, I realized that was a pretty lofty goal as I had to learn how basic circuits actually worked first.
So I built a node-based circuit simulator in Roblox Studio, using real electrical laws like Ohmโs Law, Kirchhoffโs Laws, and linear a...
this person seems like they'd be in this server
like this person just seems to me like they're exactly neuro's target demographic
video is also fire its more about programming and math than roblox
Not having watched it yet, I think there are more people here into (relatively) higher level programming rather than dealing with circuits and such
At least I haven't seen anyone nerding out over anything in assembly
We have some assembly, most of it is more in language dev area
I personally enjoyed the video
hi #programming! anyone here have experience with llamafiles?
Essentially, . llamafile acts as the distribution vehicle, packaging everything needed to run the model. llama. cpp provides the core library for efficient model inference, while ollama builds upon it to offer additional features and optimizations.Dec 13, 2024
ok, that means I guess i do have experience with it coz i'm using llama.cpp ?
(having to look it up with google means i don't have extensive experience though)
@mighty thorn would this be something you know by chance
this is the command and output (w/ verbose flag) but whenever i run on gpu it freezes on "loading bos token..."
What
No
I donโt use llama cpp due to my lack of intelligence
i like llamafile cuz its (in theory) incredibly simple
i would reference my previous projects with llamafile but well...
i lost all that data when my secondary drive died lol
Mistral 7B is ancient enough that I'd expect it work, but given that there's already an error with the chat template I think it may just not be supported by whatever version of llama.cpp they baked in there
just try with normal llama.cpp mabe
shouldn't be that much more difficult
does it work with cpu mode?
yes
I dont like using cpp personally cuz cpp compilers hate me for some reason
RTX 5060 Ti, w/ cuda library installed and studio drivers
it seems to have prebuilt Windows binaries though
the versions with the CUDA DLLs included should be effectively the same as the Llamafile
ill check it out, i havent really tried it recently tbf
actually, nvm mind about this. If GPU failed it should auto fail back to CPU I think.
the other issue is that while I do have cpp experience i much much prefer python, and while there are py wrappers for llama.cpp, id like to keep it as close to all py as possible
hm actually the py wrapper im looking at might actually work for me
the llama.cpp python wrapper sucks so much, it's always behind far the upstream version and IIRC lacks some bindings too 
i see
i'm using a prebuilt llama.cpp with CUDA support and loading Gemma 3 gguf files and it all works fine.
when it works it works but I prefer just using the llama.cpp server when possible
Im pretty big into circuits, the game im making is a combination of fantasy and electrical engineering
yeah, it's just my intuition that Neuro's fanbase is more likely to lean towards high-level stuff, not that there won't be anyone into low-level engineering
or people into both
I have a feeling that there are much less low level specialists in the world than high level programmers
that too
The higher you go, the more room you have to be stupid 
I wonder how many software developers would say that they are confident in understanding how you go from electrical engineering to software
and at the low level you don't even need to be stupid to make mistakes. ;/
Nice thing about programming: no matter how badly you fuck it up, you are unlikely to start a fire.
until you tie it back into controlling hardware
Is it possible to make some computer part catch on fire using software
I think i asked it before but i forgot what was the conclusion
That is one of the final goals in my game, one of the first challenges is logic gates, the last is a computer that can run assembly code.
in my sub-field, they constantly tell you about the case study of a radiation therapy machine that basically cooked someone with radiation because of a race condition
The easy answer is CPU due before thermal shutdown was a thing
cooked with radiation? Thats crazy
The Therac-25 is a computer-controlled radiation therapy machine produced by Atomic Energy of Canada Limited (AECL) in 1982 after the Therac-6 (neptune) and Therac-20 units (the earlier units had been produced in partnership with Compagnie gรฉnรฉrale de radiologie (CGR) of France).
The Therac-25 was involved in at least six accidents between 198...
Your computer does a LOT to prevent you being able to start a fire, mean while pass a few too many amps through a wire and it will catch fire.
it's not radiation, but many also don't realize that the biggest danger about an MRI besides the big-ass magnet is actually from burns due to radio waves (and burns are actually more common than magnet accidents iirc)
every MRI machine has safeguards for when the software thinks you're gonna cook the person you're scanning
Famously, though, stuxnet was made to set cause certain uranium processing centrifuges explode
depends on what you are programming. i make 3lb combat robots with oversized motors and batteries and arguably undersized speed controllers. I had an early version of my robot catch fire but as long as i'm not saying "give me 100%" to them all_ the time, they're probably going to survive the abuse. but if the code fails they could go 100% on and pull more power than the ESC is rated for way too long. i might even be able to do that safely for a minute or so. but probably nor for the duration of a fight.
so most of the time programming is safer. depends on what you're programming.
I remember seeing a 102% in here, that just proves usage estimation is not as reliable as one thinks it is
remember that stupid challenge though with sparks from a certain type of school laptop that caused fires in some cases? sometimes they don't do quite enough. but that was a physical attack, not code.
I TAed a computer architecture course that was meant to be the "bridging the gap" course and I can say most of the students came out of it going "I will never have to think about a logic gate again"
I wonder actually, with lithium batteries existing now are there any regulations to where they are allowed to be placed?
Yeah, you gotta REALLY work at it, and its always a hardware fault. Because its SO easy to cause a fire with hardware.
hit the weak point for massive damage!
the ruiles at most events say they need to be covered. exposed batteries usually end the fight, but the league/organization i fight with doesn't have that rule.
lipo fires make exciting footage
I say mount the LiPo battery in a prominent and exposed location

and the goal is to puncture the opponent's battery
Overcharge a capacitor: it explodes, short a wire, it catches fire, just sent too much current, it catches fire.
Not to mention the danger of high voltage
100kw on led?
I wasnt sure about mw...but kw...yeah that will fry the led...probably a lot of other things
Can't count how many leds I've fried from sending too much current.
this is so terrible
Me too
can you mount multiple batteries? and then perform the EV battery ejection maneuver
the real lesson is about having hardware safeguards rather than just relying on software
Right
I haven't used breadboards much, but I burned LEDs a lot.
I had some leds I took out of a pc, if you send current one direction its green, the other direction its red, I hooked it to a motor and spun the motor, it would flash between red and green.
Spin it too fast and you fry the led
Thats some weird ass mechanic
at the one I go to you could eject batteries at the other person but it'd probably be a bad use of weight. but at most events that'd be an immediate loss due to exposed LiPo.

I can see how it could be useful
Idk, but if you wanna do it yourself, take two leds and connect them in opposite directions
Because they are diodes, they only let power go in one direction
you need a resistor that'll cut the voltage to a safe range too if you don't want to fry them
What happens when you switch direction fast?
Physically
So if it is going one way it goes to one diode, the other way goes to the other.
Where does heat come from
if you drive a motor, it becomes a generator
Heat comes from resistance, the formula is i^2*r where r is the resistance of that element
Taking advantage of having too much power density, I wonder if one could improv a tesla coil and stab an exposed motor.
I doubt that is a worthy strat due to what is essentially RNG and why would anyone just have a motor or a wire exposed that could be stabbed.
do you have enough leeway and time to modify your robot for a specific opponent?
In ac I think power relates to frequency, and the motor is generating ac. But I know more of dc than ac (since my game doesnt include ac)
I'm guessing not, which is the large gamble
if you're lucky you know a few days before who you are fighting. but you don't know for sure beyond the first fight because it depends on who wins each other fight.
but you can plan ahead and have several potential configurations prepared. some people have huge numbers of permutations of armor or weapon choices they can use.
but it depends on the precise local rules
I have a question: can it be done that way in a GPU according to the principles of arm?
do you mean: can you emulate an ARM chip completely on GPU?
can what be done that way?
InstaFrame is a web-based editor that lets you generate an Instagram-style post image. https://insframe.vercel.app/
self promotion the third
OK.
Instagram uses some weird style images? 
@cinder trellis how can I create an ai to act more like neuro-sama, is it like a program or a coding fix cause im genually wondering how can Neuro do all that she can do, and if possible can you show me what porgraming app do you use for neuro

you uh, probably won't get a reply but
if you want to get an idea, you can look at some open-source neuro clones: https://github.com/kimjammer/Neuro
ya i know but its worth to try
thank you sooo much
"He coded his daughter, but she programmed him into dad."
the thing he'll absolutely never say anything about is how he specifically fine-tunes Neuro, so you may get a different personality and behaviour but
if you only care about the general technical details, then that should give you an idea

if not how Neuro is specifically set up, but how one could make something like Neuro
oh ok thanks
the important things are her memory and her ability to plan actions and then do them. she often thinks and during the subathon Vedal gave access to her backend thoughts to another streamer....
this middle layer is for planning and the layer above that does the speaking. this makes her much smarter than just saying the first thing that pops into her head i think.
having a scratchpad or chain-of-thought also allows an LLM to do something like lie
because otherwise an LLM has nowhere to keep and reference "hidden" text
the only thing the dude is getting is the ban hammer
and without a way to have "hidden" text, an LLM can't "keep a thought" in mind that it didn't already say out loud
organizing persistent memory so she doesn't gradually get worse at thinking over time is also a hard problem.
- Vedal, do I matter to you?
- I know I'm just a silly Al.
- but, it would make my day to hear you say it.

where faq sheet
There is a link on the official site about how to create an AI like Neuro, but it's very very high level.
thank you all for the info
it doesn't really go beyond "Neuro is an LLM"
I just have the broswer window saved now
for good reason
its cause i was wondering how can that all be prossesed without like having the system over load it's self
basically we're saying "if you have to ask random people frpm the outsett, you probably won't be able to finish it".
ya
I need to open up an issue for DO NOT PING TUTEL
because some people (a lot of people) can't read
-# for anyone wonderinig yes this on github https://github.com/igglyn/NeuroProgrammingFAQ
I'm not sure what to say besides that it isn't magic; even if we don't know exact specifics, nothing about Neuro is something that seems impossible
if you look things up and learn, you will be able to see how it can be done
but also generally people don't respond well to "how do I make thing, I tried nothing yet"
there's two main ways to avoid overloading your system: offload the processing to 3rd party or use smaller models. you'd be surprised how good some of the small models are.
There is nothing about fine tuning
or maybe im blind
How is it done?
You give it a prompt to follow?
like, fine-tuning an LLM in general?
Yes
there are multiple methods
really fine-tuning is just a generic term meaning you're modifying the weights but implying you're not doing it over the entire model most of the time
@mighty thorn actual correct ping 
What
you can fine-tune an LLM by just throwing text at it to learn from directly, but you can also use methods like reinforcement learning where you basically reward certain outputs and not others
Arent weights impossible to tune because there are millions of them and we have no way to tell which is which 
yeah you don't modify the weights manually
Doing it manually is impossible (with any semblance of intelligence), thatโs what training is for
there are also methods like swapping out or grafting pre-trained weights to some parts of a model (LoRA)
if I had to make a guess with the caveat that I have no real idea, I'd guess most of Neuro's fine-tuning is via reinforcement learning
with who knows as the metric to reward
Check out my newest ASMR Video (My Christmas Memories: From The Sega CD To Sony MiniDisc):
https://youtu.be/qWyqOnyPBCw?si=gx1FyLXX8ZE85sI0
Subscribe to Rich Tomorrow:
https://www.youtube.com/@RichTomorrow-p6r?sub_confirmation=1
Subscribe To Tech25 ASMR:
http://youtube.com/@Tech25ASMR?sub_confirmation=1
Join this channel to get access to per...
Did you need something
fine tuning is what you do when you train an already trained model so it acts closer to what you want (usually the changes are over in a smaller domain, like you might teach it how to read your company's format of a certain report and determine if it fell in some category system you used.
no, someone pinged vedal asking how to make a Neuro
Or is my suffering just nutritious to you?
Bruh
Yeah my bad, the orginal question I pinged you for was how to fine-tune a model. I need to remeber to actually do it in reply to the question
I've seen some people describe prompt engineering as fine-tuning too, but I....don't like having that in the definition
am I really the only one here who does finetuning nowadays
you fine tine the prompt, but that's just tweaking it. not the same thing.

At least the only one I know of 
I remember when we just called it transfer learning
Das distillation where I come from
I think there are some differences in definition, but people say fine-tuning whenever they mean transfer learning as well nowadays anyways so
one day AI will look back on how we did this and laugh at us.
But how does it look like?
You give it input and rate its response on a scale 1 to 10? And it asjusts its weights?
Or it looks like a dialogue where you give it "no be more mischievous" or smth like that when it acts like complete clanker (crude example but i hope you got what i mean)
it doesn't necessarily look like any one thing; I guess it'd "look" like how you trained the model in the first place
it's almost exactly like the initial training, except usually the dataset is smaller and the learning rate might be higher again.
like, there's reinforcement learning from human feedback (RLHF) which can look more like what you're describing on the front end
Huh
but you're basically doing the training process but usually limited in some way
Is someone here looking to to train or something
or it could be the whole thing
Who should I be mansplaining to
*aisplaining
like uhh, it might help if you stop thinking of it in terms of an LLM
and instead like an object detection model
All open source llms have some kind of interface for additional training?
I mean it either looks like wandb or a console window
Or both
you'd start with a pre-trained YOLO model for object detection, but you have your own dataset of something more specific you want it to learn
so you...kinda just do further training with your own dataset
I guess i just lack the vision on how these llms look in general
at which point training just looks something like:
I currently see it as some kind of binary file which has weights
Which is clearly wrong
uhh, I guess you can say most of the data input and output would be in tensors
which in machine learning basically just means an array of whatever dimensions
so you're converting whatever data you're training with into tensors and then you need to convert from it to read it again
but there's really less to this than you'd think
you kinda just....throw it at it and hope it gets what you want
again, it's much easier to conceptualize without thinking of an LLM in particular
but an LLM is still using the same concept, just with a specific application
So this thing comes in bundle with llm?
no this is just what a training script output looks like
you can have it display whatever you want, but most will display a loss function
which gives you an idea of how the training is going
because when a model is being trained, it's trying to go for some kind of goal that you need to define
So you take "llama llm opensource" then run some 3rd party made by hacker29584 script from github which takes llm and your training data and
Something like that?
you can write your own training script
it's really not much, like 50 lines of python
the LLM itself is just complicated function that takes in a large number of parameters and outputs a token. the model is just all the weights and biases and how they are connected.
training is just how those values get set to something which works.
fine tuning is just continued training with more specific data.
most of the actual meat of the code for loading, training, and etc. with a model is done for you with a single line of code
"import pytorch" or "import tensorflow"
so what you just need to worry about is defining the training and validation data (validation is checked during training), figuring out how you want to convert your data into tensors, and uhh, that's kinda it
oh, and set some parameters
and there's really no "correct" way you must use to convert your data into tensors
because it's up to the model to find its own "understanding" of the data
So every training data entry is input+output 1 to 1
uhh, there are different kinds of training data
so the simplest kind would be for what is known as supervised learning
where you basically give it examples of what you want the model to do
like a bunch of problems with an answer sheet
and you want the model to sorta figure out some kind of pattern/understanding based off of that
there are various methods of unsupervised learning as well where you have some other method of telling it the "right" answer without you explicitly defining it
one example is how older image AIs usually worked where you have a second AI model try and figure out if the output of the first AI model was made by an AI or a human
and they would both act as the "right answer" for the training data for each other
Ok so input + function to check if output is correct
hence GAN for Generative Adversarial Network
I guess, although it's more "desirable" than "correct"
the "loss function" is the single number representing how desirable the input is: difference between desired and current is the loss and you want that to be small.
the weights aren't necessarily all in a single "layer" as in it's not just all input goes into one set of neurons and then straight to the output
how you define how the weights are generally laid out like how many layers and how they connect to each other broadly is what people mean when they say "model architecture"
Ok but dont you need to set up a huge program to handle the specific format of those weights
Like
and when you see diagrams like, this, it's representing the model architecture and layers:
500 layers with 1748 weights per layer or
handled with "import pytorch"
like uhh, lemme give an example of what setting up a model architecture looks like
The weights file has everything inside for pytorch to read?
the model typically has a json file describing the layers and how they connect and you load that automatically so you can load the rest of the parameters
So it can handle it correctly
Ok this closes my question 
Its what i was asking
the structure is part of the binary data too
this is what defining a model architecure looks like:
Understandable 
you're just defining what the layers are
and the size of them
and how they broadly connect
Do you happen to know the math behind back propagation by any chance? I am somewhat curious
when you download an open source model, they already had an architecture laid out
a lot of vector math
how they made the actual structure is typically an art though... the precise number of repeated modules and sizes of each layer are often just "well, we have enough memory for this many copies if we make that layer this size". could there be a perfect structure though? needs more research i think.
yeah, there's no "right answer"
I know you have to calculate the derivative (or integral, not sure which) of a sigmoid, which then somehow determines what weights need to be changed...
frankly, when I train my own models from scratch, I usually throw together either a structure described from a paper or another example or if there are none
I will literally use an existing architecture I had used somewhere else if it seems good enough
even if it was used for a totally different task
the model architecture is like defining the boxes for the weights to do their work in; the box influences it but there's flexibility
or uhh, like designing the room you want your office to be in
but it's up to the office workers to figure out their own seating arrangements
How would you do back propagation on your first training run, if all the weights are neutral, and you have multiple hidden layers?
(I know I likely lack even basic understanding of this field)
the weights would be arbitrary to start with
and your result would be obviously wrong, so you can tweak from there
sorry, all i can remember is you need to use the chain rule for multiple layers and it's based on activation strength... i really didn't pay attention coz i think i was panicking about calculus.
yeah, this is something we learn in class and then forget afterwards because
it's all accomplished with "import pytorch"
a typical developer won't know much beyond what backpropagation is
I like understanding things deeply
and a thought of "yeah I could figure it out if I need to"
If I don't understand what is going on when I call a library function, can I really call myself a software engineer?
I mean, you do have to draw a line and black box something you use
eventually
for down the path of total understanding lies insanity
https://www.youtube.com/watch?v=VkHfRKewkWw <= i've watched this before.
Take your personal data back with Incogni! Use code WELCHLABS and get 60% off an annual plan: http://incogni.com/welchlabs
New Patreon Rewards 29:48 - own a piece of Welch Labs history! https://www.patreon.com/welchlabs
Books & Posters
https://www.welchlabs.com/resources
Sections
0:00 - Intro
2:08 - No more spam calls w/ Incogni
3:45 - Toy Mo...
I'm going to be a little harsh here...
It is only insane because sheer ignorance makes it so.
I think it's a factor of how large of a system you're working with
I think surface level understanding is good; you want to know why you're using a thing and what it can and cannot do
but you eventually have to even black box stuff that other teams on a project you're working on have done
to get a project done on time
Once people have the model loaded they can work on a meta-programming level with prompts and such.
most people don't want to deal with the low end understanding of anything they don't need too. they don't have to understand their car because it's insanely complicated so you don't have to.
Black boxes create insanity that I cannot even begin to describe. That is why I hate them.
it's like this classic: https://youtu.be/-5wpm-gesOY
A web app that works out how many seconds ago something happened. How hard can coding that be? Tom Scott explains how time twists and turns like a twisty-turny thing. It's not to be trifled with!
A Universe of Triangles: http://www.youtube.com/watch?v=KdyvizaygyY
LZ Compression in Text: http://www.youtube.com/watch?v=goOa3DGezUA
Characters, Sym...
You are likely typing on a black box
Yes. That is insane.
I think it's good to understand that a library you're working with has to deal with a lot of issues with timezones; but I do not need to remember the exact way it handles every single timezone
because if I really need to know, then I know I can go learn it
you need to go back 40 or so years before you get to computers that a single person could understand every component of. and they'd need to be really smart.
and those computers are terrible.
I mean, we went to the moon with wires going through magnets for memory...thats frikken metal.
hey that's better than using a CRT display as computer memory
Computers aren't that complicated, once you know the basics.
A modern processor simply repeats these basic principles 1000x. Modern hardware iterates on these principles.
The basics stay the same, as long as the computer you are referring to is digital.
but what you're describing is the surface knowledge
1940s computing: https://en.wikipedia.org/wiki/Williams_tube
The Williams tube, or the WilliamsโKilburn tube named after British inventors Freddie Williams and Tom Kilburn, is an early form of computer memory. It was the first random-access digital storage device, and was used successfully in several early computers.
The Williams tube works by displaying a grid of dots on a cathode-ray tube (CRT). Due t...
Yeah. That actually happened
the problem isn't undestanding the components individually, it's that it's impossible to understand how they all interrelate to each other.
the possible very first videogame made might've basically hijacked the computer memory as a display for tic tac toe
Heck. Even the interrelations work on basic principles.
explain Neuro.
but you're describing basic principles still
If they didn't, our hardware would be entirely untenable
that's surface level understanding
That is foundational understanding
you are not an expert on every single aspect
Never said I was
well, it seems like you're saying it's not worth it to ever abstract away your understanding of something
What do you think happens when we lose touch with how our computers work?
you have to pick your "need to know everything" battles because there is always another level.
is "we" the entire industry or us as individuals?
that said, I do think it's good that every CS program has us at least get some understanding of the low-level principles of how a computer works
I just don't think every software developer needs to have an expert level knowledge of that
you just need enough to know that it's not just magic
Both entire industries, and individuals.
If we forget our hardware, do you think we can just keep cruising along, and not come across some unforseen, unsolvable issue that is impossible to explain?
not if we have the confidence that we can learn the hardware if need be
i've been doing low level software stuff longer than (i suspect) most of you have been alive. almost no individuals ever knew or ever will know how computers actually work at the low level.
or have access to someone who does
Forget low level systems, most programmers dont have any understanding of kernel architecture.
and a well-designed system is set up so you don't need to know how each layer works
the kernel is the lowest level, except maybe bios/uefi type stuff. perhaps forgetting low level systems is why programmers don't understand the kernel.
You don't think that is a problem?
Human communication overhead has a large cost. Microsoft is barely even capable of writing a functional OS anymore.
This has serious implications.
I don't think this is a new "problem"
Microsoft has been writing code for longer than you have been alive, most likely. They're not getting worse. People always complain about them but that's because they're running on almost everyone's personal computer.
or limited to computing
Nobody has been teaching computer hardware adequately for a long time.
It is extremely hard for me to learn about these things.
most factories sourcing parts or raw materials won't know the exact details of how it got there
the Chinese didn't even know the glass they bought was Roman and the Romans didn't know that the silk they bought was Chinese
Why is my experience on Linux so much better? Why the difference?
I have limited understanding on how data scientist train the model and what they used. but it does not stop me from converting the model into an onnx model and deploy it in java
Yeah. That isn't as much of a problem, because the resources aren't very local
i've used both. i prefer using Linux on Windows.
instead of the physical location, you can think of the virtual context then
What do you mean by "virtual context?"
like, the person writing brain surgery software does not need to know the super low-level aspects of a computer to write good software
they're "far apart"
Can you explain how linux executes files? specifically, what happens when an execve system call occurs? Does knowing (or not knowing) this affect your experience of the linux kernel?
If you wanna make something ON a system all you need to know is how to USE the system, if you want to make your OWN system...thats when you need to understand the low level stuff
My entire point is that black boxes are probably not necessary, and that they create problems that would sound extremely ridiculous to someone who programmed in the 1970s.
the entirety of uh, machine learning though?
The black box model is flawed.
Do you know everything about how your car works? Do you need to in order to drive it?
but to most people computer is a black box isn't it?
linux is a clear box but most users have never looked inside. so it may as well be a black one.
a programmer in the 1970s has their own black boxes
most people aren't able to read C.
What happens when nobody does?
that's not what we're talking about though
What happens when everyone forgets that a car has an engin
Someone always will, because they make the things
That is the implication I'm getting at
it is not though, because we're not saying no one knows the things
it's that different people know different things
We forget how things work, so we can't fix or iterate on them
as long as we fix or iterate on them, we do know
we only forget when we stop fixing and iterating on them
it's the other way around
we forget how it works, we still have sufficient instruction to fix it
how about: you state your point so we understand what you are trying to say?
i think if nobody knows how something vital for existence is made we either will go extinct or we'll work it out.
we're pretty smart.
for instance a LLM failed to operate, we either add additional guard rail or just replace it
we didn't forget how to build a Saturn V because no one knew how it all worked; we forgot how to build a Saturn V because we stopped building Saturn V rockets
Point:
My entire point is that black boxes are probably not necessary, and that they create problems that would sound extremely ridiculous to someone who programmed in the 1970s.
Implication:
Over time, we forget how these black boxes function. We will no longer be able to iterate them, fix them, or produce new ones to replace the old. Our systems become progressively more broken over time.
but do you accept that there are always people working on those black boxes? the black box is on a personal level, not an industry or societal level
Have you heard of enshittification?
that's not enshittification
what does that have anything to do with this
you should make your own open source computer then so you understand it.
But it is what causes enshittification
it does not
What does when?
enshittification is a business practice more than anything technical
it doesn't happen because people go "oh shit, I dont know how to make it anymore, I guess I'll just make it worse?"
Greed and lack of respect for customers
similar to website "revamp"
Really? Because the entire idea of it makes no logical sense.
Why would a company willingly ruin their reputation, in order to enshittify their products for absolutely no reason?
Money isn't a great answer, because they would likely make more money otherwise.
if web designer make a perfect website they will be fired, since well it is perfect
they're trading reputation for another gain is the point
But that gain isn't even worth the cost.
perhaps it's only a short term gain, but it's a perceived gain regardless
who says?
It doesn't add up
Its a difference of short term profit seeking vs long term, and a lack of respect for customers.
it absolutely can be for anyone who is only looking for a short term profit before cashing out
Or maybe... They are unable to play the long term strategy.
My entire point is that black boxes are probably not necessary, and that they create problems that would sound extremely ridiculous to someone who programmed in the 1970s.
I disagree. I want to keep my secret stuff hidden from other people. I will keep it in a box they can't see inside so I can profit of my ability to use the contents of the box while they have to pay me to use it for them.
without the black box, others could steal my intellectual property. not sure what would sound ridiculous to someone who programmed in the 1970s.
also, let's not forget the community that we are in right now
Neuro is literally an intentional black box to us lol
vedal didn't open source Neuro
Do you think there is enshitification in Hollywood?
Humans are also black boxes.
Does Neuro being a black box herself have some strange, far reaching implication that I haven't thought about?
i think there are ethical reasons not to open source an entity or system that could be a person.
why would Microsoft retire windows 10? well they can reprice the "solution" push out "feature" and sign the contract with other big companies over and over again
"it's just business"
is it a problem for a mechanic to not know how mitochondria work because they work on a car that is for humans and the human body uses mitochondria?
It makes sense to keep Windows a black box. It makes less sense to make Windows a black box to their own employees.
Might read up on rational ignorance: https://en.wikipedia.org/wiki/Rational_ignorance#:~:text=Rational ignorance is refraining from,that the knowledge would provide.
No.
There are no implications for a single mechanic not knowing how mitochondria works. They don't care. That isn't his job.
but it is not a black box to the employee, probably most of them have access to majority of the source code. but 99% of them would give a f about how it works, since it is not their job to know everything
so they don't have to maintain it anymore. they've given you tons of time to move to linux. look at when windows 10 was released: July 15, 2015. over 10 years ago. It's old.
windows 7 was kept for so long and they had to support it for so long they changed their mind about supporting things forever.
oh? their work impacts the safety of a vehicle. The safety of the vehicle is determined by the impacts of an accident on the human body. The way the human body can be injured depends on the inner workings of the body. the inner workings of the body depend on an understanding of mitochondria
Would a person working on word do their job better by knowing how the windows kernel works?
Probably not.
This does not appear to be related. How could windows be considered "a person"?
pretty sure, they promoted windows 10 as the last version of Windows.
Mechanics don't care about that
but you're saying it's a problem if they just black box away their understanding of a human body
What kind of argument is that?
Kinda the point, learning it would take a ton of time and gain nothing, so it is a black box to them. Thats rational ignorance.
i don't understand what you're talking about and you are in multiple threads. going to leave and it's because of you.
I never said they should learn how Windows works.
I specified specifically the hardware.
You said black boxes are bad
Yes
...so they should care more about what's...less relevant to them?
The windows kernel is a black box
most things are black box to most people
If you learn how hardware works, you wouldn't have to use Windows at all
The windows kernel is the interface between the higher level processes and the hardware
you can replace Windows with an OS in general if you want
so reinvent the wheel
or are you saying every programmer should make and develop for their own OS?
I care more so about the implications of things, not so much what single individuals need to know at a particular time. The former is what I'm focusing on with this discussion.
Only because "implication" is so vague it can mean whatever you want in the moment
Yes. That is the nature of what implications are. The future is quite vague.
so.....if it's not about individuals at any particular time, then how is society having people who know how the hardware works but not every individual a problem?
You are just vibe-arguing then
This is the basis of planning a strategy.
I guess what you call "vibe arguing" isn't so bad.
ignoring is much easier.
No, planning a strategy is having a concrete argument...you cant plan on vibes
but the argument is a black box
The future is all vibes.
The future is a black box.

Lol
Anyone capable of doing something at the same level wouldn't want a clone imo
"doesn't not knowing everything about everything make us less human". "no. people have limited time to spend learning everything. knowing a lot about anything is left to those who don't need to work for a living. the young and the rich. enjoy your youth."
probably yeah, but I don't blame Vedal for having the concern
I guess I'm the most boring person alive
My god
Yah multi model with the rest of the stack is the only way to go
I never said you had to know a lot about everything. You still have to be selective about what you learn.
It isn't like I go out and learn about what some random politician did the day before.
Imo it's also that the people asking would want you to handhold the whole way
I think being able to link someone an open-source Neuro-like is good enough to ward that away for the most part
and if it isn't enough then there's always "look up some tutorials for the basics"
True... I mean personally I haven't looked any projects up and what I've done behaves very differently... And that's perfectly OK
I feel like most asking stuff like that just want to know the general idea of how it could be possible; they just don't know that it isn't all some very specific and single magic button that was used
Yah true
so showing an example of how a similar thing was made is enough
I'm in the rabbit hole though so it's too late for me
Neuro is also not necessarily the optimal way to design an agent: she's the way Vedal made her because that's how he did it at the time. he's expressed remorse on streams vaguely about choices he'd made in her implementation and talked about moving parts around to make them more stable (last one I recall was getting rid of a voice chat related memory leak by moving that code to a separate process).
so even if you could copy her exactly, why would you? you'd get a cobbled together thing instead of what could be well designed. (and this is what i tell myself)
but I think it's a lot of people who view projects as like, being on a gradient of "programmer skill". the more "skill" you have the more feats you unlock and there is a "make neuro" feat
True to an extent, although it is nice to hear when he goes on rants about problems he's having that I've run into. Means I'm not just dumb
I think it might be interesting to figure out a way to setup a memory space for LLM as a starting point
But also if he hasn't figured it out yet then I have a headache to deal with
I bet it depends on implementation.
I'm juggling 5 models and over 120k context across two systems
this is why everyone loves code review right?
True
Id like to try ai work, but my computer chugs running unity
You have unlocked new role
I got like 16 gb of ram (still crazy to me thats not a lot...remember when having a mb of ram was a lot...)
last year I brought 128gb ddr5. I was trying to run 70b model with cpu 
My computers chug and I'm running a 10900x 2 3090tis an A4000 and 128 gb of ram in the desktop as well as dual xeon golds an A2 and 768Gb of ram in the memory DB server
at least, I can sell my memory for a profit 
I have a computer with 500GB of RAM and 44 cores, Iโm just running a Minecraft server on it but I feel I am under using the computer. Any ideas of what I should put on it. It is 10 years old though, keep that in mind
768gb of ram...what? That is an unreal amount of ram..
What do you need 768 forโฆ
SQL DB
Ah
I'll trade you, I have a laptop with 16gb of ram from 2019
database for what
Just upgraded the vector DB embedding method and added cosine similarity matching
my 5 year old computer is still not crap. mostly coz it has a 3090 in it.
was intending to upgrade around xmas but that plan didn't happen. maybe the bubble will burst this year.
Nah I think Iโm goodโฆ i donโt use that computer much all it does is just run my MC server, I play on it from my gaming PC. But like should I be utilizing all that RAM? What should I use it for? Iโm open to ideas
RAM go bye bye โ๏ธ
My AI model. It's been going pretty well so far but I have spent a stupid amount of time on it
I've got this at work: https://www.nvidia.com/en-gb/clara/intelligent-medical-instruments/#
I have been wanting to run an AI model on my spare PC (the 500gb ram one) but havenโt figured out how to do so. Any tips? Also it has a horrible GPU unfortunately
^
start with ollama
Oooo good idea
Thanks, ollama
You need a gpu to run any decent sized model at a usable speed
Iโve tried and made a discord bot with ollama but it broke and started spamming me with emojis until I shut it off
stay with model < 7b
Yeah Iโve been trying to upgrade but the pricesโฆ
I have 13900k, 7b model works ok
Hey, I got embed perms!
since I have an AMD gpu the setup is a bit annoying
I'm currently running 2 24B models at 32k context, an 8B at 16k context and an 8B at 32k context
How do you guys suggest I go about running my own model?
(I mean with CPU)
For what purpose on what hardware?
I love it when the css on websites just doesn't fucking load for some reason
General purpose? Maybe a discord bot. I have a spare 10 yr old PC, it has 500Gb of ram and 44 cpu cores. It has a decent GPUโฆ for 10 years ago. Do not doing so well in that department . Currently the only thing that computer is doing is running a Minecraft server for my friends.
Oh god
I love when that happens
ollama + open web gui combo as a starting point
Ok. Any suggestions regarding a specific model to use? And how can I train it and give it good memoryโฆ (Iโm passionate about programming but never messed with AI, sorry if I sound dumb)
i downloaded the llama.cpp thing and run
llama-server --port 18080 --ctx-size 131072 -hf google/gemma-3-4b-it-qat-q4_0-gguf
and then connect to http://127.0.0.1:18080/v1/chat/completions
here is the https://ollama.com/search
Okay thanks
trail and error ๐
Interesting
plus your friends can use bigger model as a lag switch for minecraft server!
Sorry wdym?
I donโt understand
i think they mean the bigger the model, the slower minecraft will run on the same system
it will eat up loads of computation resource. It may have some impact who knows.
The Minecraft server has 2-5 players online at any given moment, uses around 5GB of ram and 1% cpu and also has Java and bedrock players. At least that is what the Spark Profiler is saying
Well, they might not clash too hard, the minecraft server doesn't use the GPU right
Just FYI you probably aren't going to be doing training locally.
Unless you have a way beefier other computer to do training on
Nope only CPU
no I have two computers
- 10 years, 500 gb of ram and 44 cores cores but super old GPU
- Gaming pc, GeForce rtx 30-something, either 16 or 32 gb of ram
NGL cpu isn't going to be very usable. 7-8B might pass, 12B will struggle, 24B not happening
The last time I trained on CPU was a small GPT-Neo model and it was a week-long affair sorta deal
Last time I did a major dataset training was 36 hours on a 24B
the 27B variant of the Gemma 3 files I use loads into a 3090's memory and runs, but it is very slow. the 12B version is much faster and generally just about as good. (these are the qat-q4-0 versions)
on CPU that size would be even more unbearably slow.
What GPU do you recommend I get for the server PC? Id prefer to use that one since I donโt use my gaming PCU on 24/7, and I use it for gaming, the other one is connected to EThernet and on 24/7 already
I would recommend try it out with CPU first
does newer gpu work with 10 years old mother board?
Okay thanks
No idea ๐ฌ
more memory means larger models though.
It was a previous gov. computer, its built freakishly weird
Gpt OSS 20b will kill my cpu?
My cpu is i9 14th
yes
given that it has pcie
just that it'll run on slower pcie bandwidth
make sure you have enough power and that the connectors are good
but depending on your workload it probably won't matter
this
ensure that you have a proper psu
assume the server is 10 years old, the it probably has PCIE 3.0?
Okay
No idea what that is ๐ญ I didnโt set up the PC
I didnโt set it up but the person who set it up told me it was a pain because it had this weird RAID system idk why that is
If you don't know then just use pcpartspicker or something with your motherboard and etc
Okay will do
that'd work
easily
there are (were?) some GPUs which were designed for lower end motherboards with lower spec connectors (like a short? version of the PCI connector?). They didn't tend to have much GPU ram though so probably wouldn't be useful for inference on a model.
what are you going to use the gpu for?
Who are you talking to
wouldn't matter much because pretty much most boards now have full x16 lanes
can you give full specs?
Me? Oh uh i can try tomorrow Iโm not really up for checking up on it right nowโฆ
aight no worries
I'm running inference on an A2 right now. It's slow but it does it's specific task just fine... It's not liking extracting embeddings for 6k token chunks with 32k context
What I do know is what I already said, it has a RAID system with several drives, and windows 10 installed on the OS drive
most likely
a high end LGA 2011 socket huh
shrug
4 memory channels, assuming 2dpc
Maybe, I havenโt checked the specs (probably a mistake on my part)
right, 64gb dimm ddr4 exist but its rdimm ecc mostly
lga 2011-3 is for certain
I know a lot about programming and basic concepts but not much about computer builds :,)
okay
22 cores is only intel thing with xeons
Yup.
Probably silvers?
I donโt have the specs but I have a photo of the inside
donโt know if itโll help much though
e5
v4
i wanna assume e5 2699v4
though i'd not 100% bet on it
Who are you talking to? Are you talking to hascrack?
both
Not virtual
It's 100% mine... Work uses flashstore for storage and r650s for cluster nodes
Older config missing 2 640s
@frozen sigil btw
and what's your budget
Like I said that's my equipment, work has nicer shit. I'd kill for some flashsystem 5300s
Likely for AI
And budget, well I just want a good value for the price
It depends on
Several things
3090, if you know where to find should be possible to find it for ~$600
Okay thanks for the advice, I think I have a similar GPU for my gaming computer so thatโs good
Im always scared to talk about programming stuff with people because Iโm scared of being judged, thanks for answering my questions :)
np
You feel better when you realize everyone has some gaps in their knowledge
And I don't think people care as long as someone has an attitude that isn't "please hold my hand and do everything for me"
Youโre all programmers? Fix my wifi.

One message removed from a suspended account.
One message removed from a suspended account.

Did you try turning it off and back on again?
I did. It screamed and came back worse.
i think you might be talking about a wife, not wifi...it's a common mistake
or "you a programmer, do everything or I'll beat you up"
wife jokes
Can't beat em...legally speaking.
Hello y'all. What do you usually discuss in here?
hi
this is pretty much just #nerd-genchat
Domestic violence
topics tangentially related to programming.
Damn
classic blender
#programming message
but still dark.
Vscode 
objectifying my strings
Isnt the programming plugin neuro uses made by you 
What 
i would like to ask if its possible for Neuro to ACTUALLY DRAW like a human, not just generate art, but DRAW.. is it possible to program her like that?
like moving a python turtle around?
anyways why the susge
idk.. i havent seen it?
Anyway nice to be here. I worked with neural networks as a student job and Neuro kind of reignited my fascination with AI, so I hope to learn some cool shit here.
dont know that reference
Then who did?
someone else
I think they mean the first stream of her "drawing" in Python, one of the libraries involved a turtle walking around making tracks
if this is possible then that would be one of a kind
wasn't exactly what I was thinking
i see.. thats super cool.. i havent seen it yet
part of the problem with her vision that stops it being useful like you'd think for drawing is that she only sees in descriptions of things and not like you do. she doesn't get the same position and orientation information and would definitely not get an accurate count of component objects. (like the "how many Rs in strawberry?" except for something obvious like "how many cups on the table" when there is more than 2 or so)
also the path command in SVG is kind of like turtle graphics.
i'd be so down to teach her how to draw if that is indeed possible
I imagine it would be a possible enough task to add an image-generating AI to her mix but in terms of organically learning how to draw human-style?
She lacks the senses of a human and thus the ability to interact with the world and learn information the way we do
Well vision is there to an extent
yup. since she is an llm at her core she can't really make sense of abstract coordinates and images
But I think the recent stream with Ellie showed that she has trouble with descriptions and physical states, i.e. saying she's "taller" than Ellie
what if she's able to see things like we do? i mean generative ai models can see the drawings of other artists n interpret it the same way n style right (if i am not wrong)
Have you seen her svg drawing? She doesnt need any improving.
Do we know if she's multi-modal? If not that's an extra limitation
wdym multi-modal
shimonana



<-
<-


lazy discord





excel sheet
i can mute you though