#programming
1 messages Β· Page 1 of 1 (latest)
https://rxresu.me/ I like this for realitively straight forward resume building at lest
A free and open-source resume builder that simplifies the process of creating, updating, and sharing your resume.
Yeah its crap but I do at least care about some of the work some of the places I'm applying for, renewable energy, medical devices, financial regulation speak to me at least
Also the fact I got over half of the members
true
employee management for a medium sized entertainment studio
I did have smoene reach out to me on linked in but they wanted a Senior DBA which firs tof all no interest, and also I'm barely more than a juniior DBA
ExtraordinarilySoVerySilly
I did legit play that role, not wrong at all
If you want to spin more crap you can talk about your time on discord as "community engagement" and "tutoring in LLM and AI usage"
I get contacted by web3, NFT and fintech companies all the time because my focus is Rust and apparently they all use Rust lol. It's so annoying.
the money range is insane, but my morals tell me not to
I am noting all of these for later
Meanwhile me: I managed converting a bit of audio to a different bit of audio
And a few other audio things
i would 100% bag that money ngl. sorry not sorry i need a 2nd 3090
Pay scales here are like 70-90 for eng, 90-150 for mining work, 250k+ for quant trdaing
for entry level roles
Game Jam - Slave artist
- Drew 18 full illustrations in 72 hours under pressure
- Tuned vocoder vocals with industry standard Dreamtronic software for the creation of a full original song
- Tuned machine learning models for realistic text to speech voice synthesis with Pytorch
- Achieved first place in the audio category among hundreds of other teams
yeah I just can't. It would eat me up knowing that products I help develop are used to financially ruin others
Good on you then
Everyone has their price, I just hope mine is out of their range
my price is anything above 7.7$/hr
Next game jam though, I'm gonna be using a custom-trained vocal synthesis model to help create a new original song
That's gonna be fun
Not linkedinified enough
Not hired
Isn't the singing built on vocaloid technology
I have no clue what that means
means more people go to the art slavery
Too humble, not buzzwordy enough not going to get hired
Making it seem like the achievement of all time
No it is not
Vocaloid is not even AI
We'll need to grab artists (that are not Shadow)
Whar
Well, certainly not me, I can't art
yep, relieve some of the rpesure
I can't art well enough to main
I got a bit of time to practice but who knows if I will end up doing so
im probably gonna be too busy doing 3D modeling and engine stuff
Next time we are not doing that many cutscenes
I really hope we'll be able to get NeuroSynth up to par with Neuro RVC in time for the game jam, it'd be such a great showcase of the new tech
"- Developed an instrumental piece of Software challenging state of the art singing voice synthesis tools in precision, utilizing novel machine learning algorithms to support an indie studio achieve nth place in a global game development hackathon with a storytelling-focused game"
this is how to linkedinify
That seems way too scewed for my liking
I usually like to be as direct as possible
That is the point, need to shove in more buzzwords
Problem identification
@olive sable Surely you know what this is referring to.
NeuroSynth do be sillyin'
Optimization
optemizig Problem identification?
Figuring out what to optimize requires knowing what needs optimizing, hence Problem Identification
Still BETA-2, the new dataset is still in development
new andor is out for anyone following
my Jellyfin is already showing it to me... but alas I must work 
i wonder if using an arm laptop would be a good idea
i think its like 50/50, steam and ida wont run but the rest should be a slightly better experience because all of my devices except my laptop run arm
arm chips are also generally more power efficient, it'll probably be a dream thin client
I HATE AI SLOP
i dont think arm is ready yet for general use, so many quirks and random things not working properly
Aren't most phones using arm
only arm processors i would trust even slightly is the apple M series, but that's only if you want to be in the apple ecosystem
ye, but it's mature enough where everything on a phone is design around the arm processors, it's different for desktop applications
Isn't that more because of x86 dominance
if arm had greater market share you'd find more things working on it
Not using 256 πππ
Early adopter cost and all that
ye, but i dont think they give a huge benefit over x86
at least currently
though AMD has talked about designing ARM chips if the market goes that way and it becomes more viable, they arent married to x86
The energy saving are huge aren't they?
The smaller instruction set makes things mostly simpler and more efficient
Is what I heard
And in the current world, that's definitely a big w
Though I do agree it still needs a lot of work compared to the behemoth that is x86
yeppers
wtf is this data bro 
you gave me almost a million files
it unzipped into a tokens folder inside the existing tokens folder, so give me 7 min to fix it
i can understand why it took so long now, each file is only 6kb so you cant fully utulize ssd speed and everything
i stopped paying attention to any online articles besides web forums/social media discussions
its such a waste of time
the rest should be a slightly better experience because all of my devices except my laptop run arm
it's like saying "all of my devices run linux except my pc so linux should be a better experience"
ALL phones are using arm
it's either snapdragon or mediatek
Or Apple silicon stuff
on my way to make a x86 phone jsut to spite you
True
ps1 is risc v iirc
some use exynos
That's funny
samsung in europe for some reason.............
isn't x86 kind of risc-esque inside
nope, just risc, no v
there are none
well, there are, but they cost like a Boeing wing
there's rpi, but 5 needs active cooling
theres the microsoft surfaces but those turned out to be more expensive than I was expecting
π₯¬
Cooked
I warned you πππ
Its a lot
what's the point of a pc which can't heat my room
its supposed to be 948.157 files ye?
cuz idk if the exrtraction crashed halfway while i was gone
no
use spacing
to think i though of you as "smart" yesterday.
you dissapoint me
i was just hoping yall qould know an amount of physical objects cant be a float
but ye we use , for the decimals here
noy .
but i have qwerty and program so i use . most of the time anyways
its really annoying for writing essays tho
948'157 :3
Easily accessible for qwerty
And undoubtedly european
I think

Added benefit is that modern c++ allows this notation as well :D
Ooooooh that I didn't know, very cool
Surprising considering how much of a pain apostophes are to lexe and parse but that's cool
nope its not, it should be over 2 mil
my windows does not like extracting these
time to install 7zip
Sounds close enough π
Actually lemme do the math brb
im not gonna half-as sit
ill let 7zip extract it cuz i think the windows one keeps crashign halfway through
???
theres nothing fundamentally different
Ever considered appending the files to make it more efficient?
Isn't the biggest change just the instruction set? I'm not familiar with differences between architectures
I simply just forgot lool
2 million SMOL files is stupidly inefficient compared to just 1 single big file
yep
why did you even do it like this then?
Its not intentional
Well yeah which inherently means that the processors have wayyy less functions it can do
Just less abstraction, but you can still do just about anything
and it is a better experience, so i dont get your point
Like what
true thats why android supports x86
Oh it does? Neat
android on threadripper when?
Be the change you want to see in the world
Cuz no way in hell ur counting the "Intel Atom"
i need 16K RTX clash of clans
i think they are no longer being made
There is no reason to anyway
waydroid
yeah it would make sense for embedded stuff maybe but i dont think it makes much sense for retail phones
100 watt phone π
you'd be surprised how many IoT devices run android
if its not using a kilwatt, then what even is the point?
Probably wouldent
Compared to arm?
Yes ofc
100Β°C
you joke but my prev laptop ran that hot 24/7
Yeah same π
i once spilled a bit of tea under it and it boiled instantly 
ppc is very nice too but kinda irrelevant nowadays
The OpenPOWER Foundation is a collaboration around Power ISA-based products initiated by IBM and announced as the "OpenPOWER Consortium" on August 6, 2013. IBM's focus is to open up technology surrounding their Power Architecture offerings, such as processor specifications, firmware, and software with a liberal license, and will be using a colla...
to clarify, it simply means anyone can create a riscv chip with arbitrary extensions and without licensing fees, not that the chip design itself is open
i mean i guses theres https://openrisc.io/
theres also mips (and loongarch i guess)
RISC != RISC-V
RISC means "reduced instruction set computer" (not so reduced anymore, its complicated), as opposed to "complex instruction set computer"
i once wrote a whole essay about the ps1's proccesor, its been a while since then tho
ps1 is a mips
while RISC-V is just a RISC ISA
MIPS is a RISC ISA as well, or rather there are many different MIPS ISAs
the most exotic ISA that i ever disassembled code for was probably PPC, but i don't remember where i stumbled on that code
aight, i got the full zip extracted now without crashing
3 milion
let the gpu torture begin i guess
Why would extracting a zip crash
@prime ridge i need a discord_llm_config.json
Unless it's a zipbomb 
it kinda acted like one

32B param model finetune trained with public access globally distributed reinforcement learning just dropped
With public training code
And a public RL dataset
Have a fun couple months
Do you even have the patience to not use your PC for anything except training and maybe light browsing for potentially a few months?
@prime ridge
Assuming it's really batch size 1 and uses gradient accumulation to compensate for that, it's gonna take literal ages due to low utilization and high compute requirement
he said up the batch size till it crashes
vscode is already crashing from just opening the project oflder tho
Is there even an option for gradient accumulation?
Without gradient accumulation, the batch size is gonna suck and the model is gonna suck
Apparently most modern LLMs are trained with like 10s of thousands of batch size
I don't know if that's the same thing as gradient accumulation
i dont know either
i think im fine with just not doing that part of the command
i read through the code and there is a fallback
not suprisingly, rip ssd
i have no clue
Look at what the code is doing then
gradient checkpointing is for calculating some gradients on demand during backprop, gradient accumulation is for accumulating gradients and updating them fewer times
Sounds like different things
very different yeah

I assume gradiend checkpointing doesn't do the same thing as gradient accumulation where it increases the effective batch size by a multiple
no you just directly trade memory usage for extra compute usage
which i guess means bigger batches indirectly
Well, certainly seems less scalable than gradient accumulation, and also less powerful
he said put the batch size higher till it crashes, so i have it at 100 rn
Uh, check your VRAM usage in case it tries to leak into system RAM
Probably not training yet then
Might be dataset-preprocessing, which can take ages
It's probably preprocessing
When I was pretraining one time the preprocessing step used a total of like 300GB of memory, including the pagefile

100 batch size is probably way too much though
If it's currently at 512M parameters and my 12GB card maxed out at 24 batch size with 150M parameters or so, it's not gonna be even close
"warmup_steps": 200 ?????
ok
200 training steps to slowly ramp learning rate
ah
Gradient checkpointing is just remat
does the preproccesing get saved or does it do this each time?
I don't know, if the code sucks it redoes it every time
uhhh
That part is up to the competency of the programmer
Sam
yes?
Do u have access to the config to change the model params
yes
Aka where ndim etc is defined
What is current config
U should be able to optimize the model size
what hidden dim
i have no clue what any of this means
Hm, 8 gradient accumulation steps
theres model dim and hidden dim
Seems a little SMOL though
Nah its good
Well, depends how much the base batch size will be
Ah
i got told, put the batch size as high as possible
Ye but only till ur hardware can handle
checkpoints 

nott my code
is this code oss
oss?
Hi #programming
open source
hello
Hiii

googley moogley
this is average ex googler salary right?
idk
in return i will let you pick me up from the floor
POV
Who would not give *me* a trillion gazillion dollars

welp, i opened the side menu in vscode to see if anything changed int he project folder, and it crashed
THIS WAS FREE TRIAL BTW
thats what happens when the dataset is 3 milion 6kb files i guess
parquet my beloved
Most insane compression
All trillion scale datasets are stored in this
fuck it, im adding prints to the file so i know what its doing
parquet
parquet

ah yes
apparnetly its stuck at
# Setup training
total_steps = self.setup_training(tokens_dir)
My beloved
This is a get stickbugged level evolution of a word
Yes
Thanks
Thanks for reminding me of what i want
And that I'm underqualified for
need to be like that boss of mine with 3 Masters and 1 PhD

aight, i found where its stuck inside that function.
for file_path in self.file_paths:
with open(file_path, 'r', encoding='utf-8') as f:
data = json.load(f)
self.total_samples += 1
this one is slow as fuck
i mean, with 3 milion files i can see why
I love this meme so much

would it be foolish to want to get some sort of programming job in the future

programming job? clearly wishful thinking

tbh I don't know anything about the industry lol, just that the idea of programming seems fun so far
soooo
high competition

Try it out and see for yourself, if you're having fun with it in the long term, maybe it's for you
i feel the need to ask, but this code isnt actualy doing anything is it?
this could all have just been a len()
its been doing this for 15min rn
Only you can decide if programming would be good for you
I mean it probably would be, it's just whether it's worth pursuing a career in
but if the competition is high then I have zero chance lmao
Is data even used?
nope
LLM code 
That's not how that works
It's always worth it if you're willing to put in the work
Sure the competition is high, as with every "interesting" job imo
Just go for something that you're interested in, if your heart's in it, it'll be easier to put in the hours
Rather than going for something that doesn't interest you but is "worth getting a job into"
Bruh
I mean like statistically I'm in a very deep minority, I probably can't get a bachelors in computer science or anything because my math isn't the greatest (at least for now), meanwhile there are people who dedicate their lives to it lol
gwuh
You don't need math (kinda)
good maths aren't needed
right now translating is my main career choice, just programming would be cool to combine with it
eh to get a degree it requires math, and most jobs require degrees
if you have the basic idea of what calculus is that's enough for CS ed
You could enhance your translating using programming, that's a good idea
what's calculus again
remember that most jobs at faang don't actually give a shit about your degrees and qualifications. As long as you can prove that you know what you're doing and that you work well in a team, then they will likely accept you.
for example, can you explain why you did something in a certain way, and explain why (at least in your opinion) it was the best way
I'm not an expert but it's not as simple as "you have a degree you enter and if you don't goodbye" anymore, at least not where I live
noway, i get an actual UI now
derivatives/integrals
Though I'll admit it's still very much the norm
just when I looked at Seek most of the job offers had requirements of a bachelors in computer science
Let me translate what the GPU is saying
check whether those are the minimum or the recommended. Indeed only lets you set one so they usually put in their recommended. You may have to go to the companies website to find this out

wha'ts indeed
im guessing lower batch size?
Try 8
for example, most jobs at AWS recommend that you have a degree, but you'll see that their actual minimum is just 1 years experience in a professional environment, a degree is just their recommended
also I don't think I've ever touched derivatives and integrals in my entire life
8 sounds like not that much lol
a job searching service, there are others, it is just one example
thats like.. something they're supposed to teach you in school
Then start touching derivatives and integrals
platonically*
it might've been taught in year 12 but I kinda dropped to an easier math class by then >->
I graduated through a system that didn't care about math or english like normal schools so it makes sense
most countries start to teach it when you are at or around 14-15, and only if you choose math as a subject
so like my main field of study right now is Japanese and translation
damn

same lmao
i had to remember when they retaught us in uni
oh wait is it sin cos tan
no
nah, thats trig
damn
useful nevertheless, but slightly different
is there supposed to be nothing in ram only in vram?
If you know what Ξ£ is, u know more than a ton of swiss people
not even kidding unfortunately
of course i know! it's a hit song by betsy and...
I don't know that, wasn't it like the sum of something
I've had people in my soft. eng. course that had to use a calculator to work out 12+53
Yes
i mean, i've had moments like that 
opens python
types 1+3
its doing stuff now
its not a big deal to just catch up on stuff you dont know
ok so like is it worth trying to get a job in programming or not I feel like I'm getting mixed signals (also doesn't help that my math skills are subpar)
only you can answer that
927 hours 
bro said 20, he forgot the 900
oh yeah actually learning the math won't be hard, it's figuring out what to learn that's an issue
we all like programming in this channel
Lowe it till no shared memory is used
but what we talk about here aren't our jobs
if it's in high demand and difficult to actually get a job in then I don't see a point, objectively/lh
oh, okay
(unless something interesting happened at our jobs of course)
you're unlikely to get objective answers to that question in #programming
there may just be a slight bias 
I "like" programming
fair enough
difficult doesn't mean impossible, the rest is luck, effort and connections
with a double serving of connections
yeah connections is funny (I have none)
Connections are OP asf
we have connections in here so idk
i have an internet connection
I live in Australia so uh
rip
I sometimes have an internet connection
I no connections with anyone or any identity of asset in this discord. I am not here under my own knowledge, more than likely added by a third party. I have no sympathy for the things found in this discord, and affiliated peoples
Only if Mercury is in retrograde
that's a lot of big words
its going
I'm stupid what does that mean
llm shit

tbh i have no clue either, this is not my code
real
hello i can be your online friend yes
(shadow, is your headset charged?)
batch size 14 seems to be the sweet spot
oh nope, justy slightly went over
i guess 13 then
I can't wait for when I understand all these big words
i dont know most of them, but im learning
tbh I just wanna make an ai lmao
94 hours, so 4 days
there's no way in hell I could make a good one with the specs my pc has though
8gb vram and 32gb normal ram 
im doing this ai training rn with someone else's code cuz they didnt have a good enough gpu
ah
ngl the code isnt too optemized but im not cpu limited rn so its fine
only had to do small changes
I think I'm everything limited lol
i got 64gb normal ram and 24gb vram
As long as you're not motivation limited, anythings possible 
I have 64gb as well, but not for AI.
I have it because DCS eats all the ram in your pc
Oh
Beautiful
No its cooked

btw, did you use ai to write the code, cuz one part of it was a bit whack
Its using wrong model size
oh
I am completely normal and mentally stable.
is this using ratatui?
Lemme fix the json
haha lack of money and income go brr
I make zero money and am currently in a little debt lmfao
same
its 512 rn
was it not supposed to be?
"model_dim": 512
its meant to be 1024 (I know absolutely nothing about AI)

model dim means intermediate FFN projection size
what you actually need to do is to keep track of total # of parameters
I just gotta hope my mum decides I've done enough chores to give me anything
Its a 30m model rn πππ thats cookee
Should be 512m parameters
Yep, way too much batch size
Loading the entire dataset
Get it out of shared memory, should go way faster
but you werent using the data you got out of it
Yeah make sure it doesnt leak into shares mem
Validation ik its cooked
it is
what?
validation for what?
it just wastes 2 hours beforehand
cuz there are 3 milion files, and its python
U can remove it. I forgot about that
Coding in one big file has its disadvantages π΅βπ«
i didnt actually wait 2 hours, i gave up after 30min and replaced it with a len()
you might have more single core idk.
i got a 5950x
Hopefully ur batch size will be at least 10 π¬
its 13 and doing fine
On new model size?
30M parameter model at 13 batch size and already using all the VRAM......
's training code was better
yes
Idk how
It could do 150M at 24 batch size on half the memory
Like, what? How can you be so much worse than
's garbage code
i dont have a config json, im just changing hte default config in the file
Oh right I deleted it cuz it was wrong
Im not at my computer rn so I cant send it
Thatll work tho
Sorry for this shitty code
just tell me what to change and ill do it
nah its fine
Should be in dms
It's truly garbage if it's so much worse than
's very bad code
Just model depth and ffn size
Your code: 24GB. 30M params, 13 batch size
: 12GB, 150M params, 24 batch size
Though now that I think about it, it's possible it's the CTX
The CTX scales horribly inefficiently
I don't remember, was it expontential if you don't optimize it at all?
No bro
Not 30m
Its not exponential its quadratic
And I didnt implement any additional optimizations
You said Sam was running 30M params there
It might be a ui glitch
Cuz I might have accidentally hardcoded 30m param
Maybe
I hope not
But during debugging there is a chance
Did you by any chance use an LLM to write the code?
"Model size: ~" + f"{self.config['model_dim'] * self.config['depth'] * 8 / 1000:.1f}M params"
in the code
For a lot of it
Yes
So it's definitely trying to calculate it
Which would mean it's either calculating incorrect values or training 30M
Damn how tf is on 30m π
should be around 245
512 ish
Well, whatever you're doing, it seems like you're encountering some sort of silliness
Fs π
1280Γ24x8 / 1000 no?
about 245
Guess who can rebase things again tomorrow. Wohoo.... 
its not getting squared in the code tho?
squaring would be self.config['model_dim'] ** 2
In the main section
Its reading my set values
Pretty sure
Did u download the json file?
nope
Ok so I think its in the main section
Reading code on mobile is fucked
Gimme a min
Hmmm
πππ
Its a ui glitch
So we chillin
Probably...
Just make sure u are rerunning the program
It wont auto update or anything
And make sure u clicked save on thr file
can someone tell me what it means by "faster" when the line of code is literally done in a fraction of a milisecond
optimized code is often measured in nanosecond improvements.
at least in my code, idk
also, idk how this works in python but in Rust a list and set are on the heap, while tuples are on the stack
yes. but if you have really really hot code, it adds up
you might not really notice a difference when the amount of data is very small but that can change drastically when we are talking about vast mounts of data
some optimizations we do appear like arcane magic to outsiders
fixed it.
but like i said before you arent squaring it here so you just get 245
idk why it even needs to be squared tbh
look at the big O notation complexity of operations on List, Set and Tuple to get a better idea
While maybe not translating quite directly to Python, the collections documentation of the Rust standard library is great for learning.
Just count params manually
Also 30M model should be doable on a 3090
My 3090 handles 300M param pretraining ez
it seems like mine really doesnt like these settings
im still going over vram at batch size 6
Bro, tbh I'm listening to this from moment you sent this... Why the f I like it lmao
batch size 2 is still going over vram 
Something wrong is happening somewhere
IT WAS TEST EVEN 
Whar
idk what im doing shadow

i can provide the hardware to train on, but the code is borked
and i dont know llm code
Keep going tho... I like it
Me sillyin'
its obvious this was made with an llm too, i feel like there's no saving it unless an actual profesional looks at it
Incomplete and still heavily WIP

it sounds so good
LLM code: takes longer to debug than it would have taken to just write yourself
ye, kinda
It's just a silly thing I've been doing
Here's one test where I used both NeuroSynth and Kasane TETO
π₯ oooh yeah
Was mainly me testing Kasane TETO, but decided to mix it properly and it turned out really well
It would be so over if somehow my Bluetooth malfunctioned and leaked into the office space the songs I'm listening lmao
NeuroSynth, is that your tool?
Sorry if I look like an ignorant or dumb
NeuroSynth is the model I use
Current latest is BETA-2
It's one I'm developing with Wispers
Neuro vocal synthesizer model
Is that RVC model?
No, vocal synthesizer
Got it, yup
Gonna replace the old SynthV + RVC chain with a full-chain vocal synthesizer to remove the dependency on an expensive piece of software
I love meetings by higher ups when they inform of a bunch of changes and i'm uninformed by basically all of them
I'm like
Ok nice but i got no idea what ur talking about
so good for u
real
@sage crag just got a book with stuff I'll be looking at the next whole year in math and they said if i can complete this book i should have no problems in math for the whole year
The book in question:
yeah here it's grown ass adultsπ
Also this is the only thing left between me and university
If i complete the next year i can do the uni exam
if my headphones disconnected while i was listening to music everyone within 50m vicinity would suffer levels of psychic damage dangerous to their lives
holy vocab
(except that never happens since i never have no-headphones volume at nonzero levels)
wdym vocab this is basic english
Konii am i cooked 
honestly i bet if you learned it in school you'd forget it by now anyway
these topics are worth revisiting even if you learned them already
in general maths is so big you're bound to forget some parts, you're mostly learning so you can easily revisit it when you encounter it again
this lmao
i personally don't remember how to solve quadratic equations
i'd be able to derive it if you give me 10 minutes (surely
) but i dont remember it off the top of my head
maybe 5m
@sage crag https://www.swissinfo.ch/eng/society/pisa-survey-swiss-students-perform-well-in-maths/49032102

"But 25% of Swiss pupils failed to reach the minimum level in reading."
quarter of them are unable to read lmao
Unable to read AND math
let's just say i didn't do some of my homework 
Switzerland 
i will never become an actually useful ML research scientist at this rate
easy enough yeah
can't even read
Wdym
surely discord is reading 
i mean i personally think it is about experiences
Ive got the code to work
tbh math in ML is more like general usage of equations instead of twisting it and make it hard to comprehend
It trained and it worked
Its not that hard
Not if u wanna do revolutionary shit in foundational ml
def relu(x):
if x > 0:
return x
else:
return 0

Attention is not hard
π

The vram usage is out of control
Thats not because of the code
oh yeah ofc attention is but like new ones yk
Whats hard abt attention it's literally just dotproducts
Thats normal
like underlying logic of RoPE and such
and topk
Its not
they are alright
Batch size of 1 is even over 24gb
?? what is hard abt this
I can try to add more optimizations tho
Back prop
but ts is not okay for me:
Adjusting attention matrices ia hard
Ok sure but the algo is the same backprop
Unless ur talking abt optimizers then sure that's harder
With attention is wayyyyy harder than weights
But its still a matrix
Let me complain damn
Lile im a hs student π
Multi variable is hard enough
There's no diff between the layers used in mnist and attention
vanilla attention is π
but that doesnt make attention hard
There is juat a lot
That makes the optimization part hard
True
we don't derive by hand all those gradients
In academia you do
???
When optimizing
Only if ur trying to invent something entirely new
Understanding how gradients are calculated is important
Or formal studying material
Moreover knowing how they work is still kinda complex
Queries,. Keys, and values is kinda confusing at first
yeah if u know 0 abt it

Ok if attention isnt hard what is
If u take a look at it for a while it becomes clear
Thats true for anything tho
oop 
Second order optimizers
certain loss functions
Loss functions πππππππ
Ngl prolly gunna need to roll my own optimizer
i mean it is extremely rare to even have SGDs fall into local minimum
Adamw uses so much memory
bc there are billion parameters per model
Ye but relatively stable
Scales extremely well
Exactly
in russian, students call calculus "mathan" (short for "mathematical analysis" which is the russian name for calculus) or differential equations "diffeqs" (or rather "diffurs" for "differentsialnoye uravneniye"), are there stupid abbreviations in maths in english or are english students serious people?
To gazillion params
english only uses calculus -> calc no?
what is decision here
Then use optimizer that doesn't use all those extra params
Yeah ima do that
Deciding what to do with ur life
the japanese abbreviation culture is even more hardcore than russian one
oh
True
correct name ig
Cant wait till i understand optimizers properly 
I'll be reading a text and have 0 idea what's going on
Idk if anybody does atp π
ML is pretty easy usually
i was about to be happy game semantics got mentioned but then realized game theory is a different thing
Getting good data is half the challange
bros saying "oh yeah we used 5 steps of the newton Schultz iteration to solve x problem" in optimizer papers and I'm like "damn good for u or bad for u idk what this is"

idk about abbreviations in russian empire but in ussr they went CRAZY with abbreviations and naturally it's still there in the language to this day lol
https://calvinmccarter.wordpress.com/2021/11/18/the-newton-schulz-iteration-for-matrix-inversion/ like wow thanks for the easy explaination
I 100% understand whats going on here 
Not a single representation is comprehensible
speaking of random math stuff, i like dual numbers https://en.wikipedia.org/wiki/Dual_number
they can be used for differentiation

Anything past calc 3 is purely meant for torture
abstract algebra and category theory are fun you should totally learn them
they can't keep getting away with creating random impossible numbers and pretending like they exist
how is it that every time I look at math there's always a new symbol for it I've never seen before
i have 0 idea what sin cos tan is, am i cooked
it's just triangle stuff don't worry, geometry or something
sinus cosinus tangent
(these are basically "ok let's see what things mathematicians made up and maybe generalize to see what's common and what's different between them")
Ye but what they are used for
for messing you up
isn't it to find the angles on a triangle
For propaganda
for just about everything in geometry
S tier rage bait
as soon as I graduated I just forgot all math ever sorry
they still do they're just lying to sound cool
Caca
every minecraft player uses trig surely 
who tf is leaving out random angles for no reason anyways
trigonometry pentagonometry
that sounds nightmarish
honestly though other than minecraft i can't remember the last time i used trig...
finding the fortress with 2 pearls
checked my .python_history and rust/*/src, no signs of sin or cos thre
trigonometry =
tri (three)
gon ("to say" in japanese)
o (oh)
metry (the act of measurement)
the act of measuring Three saying oh

but it isn't trigonometry, you're simply trying to find where two lines meet
a.k.a. a math equation

and that's done using trig
why so mischievous 
I did A level further maths w/ decision
so unfortunately yes
forgot most of it now though
man math is hard I just wanna make code do stuffs
yeah but maths hurts my head
unlike learning to code, learning to math is just painful and annoying
well code I just type words and it do funny trick
pov ai slop chatgdpr coding
you don't know programming enough
with math it's like "you see this letter? Do the most complex equation possible to get the number 2 :)"
I do not
-# I literally just started and I'm scared
A lot of CS can be represented as some math equation
why do math when I can just make the code do it for me B)
me when Nvidia fp4
programming is just applied maths, computation is just doing proofs
when can I make my computer do integration
now

What is what times what?????
WINE WHAT SAYE OU TODING
Why
math is fine for me, I have never had problems with math. But I think programming is much more comprehensible than math, at least for me.
debugging 
but
the bugs......
true
My file has no bugs
it is 0 bytes long
Just don't program
There, problem solved
fudge
@sage crag https://www.swissinfo.ch/eng/education/why-the-swiss-school-system-is-so-successful/89129497
this was just a small bit of all the winedbgs it spawned btw, suffice to say i'm reverting back to the old version 
I can't deal with it
A file? in my filesystem? ludicrous
free eduction 
...eduction.... mmmmmmmh
@ruby timber do you also think swiss schools are so good
We learn so much
click the squares (insert masterpiece)
I think the mandatory part of schooling is pretty good and also terrible in certain aspects, as with all education systems-
BETTER THAN NOTHING
Do you also love learning the things konii learned when she was like 10 but 10 years later
If it's useful to daily life, nope
just got back from fighting a cockroach after taking pain medication but will do π«‘
here taxes are automatic (unless you're self employed) (i've never done my taxes so i wouldn't know)
yumy
somone in a lower year did the same with copper sulfate
I think where I live you can get someone to do your taxes for you
not for free
Anywhere
I did learn in my last year of middle school some politic stuff about how the country works, that was pretty neat
like votations and stuff
all I remember about titrations was people complaing about the math
no taxes though
(I also had no idea how to do the math but I just shuffled numbers around until I got the answer)
well yeah not for free, but at least you don't gotta do them
You will survive one day just cause of that and the rest will perish
our one was pink I think
One place where i saw the progress is insane is
In chemistry and stuff for some reason
Like
We go from 0 knowledge to
wait in what context
i skipped all physics and chemistry in highschool π₯
Yet basic ass math is ommitted
also biology and geography
I did like 3 subjects in the end of high school and 2 of them were forced upon me
honestly i skipped just about everything i could it's a miracle i passed
yeah because school is bs
i did go to some classes but i never went to a single PE class but luckily never got a failing grade
What if i just quit and become a DHL driver
Or just move to the alps and become a cow milker or something
Would I get a π₯ medal if I posted this?
https://deepmind.google/discover/blog/alphaevolve-a-gemini-powered-coding-agent-for-designing-advanced-algorithms/
Borg my beloved
so like does AI actually provide any jobs or is it only replacing them
because I saw a video earlier talking about how it's the same as like the invention of the automobile, a computer, the internet, etc.
WHAT
32.5%????
Yeah. Kinda wild.
so it just creates a more competitive environment for finding jobs for everyone π
does ai require you to know programming?
very little
sigh
the thing is, LLM writes bad code and you have to be an expert to tell when its good enough and when its bad to the point of not even being fit for the job
The problem is when the llm hallucinates and give a bogus code and you canβt tell whatβs wrong
its not even about being bogus, the code can be bad while still doing what's asked




