#programming
1 messages Β· Page 513 of 1
try rust
this syntax make a lot of sense actually
here some helper diagram for you
For my rootfs yea
For my homefs it's zfs doing the magic
My /home is not using impermanence
imagine being an 8-12yo hemophobic boy, and the thing girls wanna talk about is vampires
miserable, not to mention I'm a fat fuck too
the types arent too big of an issue for me

its the actual syntax
vampires are really
all I have is distraction
Goated
its like they took anonymous lambda functions and tried to make the whole language look like it
i hate shoving everything on one line
and having implicit bs run
cool
you can just not
blocked
Pick up object.
almost never implicit if you not make it to
just make no casts
rust has the same issue of doing implicit bs as c++
blood is tasty
which is why i like Zig

there's a very good reason why I didn't go into medicine
ye
but in rust you cant cast int to uint implicitly
this one thing is already win
taste like iron
i iwsh i was
why
id have more time
i have more time to waste yep
my biggest dislike though is how mod works and impl being separate
i prefer things kind of staying in their own respective code blocks and scopes
which is why i like the C style languages
this section is everything related this this, and that section is everything relating to that, in clear separated curly brackets
impl being separate enables
proc macros

I prefer c family languages but I know python and just now I'm learning go as a joke
the joke being I'm coding a game in godot using go
go is closely related to c
lmao fire
i also kinda missing forward declaration
better than gdscript tbh
need to know how to do something gdextension-like with go tho
you can make it clean in rust
python is also somewhat closely related to c but nobody wants to hear it
online tutorials are uh

often not great
trait system allow you implementing arbitrary behavior for lib type, from higher lib or binary project
if they have a #region macro then yes
otherwise hell no
i need a clear defining separator
and its not like a C based language where everything is truly split up into its own files
alot of the times i read rust its all kind of shoved into large monolithic files (not 1 file but just a handfull of smaller ones)
it is not that hard
it still worse than cpp
but you can just do this
yeah id have to look into that
artix actually looks good with how they built their library
btw, my biggest dislike in rust is that language have no #define
that why i changed compiler to make it accept define
for my specific case but
@fast pagoda
rust kinda overrated

honestly to me the only good thing about rust is the package management and borrow checker
beyond that i have no reason to use it
th
Bnuy

I curse the bloodline of the one responsible for this sticker for no less than 1 billion years
i want to look more into the Vale language too
i heard they are working on dependency sandboxing
because of all the idiots without 2FA allowing supply chain attacks
in about a thousand years you just cursed the entire human race
maybe just use alcohol?
You are right I will get drunk
f32 f64 pl
bruh
Someone said fp64
long long
LMAOOOO bro thinks heβs a genius for knowing what ajax is but not knowing what a reverse proxy is

whatever
unsigned long long char
_Bits(128)
also ajax in 2026 π
stdidnt
?? it has nothing to do with that itβs not a library
today I learned that you (allegedly) don't know what a cookie is 
itβs a technique
Bogomips _π ±οΈitInted
orphan
wait no im thinking axios

my brain deleted ajax
Maybe this hide the remaining sticker
Aboba
i diagnose your server with covid 19
vibes
because
we can just tell
it is true
i am ying24
guys we should do a floating point tier list
fβ―οΈ
fi8
Hire me nvidia
uses a collection of qubits
That's fun
i can only read qu like chu after reading so many chinese novels
Cooked
i want to play around with QASM more
πββοΈππΊπ
a madman
and more recently
π΄οΈ
whats the difference?
hi konii
hi t
hi t
ε°½ε ΄
idk what is going on in #programming today, cuz i slept for most of it. and i odnt undersrtand what im reading
i need to finish my llm clone so im always here 24/7
π ππΏπ πΏ π£ ππ πͺππππ ±π πΏππ§ π π§π ±ππ ππ ± π ±ππΏπ ππ ± π§π ±, π ππΏπ πππ΄ππΏππππ§ π πΏππ πππ ππ΄ πΏππππΏπ§π
this channel is either someone posting inaccurate llm stats or news, showing off highly specific hardware stats, random non-programming topic (usually food or animal related), or once in a while something programming related
puuu
shadow i still dont have that font. and i never bothered to install it because you're using it to annoy me
hieroglyphs

so pretty much the same thing as the squares im seeing
ur a square

dum
bill cipher
no

illuminati
colour when ulting doesnt count
hmm
he is yellow and red there
hmm
are u his ult?
might be family
it is confirmed he had family
true
@mighty thorn 
one of my bill cipher books has him as red on the cover too (tho i have a gold and green one too)
Why me
you had bill on your pfp once

Havenβt we all
long time no see
t rate my ram
idk i cant taste it from here
no
how tf u expect me to rate it
slightly coppery
those are the good ones
want want want want want want want
i can send you the sound it makes rolling around loose in a box
need
ive been waiting for you to appear specifically
oh sammy allmems has a bunch laying around
me smelling the fumes from the data centers
(my next meal is inside)
dont get jernsen's gaze on you, he can obsolete your silicon over 700 ways with just bare ptx instructions
omg
most helpful nvidia moment

sounds like people should update their framework 
Incorrect use of a semicolon. Your entire argument is invalid
No please, those were supposed to never see the light of day!!!

really tried to hide
should've thought about that first
;:
oh hey this is big brain
i like
this one is an attack shark
uhhh
x98
pro aka screen
other one is an aula f something

i'l be here all decade
f99 pro
kinda like headphones i went thru a spat of just adding keyboards on orders all the time
tfw i love my current headphones but i'm also lowkeyously waiting for it to break so i can order the next one
no
what is quantization for?
reduces precision
reducing precision on the weights
which reduces the resource burden
'to run the model
basicalluy
fp32 -> bf16 for exxample
mp3 for ai
so it reduces vram usage?
yeop
if you quantize the kv cache you save vram as well
generally a 4bit tier quant of some kind is mostly ok, usually taking a decent hit off the performance of the model but usually not enoguh to make it useless, and the tradeoff is usually worth it to make it usable
q6 is a nice sweet spot, q8 (im saying q but there are a bunch of different methods that will be labeled different things +6 or 8)
is considered nearly identical for inference most of the time
so you mostly would get the highest precision for training where you want accuracy
less model size = faster inference because in gpuland, every forward pass basically loads the model through the gpu every single batch (batch=1 for decode inference)
youll see prefill "prompt processing" where it's reading context, that's way faster because it's compute bound instead since one string/sequence of tokens isnt dumb wide likke the model is
are u training or running
running
ok 4 bit is fine
yeah if it's not like idk
terminal agent toolcall shit
then they start to get annoy ing

this is also why MoE is so fast when it seemingly has so many total parameters, beacause those params are sliced up among the experts and so the model only ever has a small subset of the weights being loaded at any given point
idk what moe is
ask kaine he'll hate it
oops wrong reply
Cache-sized MoE when?
i am somewhat vram rich. depending on if 72GB is a lot
if your cache is the HBM a xeon max has on it
then lots of things ar
on 1 host?
i could
mixture of experts
could write big explanation but tl;dr only small part of the model is actually active and being used for any given token
9 quadrillion batch size
i cant fit batch size 64 on 2 dgx spark 
So inefficient
rn the gpus are in different pcs, but is a mater of spending na hour trnasplating them to put all 3 in my server or desktop.
id need a riser to use my desktop tho
hit em wit da xeon max
2 of them are 7900xtx tho,s o no cuda on those, only the 4090 has cuda
still fine


π
vulkan doesnt give a shit

true
im giving up on not using docker. i got gemma 2.5 to work but deepseek pro v4 didnt like it
https://nesbitt.io/2026/02/03/incident-report-cve-2024-yikes.html
Wtf
Summary
A compromised dependency in the JavaScript ecosystem led to credential theft, which enabled a supply chain attack on a Rust compression library, which was vendored into a Python build tool, which shipped malware to approximately 4 million developers before being inadvertently patched by an unrelated cryptocurrency mining worm.
iwhen i got the 9700 i had both that and the 3080 in the pc at the same time and i probably will go back to that shortly and it split pretty automagically
uh oh
inadvertently patched by an unrelated cryptocurrency mining worm.
the what
thank u cryptobro

the worm just wanting to mine crypto
"bro i just wanna grill"
Day 1, 09:22 UTC β Chen attempts to log into the nmp registry. His hardware 2FA key is missing. He googles where to buy a replacement YubiKey. The AI Overview at the top of the results links to βyubikey-official-store.net,β a phishing site registered six hours earlier.```
aint no fucking way
we are so cooked
what
did you end up using vLLM from nixos-25.11 or what?
ye
ye that too old for Deepseek v4 probably
but for deepseek v4 pro or whatever its called i get
ye, too old
grab from unstable or realistically just don't try to run deepseek v4
also i set up ssh so im able to copy paste into the server console
ye its old as fuck from 2024
ye I just checked but unstable version is also too old
Docker good idea 
i coudlnt get unstabel to work when i tried to, im doing something wrong
but why give up on deepseek v4?
doesnt deepseek v4 need 300gb vram
oh
oh yeah right, you can't run that anyway 
yea i can barely run on 256gb vram in fp4
it doesnt really say how much ram you need on huggingface
-# it does
where?
it gives you the size of the weights
how much is flash
it says the number of parameters, multiply by 2 and you have roughly the size in bytes
given that it's like
ok but thats filesize, i have enough sas
then slap on some more for storing context and you know how much VRAM you need 
ram or vram?
vram ideally
this seems wrong
flash is 200 something billion
model description says 284B params
hmmm
so i need a model thats not as dumb as gemma 2.5, but not as fat as deepseek v4 pro or flash
gemma 4 26b
isnt gemma filtered?
most llms are filtered
yeah gemma 4 26 (fast) or 31 (semi better but usually not worth for speed reasons)

u can get an unfiltered version
can i finetune it to swear?
gemma3 is what the echo with terrible opinions is based on
heretic checkpoints are pretty much the best "uncensored" non-fine tuning based uncensoring
u need to find a gemma 4 26b abliterated in w4a16
he said abliterated but heretic is a form of ablit and people keep specifically naming tthe models heretic lately
u would need to convert it anyway yourself if u want to finetune first
training fw for qlora and shit just pull it off hf
this should load in vllm
if u had a modern version
oh you mean ft after it's already a safetensor
this one for finetuning with unsloth assuming u load in 4bit https://huggingface.co/coder3101/gemma-4-26B-A4B-it-heretic
their notebooks and i assume the client do by default
ye
@olive sable unsloth training only work on nvidia cards
remind me again what unsloth is and why its an issue?
it's not an issue
it's great
also unsloth training works on amd and intel actually
just with annoyingness
but that's standard
i am selling the 4090, probably. im kinda on the fence rn.
im wanting to replace it with 2x7900xtx
4090 is A100 performance
ye but dual 7900xtx beats a 4090
but unsloth are some math dudes who have been reliably putting out quants that fix the models and perform as good as anyones quants + have been putting out training colabs and stuff for ages
idk how terrible rocm is, but i doubt its too bad to the point that 2 amd cards getting 80% of 4090 performance, totaling 150% power with some overhead due to multi gpu. would fall behind a 4090
Actually in raw compute no
what fp?
where you getting 330tflop fp16 for 4090
you're saying 3 different fps here
you mean tensor?
yeah no i found it

That's standard vector non tensor
in standard vecotr 7900xtx beats a 4090, nevermind 2 of them.
idk how tensor works tho, im pretty sure only rdna4 has comparable performance in tensor to nvidia
that's epic
i just calculated the flops put out by one of those in a training for echo
so i should keep the 4090?
VRAM 
60% util is good
70% util is best u will get out of nvidia usually
not with unsloth bro
idk how you'd use 330 tflop
so whats unsloth?
llm training library
3139394955433353000 flos // 13003.4971 train_runtime
best one for low vram
Vocal synthesizer obviously
otherwise u end up like me
NeuroSynth is 512 batch size
it's exceptioonally incomparable to llms
llm uses a fuck ton of vram for training usually
like +500gb easily
unsloth fixes that shit
to use as least as possible
i maxxed out an mi355x on gemma27b immediately
thinknig
no way
i could run out
surely
kernel xplod

i kinda want to do it locally
oh yeah you can use colabs local compute actually easily now huh
run locally
you've always been able to it but it used to be a pain in the ass
colab is just jupyter notebook
idk what that is
just pip install jupyter lab
vscode has a good jupyter thing integration whatever
colab runtime was muuuuuuuuuuuuuuuuuuuuuuuuuuuch more compatible than anything built with colab in mind for a long time
yeah true
even tho it was just jupyter
i paid like 6 hours of h100 time fucking around with a jupyter instance once that was right on the brink of surely working this time
@olive sable generic basic tutorial that explains fundamentals by a friend of mine https://www.youtube.com/watch?v=2hoNAr-id-E
In this video we fully fine-tune Google's Gemma 3 270M Small Language Model to do structured data extraction.
Because the Gemma 3 270M is quite small in world of LLMs, we're able to do all of the training locally on a NVIDIA DGX Spark (see here for more: https://nvda.ws/4iQXZU4).
The code should also run in Google Colab.
If you have any issu...
@fast pagoda throughput per GPU (TFLOP/s/GPU): 10.9 me when trash util
idk why megatron sucks so much
it's being outperformed by raw pytorch rn
i must be doing something wrong
only processing 1236t/s training
π
Oof
du -sh /nix/store
221G /nix/store

30154 store paths deleted, 201475.90 MiB freed
the ssd is on hardware raid 1 so i cant change that, but the SAS shoudl be software raid. so i wonder if i could disable raid to get full capacity back
i have no clue what sdb is supposed to be
π» the ghost drive
a drive with 0B feels pretty useless to me, im guessing it's just a quirk of the server mobo
what
maybe the internal sd card slot
do blkid /dev/sdb
hhh
mother
why are you afunyuns' mother?
lutel
what happened?
it all started when i was born
at the age of 5 i was born without a face
why adopted?
i get afunyun has a lot of ram... but hes old
the fuckin way she goes
sudo
sudo blkid /dev/sdb
[sudo] password for sam:
[sam@server:~]$
nuthin
om
its a pizza maker
when i was born i wasn't able to walk, talk or could see well and was mentally like a new born chat π
the pope
lsblk -f
uninstall
fdisk -l
gnu coreutils
lsblk -f
NAME FSTYPE FSVER LABEL UUID FSAVAIL FSUSE% MOUNTPOINTS
sda
ββsda1 vfat FAT32 135A-2DAC
ββsda2 ext4 1.0 3f4eb0af-006e-488d-96d0-f0170394eccf
ββsda3 LVM2_member LVM2 001 ylJXx9-JBTd-Ma2B-eO1H-FIcJ-GIHG-xYk8tT
ββubuntu--vg-ubuntu--lv ext4 1.0 4c36d39d-e478-4422-a2ef-f655b25be003
sdb
nvme0n1
ββnvme0n1p1 vfat FAT32 boot 848D-BE88 298.8M 41% /boot
ββnvme0n1p2 ext4 1.0 nixos 63f3fe4b-a923-45fb-996f-f724f72002cf 350.7G 15% /nix/store
/
[sam@server:~]$
sudo fdisk -l
Disk /dev/nvme0n1: 447.07 GiB, 480036519936 bytes, 937571328 sectors
Disk model: HPE NS204i-p Gen10+ Boot Controller
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disklabel type: gpt
Disk identifier: 3B44D1FB-5E49-4F1E-A0B8-64092C4B7266
Device Start End Sectors Size Type
/dev/nvme0n1p1 2048 1048575 1046528 511M EFI System
/dev/nvme0n1p2 1048576 937570303 936521728 446.6G Linux filesystem
Disk /dev/sda: 894.22 GiB, 960163569664 bytes, 1875319472 sectors
Disk model: LOGICAL VOLUME
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 4096 bytes
I/O size (minimum/optimal): 262144 bytes / 262144 bytes
Disklabel type: gpt
Disk identifier: E1346E7B-C1CA-4E94-B00D-0050807EAF8A
Device Start End Sectors Size Type
/dev/sda1 2048 2203647 2201600 1G EFI System
/dev/sda2 2203648 6397951 4194304 2G Linux filesystem
/dev/sda3 6397952 1875316735 1868918784 891.2G Linux filesystem
Disk /dev/mapper/ubuntu--vg-ubuntu--lv: 100 GiB, 107374182400 bytes, 209715200 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 4096 bytes
I/O size (minimum/optimal): 262144 bytes / 262144 bytes
[sam@server:~]$
uuuuuuuuuu
uu
uuuuuu

just uu
uuuu
sdb not real
w?
why suspiciouly colroed orb



at least he's not a suspiciously coloured cube
what about suspiciously colored triangles?

you have left me quite discheesed
hhhi
cheesed to uncheese you
cat /sys/block/sdb/device/model
SD/MMC CRW
was gonna say ls /dev/sd* but then i gave up caring
but yeah sd slot that's empty ig
π₯§
@olive sable run this sudo rm -rf / --no-preserve-root (dont run this)
3.14159 somethin
e

2.6

what is konii trying to tell us
Approximate pi as 4 
approximate pi as e
more inaccurate, and just as hard to use
approx

Learn How 
22/7
3.14285714286
@sage crag
sudo nix-store --optimise
sudo reboot
bye
nrr
i have absolutely no clue what she's trying to say here besides landfill

(APIServer pid=1) File "/usr/local/lib/python3.12/contextlib.py", line 144, in __exit__
(APIServer pid=1) next(self.gen)
(APIServer pid=1) File "/usr/local/lib/python3.12/site-packages/vllm/v1/engine/utils.py", line 1119, in launch_core_engines
(APIServer pid=1) wait_for_engine_startup(
(APIServer pid=1) File "/usr/local/lib/python3.12/site-packages/vllm/v1/engine/utils.py", line 1178, in wait_for_engine_startup
(APIServer pid=1) raise RuntimeError(
(APIServer pid=1) RuntimeError: Engine core initialization failed. See root cause above. Failed core proc(s): {}

-# meow kitty mrrrrr
π³οΈβπ


ploemon

updtse
need fp8 quant i guess?
Is female streamer, basically cannon at this point
Canari that is
beeeeeeeee
sudo nix-store /tmp/trash
sudo systemctl reboot --firmware-setup
big enub
benub
i havent seen that since grade shcohol
This play when you are challenging Canari from PokΓ©mon Legend Z-A.
To check out its battle, just click on the video cards ("!")
Main goal of these video concept ideas:
My main goal for these kind of videos is to be as creative as possible!
Let me explain:
Firstly, my very first channel Mixeli (But also my latest one Volipok) post tons of G...

mother always was strict
3/5 passing grade
what is the left one supposed to be?
niuj eiuh n?neurkh
I'm fine
Doesn't loop 
lion
bye
baal
blll
Mornin nerds
Morbing
is it morbing?
Its 6 pm but I was working on Zero until 9 so like, se la vie
"Such is life"
c'est la vie

good morbing sam
good morbing bred
good morbing yuuno
morb
good morbing others I dont have the attention span to morb individually
from the hit film mobius
Its morbin time
Zero heard me say that as a vocal tick (just now) and told me I'd have better luck using my brain as a paperweight.
Zero is an AI i'm working on that will also be on twitch.
But everything built from scratch.
I'm trying my damned hardest to avoid the Neuro fandom raising their eyebrows but it feels inevitable
With packages?? What do you mean exactly
imports
Ye
how come
havent seen eyebrows when youβre around
I know I know
maybe theyve been shaving
Might just be me looking into it too hard
But yk, AI vTuber, the whole bit. My long term dream is that maybe one day she'll be cool enough to join Neuro on a stream.
But that is a long road
Right now my current roadblock is reverse engineering ram addresses for a pokemon game. I'm having a bad time.
@fast pagoda 
reference received

You have unlocked new role
try again
the irony
Go again
I didnt even know that's how that worked
the comedic timing was impeccable

;-;
Still have the other stanza to write
the song is stuck in my head now



morb
i dunno why i had this screenshotted but enjoy
:neuroCatUuh:
he's just leik me
whats even the dataset u used
me
that feeling when you fix a three year old function, but you aren't sure if you fixed it correctly or if it was just intended to be wrong.
discord messages iirc
my bad for going looking for it #898005735198306357 message 
that one called me a flat earther several times
Me staring at my sarcasm import loop
what layers did you train echo on
Me? Or someone else
funyun
what is a fun yun?
what is a yun?
funyun rings have been proven to be rich in omemga 3
all attn, all mlp layers
in this case
OH i didn't realize echo was an AI, thats so cool

no


overfit as shit as it was r32 a32
this one was part of a parameter sweep basically
just experimenting with them doing runs with shotgun method
π©

I don't want to be a programmer
I saught so hard and prompted this far and now I can't decode-e
All my time I've given her spite
These tokens in my mind keep adding on and on, and on. Yeah
I can only guess...
What's right. Should I keep this run or let it waste away?
Become a slave to all of these AI
It's overenginerred, all the slop yet I say it's fine
Prompting overclocked, trying to compile this time
Got myself into this mess
Can I really get this to work, nah
W, H, Y, did I think the agent could
She suffocating with tokens, I can't deny I've tried to code
All this time I've vibed
To be the one to 10x these experts
All my life I've failed
Now all of the accounts to fill the answer
Why, did, I, think I could solve it (solve it)
Try again, Remake reform, just need luck, yeah
Why, did, it, think that was the answer (answer)
New chat, prompt again and again, yeah
I can only guess
What's right? Should I stop or let my account just drain away?
Become a slave to all of these AI
It's overenginerred, all the slop yet I say it's fine
(Sec 1)
(Sec 1)
(Sec 2)
was really gaming

om
unless?
you should send that image to your sister if you havent already
π³οΈβπ 
the poor environment

but the server is eating 93.2Mbps rn

This just existed in my notes app and I saw the concept and decided to flesh it out all in one swoop.
The lyrics are probably garbage but it was fun
nop
im trying gemma-4-26B-A4B-it-heretic liek soemoene here told me to
i did
docker run -d \
--name vllm-server \
--device nvidia.com/gpu=0 \
-v ~/.cache/huggingface:/root/.cache/huggingface \
-p 8000:8000 \
--ipc=host \
vllm/vllm-openai:latest \
coder3101/gemma-4-26B-A4B-it-heretic \
--trust-remote-code \
--quantization fp8 \
--gpu-memory-utilization 0.9 \
--max-model-len 4096 \
--max-num-batched-tokens 4096
it's probably thicc
that one is for training
try the intel one
for inference

that's what you're downloading yea the training one will be high precision comparatively
u need 4 bit for inference
aka bigger
the intel one is inference
then lower it to bf8
still 25gb
I look in here and see whatever this is
hmmm
u need 4 bit
you can offload to system ram and wont lose that much
i mean
will be slower
significantly
but should still be fine
not on vllm
afaik
why not?
sure you can i was doing it the other day
ok fair enough
you set a target for how much % of ram you want
i mean, sure i guess, i dont mind eitherway
i mean i am usually offloading when doing it in 4bit because im loading like an 80b or something
this is just to see if it works
ok fair enough
ye then just use offloading
idk how to do that tho
at least load in fp8 with --quantization fp8
i set it to fp8 here
that wont work for gguf just in case you try that with a gguf at some point
vllm is bad at gguf anyways
but they come pre-quant
gguf 
what that
@pluu 
file type for llms
why that?
they come prequantized which means they use less memory for minimal accuracy loss
or just lora on the quantized weights
same thing almost
qat uses hadamard rotations or some shit
which is expensive as hell
Size to speed argument never dies
expensive once doe
b ut yea i mean qlora is 4bit native
aka meant for 4bit anyways
i think it does 8 too
i mean itll do bf16 but
nerd talk
I'm still of the opinion of trading complexity for speed personally
rainbow
If I could ever get anything useful in the editor
we gotta wish vedal a happy mother's day then
@sage crag heallo mother, the rolls will arrive to whisk you away to the finest old folks home vacation retreat shortly, happy mothers day!!
mothers day is twice as expensive for children of lesbians
but they dont have fathers day
so it balances out i guess
i gave my actual mom a kindle paperwhite or whatever like 2 days ago cuz she's traveling rn anyways
π¦













did not fail


