#programming
1 messages Β· Page 431 of 1
red is just a saturated pink.
flat uni theory
both colors are on the spectrum. don't worry.
pink is not a color
why not?
you did mean the autism spectrum right?
False. PURPLE is not a colour.
Purple is a NOT-GREEN.
literally the electromagnetic spectrum but if the colors were people, they'd be pretty focused
na pink is just diluted red
But at least it exists.
pink is red + white (and white is all the (light) colors mixed together)
Unlike purple.
Leon is a very good example for potential, I've also seen one image comparison of a mage from Hogwarts legacy that's way better than what they used in their video examples
purple is also just pink
it doesnt exist because its not in the bible
yellow doesn't exist really for people. it's just green and red together.
you dont exist because you're not in the bible
checkmate liberal
actually i do, my given name is from the bible, checkmate
wouldnt you like to know weather boy
not going to dox anyone, but joshua is the most common name in the US.
or possibly michael. it depends on the year
(for boys anyway... )
the name just picks whatever it feels like being, every year
but the name doesn't refer to you. checkmate.
not from the us, check your mate 
don't have to be. the name i n the bible doesn't actually refer to you. so your existence isn't proven.
i think most of us aren't from the usa
it refers to me specifically actually, im the only surviving person from the bible
considering the time in the us right now id be suprised
bro is noah
plenty of neets in the usa who dont sleep
oh. the legendary person who heard "there are those among you who will live to see me come again" speech.
they have no name,
also its like morning in the usa
yeah. it's early.
everyones awake already
does your average neurocord chatter wake before 10am
depends if they've been the sleep yet

or me, every one of you is a robot!
i'm currently detached from my real time zone due to things i'm working on.
dead internet theory but everyone is neur
man's in a pocket dimension rn saving time
the real hustler
bouncy lines
davinci is my favorite enough free video editor.
wrr
kdenlive 
losing my mind
I think your the first person to ever say that haha
Doing some sort of wizardry?
its useful when you have bright light (the sun) on your monitor
otherwise I get why resolve uses a dark theme
Ahhh that might actually be useful π
I've had this gripe with other blackmagic tools before
couldn't see the "on" button in the atem audio mixer

I'll keep it off, thank you very much.
How is this kid not traumatized? Lmaooo
like for reference
the software looks like this
and I was on an old lcd screen in the middle of a field
What are you working on? dangg
that's just an example screenshot from blackmagic lol
that is audio silly

some idiot (me) forgot that microphones need constant attention otherwise you have a lot of work to do in post
NOT podcast
podcast 
there can only be one dave
just saw dlss5 and i'm astonished how detached these companies have become from reality
they're so proud of the destruction of artistic intent it's wild
it's so incredibly bad i don't even have words to express it
chat, should I upgrade my router?
ive been using the same router since 2011 so...
should I?
cant believe ts outlived my TV lmaooo
also depends on your internet connection I guess
wpa2 in 2k26
even on wifi ts lowkey cursed
if you have practically no local traffic then you don't need a fast local network 
so if you can saturate your internet connection it's good enoughβ’
literally sitting less than the width of my hand from the computer.
but it's also plugged in at Gbit.
yay comcast equipment
try Cloudflare speed test mabe https://speed.cloudflare.com/
it'll also show packet loss and latency
uhhh is dis gud?
yeah that seems pretty good
latency is high but I think that's just because of the speed test works
but no packet loss, even on WiFi 

packet loss 
lol it really just says >1Gbit if you go above that
moved phone by 30cm
seems to be pretty much the same result

the cli speedtests suck
how can it say my internet is unsuitable when this is the fastest my internet has ever been

theyre daring you to be beter


speedtest++ is fairly accurate
speedtest-cli uploads liek 2 mb so it's super random values
mobile data NOT faster
they have fibre to the box and fibre to the property, i have vdsl

froge
rice very neutral
im about to rip someone's head off at verizon
i tried putting mitarashi dango glaze (made of soy sauce and sugar) on bread
it tasted like
soy sauce
but on dango it tasted like
sweet soy sauce


I can see my mobile data tower 
I figured out why the result was bad 
mobile ISP contract renewed
would post picture but even google ai can doxx me from the building

armed and ready dt pass b rank
sometimes I google stuff and get hyper specific results even in incognito so i question its accuracy
but
do not want to find out
verizon decided there's no way anyone could ever want to so you can't disable 5g band to just use 4g
what do you call 64-bit x86
6
14
2
x86_64
1122595147524341830
eliv
and the 5g tower they have near me has some sort of known issue
for months
so i just have no data half the time
do you not get that option in your phone settings?
i forget the us does that
m
we have some preinstalled apps in russia
in our case its mostly just the government wanting to prop up local businesses
candy crush?
oh yeah king software very niche local dev
when i called verizon about the simply non functional nature of their 5g tower here
they were like
yeah known issue
we'll give you a discount
but
they did give me a discount
$10/month
in order to do so
they removeed the $20/month off discount i already had
so they increased my bill by $10/month as a treat

and the service still doesn't work a month later 
wish i could just tell people "workin on it" for months at work
you can. for about 3 months usually.
i have the same question with my ram
8gb idle
like what is windows doing
how much total?
16
its eating half my ram
my fedora takes up only 1
and its bloated
are they just genuinely throwing updates at the wall with an llm and seeing what passes production tests
i mean
do a task that uses a bunch of memory and then kill that task
see how much memory you're actually using
I've actually done that test on a laptop with 8GB RAM
The result was that it was still using like 1.8GB after
On Win10
After I had debloated it a bunch
(the test was running 2000 instances of minesweeper and then killing them)
its showing ghost values for some reason
when I go through theres not a lot of apps using memory but the overall usage is 8
did I get a virus lmfao
pagefile on ssd
what
are we just making shit up these days
oh wait no I do in fact have pagefile: yes
idk where my pagefile even is
apparently I have a 3GB pagefile
somewhere
the image says there is a pagefile on C:
Windows doesn't tell you about all of its memory using components
not that it got used
crazy how the process list doesn't show memory not being used by processes
Windows do be silly like that
Linux does the exact same thing 
What a cute cat
im going to build my own zilog z80 cpu running cp/m and ooga booga like the old times
free of any bloat as I will rewrite the os in rust

running rust compiler on z80 cpu
tall people have bigger eyes, so more light enters their eyes, so they can see better in the dark

but the image is less sharp
so short people see in 4K
the reason tall people can see better in the dark is because their eyes can pick up longer wavelengths.
like infrared.
that shouldnt matter since the cones in the eyes pick up very litle to no infrared
if the person was larger in every way (including at the atomic level) it would work
larger antennae can pick up longer wavelengths.
the optin in the cones in your eye would respond to lower frequency light coz they're bigger now
makes as much sense as saying tall people can see better in the dark.
humans only see 380nm to 750nm
thats cuz IR doesn't have enough energy to trigger the chemical reation in our cones.
i was being silly. short people don't have higher resolution eyes either.
height doesnt upscale the size of your cones

cell size remain uniform across height
i think they're doing a hypothetical
and larger eyes would increase sharpness too
i guess if the electrons were further from the nucleus, but without increasing the size of either so the force keeping them together doesn't scale. then it would be able to see IR
it does
if you're over 3 meters tall you can see microwaves

depends on the density of photoreceptors rather than the surface area
lens size does have an impact on visual clarity though
only 1 in N people can see this object
and having more space for the focus to be adjusted allows better range of distances, though you'd probably be long sighted compared to a regular sized human
the larger the aperture, the higher the resolving power.
due to the minimum adjustment being larger
the reason im going on this dumb tangent is cuz i was thinking of lenses ye
i remember reading that Legolas in lord of the rings would need eyes like a meter across to be able to see as far as he did in the story
magic lenses floating in front of them that focus it all down to their eyes
that'd work
coz magic
like binoculars
but magic
elfs (elves) may have completely different internal anatomy to humans
maybe they have four stomachs like cows
elfs (elves) 
Or theyβre more magic than physical and thus donβt give a shit about the limitations
made up word linguistics debate
elfs elves
say one and someone will cry
https://www.youtube.com/watch?v=Rk2izv-c_ts <= was this one
There's a famous scene in the Lord of the Rings where the elf Legolas claims to be able to count the exact number of horsemen 5 leagues away, and on top of that can tell that their leader is very tall. But even with the most perfect eyes possible, would anyone be able to see that far?
Created by Henry Reich
Thanks to Nima Doroud for contributio...
elves might have a mirror at the back of their head, and the eyes extend into the body, allowing them to be 1m long


what about tolkien?
so the solution is "Elves don't exist, so they can do anything they want."
Tolkien strongly preferred the plural elves (and dwarves/dwarvish) over "elfs," correcting editors to maintain this usage.
huh
the elves themselves spoke Elvish and not English, so didn't give their opinion.
the only thing i can really blame about the lord of the rings trilogy is.
they used ballistas to attack a city, but that city use trebuchets to defend
it should have been the other way around
my main fault was the trilogy takes longer to watch than the flight from Los Angeles to Auckland. definitely too long.
https://en.wikipedia.org/wiki/Elvish_languages_of_Middle-earth
there are a few
i bingewatched the whole thing a month or so ago
and the pacing is actually pretty good
just a long story
i was actually pretty sick of all the media about it (and am originally from NZ so it was x 10 over there) and have only watched the 1st one.
i've read the hobbit. i think it took less time to read than the movie.
like, i read it in a weekend while i was visiting a friend
without being rude
i didnt read any of the books.
i was supposed to for a book report tho. but i ended up winging it
the hobbit is a children's book

its still a good book, but its very short
i know ;] it's quite good though. i only read it coz i saw it was so small
i like books short

like i said, i was visiting someone
the hobbit is 95000 words apparently
was it really?
novels are usually anywhere in the range of 70000 to 120000 words
that seems long for what i remember
i read the first chapter, and then made the assignment with help form chatgpt
to get an idea on how the story goes, i wrote most of it myself
how many partial degrees do you think chatgpt has earned since they switch it on?
a couple million
thats more than 95K
damn
oh
no i was reading the wrong one
specifically the hobit is 95k ye
wow. i must've been ploughing though it. then again, i have no idea how many pages or words books have these days as i use a kindle
i lose focus after about 4000 words
according to this chart
average reading fluency
but 350 seems a bit low for an avid reader
unsure, my reading speed is around the 400wpm range though, which is why i brought up the 4 hours
if i'm focused i can read as fast as my eyes move.
eye movement is naturally the bottleneck for reading speed
there's a trick where you move your finger down the page and you can read faster, but that only works for about 2 pages for me before i start zoning out
moving eyes between chunks reduces eye strain and improves speed
and i like to read dialog "out loud" in my head
rather than continuously moving the eyes
i read at whatever speed is comfortable to absorb all the information
places with lots of dialog are slower usually... depending on how they speak. if they're being direct and to the point it's probably faster than if they're being all clever.
unless im bored and then i start skimming between major events
but that's unusual
when i'm reading quickly the danger is i get three pages on and realize i have just been thinking about lunch for the last 3 pages
blinking is the next bottleneck
this random reading test says i have 391wpm
i unfocus very easily though
really the danger is when I read a particularly clever line and I'm just repeating it to myself and marvelling while moving my eyes over the words.
me
me: squirrel
non-skimmed
but easy article
you could guess these questions easily even if you skimmed the text
i wouldnt be surprised if your peak reading speed was higher
though i suspect most of the difference comes from better focus
i think the trick would be to write the article using completely ficticious groups with counterintuitive properties.
like the logic problem "If all things that are smoked are good for you, and cigarettes are smoked, are cigarettes good for you?" ... the answer is "Yes." (even though the premises is false, if it was true, then cigarettes would be good for you. i just asked about that set of facts in that specific (not true) state.
cigarettes are not good for you, but "IF all things that were smoked were good for you" THEN it would be true. even though that is not true.
but language doesnt exist in a vacuum
the logic was what is being questioned, not the facts. the reading test also was a test of background knowledge
bred get moldy if not vacuum seal ye
in space nobody can hear you wrrr
language doesn't exist in a vacuum because sound needs a medium to travel through.
i guess sign language could though. so language does exist in a vacuum. (deliberately misinterpreting the meaning of "exist in a vacuum" for humorous purposes)
the roomba uses a protocol to define what mode it should be in and how it should behave in various situations. this is a lanaguage and the protocol specifies what is a valid "sentence". so language does exist in a vacuum.
i exist in a false vacuum
iirc if you touch them you can hear them
transferring vibrations from one to another
thats one way to put it
today is a sad day. a Computerfile video was titled "Writing a guitar sound in C". They lied. It was writing a guitar processing effect in C. And (at least the initial one) was just an overdrive effect.
writing a gutiar sound would've been much more interesting... simulating the strings.
what is the difference?
i kinda gave up on computerphile because 90% of their videos became about ai in recent years
it's just boring seeing the same topic over and over again
for an effect you still need a guitar.
it's saying "for this array, perform this function on all the samples"
which isn't very interesting until you get to resonant filters
if you were simulating the strings maybe, but in practice even if they did the guitar sound actually, it's just sine waves
aint nobody simulating the actual strings
could tie back to their FFT videos maybe
yeah. i've played with an open sourced/public domain VST guiitar synthesizer before to make it so it would do electric guitar sounds instead of acoustic. it used simulated strings.
i wrote some fun sounding (but extremely cheesy) guitar solos that even did some effects like slides and some harmonics with it, but it took a hell of a lot of processor. now you could probably push most of that to gpu
i wonder, daws are notoriously cpu heavy because of the dependent calculations you need for the whole simulation
not sure if a gpu would do a good job, it's almost never been used afaik
if you're only doing the generation of one instrument with it it would probably be fine, especially if you know the latency... assuming you're not using it live
for live, get a real guitarist ;]
duh :V
but like in general, i dunno if a gpu would help much, audio sim isn't really a parallel task
it being a vector op doesn't mean much, what about the dependencies?
basically they chop the string into sections and simulate each section. the only dependencies other than the position of the guitar pick are the two sections on either side
hmm, maybe i'm imagining what the string sim does wrong, got a visual example for it?
page doesn't seem to say much, example needs a whole ass account
there is a version.
so it basically makes an array of delay lines and uses those to hold the string position, bouncing the signal off the end of the string and/or where it is pressed. because it allows you do choose where you strum the strings and how much and what angle you put the pick on, you can make realistic sounds with it
most of what I did was removing parts coz i wanted a rougher sound. ;/
when it comes to simulating real sound this comes to mind as the perfect example https://www.engine-sim.parts/
Wishlist Engine Simulator on Steam!
https://store.steampowered.com/app/2381500/Engine_Simulator/
Quite possibly the dumbest project I've ever undertaken, this was an odyssey but well worth it in the end. It's hard to fully explain the depth of the rabbit hole that I was led down and I ended up having to study things like fluid dynamics, physics...
yeah same kind of "physically based rendering" but for sound idea
the pipes like exhausts in this simulation are delay lines like the strings except obviously they have different rules for how they work
noway screeps update

es2026 screeps? 
from 10 to 24 is a big ass jump
Awww yeahhh! Ya boy just got an employment offer!
I've been partially / fully unemployed for just under 2 years now
jobs are overrated.
i mean, except for the money part.
and if you're lucky, the access to cool toys part.
ok ngl some jobs kick ass
3/5 better than the alternative
how do yall survive without employment
I dont know much about adulthood
but isnt money kinda required
if you live anywhere other than America - Government benefits
If you live in America - idk bro good luck I guess?
ah american here that makes sense
we are born into the world with more debt than we will ever have wealth
Staying with my parents here, thank goodness they are so kind
i dont have to pay rent, food, electricity, nor any other bills
also gov benefits i guess
altho most of it goes into cool tech
working on my presentation while listening to neuro21 radio station and money machine comes up
my focus was never higher for those 3 minutes
I sold my home and moved in with my parents lol

Okay guys im very confused. Why is FP16 so much slower on this (or probably other pascal cards aswell) P40?
180Gflops fp16 vs 11tflops fp32? Why not just run the models in fp32 then? Vram useage id guess, but cant we at runtime trade some computation per layer to cast the values between fp16 and 32? Perhaps cast the next layer while the current one runs?
https://www.techpowerup.com/gpu-specs/tesla-p40.c2878
That is tech called AMP, and is normally used to go from f32 to f16
Well f16 is supposedly slower here.. can amp do the opposite? And how much does it matter for Q4/Q5 gguf models?
No native FP16 presumably
I guess so too but why not just use fp32 then?
In the case of my 3090 it has 1:1 FP32 and FP16 because it emulates FP16 or something
Memory
Surely casting between fp16 for the inactive ones and fp32 for the active and next one cant be that bad?
Get out your notes on Neurosynth
This must be the stupidest thing to have been put in a Systemd service
I hate Python environment management
I mean, doesnt gguf cast int(whatever) to fp16/fp32 anyway?
When it comes to compute itself, usually if you want to do high throughput you want to stick to the same dtype as what is in memory
In inference you can get away with the compute being worse due to having to do casts because the mem bandwidth is the limiting factor anyway
So.. it sounds like that casting between fp32/fp16 sounds like the thing to do then
Or ig Int4/5 to fp32 instead of 16 or whatever
If you really want to do inference with FP16 weights, yes
If you're training though, better get an FP16 capable card
Just interested in cheap ways of running models
The P40 is certainly a cheap VRAM thing
And kinda seems insane to me that pascal seems so capable but just not used at all due to bad fp16
Last time I ran a model on one of my p40s it was about 2-3 years ago
I find something about this extremely cursed
The ./ Seems bad to me.. idk tho
Just fell out of interest and now I don't have a setup to use them in
The service sets the working directory before that script runs
Ah
I mean.. 48gb for 500β¬? Hows that not interesting?
48GB with so little compute it's useless and I'd rather use that 500β¬ on a 3090
I can barely get a 3090 for 750β¬
I got my 3090 for 500 β¬
U lucky
Nah I just asked the triangle to find one
The triangle?

Oh ig
I ended up getting 96gb 700 euro give or take.
Thats two triangles tho
price of a single 3090
Yea
prices have risen since the ram crisis. there was a good week or so i could get them for 400
so I just traded in compute for memory
Im mostly interested in models like qwen3.5 35b a3b.. so compute aint a biggie
I have my 3060m for training, smol models sure but I still consider larger models to be a skill issue.
The story behind this:
For whatever stupid reason, today Streamlink decided it would be a good idea to bork
So then I had to go and first figure out what the issue could be, then I ended up recompiling my stream capturer wrapper, and even then it still didn't work
Then I got it working in my regular terminal but not as the Systemd service
So then I found out that the stupid Streamlink in the service was the wrong version
So then I went to try and upgrade it, but Python is stupid and I could not figure out where in the name of Neuro-sama did I install it originally
So I ended up having to make a virtual environment dedicated to the program specifically
The 3090 could handle that fine
At least it happened today and not tomorrow
That would have been baaaaadddddd
Well, for 750β¬ vs 2x250β¬ .. the p40 is considerably cheaper
Mabe with your prices
And would handle more context in theory
With high context comes high compute need
I mean yea i dont see any decent prices in germany
Ah of course, Germany, the prices there are messed up
extremely cheap rent
True, TTFT increases, but im happy with like 100k tokens and willing to wait like half a minute or something
RTX 5000 for 785 popped up locally
apparently the P40 does have native fp16, it's just horrible because there aren't many fp16 cores on the die
in the Pascal generation only the GP100 (in the Tesla P100) has the newer fp32 cores that support fp16 by running at half width
for the other chips there are still some fp16 cores to have any fp16 support at all, but only a few to avoid wasting space
Yea. Its awful.. i was happy to get a mined on rx6700xt about 2-3 years for 205β¬ .. still going strong.
Meanwhile me running the higher quality 27B model at like 37 t/s with more CTX than that
superbox stop flexing 
Lemme just..
i got scammed out of 428 bucks cuz you wanted another 3090
I'm just doing my normal usual sillying
Hem
Hopefully you can get those back
thanks for clarifying 
I may also be stupid as well who knows
Can't even get a stupid Systemd service to work without a cursed thing
This is sorted cheapest first..
i can get them from like 700 on a good day
worth it
But, according to them the card is hella loud and runs insanely hot (apparently some crappy palit model)
if you dont want it ill buy it from you
Repasted and repadded
Re-paste and it's good
It is repasted
Hem
Ill be allowed to play with it but im not too keen on those 16gb vram
16GB is kinda fine
I used to train vocal synthesizers on 12GB
Especially in combination with my 12gb rx6700xt and my (poor) 700W psu lmao

That is indeed a good deal for a 4080
If you buy that 4080 can I buy your old 3090?
4080 has a decently higher gaming performance than 3090
no
Aw
i need a 2nd gpu for the theatre room pc
Yea, and normally id take it immediately but its not like i need it, and since its a friends deal idk if id feel fine selling it and stuff
ia lready have a pc with a cpu, but no gpu
4080 is 33% faster than 3090 according to this performance chart

Hem
I see
Though an Nvidia GPU will mean it'll idle at at least 20W no matter what
I mean tbh probably would be fine selling it.. but at the same time, id like to run bigger llms.. how does vulkan deal with an amd card and a nvidia card at the same time?
If you were in Germany you'd care, with them having the world's highest electricity prices
i program in vulkan
so
not great
(I know from my friend Xoda)
My server is tuned to sip 28W and i dont run it 24/7 LMAO
also, why you using vulkan for llms?
Rx6700xt no likee rocm
Nvidia drivers are funny, no matter what you do there will be a default power consumption of 20W/GPU
And also, if i have the rx6700xt + rtx4080 in one system, don't i have to use vulkan?
the graphics api?
Yea
for llms?
Vulkan is also used for compute
On my rtx2060 laptop vulkan vs cuda is like 2t/s difference (at a scale of 30t/s)
you're just running the same code as a shader
llama.cpp for example has Vulkan compute capabilities
why can i not shrink it like this 
RE::TESContainer* container2 = ((RE::TESContainer*)((RE::TESObjectCONT*)((RE::TESForm*)((RE::TESObjectREFR*)((RE::TESForm*)a_ref))->data.objectReference)));
RE::TESContainer* container3 = (RE::TESContainer*)(a_ref->data.objectReference);
visual studio does 1st line and it works
i thougth these casts are excessive and did 2nd line
but its different
if vulkan is used for ai the same way it is used for gamedev. then it will probably run fine with both, but you need to have it work on different things, or it will need a copy of the data on each gpu
Presumably if you want to use both GPUs
Or mabe if llama.cpp lets you compile both GGML_CUDA_ON and whatever AMD needs...
Who knows
I dont particularly want tensor.. what was it called? Im fine with layered offloading, since aslong as it stays in vram, its comfortably fast (30t/s) with my rx6700xt and id assume the rtx4080 will have 0 trouble keeping up
It will definitely be speed limited by the slower GPU
I also was looking at the intel gpus, but idk.. >1000β¬ for 48gb vram of vulkan only seems iffy
Yea
The Intel consumer series isn't that bad price wise though?
12GB GPUs you can stack a bunch of for cheap
Yeah, consumer only sadly really goes up to 12-16gb tho..
Tho im curious what intels are you talking about?
The B580 is really good value wise
Which is why im not too stoked about it
Even if its hella value
If one doesn't want a 4080, one sells the 4080 to someone who wants the 4080
id gladly trade you a RX 7900 XT for a 4080
If only they werent prone to die, thats a hella offer.. if shipping doesn't kill the deal
That seems like a great trade in this situation
24GB for 16GB
AMD so it works with your existing GPU too
Is 4 tbβs enough storage.
If Belgium to Finland is cheap, Belgium to Germany gotta be even cheaper with no lake in the way
the XT is only 20, but i can probably get my hands on a XTX too
No
Wa
Since when?
to germany i would just save costs and meet in the middle
Dang need more
since always
Huh
I did not know that
Well, if that could work out, that would be hella cool.. ill keep it on mind and keep you updated

electricity in germany is expensive coz they closed all their nuclear coz they panicked after Fukushima iirc
Now next issue: will my 700W bequiet system power 9 blow up if i hook both cards up lmao
More or less, tho i think Chernobyl was the turning point or something..
Is this programming π
Everything is code at this point
im ngl, the rx 7900 xtx seems to be a lot better than the 4080 in everything besides RT
this is a terrible trade
355 + 230W is a bit tight
except the 7900 xtx is cheaper
Im not even gonna lie, i expected that, but i haven't kept up to date at all
Seems like in this performance chart I'm looking at it's exactly equal
Plus a ryzen 7 3700x xD
Yeaaah you may want to invest in a PSU upgrade
I mean the 7000 series did target the 4080
I mean rn im running a 5 2600x.. but that aint better
I dont mind undervolting and wattage limiting but yea
Its.. tight
So basically with a 7900 XTX you'd be trading a 4080 for a 4080 level GPU that works more efficiently and better with Linux and has more VRAM
Yup, giving up cuda, gaining vram, "gaining" actual rocm
And "good" raytracing.. which i couldn't care for at all
I specifically need CUDA because NeuroSynth runs only on either CPU or CUDA on Linux
On Windows it'd run on more but I don't touch slop OS any more
Only bummer is no dlss but.. oh well, i cant even max out my rx6700xt in games
Have you tried zluda and the other thingy?
I haven't yet and i wonder how it performs
I do not have any AMD GPUs that I could test on
Ah fair enough
If I had AMD GPUs I could probably write a patch for NeuroSynth to run on that too
storage isnt an issue
Holy a Quadro 48GB??
Its very tempting to throw out my money at a DGX but i know better.. and i need it for a driving license lmao
i know a guy with one, it sucks
Oh?
according to them
It seems like a nice machine to run 70b models on
yeah. don't get a driving license ;P
If you mean that tiny thingy with 128GB of slow unified memory, it's not gonna be very nice
except for going places, what use is a car anyway?
Yeah not 4k⬠nice
Job, to buy more car
To do more job or something
Idk
I'd rather take 4K⬠worth of 3090s than that thing
Yea
If that thing was like 1k⬠it would be a different story but like
supposedly thermal throttles so only reaches half of its tflops. the gpu is ass, and the software support is abysmal
arm stuff
Just casually loads up a 200B model on that
Well, 96 or so for me but sure
Nothing a bit of modding and a dremel dont fix ^^
I wouldn't hesitate modding it lmao
Or alternatively NeuroSynth with 4096 batch size
oh yeah nvidia's runs on the hotter side, though putting it on the side apparently helps a lot
In 8-way parallelism
laptop cooler
I mean yea.. the cooler slot is wide, has input and exit next to each other iirc.. so putting it on its side and letting it rise at the back makes sense
well yeah what did you expect
That thing looks like a lunch box, not a high power computer
and where is big fan going to go
Just waterblock it
Honestly.. how bad would this be
good luck
Well the cooling would be like way bigger than the computer itself so you'd basically just be redesigning the whole thing
What's the specs on that?
Hm, I see
beefed up 2080ti
Looks like mem is 2/3rds 3090 speed
And the compute is not that far behind
half the tflops of a 3090 too supposedly
They start at 700β¬ here lmao
Oh you're looking at FP32 I guess
3090 does have nearly double the FP32
But in FP16 they're like 3TFLOPS apart
Does lamacpp and such care about fp16 or fp32 when running quantizisied models?
I'd assume it uses FP16 if the card supports it
it supports both iirc
And no way to force fp32 with on the fly casting i assume (is it really that costly?)
fp16 should be the standart mode???
Titan RTX doesn't seem that bad
look
some parts of the model are still fp16 or fp32 even when quantized (as far as i understand it)
its 475, 500 with shipping, and i need to recoup some of the money i lost on that 3090
so ill do 550 for it
Hmm
preferably more if you can spare some change 
"used in a research environment" supposedly
I assume it's the base model cooler?
ye
Are you capable of visiting its location to validate that it's real?
Or is the seller reputable looking?
yes, but it is quite far
shipping shouldnt be an issue, but ill sjut be less lenient about the payment
Hmmmmmmmm
the seller hasd been on this website for 18 years, so they're decently reputable 
Yeah I think I can trust the seller this time at least
oh actually, the highest bidder is at 500
That I definitely can't afford
I was about to say that aint bad, but then it was "bad"
You could probably sell the RAM for like 2K
Thats good for moe
Wouldn't be out of the ordinary
- 2 x Processor Xeon E5-2683 V4 (FCLGA2011-3)
- Total 32 cores / 64 threads
- Memory 128GB DDR4 ECC RDIMM 2133Mhz
- ASUS GeForce GTX 1080 8GB (for the video outputs)
- 2 x Nvidia Tesla P40 24GB ECC (Total 48GB)
- Total 56GB NVRAM (slightly less by ECC of the P40s)
- Total 10240 CUDA Cores (for comparison Nvidia GeForce RTX 3090 there has 10496 but much less VRAM)
- Tailor-made cooling system for the Tesla cards, high air pressure fan with shrout, but less noise pollution than 40mm server fans. Under full load, the P40 cards remain neatly below 70Β°, the GTX at 88Β°.
- 250GB Sata SSD Windows 11
- 480GB Sata SSD Debian Linux
- Currently not dual boot, you need to physically swap the SSD.
- SilverStone SST-FS202B for two 2.5" SAS/SATA drives.
- LM Studio, Ollama etc.
- Power supply PSU 1450 Watts (ample sufficient)
That storage is so tiny
the storage is upgradeable
For storage id have tons of semi bad hdds lmao
id jsut pop a 16tb hdd in it and be good to go
"just pop a 16tb in it" this hurts..
why?
The 16tb hdd is like
250 euro
HOW
are there any 16 TB HDDs in the 2.5" form factor
I kinda wonder
What's your HDD pricing like these days?
I have a lack of HDDs
Doubt it
probably not
the big ones are still close to new
iggly
1.2K
not a bad deal
I have 1k in my account 
adding transparent restarts for my agent. i shouldn't have to tell them I'm shutting them down coz it'll be transparent when it starts up coz they'll just reload their context.
128GB DDR4 ECC RDIMM 2133Mhz

Well anyway
I am uncertain if I would have the capability to afford the RTX Titan
It would be kinda nice if I could, not sure
5k here but at MOST 1k available to spend.. driving license
ye, the biggest one I can find quickly is 5 TB
so you're not getting to 16 TB with those 2 2.5" bays 
ye that ram alone is already 1.2K
thats fine, ill put it sideways
or jsut put it somewhere inside
Keep an eye on that RTX Titan I guess
I'll see if I suddenly get extra money from somewhere
the inside of these tends to be quite full already 
but I guess worst case you could just... remove the drive bays 
Usb connected sata adapter: hehe
(For the love of god, do not)
Although.. a pcie slot is free.. you could do usbc..
i love how this has a 8 bit display thats coupled ot he outside of the gpu and not the actual gpu temp
they put the gtx1080 inbetween the p40s 
big fan
it'll be fineeeeee 
this might be not horribly noisy
any buyers in chat? 1.275 euro + triangle fee
#ad
all jokes aside, not a bad deal, but i cant afford to
What can one do with 6,5 p106-100 btw
I have em, dont use em..
(essentially mining 1060 6gb)
6gb of vram, not much i guess
6.5?
Yea one's fan had to believe for it.. its fanless

so 7 x 6gb
Well
42gb of vram
8x i do have a 1060 6gb
48gb of vram
I also have a 1650.. and a 1050 (latter one im not selling/giving up/using for sure, first gpu, its my baby)
that sounds horrifying but i guess if you already have the hardware
you could use it for ai if you want. more vram than that 4080. just a lot slower

is it faster than CPU?
1 token per 1-3 business days?
48GB of horribly inefficient to use VRAM
depends on what cpu
if you have AVX512?
then probably not if i had to guess
Also, also, poor pcie bus lol
Nah it'll be a bit faster than that
It'll go at the speed of one of those GPUs
Thats not bad..
A 1050ti can do 30t/s in qwen3.5 2b...
Ah, so it has to wait for the last gpu to start the next token?
hmmm.. i have a 1650 sitting in a box
Yeah it has to wait for the final GPU in the chain to generate the token before it can begin predicting the next one
that could do my TTS
That would mean.. 8x slowdown
No it would mean 1x speed
No speedup
You can load a much bigger model, but it won't go any faster than if it was run on a single one of those GPUs
Why so? If every gpu needs like, idk, 20ms for their own step?
it's the communicating intermediate steps between the GPUs i think
Thats fine if they are as fast as a model that fits into a singular vram?
Wouldn't it be 160ms per token then, slowing it 8x?
if you keep the model on one GPU it'd be fast.
If you load a big model, it'll be spread out so that each GPU has different layers
Each layer still operates at the same speed, so it's essentially as if the GPUs were a single GPU with the core and bandwidth of a single one but the VRAM capacity of all of them
You only pay in latency between GPUs
Yes but since you have to wait for each layer to finish before you can do the next layer, and have to wait till the last layer is done before feeding the first layer again...
the trick is to use multiple models and spread them across the GPUs in such a way as you tend to be using all of the GPUs efficiently
That's how inference works in a single GPU operation too
Just that it happens on a single device
You pay the calculation time between the first and last gpu, no?
This is true, however most single models only give you MoE which sucks
Yeah, but since you parse only a few layers at a time, it would still be slower than a model that fits into just one gpu, no?
Definitely out of my budget unfortunately
U cant streamline/pipeline it, no?
Aka feed the first gpu before the last gpu has finished the current token
There's no pipelining in inference though, you need to compute the previous layer before you can start the next one
this is only true for batch size 1 btw 
if you have a batch of multiple requests you can parallelize
I'm currently cycling between inference tasks, checking if each one is running and if so stepping it and going to the next one. two GPUs would make that code much easier one dedicated GPU to the main task and everything else on the 2nd
You can't have multiple tokens in-flight
(not the ada variant) its the other one
That is a true exception
Yea so each of the computing time of the previous token per gpu gets added.. its a slowdown
Turing?
they didnt mention
lovely, this patcher loss is so low that I need to be able to see more decimal places on the loss to make sure it isn't walling.
just "not ada"
TU102
it's a slowdown yes, because each gpu ends up sending their output to the next gpu, which you don't need to do on a single gpu 
I don't get what you're saying
there are ways of working around it to an extent by doing batching and async shenanigans so the hardware doesn't stall and do nothing
That's latency, idle time instead of compute time
same end result, it's slower 
the slowdown part is the inter-device communication overhead as well as the difference in speed of the device
Ig my miscommunication was a bigger model has more layers which means more compute time, while you assumed running the same amount of layers on a single gpu?
While i meant less layers on a single gpu
Ooooooo
Yes of course a bigger model means slower
also it's kind of obvious that if you put 1/2 the layers on one GPU and 1/2 on the other it would be slower than putting them all on the faster GPU.
Yea.. would be cool if you could start feeding the first gpu before the last gpu finished but thats not possible
With a bigger model you need to transfer more data from VRAM
Yeah unfortunately not because the tokens are a sequence
The previous token is needed to predict the next token
can the bubble pop so i don't have to do ML stuff 
not yet
The bubble can wait a little longer so I can have a bit more money to work with to then grab some cheap GPU
Ig multiple prompts works for well many people but yea not exactly what im going for.. not many people will want to run an 48gb ai at 5t/s (if im lucky) soo.. not worth investing into a mainboard for.. i might get my hands on the mining rig theyre out tho, idk
i'm pretty sure the AI coding genie won't go back into the lamp though.
tesla m40 24gb for 200
i feel like the m40 isnt worth it tho
me staring at the loss terminal output as loss is starting to go under 5 decimal places

If you think about it, claude code and such are just slotmachines..
that's what %g is for. scientific notation.
(or %e)
%.20f my beloved
I'm just at %.10f atm
βββββββββΌββββββββββββββββββββββββββββββββββββββββββββββββ€
βg β Prints a signed value in either f or e form, β
β β based on the given value and precision---an β
β β exponent less than -4 or greater than the β
β β precision selects the e form. Trailing zeros β
β β and the decimal point are printed only if β
β β necessary; takes a double. β
βββββββββΌββββββββββββββββββββββββββββββββββββββββββββββββ€
doesn't that sound more sensible?
it's nice knowing if it's 1e-9 or 1e-10 at a glance i think.
Fair, but math
funni numbers go brrr
Exactly
log scales are boring
i don't wanna use them but then you can't see anything 
(suffering from success)
0.00000000000000000000 isn't the number you thnk it is though
Whats life for if not seeing a ungodly amounts of 0's with every print
Im making too many typos.. i should sleep
I'd consider it once I'm that low
ungodly numbers of digits should only be used for exploits in CTFs
"print pi, the right values will be in there eventually, the grep will catch that"
tbh you should just print in binary
@nocturne olive
if adding a 2nd gpu is possibel, what stops you from getting a shitton of 8gb cards?
there is a similar method of finding primes using a regex.
sam have you considered there's this thing called physical space















