#programming | Neuro-sama Headquarters | Page 436

cobalt sigil Mar 22, 2026, 2:14 AM

#

I remember

true hemlock Mar 22, 2026, 2:15 AM

#

InabaBird

cobalt sigil Mar 22, 2026, 2:15 AM

#

I think it's like 2-3 weeks ago ppl all over Chinese use that AI to like order maituan (like door dash)
since they are like doing a promo, if you order the food via the ai, you can get it for free

#

which kinda makes the server workload pretty high

true hemlock Mar 22, 2026, 2:16 AM

#

COCKA

olive sable Mar 22, 2026, 2:16 AM

#

meow

cobalt sigil Mar 22, 2026, 2:17 AM

#

_"The AI battle during the Spring Festival is heating up! Early on the morning of February 6, the "Spring Festival 3 Billion Yuan Free Drink Giveaway" campaign officially launched on the Qianwen app, kicking off a "bubble tea offensive" and inviting people across the country to order free bubble tea with a single AI-generated phrase. During the event, each participant can claim 21 no-minimum-spend 25-yuan free drink vouchers, totaling 525 yuan.

A representative from the Qianwen app stated, “Through this Spring Festival giveaway campaign, we hope to invite people across the country to experience a brand-new lifestyle in the AI era and integrate AI into their real-life consumption habits.”_ translated via deepL, source
-# https://www.hk01.com/財經快訊/60319797/千問app啟30億免單-ai一句話免費點奶茶-每人可領525元免單卡

香港01

千問APP啟30億免單︰AI一句話免費點奶茶　每人可領...

春節AI大戰殺瘋了！2月6日一早，千問APP「春節30億大免單」正式上線，發動奶茶攻勢，邀請全國人民用AI一句話免費點奶茶。活動期間，每個人可以領取

true hemlock Mar 22, 2026, 2:18 AM

#

hollow matrix Mar 22, 2026, 2:26 AM

#

neurOMEGALUL

sick owl Mar 22, 2026, 2:28 AM

#

On further testing the 27B Qwen 3.5 drops off severely below Q6

#

Even Q5 is prone to infinite loops

jagged turtle Mar 22, 2026, 2:28 AM

#

hollow matrix <:neurOMEGALUL:1097297318119743638>

still makes me lmao

hollow matrix Mar 22, 2026, 2:29 AM

#

jagged turtle still makes me lmao

You just love to see it neurOMEGALUL

sick owl Mar 22, 2026, 2:34 AM

#

sick owl On further testing the 27B Qwen 3.5 drops off severely below Q6

If I had to guess I'd assume the unique architecture they've gone with is just extra sensitive to accuracy loss

fast pagoda Mar 22, 2026, 2:39 AM

#

sick owl On further testing the 27B Qwen 3.5 drops off severely below Q6

q4_k_m does it with thinking on but off it's fine

#

makes sense as errors pile up during the thinking portion

sick owl Mar 22, 2026, 2:40 AM

#

I'd argue the performance is probably still severely degraded with thinking off

#

Its just not as obvious

fast pagoda Mar 22, 2026, 2:40 AM

#

it's worse definitely

#

it's just not

#

infinity loop bad

#

q6 is the first usable quant it seems

sick owl Mar 22, 2026, 2:41 AM

#

That aligns with my testing yeah

true hemlock Mar 22, 2026, 2:47 AM

#

didn't the frog tested 35B A3B

#

idk

mighty thorn Mar 22, 2026, 2:50 AM

#

i have successfully finetuned an LLM without backpropogation

true hemlock Mar 22, 2026, 2:50 AM

#

have anyone tested 27B vs 35B A3B

#

moe slop idk

fast pagoda Mar 22, 2026, 3:05 AM

#

35b a3b is ass

#

well

#

it's not ass

#

it's just worse than 27

#

damn

#

he mad

quartz scaffold Mar 22, 2026, 3:15 AM

#

Sorry missed send

tender river Mar 22, 2026, 3:18 AM

#

it's okay

#

you're forgiven

#

i'm feeling very kind today

#

https://media.discordapp.net/stickers/1432138279679033354.png?format=webp&quality=lossless&width=223&height=223

olive sable Mar 22, 2026, 3:27 AM

#

neuroNOWAYING

#

anyways

#

volumetric display REV2

#

evilNya

#

we need that extra of empty space at the bottom of that box to put the power cable and hdmi into the projector

jagged turtle Mar 22, 2026, 3:28 AM

#

I got an email from microsoft

#

#

evilWheeze

#

actions speak louder than words neurowheeze

olive sable Mar 22, 2026, 3:30 AM

#

olive sable volumetric display REV2

this still doesnt have the motor assmebly, but ill do that later

jagged turtle Mar 22, 2026, 3:30 AM

#

here's what they're "doing in response" for those of you who arent in the windows insiders program

#

hey, at least they're removing copilot (supposedly)

tender river Mar 22, 2026, 3:37 AM

#

some instances of copilot neuroSensei

amber fractal Mar 22, 2026, 3:39 AM

#

jagged turtle here's what they're "doing in response" for those of you who arent in the window...

nowaying able to skip updates

naive burrow Mar 22, 2026, 3:48 AM

#

Honestly, to a millionaire like Vedal, this is lunch money.
Crazy fucking robot body soon? Pause
https://fxtwitter.com/asimovinc/status/2029141385069379825?s=20

Asimov (@asimovinc)

You can build your own humanoid at home. Asimov – Here be Dragons is now available for presale. $499 deposit, $15,000 target price. asimov.inc/diy-kit

**💬 52 🔁 164 ❤️ 1.9K 👁️ 430.0K **

jagged turtle Mar 22, 2026, 3:51 AM

#

amber fractal nowaying able to skip updates

*only for OOBE it seems

last lantern Mar 22, 2026, 4:11 AM

#

hello im long time neuro/vedal watcher but never joined discord, now that slay the spire 2 has come out i was looking at automated slay the spire policies again and I revisited some of the neuro/evil vods for the game

#

i was just curious whether anyone happened to know what kind of setup vedal used to have neuro play the game? I skimmed through the history and there is some discussion on the API and what kind of information neuro gets, but with regards to the policy is it known whether it's an LLM-primary approach maybe with a lot of instructions, scripted (heuristic) or algorithmic (e.g. tree search based), or ML/RL based approach?

#

i remember seeing vedal program some of the among us stuff on stream so i wonder if he ever showed the slay the spire development on stream

#

i notice generally potions are saved until the boss and some significant portion of runs avoid elites completely, so i thought there might be some scripted part at least

#

sorry for the text wall and appreciate any pointers :)

glass flower Mar 22, 2026, 4:16 AM

#

i would generally assume it just works like any other game integration (expect minecraft and osu!) where neuro is just getting information and working with actions where she can choose things.

jagged turtle Mar 22, 2026, 4:20 AM

#

even that's how mc works nowadays I think

#

but also iirc slay the spire was something kinda custom?

glass flower Mar 22, 2026, 4:21 AM

#

i would assume it was custom at the start but wouldn't be surprised if it got updated later on to use the sdk. tho idk the code for it isn't public like the other games so maybe it still is custom

young plover Mar 22, 2026, 4:25 AM

#

jagged turtle even that's how mc works nowadays I think

There's definitely a separate bot playing minecraft while the twin give high-level directions.
Evil has argued with it the last couple times she played.
They can't keep up with real-time games.

glass flower Mar 22, 2026, 4:25 AM

#

this reminds me i really should finish my brotato integration... when i switched to linux i kinda abandoned it LUL

jagged turtle Mar 22, 2026, 4:25 AM

#

young plover There's definitely a separate bot playing minecraft while the twin give high-lev...

yes, what I meant is they are still getting information and working with actions to choose how the bot should respond

last lantern Mar 22, 2026, 4:25 AM

#

interesting, ty guys

#

I guess maybe not enough info to say exactly but has a chance to be custom?

jagged turtle Mar 22, 2026, 4:26 AM

#

most likely custom yea

glass flower Mar 22, 2026, 4:26 AM

#

well just the api would be custom on how it talks to neuro. but it would still work the same as something like incryption

jagged turtle Mar 22, 2026, 4:27 AM

#

^

glass flower Mar 22, 2026, 4:27 AM

#

and the inscryption mod is public atleast: https://github.com/VedalAI/neuro-inscryption

#

its just neuro choosing actions. no RL/ML, no heuristics nothing. just a llm choosing funny choices

last lantern Mar 22, 2026, 4:28 AM

#

huh that's good to know

glass flower Mar 22, 2026, 4:29 AM

#

but obviously neuro won't be draggin the cards and such. she will just say "play x card on x enemy" and then the SDK will handle on how to actually play and select the enemy

last lantern Mar 22, 2026, 4:29 AM

#

there were some other LLM evals on slay the spire and it feels neuro/evil play differently but i suspect vedal gave a different set of info

#

yeah makes sense

glass flower Mar 22, 2026, 4:29 AM

#

neuro is also finetuned. so she won't act like other llm's generally

last lantern Mar 22, 2026, 4:30 AM

#

im actually quite curious is it known what vedal used as the base model

jagged turtle Mar 22, 2026, 4:30 AM

#

no

#

speculative at most

glass flower Mar 22, 2026, 4:30 AM

#

just guesses mostly.. wouldn't be surprised if it also changed over the years

last lantern Mar 22, 2026, 4:30 AM

#

i feel neuro talks quite naturally sometimes but maybe this quality is uncharacteristic for smaller local models

#

icic

last lantern Mar 22, 2026, 4:30 AM

#

glass flower neuro is also finetuned. so she won't act like other llm's generally

and yeah makes sense as well

glass flower Mar 22, 2026, 4:31 AM

#

my personal guess is she is lama based. or was at the start. but if she still is or switched to deepseek/qwen no idea

#

or maybe the mad man actually trained his own basemodel

jagged turtle Mar 22, 2026, 4:31 AM

#

the finetuning doesn't exactly make it easier

jagged turtle Mar 22, 2026, 4:31 AM

#

glass flower or maybe the mad man actually trained his own basemodel

unlikely, but given his resources now I wouldn't be surprised if he's working on it

glass flower Mar 22, 2026, 4:32 AM

#

i mean... it is possible to train your own model. and if you use cloud runners i don't really see why not.

jagged turtle Mar 22, 2026, 4:32 AM

#

I think he runs his models locally

glass flower Mar 22, 2026, 4:32 AM

#

tho you could also just use a base model that exists and just finetune it like crazy so thats probably cheaper

jagged turtle Mar 22, 2026, 4:33 AM

#

yeah that's the leading theory

glass flower Mar 22, 2026, 4:33 AM

#

YES he is running them locally

jagged turtle Mar 22, 2026, 4:33 AM

#

also I feel bad for the editor here

glass flower Mar 22, 2026, 4:33 AM

#

LULE

jagged turtle Mar 22, 2026, 4:34 AM

#

it gets better later on

glass flower Mar 22, 2026, 4:34 AM

#

but also... huhh neuro played blue prince?

jagged turtle Mar 22, 2026, 4:34 AM

#

no but there's a WIP integration for it

glass flower Mar 22, 2026, 4:34 AM

#

NeurOhISee

jagged turtle Mar 22, 2026, 4:37 AM

#

glass flower Mar 22, 2026, 4:42 AM

#

WHATT i also just noticed the gary rework is finished. now i really don't have a reason to not work on my integration

mighty thorn Mar 22, 2026, 4:47 AM

#

mighty thorn i have successfully finetuned an LLM without backpropogation

i take it back

jagged turtle Mar 22, 2026, 4:49 AM

#

glass flower <a:WHATT:1188056178786316308> i also just noticed the gary rework is finished. n...

neuroNOWAYING WHAT

blissful geyser Mar 22, 2026, 4:58 AM

#

I coded a webDAV server and interface for my Android phone, it automatically starts using Termux:Boot when the phone is powered on.

#

visible on any wifi network its connected to

#

its still on even if i exit and force stop termux

fast pagoda Mar 22, 2026, 5:01 AM

#

mighty thorn i take it back

that sound highly secure

#

oops wrong reply

mighty thorn Mar 22, 2026, 5:01 AM

#

fast pagoda oops wrong reply

He hates me

patent shard Mar 22, 2026, 5:08 AM

#

would be cool to listen to what it'd be like with an official acapella and reverb removed
cause using separation software has a looooot of artifacts
I know there was one song released with acapella.. but still had reverb
there's something about hearing the raw vocals

glass flower Mar 22, 2026, 5:10 AM

#

fast pagoda that sound highly secure

LULE was about to say the same.... thats sounds like a fun side project to break into 4evil

fast pagoda Mar 22, 2026, 5:14 AM

#

https://www.neowin.net/news/microsoft-kb5079473-breaks-internet-access-to-windows-11-teams-edge-onedrive-copilot/

#

sounds like a good feature update

#

broke a bunch of malware

jagged turtle Mar 22, 2026, 5:16 AM

#

fast pagoda sounds like a good feature update

no it was a security patch

fast pagoda Mar 22, 2026, 5:18 AM

#

por que no los dos

fast pagoda Mar 22, 2026, 5:19 AM

#

mighty thorn He hates me

to your message i was going to say

perhaps we're all wrong and that's the magic loss number

mighty thorn Mar 22, 2026, 5:20 AM

#

fast pagoda to your message i was going to say perhaps we're all wrong and that's the magic...

There are two optimal values for loss:
0.00
297965.6253
all others mean you haven’t completed training

fast pagoda Mar 22, 2026, 5:21 AM

#

the model grokked agi and it fucked up the loss calculation since it gained omnipotence

#

looks a lot like the gradients exploded

#

they exploded alright

mighty thorn Mar 22, 2026, 5:31 AM

#

fast pagoda they exploded alright

No it grokked into consciousness

amber fractal Mar 22, 2026, 6:09 AM

#

mighty thorn There are two optimal values for loss: 0.00 297965.6253 all others mean you have...

Minamhm I like the second one

blissful geyser Mar 22, 2026, 6:23 AM

#

can someone check if my site has cert or if im just being played with
borgortube.duckdns.org

#

its supposed to say "Lets Encrypt"

jagged turtle Mar 22, 2026, 6:29 AM

#

blissful geyser can someone check if my site has cert or if im just being played with borgortube...

no cert

#

did you start the server on port 443?

fast pagoda Mar 22, 2026, 6:38 AM

#

yea nginx is listening on port 80 only

flint dome Mar 22, 2026, 7:00 AM

#

i want to get a shitty llm to run on my laptop and i was thinking of two options

#

deepseek r1 distill qwen 7b and distill llama8b

#

which do i get

glass flower Mar 22, 2026, 7:17 AM

#

tink why not just qwen 3.5 9b?

blissful geyser Mar 22, 2026, 7:24 AM

#

jagged turtle did you start the server on port 443?

its on port 80

#

im using nginx

fast pagoda Mar 22, 2026, 7:28 AM

#

port 80 is plain http

blissful geyser Mar 22, 2026, 7:30 AM

#

oh

blissful geyser Mar 22, 2026, 7:31 AM

#

fast pagoda port 80 is plain http

can yall test playback on the site

#

thank you

#

just search anything and play it

#

ill make adjustments later

jagged turtle Mar 22, 2026, 7:41 AM

#

blissful geyser can yall test playback on the site

it's still not on port 443 btw

fast pagoda Mar 22, 2026, 7:41 AM

#

it works but it is also constantly hammering localhost:8001

#

trying to open a websocket

blissful geyser Mar 22, 2026, 7:42 AM

#

fast pagoda it works but it is also constantly hammering localhost:8001

holy shi

#

i didnt notice that

olive sable Mar 22, 2026, 7:44 AM

#

glass jetty Mar 22, 2026, 7:47 AM

#

fast pagoda Mar 22, 2026, 8:03 AM

#

blissful geyser holy shi

it's your mpvbridge looking for deno

#

through*

#

ig to open up[ mpv playback w/ playlist

proven merlin Mar 22, 2026, 8:29 AM

#

So, what should i use? I have RTX2060, RX6700XT, some "1060 6GB" cards.. and i mainly run Windows.. i mean, running llamacpp server, and then yet another server for chat-ui cant be the way to go, no?.. at the same time it does allow to be very modular, at the cost of being janky af to boot up..

fast pagoda Mar 22, 2026, 8:38 AM

#

proven merlin So, what should i use? I have RTX2060, RX6700XT, some "1060 6GB" cards.. and i m...

my bot runs its discord client on my mini pc and then the model is on whatever

#

usually my main pc but ive been moving it around

#

usually the servers will chill and load jit if actually called

cobalt sigil Mar 22, 2026, 8:39 AM

#

guys I kinda regret touching termux

silent island Mar 22, 2026, 8:42 AM

#

Dawwggg i rly am living in a broke ass environment huh? neurOMEGALUL

fast pagoda Mar 22, 2026, 8:42 AM

#

looks reliable

silent island Mar 22, 2026, 8:45 AM

#

fast pagoda looks reliable

Reliable my ass 😂

proven merlin Mar 22, 2026, 9:01 AM

#

fast pagoda my bot runs its discord client on my mini pc and then the model is on whatever

Not really an option for me, ig i could do xmpp but yea.. not private

fast pagoda Mar 22, 2026, 9:01 AM

#

@glass jetty neuroPoint oauth

proven merlin Mar 22, 2026, 9:02 AM

#

proven merlin Not really an option for me, ig i could do xmpp but yea.. not private

I might just try to get an ie3 compatible frontend running..

#

var myElement = document.all["myId"];

Apparently ie3 does support this much js soo.. with some php and all it should be possible

#

Wouldn't be my first rodeo with getting decent ui from ie3, but my first being somewhat dynamic

fast pagoda Mar 22, 2026, 9:04 AM

#

what the helly is ie3

#

internet explorer 3

#

NODDERS

proven merlin Mar 22, 2026, 9:09 AM

#

fast pagoda internet explorer 3

Yes

#

Don't judge me, i maintain legacy systems because yes

jagged turtle Mar 22, 2026, 9:14 AM

#

🤮

#

god

#

I get that you need to maintain legacy systems

#

but ie3 is ancient

blissful geyser Mar 22, 2026, 9:18 AM

#

bro thats a fossil

nocturne olive Mar 22, 2026, 9:21 AM

#

Nah that's part of the cosmic microwave background

fast pagoda Mar 22, 2026, 9:33 AM

#

i was not yet 2 years old when that browser came otu

#

not sure if there's any ARPANET compatible discord clients

proven merlin Mar 22, 2026, 9:52 AM

#

jagged turtle but ie3 is *ancient*

Yea, default browser of Windows 95.

#

95 can do tcp ip

proven merlin Mar 22, 2026, 9:57 AM

#

proven merlin ```js var myElement = document.all["myId"]; ``` Apparently ie3 does support this...

It gets worse. To allow adding new messages, i have to fire off get requests or something via js (ie3 compatible js no less) and render that in a hidden iframe

#

I mean it should work but.. yeaa

#

Purely conceptually, ig a chat open should immediately redirect to a chat running with an id, to keep track of new chats and stuff

#

Is that even doable php-side?

#

Should i even do this with php or rather a python server..

#

Python would make everything else easier ig

fast pagoda Mar 22, 2026, 10:04 AM

#

https://github.com/ttalvitie/browservice

https://github.com/rn10950/RetroZilla/releases/tag/2.3-release

https://github.com/ThomasDickey/lynx-snapshots

plz

sage crag Mar 22, 2026, 10:32 AM

#

https://tenor.com/view/funny-animals-happy-easter-disaster-april-fools-day-fluffy-gif-11463156

Tenor

fast pagoda Mar 22, 2026, 10:32 AM

#

LOP

#

maiden geyser Mar 22, 2026, 10:38 AM

#

as for

sage crag Mar 22, 2026, 10:40 AM

#

fast pagoda LOP

m

#

lop

fast pagoda Mar 22, 2026, 10:40 AM

#

megalop

sage crag Mar 22, 2026, 10:40 AM

#

(LOP)

proven merlin Mar 22, 2026, 11:18 AM

#

fast pagoda https://github.com/ttalvitie/browservice https://github.com/rn10950/RetroZilla/...

Those proxy services kinda dont work well on a 50mhz i486 with 4-20mb ram

#

And IF i can get it working with just old tools, why wouldn't i

quick condor Mar 22, 2026, 11:26 AM

#

I honestly still don't understand why people glaze Llama.cpp... I can't think of any reason it's better than alternatives

nocturne olive Mar 22, 2026, 11:27 AM

#

Well have you even used it? The VRAM usage efficiency even for big models is great

fast pagoda Mar 22, 2026, 11:31 AM

#

proven merlin Those proxy services kinda dont work well on a 50mhz i486 with 4-20mb ram

i only included that because it exists but that would be my last choice, the other 2 are more of what the suggestion was

why wouldn't i
i dunno, since you were moaning about it i figured you dont like ie3 but felt forced to use it or something lol, if that's the aspect of it you enjoy then go off king i just read your messages with a more exasperated tone than intended i guess

quick condor Mar 22, 2026, 11:35 AM

#

nocturne olive Well have you even used it? The VRAM usage efficiency even for big models is gre...

I run vllm

fast pagoda Mar 22, 2026, 11:46 AM

#

sglang is better and cooler and has nicer clothes + a cooler ride with a supercharger and also more people asked it out to prom

quick condor Mar 22, 2026, 11:47 AM

#

As for vram usage a W4A16 GS64 24b model is similar in quantity to q6_k yet it's about half the size, I'm able to fit a 24b+ 131764 f16 context in 48gb

quick condor Mar 22, 2026, 11:55 AM

#

fast pagoda sglang is better and cooler and has nicer clothes + a cooler ride with a superch...

I had a cobalt SS for about a week and that thing was evidence that a supercharger doesn't make something cool. The thing was hilariously disgusting

fast pagoda Mar 22, 2026, 11:55 AM

#

i dunno sounds pretty cool to me in a night ranger kinda way

quick condor Mar 22, 2026, 11:58 AM

#

It's a 4 cylinder making a full 200 horsepower and very little room for improvement... But I'm probably biased as I am comparing it to a dual pulley audi 3.0 tfsi

nocturne olive Mar 22, 2026, 12:02 PM

#

quick condor As for vram usage a W4A16 GS64 24b model is similar in quantity to q6_k yet it's...

48GB for 24B? Inefficient
I do 24GB for 27B

olive sable Mar 22, 2026, 12:12 PM

#

enub

#

Cheapest 3090 rn is 600

#

This popped up tho, pc that fails to boot and goes to bluescreen and such, for 900

#

I think ill buy it, if nobody bids higher

nocturne olive Mar 22, 2026, 12:19 PM

#

How interesting

#

If the 3090 works can I have it for cheap?

olive sable Mar 22, 2026, 12:20 PM

#

Depends on your definition of cheap

#

Im not running a charity

nocturne olive Mar 22, 2026, 12:20 PM

#

Well of course not
I was think something around 500€?

olive sable Mar 22, 2026, 12:21 PM

#

Sure

#

I do wonder what part of it would be broken

#

My guess would be ram?

nocturne olive Mar 22, 2026, 12:21 PM

#

It's always possible it's not broken, but the CPU mem controller has somehow degraded a little and can't keep mem speed

#

Who knows

#

Either way, update me on if you get it and have a 3090 for me to buy

olive sable Mar 22, 2026, 12:23 PM

#

It has "DPC_WATCHDOG_VIOLATION"

nocturne olive Mar 22, 2026, 12:23 PM

#

Apparently could be related to SATA or GPU driver

#

Wouldn't be surprised if it was a faulty Nvidia driver

olive sable Mar 22, 2026, 12:25 PM

#

Oh

#

There are 2 types

#

"CLOCK_WATCHDOG_TIMEOUT" also exists

#

They didnt specify which one, so could even be cpu issue

#

Eitherway, ill bid up to 1K

fast pagoda Mar 22, 2026, 12:28 PM

#

quick condor As for vram usage a W4A16 GS64 24b model is similar in quantity to q6_k yet it's...

devstral small 2 2512: llama.cpp on a 32GB card with vulkan, stock q6_k gguf, no custom anything, 131k context, multimodal, amd card LULE, uses ~40GB total across vram and ram, and this q6 is 19gb raw size shruge

nocturne olive Mar 22, 2026, 12:28 PM

#

olive sable "CLOCK_WATCHDOG_TIMEOUT" also exists

Apparently that one is related to interrupt processing
Associations may be driver conflicts, overclocking (unlikely in this case) or hardware issues

olive sable Mar 22, 2026, 12:28 PM

#

NeurOhISee

#

Im geussing the 3090 should be fine

nocturne olive Mar 22, 2026, 12:28 PM

#

It seems like the issues are more pointing towards other components yeah

#

If even the GPU and RAM work you could probably get a profit off of it

#

Or just keep the RAM

olive sable Mar 22, 2026, 12:30 PM

#

Dor 1K i could just get this one

#

#

i9 11900kf

#

No errors

nocturne olive Mar 22, 2026, 12:30 PM

#

It's up to you which you get
If you sell the 3090 for a similar price in either case, it's not gonna affect me

fast pagoda Mar 22, 2026, 12:30 PM

#

take them to the cleaners, the ssds if theyre at least 980/990 + 3090 + 32gb of ddr4 (not ddr5 prices but that's still gonna be at least $200 in ram)

#

deal

olive sable Mar 22, 2026, 12:31 PM

#

So get the one without errors?

nocturne olive Mar 22, 2026, 12:31 PM

#

Without errors is probably worth it if it's indeed only 1000€ for all that

mighty thorn Mar 22, 2026, 12:31 PM

#

olive sable It has "DPC_WATCHDOG_VIOLATION"

I’ve had that several times and mine still works

#

Mostl

olive sable Mar 22, 2026, 12:32 PM

#

neurOMEGALUL

#

No, this pc has it during boot

#

So its not usable

nocturne olive Mar 22, 2026, 12:32 PM

#

Either way, whatever you end up getting, just let me know if you have a good 3090 to sell

olive sable Mar 22, 2026, 12:35 PM

#

Aight i bought it

#

Superbox, im holding you to 500 + shipping

split steppe Mar 22, 2026, 12:38 PM

#

and superbox was never seen again

nocturne olive Mar 22, 2026, 12:39 PM

#

olive sable Superbox, im holding you to 500 + shipping

I will get those once I put some money onto my bank account

#

I just need to figure out how to actually do that because I have never actually transferred cash to my bank account on my own

split steppe Mar 22, 2026, 12:40 PM

#

your bank should have an ATM that takes cash, there's one almost guaranteed at the bank itself, but sometimes they have atleast one in most cities if they dont have a full branch

nocturne olive Mar 22, 2026, 12:41 PM

#

Well I'm not exactly in a city, and driving to the city is very scary

#

I'll figure it out

split steppe Mar 22, 2026, 12:42 PM

#

can find em on google maps also, if the bank doesn't have its own map

short socket Mar 22, 2026, 12:42 PM

#

Hi! Can I rant about Samsung here? I just can't stand their warranty policy neuroAware

olive sable Mar 22, 2026, 12:42 PM

#

Sure

nocturne olive Mar 22, 2026, 12:42 PM

#

The tehc channel does indeed allow talking about tech related things

olive sable Mar 22, 2026, 12:42 PM

#

olive sable Aight i bought it

WICKED

nocturne olive Mar 22, 2026, 12:42 PM

#

What an interesting looking 3090

#

Wonder which model it is

olive sable Mar 22, 2026, 12:43 PM

#

Asus one

nocturne olive Mar 22, 2026, 12:43 PM

#

I see

#

Hm, I wonder if that thing has a riser on it
I'm gonna need a riser in order to be able to actually attach it

#

It's vertical mounted so the chance is not 0

olive sable Mar 22, 2026, 12:44 PM

#

#

Appears to be a riser ye

#

Im keeping that tho

nocturne olive Mar 22, 2026, 12:44 PM

#

Oh ok

#

I guess I'll have to track down one on my own then

olive sable Mar 22, 2026, 12:44 PM

#

neurOMEGALUL pay extra for the riser ReallyInnocent

split steppe Mar 22, 2026, 12:44 PM

#

https://tenor.com/kN9kZuyNpn7.gif

Tenor

short socket Mar 22, 2026, 12:45 PM

#

I got an SSD from them back in 2021 when I lived in Ukraine and I moved to a different country. The SSD died and for them to replace it they told me to send it back to Kyiv. neuroCry

#

Man, why can't an international company have an international warranty

nocturne olive Mar 22, 2026, 12:45 PM

#

olive sable

Oh goodness it has 3 power connectors
I'm gonna have to do some rewiring for that

#

Eh whatever I'll figure it out

olive sable Mar 22, 2026, 12:45 PM

#

You dont need to use all 3

nocturne olive Mar 22, 2026, 12:45 PM

#

O?

#

It works with just 2?

olive sable Mar 22, 2026, 12:46 PM

#

Thats mainly there for OC

olive sable Mar 22, 2026, 12:46 PM

#

short socket I got an SSD from them back in 2021 when I lived in Ukraine and I moved to a dif...

catdespair

nocturne olive Mar 22, 2026, 12:46 PM

#

I see

split steppe Mar 22, 2026, 12:46 PM

#

nocturne olive It works with just 2?

you can lose perf ceiling with less power

nocturne olive Mar 22, 2026, 12:46 PM

#

Not like I care about that, it's just gonna run machine learning

#

I can afford to lose a couple minutes from a 1 day training run

split steppe Mar 22, 2026, 12:47 PM

#

there should be a spec chart somewhere that says how much you lose

olive sable Mar 22, 2026, 12:47 PM

#

Maybe i should keep this 3090 with more power connectors, and sell mine with only 2 to superbox ReallyInnocent

split steppe Mar 22, 2026, 12:47 PM

#

:V

olive sable Mar 22, 2026, 12:47 PM

#

You only need it for the vram anyways

nocturne olive Mar 22, 2026, 12:47 PM

#

As long as it's got a 3090 core and VRAM I'm fine with it, if it's not so big it literally won't fit in my PC

#

Looks like the Asus one doesn't have a raised TDP, so it should work just fine even with just 2 connected, as it gets 300W from 2x8-pin and 75W from PCIe

fast pagoda Mar 22, 2026, 12:49 PM

#

oh dang 3090s some had 12vhpwr and some didnt huh

nocturne olive Mar 22, 2026, 12:50 PM

#

12VHPWR? It didn't exist when the 3090 was made

#

The 12 pin connector on some 3090 models is a proprietary Nvidia connector

fast pagoda Mar 22, 2026, 12:51 PM

#

im saying 12vhpwr but i actually mean all 3 of that proprietary original, the fflaming one, and 12v2x6

olive sable Mar 22, 2026, 12:51 PM

#

Ye ive decided, im keeping the ROG one and selling my inno3d ichill x3 3090

nocturne olive Mar 22, 2026, 12:51 PM

#

Oh I see

olive sable Mar 22, 2026, 12:51 PM

#

Cuz the rog one has 3 dp and 2 hdmi

nocturne olive Mar 22, 2026, 12:51 PM

#

Fine by me either way

#

I literally have no use for display outputs

olive sable Mar 22, 2026, 12:52 PM

#

I currently have to unplug the hdmi a lot

#

So a 2nd one would be cool

nocturne olive Mar 22, 2026, 12:53 PM

#

Should work pretty much well for my purposes

olive sable Mar 22, 2026, 12:55 PM

#

Expect it to take like a week or 2-3 to get here, and then ill test it

nocturne olive Mar 22, 2026, 12:56 PM

#

I see

fast pagoda Mar 22, 2026, 12:59 PM

#

Brick

split steppe Mar 22, 2026, 1:09 PM

#

mm yes a cooler with a board

fast pagoda Mar 22, 2026, 1:13 PM

#

it's a brick for building a garden wall

#

(a v100)

#

cheapie

fast pagoda Mar 22, 2026, 1:34 PM

#

when release

#

mf been in beta for 8 years

sage crag Mar 22, 2026, 1:35 PM

#

fast pagoda mf been in beta for 8 years

its called legal assurance

#

enub

obsidian mantle Mar 22, 2026, 1:37 PM

#

if (force_choice(get_barter_items(), "You are bartering. " + get_gold_text() + ". Choose item to " + get_barter_type_text() + ". " + get_items_we_cant_buy_text(),

#

why is get_items_we_cant_buy_text executed before get_barter_items vedalSTARE

#

if its on the right from it

#

In C++, the order of evaluation for function arguments is
unspecified (up to C++17) or indeterminately sequenced (C++17 and later)

#

NeuroRage

#

oh wait is it different thing

#

can they not just do it left to right like normal people

sage crag Mar 22, 2026, 1:40 PM

#

python does left-to-right evaluation with operator precedence

fast pagoda Mar 22, 2026, 1:41 PM

#

pemdas DenkiClueless

sage crag Mar 22, 2026, 1:41 PM

#

https://www.w3schools.com/python/python_operators_precedence.asp
relevant info

W3Schools offers free online tutorials, references and exercises in all the major languages of the web. Covering popular subjects like HTML, CSS, JavaScript, Python, SQL, Java, and many, many more.

opaque sigil Mar 22, 2026, 1:41 PM

#

obsidian mantle can they not just do it left to right like normal people

leaving order unspecified allows some optimisations

obsidian mantle Mar 22, 2026, 1:41 PM

#

thonk true

fast pagoda Mar 22, 2026, 1:41 PM

#

sage crag its called legal assurance

im gona soo them

sage crag Mar 22, 2026, 1:41 PM

#

opaque sigil leaving order unspecified allows some optimisations

python functions arent pure

#

neuroCatUuh

obsidian mantle Mar 22, 2026, 1:42 PM

#

im just calling that function for dummy result so it properly refreshes

fast pagoda Mar 22, 2026, 1:42 PM

#

python town bicyccle

opaque sigil Mar 22, 2026, 1:42 PM

#

sage crag python functions arent pure

who said they are enub

fast pagoda Mar 22, 2026, 1:44 PM

#

sage crag Mar 22, 2026, 1:44 PM

#

opaque sigil who said they are <:enub:1163912230585237655>

order has to be specified when value impure, otherwise randomiser

tender river Mar 22, 2026, 1:44 PM

#

sage crag python functions arent pure

toast probably didnt mean completely unspecified neuroAYAYA

#

still

sage crag Mar 22, 2026, 1:44 PM

#

as far as my understanding

tender river Mar 22, 2026, 1:44 PM

#

must eval arguments before function

sage crag Mar 22, 2026, 1:44 PM

#

enub

tender river Mar 22, 2026, 1:45 PM

#

but ltr or rtl unspecified

#

scheme does this

sage crag Mar 22, 2026, 1:45 PM

#

ye but python ltr specified

opaque sigil Mar 22, 2026, 1:45 PM

#

NeuroConfused

tender river Mar 22, 2026, 1:45 PM

#

as fitting for a slow language NeuroTease

opaque sigil Mar 22, 2026, 1:45 PM

#

i was specifically talking about c++

sage crag Mar 22, 2026, 1:46 PM

#

sage crag ye but python ltr specified

well, depends on

fast pagoda Mar 22, 2026, 1:46 PM

#

python is fast!!!
-# when you use it to wrap c and fortran

sage crag Mar 22, 2026, 1:46 PM

#

operator

#

some of them are right to left

#

and some of them arent

#

enub

#

the important thing is that binary plus (addition) is ltr

#

though unary plus is rtl

#

enub

tender river Mar 22, 2026, 1:47 PM

#

vedalUUH

#

its ltr also

sage crag Mar 22, 2026, 1:47 PM

#

neuroCatUuh

rigid snow Mar 22, 2026, 1:47 PM

#

down-to-up

fast pagoda Mar 22, 2026, 1:47 PM

#

buss it

sage crag Mar 22, 2026, 1:55 PM

#

rigid snow down-to-up

leftern
rightern
uptern
downtern

obsidian mantle Mar 22, 2026, 1:56 PM

#

where does it matter for addition to be non ltr or ltr

rigid snow Mar 22, 2026, 1:56 PM

#

sage crag leftern rightern uptern downtern

yeah...

obsidian mantle Mar 22, 2026, 1:56 PM

#

thonk vectors?

sage crag Mar 22, 2026, 1:56 PM

#

obsidian mantle where does it matter for addition to be non ltr or ltr

if order of operations doesnt matter, you can reschedule them for more optimal codegen

tender river Mar 22, 2026, 1:59 PM

#

sage crag leftern rightern uptern downtern

eastern evaluation order (left to right)

sage crag Mar 22, 2026, 1:59 PM

#

northern evaluation order

#

(top to bottom)

#

for example

#

x =
1 +
2

#

enub

tender river Mar 22, 2026, 2:01 PM

#

sage crag northern evaluation order

NeurOhISee oh right then ltr would be western

tender river Mar 22, 2026, 2:01 PM

#

sage crag ``` x = 1 + 2 ```

f(x y
  z w)

sage crag Mar 22, 2026, 2:01 PM

#

what operator is z

#

when applied to x

tender river Mar 22, 2026, 2:01 PM

#

z is

#

argument

sage crag Mar 22, 2026, 2:01 PM

#

enub

#

false

#

clearly

#

northern

tender river Mar 22, 2026, 2:02 PM

#

NeurOhISee

#

wind blow argument

#

into function

sage crag Mar 22, 2026, 2:02 PM

#

neuroCatUuh

quick condor Mar 22, 2026, 2:12 PM

#

nocturne olive 48GB for 24B? Inefficient I do 24GB for 27B

131768 context (128k)
compute bound not me bandwidth bound

nocturne olive Mar 22, 2026, 2:12 PM

#

I've got about 96K CTX on it

quick condor Mar 22, 2026, 2:13 PM

#

fast pagoda devstral small 2 2512: llama.cpp on a 32GB card with vulkan, stock q6_k gguf, no...

That's 100% q8 context

fast pagoda Mar 22, 2026, 2:13 PM

#

quick condor That's 100% q8 context

it's not

#

i set it to fp16 purposely

#

llama_kv_cache: size = 20600.00 MiB (131840 cells, 40 layers, 4/1 seqs), K (f16): 10300.00 MiB, V (f16): 10300.00 MiB
sched_reserve: reserving ...

quick condor Mar 22, 2026, 2:15 PM

#

Math isn't matching. Context size at f16 is a static size. Not quant related

fast pagoda Mar 22, 2026, 2:18 PM

#

unified kv cache probably confusing you

#

idk

#

math maths for me

#

KV cache per token per layer: 2 (K+V) × 8 (kv heads) × 128 (head dim) × 2 (fp16 bytes) = 4096 bytes = 4KB
per token across all layers: 4KB × 40 layers = 160KB
131,840 tokens × 160KB = ~20.1GB

mistral3.attention.key_length = 128 (head dim K)
mistral3.attention.value_length = 128 (head dim V)
mistral3.block_count = 40 (layers)```

quick condor Mar 22, 2026, 2:20 PM

#

Oh wait it's llama4 kv. Devstral2 is special. I was closer to 160k kv

fast pagoda Mar 22, 2026, 2:21 PM

#

i can fit 300k context in 48gb

#

with this model

quick condor Mar 22, 2026, 2:22 PM

#

Yah no

fast pagoda Mar 22, 2026, 2:22 PM

#

jk more like 200k

#

but yeah im only at about 40 with 131k

#

so

lean bluff Mar 22, 2026, 2:28 PM

#

did you know that they have a 150 qubit quantum computer already? It was news to me at least https://meetiqm.com/products/iqm-radiance/

fast pagoda Mar 22, 2026, 2:28 PM

#

someone put out an open source quantum computer like

#

last week

#

well, idk if it was open source, but you can buy one it's diy

#

https://qilimanjaro.tech/eduqit/

Qilimanjaro

Claudia

EduQit - Qilimanjaro

EduQit Build your own Quantum Computer. Talk to our Sales Team Deploying Quantum Capability in Education and Research Quantum computing education is often limited to theory, simulators, or remote cloud access. While these tools are essential, they do not provide exposure to real system operation. EduQit enables education and research institution...

lean bluff Mar 22, 2026, 2:29 PM

#

wait what?

fast pagoda Mar 22, 2026, 2:29 PM

#

me right there

obsidian mantle Mar 22, 2026, 2:29 PM

#

my own quantum computer at home? neuroNOWAYING

fast pagoda Mar 22, 2026, 2:30 PM

#

obsidian mantle Mar 22, 2026, 2:30 PM

#

what do i need

fast pagoda Mar 22, 2026, 2:30 PM

#

obsidian mantle Mar 22, 2026, 2:30 PM

#

a bit of antimatter

#

or how does it work

lean bluff Mar 22, 2026, 2:30 PM

#

I want one so bad

fast pagoda Mar 22, 2026, 2:30 PM

#

it's just that ez

obsidian mantle Mar 22, 2026, 2:30 PM

#

neurOMEGALUL

fast pagoda Mar 22, 2026, 2:30 PM

#

lean bluff Mar 22, 2026, 2:30 PM

#

they look so cool too

obsidian mantle Mar 22, 2026, 2:30 PM

#

wtf

fast pagoda Mar 22, 2026, 2:30 PM

#

this sparks joy

lean bluff Mar 22, 2026, 2:31 PM

#

how many qubits can you get with that though

fast pagoda Mar 22, 2026, 2:31 PM

#

at least 1

#

it's

#

5

lean bluff Mar 22, 2026, 2:32 PM

#

oh

obsidian mantle Mar 22, 2026, 2:32 PM

#

why cant they put estimate price so i dont talk to their team just to know its 5 billion usd

fast pagoda Mar 22, 2026, 2:32 PM

#

we aint error correcting with this one boys

#

i read it costs about ~$1MM

#

oh no MM is right

#

im so smart

#

about a milly

#

lil wayne wrote his hit song a milli about this computer

lean bluff Mar 22, 2026, 2:34 PM

#

fast pagoda i read it costs about ~$1MM

I'm getting it tomorrow then

maiden geyser Mar 22, 2026, 2:35 PM

#

obsidian mantle why cant they put estimate price so i dont talk to their team just to know its 5...

EduQit is designed for universities, research institutions, and public-sector organisations
5kkk sounds like a highball for this profile

fast pagoda Mar 22, 2026, 2:35 PM

#

it's a million-ish

#

that was a real number

quick condor Mar 22, 2026, 2:40 PM

#

fast pagoda but yeah im only at about 40 with 131k

First, Devstral2 maxes at 256k before hitting max native support. Second, 161k is 38Gb vram for just kv at f14, with a 19gb q6_k you are 9gb over that at 161k vs the 10gb W4A16 GS64. Third the 128k in 48Gb was mistral 3.2 not Devstral2. Devstral2 uses a hybrid ministral 3 mistral 3 architecture with a sliding attention scale and other changes, it's not apples to apples.

Regardless 300k will not fit in 48gb

maiden geyser Mar 22, 2026, 2:40 PM

#

lean bluff I'm getting it tomorrow then

is there an unofficial korchynskyi

fast pagoda Mar 22, 2026, 2:44 PM

#

quick condor First, Devstral2 maxes at 256k before hitting max native support. Second, 161k i...

Devstral2 maxes at 256k before hitting max native support.
not in every checkpoint, the llama ones max at 393216

161k is not 38gb for me when i literally load it up with 161k so idk what's wrong with it but seems vllm is missing some optimizations shrug

given that the model is almost 10gb larger i'd expect to be using more vram with that context

#

just using lmstudio to show the gui cuz it's easier

quick condor Mar 22, 2026, 2:50 PM

#

fast pagoda Mar 22, 2026, 2:50 PM

#

models have different context sizes in different forms man that's always been a thing

quick condor Mar 22, 2026, 2:50 PM

#

Math says 300k is 45.6Gb for just kv

#

You aren't using actual Devstral2 if it's a Llama based model

fast pagoda Mar 22, 2026, 2:51 PM

#

it's not llama based

quick condor Mar 22, 2026, 2:51 PM

#

It uses Llama attention not a Llama base

fast pagoda Mar 22, 2026, 2:51 PM

#

it's a checkpoint directly from mistral

quick condor Mar 22, 2026, 2:52 PM

#

You are 100% not preallocating

fast pagoda Mar 22, 2026, 2:52 PM

#

i mean i am so

#

it shows how it's calculating it as it reserves it #programming message

#

doesnt look like that says llama to me

#

#

that's based on max positional embedding length, seems they dont think it's good to use that so it probably does realistically max around 256k, i wouldn't trust a mistral model at such a long context anyways but that's just what the model reports so that's what it lets me set

quick condor Mar 22, 2026, 3:04 PM

#

I'm talking full prefill that's runtime residency

rigid snow Mar 22, 2026, 3:12 PM

#

fghdsghsghs

quick condor Mar 22, 2026, 3:14 PM

#

Besides that mistral literally states "vLLM is the most efficient and reliable way to deploy Devstral locally" so I have a feeling that something is wrong. My guess is no full prefill, kv dtype, or it's silently spilling into ram

fast pagoda Mar 22, 2026, 3:14 PM

#

q8 model with 161k is still not 48gb :^)

#

it's going into ram, but the total size is not >48gb, my gpu is only 32gb.

#

load_tensors: offloading 39 repeating layers to GPU
load_tensors: offloaded 41/41 layers to GPU
load_tensors:   CPU_Mapped model buffer size =   680.00 MiB
load_tensors:      Vulkan0 model buffer size = 23206.58 MiB
2026-03-22 10:11:54 [DEBUG]
 common_init_result: added </s> logit bias = -inf
2026-03-22 10:11:54 [DEBUG]
 llama_context: constructing llama_context
llama_context: setting new yarn_attn_factor = 1.0000 (mscale == 1.0, mscale_all_dim = 1.0)
llama_context: n_seq_max     = 4
llama_context: n_ctx         = 161024
llama_context: n_ctx_seq     = 161024
llama_context: n_batch       = 1024
llama_context: n_ubatch      = 1024
llama_context: causal_attn   = 1
llama_context: flash_attn    = enabled
llama_context: kv_unified    = true
llama_context: freq_base     = 100000000.0
llama_context: freq_scale    = 0.0208333
llama_context: n_ctx_seq (161024) < n_ctx_train (393216) -- the full capacity of the model will not be utilized
2026-03-22 10:11:54 [DEBUG]
 llama_context: Vulkan_Host  output buffer size =     2.00 MiB
2026-03-22 10:11:56 [DEBUG]
 llama_kv_cache:    Vulkan0 KV buffer size = 25160.00 MiB
2026-03-22 10:11:57 [DEBUG]
 llama_kv_cache: size = 25160.00 MiB (161024 cells,  40 layers,  4/1 seqs), K (f16): 12580.00 MiB, V (f16): 12580.00 MiB
sched_reserve: reserving ...
2026-03-22 10:11:57 [DEBUG]
 sched_reserve: resolving fused Gated Delta Net support:
2026-03-22 10:11:57 [DEBUG]
 sched_reserve: fused Gated Delta Net (autoregressive) enabled
2026-03-22 10:11:57 [DEBUG]
 sched_reserve: fused Gated Delta Net (chunked) enabled```

#

161024

#

vllm's efficiency (and sglangs to an extent) is mostly about serving multiple instances of a model at once

#

it says llama all over the logs because it's....llama...cpp................... it uses that no matter what it's loading

#

i just pasted the entire text of pride and prejudice into the chat

#

which is about 159k tokens

#

it's at 102% of 161k

#

ram allocation has not changed

quick condor Mar 22, 2026, 3:30 PM

#

You are spilling into ram at, 161k at q8 q6 is only 5gb smaller. 300k absolutely will not fit

fast pagoda Mar 22, 2026, 3:30 PM

#

i already said im spilling into ram obviously

#

gpu has 32gb of ram

#

but

#

it's not over 48gb

#

im talking 161k here, not 300k

#

lm studio is using about 42gb of ram + vram

#

chewing on this pride and prejudice

rigid snow Mar 22, 2026, 3:32 PM

#

has there ever been an open source actual alternative to notion

quick condor Mar 22, 2026, 3:32 PM

#

So your using 74gb not 48

fast pagoda Mar 22, 2026, 3:32 PM

#

mp

#

what

#

no

#

total system vram + ram usage isnt even that high right now

quick condor Mar 22, 2026, 3:33 PM

#

fast pagoda lm studio is using about 42gb of ram + vram

What is that

fast pagoda Mar 22, 2026, 3:33 PM

#

the model + kv

quick condor Mar 22, 2026, 3:33 PM

#

42+32gb vram =74Gb

fast pagoda Mar 22, 2026, 3:33 PM

#

42 is 32+ 10

#

32vram, 10 ram

#

over the entire system with vivaldi open at like 70 tabs im only using a grand total of 31gb of ram right now WITH the offload in there

quick condor Mar 22, 2026, 3:34 PM

#

Something is getting compressed, there is an open issue that Llama. Cpp ignores overrides

fast pagoda Mar 22, 2026, 3:34 PM

#

it's not, i promise

#

why dont you go try it

#

and report back

#

instead of trying to gaslight me into seeing a different set of numbers on my screen as i am doing exactly what you're refusing to believe and showing you

quick condor Mar 22, 2026, 3:38 PM

#

Vulkan0 model buffer size = 23206.58 MiB
Vulkan0 KV buffer size = 25160.00 MiB

#

That's not 42

fast pagoda Mar 22, 2026, 3:38 PM

#

48gb

quick condor Mar 22, 2026, 3:39 PM

#

Is it 42 or 48

rigid snow Mar 22, 2026, 3:39 PM

#

mlntcandy what am i reading

fast pagoda Mar 22, 2026, 3:39 PM

#

that's with the q8 model

#

so both?

#

hascrack was very excited to share that vllm being the superior inference engine can fit a model in the 10-20gb range AND 131k context in under 48gb

#

and now literally cannot even when llama.cpp also can do that

rigid snow Mar 22, 2026, 3:41 PM

#

that i s weird

nocturne olive Mar 22, 2026, 3:41 PM

#

A non-llama.cpp user just literally can't accept that llama.cpp is more memory efficient

rigid snow Mar 22, 2026, 3:41 PM

#

i mean historically vllm cleared

#

no idea about now

fast pagoda Mar 22, 2026, 3:42 PM

#

honestly i would've figured the same as well

#

i didnt even set out to argue it, i was just curoius to see since he gave a specific example of a model & context

#

it turns out his model was actually smaller than this one even

nocturne olive Mar 22, 2026, 3:42 PM

#

nocturne olive A non-llama.cpp user just literally can't accept that llama.cpp is more memory e...

Like what else can make a 27B model run in under 22GB of VRAM with high CTX?

rigid snow Mar 22, 2026, 3:43 PM

#

a lower precision quant

#

neuroTroll

fast pagoda Mar 22, 2026, 3:43 PM

#

he stated his was about the equivalent to a q6 but half the size

#

so i just loaded up a q6

#

so by all means it should be larger in memory all else equal if llama is the same measure of efficient

quick condor Mar 22, 2026, 3:46 PM

#

What token per second with offload (generation not prefill)

fast pagoda Mar 22, 2026, 3:46 PM

#

liike 30

quick condor Mar 22, 2026, 3:46 PM

#

OK yah I'm at 100

#

TP overhead eats the spare vram

nocturne olive Mar 22, 2026, 3:47 PM

#

But you also have some 3090Tis and whatever?

fast pagoda Mar 22, 2026, 3:47 PM

#

this card is just slow at inference

#

it has shitty vram bandwidth

nocturne olive Mar 22, 2026, 3:47 PM

#

Low bandwidth I assume

#

Yah

fast pagoda Mar 22, 2026, 3:47 PM

#

even with the model fully in memory i never hit 100tok/s

#

on a model this size really

nocturne olive Mar 22, 2026, 3:48 PM

#

How much is the mem bandwidth on that thing?

fast pagoda Mar 22, 2026, 3:48 PM

#

640 GB/s lol

#

it's gddr6

#

it's so bad

nocturne olive Mar 22, 2026, 3:49 PM

#

Yeah with a 20B range model you can expect around 30 t/s level speeds

quick condor Mar 22, 2026, 3:50 PM

#

Yah I'm at about 2Tbps but have TP overhead

#

I hadn't pulled logs because I hate myself and want to play league and logs are on other os

fast pagoda Mar 22, 2026, 3:51 PM

#

it didnt know of the popular ya mama framework pensibe

quick condor Mar 22, 2026, 3:52 PM

#

Lmao

fast pagoda Mar 22, 2026, 3:53 PM

#

i did have plans to set up a battle royale of these things and just slap them on an h100 or something so there's no possible overhead issues

#

and see how they did myself

#

havent done it yet tho

quick condor Mar 22, 2026, 3:53 PM

#

I'd have to say if it didn't say Devstral 2 I'd swear that was chat gpt

fast pagoda Mar 22, 2026, 3:54 PM

#

i use llama most of the time because it works well enough to serve models<~50b or so with good speed and no effort

#

on this card

nocturne olive Mar 22, 2026, 3:54 PM

#

Ah I see
vLLM is designed for batch inference -> sending multiple requests at once and processing them in parallel

#

Outside of that llama.cpp is better

fast pagoda Mar 22, 2026, 3:54 PM

#

yeah that's why i was saying, vllm and like sglang etc are definitely way more efficient in that arena

#

for like a single user the difference shouldnt be that dramatic

#

although i dunno, given the difference between llama.cpp ROCm vs vulkan

#

it can be dramatic

#

for the record ROCm (HIP) version is way shittier

quick condor Mar 22, 2026, 3:55 PM

#

Mistral models are easily tunable and punch above their size in specific areas. They aren't the best general model (although I'm itching to try the new MOE one) but for specific tasks they are great

fast pagoda Mar 22, 2026, 3:56 PM

#

let me try to load this thang with rocm/HIP llama

#

it'll probably explode

#

if i guess

quick condor Mar 22, 2026, 3:57 PM

#

I'd say try Exllamav3 but I'm pretty sure that's primarily nvidia

fast pagoda Mar 22, 2026, 3:57 PM

#

fuk

#

#

forgor

#

HIPIFY!!!!!!!!!

#

rocm moment

#

build cuda version

#

then ghettoblast it

#

ahahhaahahahha

#

i forgot this main reason i never use it

#

it can't have any overflow

quick condor Mar 22, 2026, 4:00 PM

#

I still think the most satisfying thing I've done is build flash attention 2 on my server. Took less than 10 minutes

fast pagoda Mar 22, 2026, 4:00 PM

#

it just explodes

glass jetty Mar 22, 2026, 4:01 PM

#

fast pagoda it can't have any overflow

overflow? when you can have overflow what?

fast pagoda Mar 22, 2026, 4:01 PM

#

the other day i managed to build rocm7.2 in about 18 minutes

#

that was fun

#

i was expecting it to take forever

quick condor Mar 22, 2026, 4:01 PM

#

Flash attention 2 was building on 72 cores and eating over 500gb of ram

fast pagoda Mar 22, 2026, 4:02 PM

#

glass jetty overflow? when you can have overflow what?

the model weights, if they are larger than the gpu (weights + KV cache) vram, ideally it just sends the extra to system ram

#

it's slower but you at least can load it and use it

#

slower by a lot

glass jetty Mar 22, 2026, 4:02 PM

#

for runtime?

fast pagoda Mar 22, 2026, 4:02 PM

#

so instead of like 50-100tok/s for example you might get like 20

#

yeah inference

#

or less

glass jetty Mar 22, 2026, 4:03 PM

#

for runtime it is reasonable actually neuroErm

fast pagoda Mar 22, 2026, 4:03 PM

#

i think when i loaded kimi i managed to see an absolutely BLAZINGLY fast

#

1-2 tok/s

quick condor Mar 22, 2026, 4:03 PM

#

See I have throughput brain rot to the same level as Vedal has latency brain rot. I don't care how nice it is, I wolnt sacrifice my throughput

fast pagoda Mar 22, 2026, 4:04 PM

#

the thing is i never really use local models for much but busywork that benefits from throughput but it's not li ke im sitting there waiting for it to finish usually

#

cuz i just pointed it at my documents folder and said "halp im lazy, organize??" and that doesnt take very long even at 30 tok/s

#

or make it watch a training run

#

which requires no speed at all

#

cuz that's just waking up every like 20+ minutes to check

quick condor Mar 22, 2026, 4:06 PM

#

See I use Devstral2 to brute force code review for stupid errors. A task that would destroy claude code or codex's usage

fast pagoda Mar 22, 2026, 4:06 PM

#

i pay way too much a month for that so i dontr have that issue but also am wasting money on it

quick condor Mar 22, 2026, 4:08 PM

#

I have 2 codex plus subs and a claude $100 sub. Still come close every week

fast pagoda Mar 22, 2026, 4:08 PM

#

rigid snow has there ever been an open source actual alternative to notion

i was going to reply obsidian to this but im positive you'd know of obsidian and i dont actually know what the difference between them is other than that notion likely is way larger scope

rigid snow Mar 22, 2026, 4:08 PM

#

obsidian is neither

fast pagoda Mar 22, 2026, 4:08 PM

#

i thought it was open source

rigid snow Mar 22, 2026, 4:08 PM

#

open source nor an actual alternative

fast pagoda Mar 22, 2026, 4:08 PM

#

Dentge

#

what the hell does notion do lmao

rigid snow Mar 22, 2026, 4:09 PM

#

appflowy seems like one

rigid snow Mar 22, 2026, 4:09 PM

#

fast pagoda what the hell does notion do lmao

databases

fast pagoda Mar 22, 2026, 4:09 PM

#

loqseq?

#

https://logseq.com they have the thinker on their homepage so theyre kinda based

logseq

A privacy-first, open-source knowledge base

A privacy-first, open-source platform for knowledge management and collaboration.

rigid snow Mar 22, 2026, 4:10 PM

#

for my purposes i'm still gonna stick with notion

#

but good to have some in mind for the future

fast pagoda Mar 22, 2026, 4:11 PM

#

https://affine.pro https://anytype.io

AFFiNE

AFFiNE - All In One KnowledgeOS

The universal editor that lets you work, play, present or create just about anything.

Anytype — A safe haven for digital collaboration

Create notes, tasks, databases, and chats that only you can access. Your data stays on your device — fully owned, secure, and private. Free to start.

#

saw these too

#

anytype seems cool

rigid snow Mar 22, 2026, 4:11 PM

#

rigid snow for my purposes i'm still gonna stick with notion

one inconvenient thing about notion is that they can nuke my account at some point if i log in without a vpn

#

they for some reason decided that's the best way to do sanctions compliance

rigid snow Mar 22, 2026, 4:12 PM

#

rigid snow one inconvenient thing about notion is that they can nuke my account at some poi...

literally happened to like multiple people i know

fast pagoda Mar 22, 2026, 4:12 PM

#

for some reason i didnt realize you're in russia

#

i see ussr domain name now though kekw

rigid snow Mar 22, 2026, 4:13 PM

#

you don't have to be in russia to buy .su btw

#

nor a subject of

fast pagoda Mar 22, 2026, 4:13 PM

#

OhISee

#

hey no judgement here, i'll never judge a person by the actions of their idiotic government cuz if i held myself to the same standard it'd be so joever

rigid snow Mar 22, 2026, 4:14 PM

#

sad that icann is gonna phase it out tho

rigid snow Mar 22, 2026, 4:14 PM

#

fast pagoda hey no judgement here, i'll never judge a person by the actions of their idiotic...

judgement where

#

i didn't see any

#

NeuroConfused

fast pagoda Mar 22, 2026, 4:14 PM

#

good

tender river Mar 22, 2026, 4:14 PM

#

vedalWow

#

mlntcandy

#

cybercriminal

rigid snow Mar 22, 2026, 4:15 PM

#

yes

fast pagoda Mar 22, 2026, 4:15 PM

#

i always feel like as a burger that i am predisposed to appear that way by accident so i gotta pre apologize

#

not hackers AND scammers

#

now i want a .su

#

to post phonk on

rigid snow Mar 22, 2026, 4:16 PM

#

ICANT

#

very .su

flint dome Mar 22, 2026, 4:16 PM

#

glass flower <:tink:1160885779543900231> why not just qwen 3.5 9b?

2050 mobile so only 4gb vram evilWheeze would have to offload rest

fast pagoda Mar 22, 2026, 4:17 PM

#

there was a real period of time i was hype thinking the global nightmare of tensions for no reason was definitely ending soon

#

there are a lot of places i'd liek to visit

#

not that it's impossible

#

but

glass jetty Mar 22, 2026, 4:17 PM

#

tender river <:vedalWow:1343810742989623296>

you know, im not surprised
i have .ru and .online domain without registration by рег.ру

fast pagoda Mar 22, 2026, 4:17 PM

#

i am lazy

rigid snow Mar 22, 2026, 4:18 PM

#

glass jetty you know, im not surprised i have .ru and .online domain without registration by...

i thought you have to submit proof of citizenship for .ru

glass jetty Mar 22, 2026, 4:18 PM

#

no

rigid snow Mar 22, 2026, 4:18 PM

#

as for like the majority of cctlds

glass jetty Mar 22, 2026, 4:18 PM

#

you just write shit on third party website as ids

rigid snow Mar 22, 2026, 4:18 PM

#

you definitely had to

glass jetty Mar 22, 2026, 4:18 PM

#

and transfer to reg.ru

fast pagoda Mar 22, 2026, 4:18 PM

#

back in the day i had a couple co ccs that were just free

#

and now theyre like 19082347198273918 dollar

#

well

#

i dont even think you can register them now

#

idk what happened

#

oh lord

rigid snow Mar 22, 2026, 4:23 PM

#

fast pagoda Mar 22, 2026, 4:23 PM

#

LMFAO

rigid snow Mar 22, 2026, 4:23 PM

#

neuro3D 🍔

glass jetty Mar 22, 2026, 4:24 PM

#

glass jetty and transfer to reg.ru

well i could, but it isnt completely fake, real phone and trace-able payment system
but as fact you can.

fast pagoda Mar 22, 2026, 4:24 PM

#

i set up sonos speakers to act as a raop sink in pipewire so i just hav it as an output device

which means if i max its volume and dont turn down the volume of the source, it maxes their volume inherently

#

and i just went deaf

#

because i wanted to hear phonk after mentioning it

rigid snow Mar 22, 2026, 4:25 PM

#

tender river <:vedalWow:1343810742989623296>

funny thing is idk if i should say this i know an actual previous cybercriminal now because he used to own one of my .su domains and reached out

fast pagoda Mar 22, 2026, 4:25 PM

#

connections

rigid snow Mar 22, 2026, 4:25 PM

#

bro had feds raid his house

fast pagoda Mar 22, 2026, 4:25 PM

#

not the feds

#

party van and all

#

wonder if that's what happened to quark doge

sage crag Mar 22, 2026, 4:26 PM

#

spspspsp

fast pagoda Mar 22, 2026, 4:26 PM

#

his satire was too powerful

sage crag Mar 22, 2026, 4:27 PM

#

https://tenor.com/view/fox-bite-attack-snow-gif-14065812

Tenor

fast pagoda Mar 22, 2026, 4:27 PM

#

sage crag spspspsp

https://tenor.com/view/bunny-kiss-bunny-cute-bunny-kiss-first-person-kiss-gif-11858029512826685577

Tenor

sage crag Mar 22, 2026, 4:27 PM

#

enub

glass jetty Mar 22, 2026, 4:27 PM

#

fast pagoda Mar 22, 2026, 4:27 PM

#

brrrrrrrrrrrrzzzzzr

rigid snow Mar 22, 2026, 4:27 PM

#

neuroLookUp

sage crag Mar 22, 2026, 4:30 PM

#

rigid snow <:neuroLookUp:1109915690971963392>

enub

#

https://tenor.com/view/funny-animals-happy-easter-disaster-april-fools-day-fluffy-gif-11463156

Tenor

rigid snow Mar 22, 2026, 4:31 PM

#

bun

#

bun omg

#

bun bun

fast pagoda Mar 22, 2026, 4:32 PM

#

The soundtrack of peace

#

Really helps sleep

rigid snow Mar 22, 2026, 4:33 PM

#

everything is ai now

#

they have "agents" in notion now

#

whatever the fuck

#

that is

fast pagoda Mar 22, 2026, 4:33 PM

#

we've injected a cve into your cve tracker

#

you are welcome

#

welcome to the agentic age

#

where instead of doing work yourself

#

you must babysit an idiot

#

or perhaps groups of idiots

rigid snow Mar 22, 2026, 4:34 PM

#

they had ai features for a long time, like since 3.5 non turbo even

#

but agents

#

in a note taking app

fast pagoda Mar 22, 2026, 4:35 PM

#

i can see how agents would be useful but i cannot get over their non-determinisim

#

i cant sleep at night knowing some ball of impressionable math is controlling literally anything of importance

#

without me watching it

rigid snow Mar 22, 2026, 4:36 PM

#

:ohidontsee:

fast pagoda Mar 22, 2026, 4:36 PM

#

Nowaying you can get le answer that you wouldve just searched for and goten anyways

#

the funny thing with this stuff is that i feel their idea is that it will make people who cannot into computer

#

be able to into computer a bit

#

but we had copilot and like all the o365 shit deployed at work

#

and boy howdy

#

do they avoid using it for anything of note

#

whole hell of a lot of slop posters and images flying around tho

#

you can literally show these people how to ask the oracle for something if they want, ive attempted it

#

and they

#

do not understand

#

somehow

#

but it makes sense because these are the same people that simply cannot fathom reading an error on their screen and that it might give a clue as to what they need to do

#

so of course when you give them a spooky ghost in a chat they wont know how it can be of any use and even if they vaguely know it can do anything they are too nervous to do it

#

maybe that's me with agents atm

#

idk

#

am i a boomer

#

possibly

#

one day ill finish helmstack and then i'll be one with the agents

fast pagoda Mar 22, 2026, 4:43 PM

#

rigid snow everything is ai now

how does this compare to like linear

#

i have used that

#

and dash? i tried that a long time ago cuz it was supposedly a project management thing w/ ai but i didnt see the benefit

rigid snow Mar 22, 2026, 4:44 PM

#

notion is less purpose built than linear, linear is like for work, notion is more for anything

#

on a very high level

glass jetty Mar 22, 2026, 4:44 PM

#

engines are dishwasher safe

fast pagoda Mar 22, 2026, 4:44 PM

#

yeah, linear is definitely like product/project planning focus

#

one thing i used for a bit to try that i thought was cool but eventually said f that was called pieces

#

and it was basically microsoft recall but voluntary

#

it did be gathering knowledge tho

fast pagoda Mar 22, 2026, 4:45 PM

#

glass jetty engines are dishwasher safe

agree

#

just like the humble motherboard

#

https://pieces.app

Pieces — Long term memory for your whole workstream

Pieces is your AI companion that captures live context from browsers to IDEs and collaboration tools, manages snippets and supports multiple llms - all while processing data locally for maximum control.

#

this is cool but i got so paranoid so fast

#

fuckin lol

fast pagoda Mar 22, 2026, 4:51 PM

#

fast pagoda it didnt know of the popular ya mama framework <a:pensibe:678483024295493640>

decided to ask echo this

#

#

quantize ya mama Dentge

#

memory efficiency: very low

#

ICANT

rigid snow Mar 22, 2026, 4:56 PM

#

might want to install a gpu anti sag bracket before deploying

fast pagoda Mar 22, 2026, 4:56 PM

#

https://tenor.com/view/damn-shookt-shocked-gif-5580082

Tenor

Daaaaamn

▶ Play video

leaden crest Mar 22, 2026, 5:15 PM

#

fast pagoda

shouldve put “ease of use” as very high for ya mama

fast pagoda Mar 22, 2026, 5:16 PM

#

kek

#

@real sierra finally mentioned again organically, congratulations you're still relevant to the boy
and nice tesla

leaden crest Mar 22, 2026, 5:18 PM

#

now get it to say happy birthday organically

fast pagoda Mar 22, 2026, 5:18 PM

#

i have no idea how to shoehorn that in

leaden crest Mar 22, 2026, 5:18 PM

#

what B is echo I forget

#

like parameter count

fast pagoda Mar 22, 2026, 5:19 PM

#

i tried

#

it varies

#

but this is a 27b

#

gemma3 base

#

The 4b gemma3 version is fucking hilarious

#

But it's like slightly too stupid to do anything useful, not enough capacity to both really dial in the intended style

#

And be smart

#

At least with my current dataset

leaden crest Mar 22, 2026, 5:23 PM

#

neuroLookUp

fast pagoda Mar 22, 2026, 5:23 PM

#

I'm gonna try 9b

#

Er

#

12

rigid snow Mar 22, 2026, 5:23 PM

#

almost pressed enter on rm -rf /* when i actually wanted rm -rf ./* what does this say about my cognitive ability

fast pagoda Mar 22, 2026, 5:24 PM

#

It's highly optimized

#

Maybe a bit overfit on that task though unfortunately @rigid snow

#

But yeah so next round is 9b to see how that splits the difference, then I've gotta enter the qwen 3.5 zone but I'm kinda assuming rdna4 training will be a nightmare with that architecture until stuff is matured more

sage crag Mar 22, 2026, 5:26 PM

#

enub

fast pagoda Mar 22, 2026, 5:26 PM

#

Gemma4 needs to hurry up

rigid snow Mar 22, 2026, 5:26 PM

#

i hate commitment i do rm -rf ./* all the time i'm addicted to it

opaque sigil Mar 22, 2026, 5:26 PM

#

drop the ./

rigid snow Mar 22, 2026, 5:26 PM

#

give me more rm -rf ./*

rigid snow Mar 22, 2026, 5:27 PM

#

opaque sigil drop the ./

no i like being explicit that i want THIS folder what if it decides it wants another

#

what if it hates me

fast pagoda Mar 22, 2026, 5:27 PM

#

opaque sigil drop the ./

Just drop the . And *

rigid snow Mar 22, 2026, 5:27 PM

#

/ is prettier than *

#

so leave just / ye

opaque sigil Mar 22, 2026, 5:28 PM

#

rigid snow no i like being explicit that i want THIS folder what if it decides it wants ano...

this is exactly how you end up removing / neuroHypers

fast pagoda Mar 22, 2026, 5:28 PM

#

Exactly so you just need to adopt the sudo rm -rf / and you'll find salvation

rigid snow Mar 22, 2026, 5:28 PM

#

opaque sigil this is exactly how you end up removing / <:neuroHypers:1135051812722982922>

i like .

#

THIS folder

#

directory oops

fast pagoda Mar 22, 2026, 5:28 PM

#

git add /

opaque sigil Mar 22, 2026, 5:29 PM

#

enub

rigid snow Mar 22, 2026, 5:29 PM

#

"folder" normie language

#

mlntcandy

fast pagoda Mar 22, 2026, 5:29 PM

#

Dir only

#

Not only you call it directory

#

But

#

Shorthand for directory

rigid snow Mar 22, 2026, 5:29 PM

#

dir is short for directory chat

fast pagoda Mar 22, 2026, 5:30 PM

#

Signals that you are literally Linux torbalds

rigid snow Mar 22, 2026, 5:30 PM

#

fast pagoda Mar 22, 2026, 5:30 PM

#

Linux tarballs

#

neuroNOWAYING

opaque sigil Mar 22, 2026, 5:30 PM

#

my point is that * is identical to ./* except there's no way to accidently mistype and yeet something else FOCUS

rigid snow Mar 22, 2026, 5:31 PM

#

look i like gambling ok

opaque sigil Mar 22, 2026, 5:31 PM

#

fair

fast pagoda Mar 22, 2026, 5:31 PM

#

Live life on the edge compute node

opaque sigil Mar 22, 2026, 5:31 PM

#

make sure you do everything as root too

fast pagoda Mar 22, 2026, 5:31 PM

#

Yes no password or reprompt

#

Slows down your flow

rigid snow Mar 22, 2026, 5:32 PM

#

%sudo ALL=(ALL:ALL) NOPASSWD: ALL literally just pasted this from my visudo

fast pagoda Mar 22, 2026, 5:32 PM

#

GIGACHAD

sage crag Mar 22, 2026, 5:47 PM

#

my cp

#

pc

#

neuroCatUuh

#

bad typo

#

my pc has no password

#

for login or root

fast pagoda Mar 22, 2026, 5:49 PM

#

i have pondered doing that on every reinstall

#

but then i get paranoid

#

it sounds hella convenient tho

rigid snow Mar 22, 2026, 5:52 PM

#

i did it mostly so that slop machines can sudo and install whatever

#

^lazy

fast pagoda Mar 22, 2026, 5:53 PM

#

--dangerously-skip-permissions my beloved

#

claude code doesnt have a terminal tho so it's still limited

#

unless it thinks to write a script and execute it to get around it

tender river Mar 22, 2026, 5:57 PM

#

i have

#

custom $SUDO_ASKPASS

#

it just prompts me to confirm

#

pulling the password from keepass

#

peak security neuroSensei

real sierra Mar 22, 2026, 6:00 PM

#

fast pagoda <@196063748354473984> finally mentioned again organically, congratulations you'r...

NAHH

fast pagoda Mar 22, 2026, 6:03 PM

#

if you ever think you don't matter

#

just know

#

you have a fan in him

sage crag Mar 22, 2026, 6:05 PM

#

real sierra <a:NAHH:1229194866110632046>

shr

#

shr

#

5 days

#

enub

fickle rain Mar 22, 2026, 6:07 PM

#

rigid snow

Technically they do hallucinate sometimes (or do work wrong) and Linus yells at them for that

#

Much like me and slop engine

glass jetty Mar 22, 2026, 6:29 PM

#

https://www.youtube.com/watch?v=ldxFjLJ3rVY
he just talk about some concepts to simple algorithm
while there literally no way to make any img look like this
sure you can create simple function that convert original to mapped
but to achieve this you should have ideally mapped 'inner' image in first place

anyway its so cool

#

not a technical difficulties

fast pagoda Mar 22, 2026, 6:32 PM

#

manim is cool

#

i hadnt seen this upload yet

#

hell yeah

fickle rain Mar 22, 2026, 7:09 PM

#

https://projectzero.google/2026/02/gphfh-deep-dive.html
This seems awfully useful

A Deep Dive into the GetProcessHandleFromHwnd API

In my previous blog post I mentioned the GetProcessHandleFromHwnd API. This was an API I didn’t k...

#

Actually not that useful, as you still have to have a handle open to the target to do anything useful

olive sable Mar 22, 2026, 7:27 PM

#

proven merlin Mar 22, 2026, 7:48 PM

#

fast pagoda i only included that because it exists but that would be my last choice, the oth...

I mean, i dont like it, but i prefer it over modern browsers, but yea.. the exasperated tone could habe been due to a lack of time and too many projects that i wanna do.. classic issue

proven merlin Mar 22, 2026, 7:50 PM

#

nocturne olive It's always possible it's not broken, but the CPU mem controller has somehow deg...

Yeah fun thing, we had many ryzen 9 5950x die "recently".. aka sporadic issues with boot and bsods but occassionally theyd run for days on end perfectly find

nocturne olive Mar 22, 2026, 7:50 PM

#

Oh I see

proven merlin Mar 22, 2026, 7:51 PM

#

Changing the ram sometimes fixes it, espescially running just one stick, but like

#

Yea Something is fucky

leaden crest Mar 22, 2026, 7:52 PM

#

wanted to play with some llms designed for role playing

olive sable Mar 22, 2026, 7:53 PM

#

the 5950x is technicall better than the 11900kf. but a dead cpu isnt worth anything

leaden crest Mar 22, 2026, 7:53 PM

#

anyone have an opinion on llama stheno 8B vs mistral nemo mag mell 12B

fast pagoda Mar 22, 2026, 7:56 PM

#

proven merlin I mean, i dont like it, but i prefer it over modern browsers, but yea.. the exas...

lol no big deal i feel i was too harsh in saying you were moaning about it, but i didnt want to edit it since it felt disingenuous, ive been there, my apologies tbh

fast pagoda Mar 22, 2026, 7:57 PM

#

leaden crest anyone have an opinion on llama stheno 8B vs mistral nemo mag mell 12B

are these creative writing models

#

i have no idea what im doing to evaluate them
which is funny considering echo is basically a form of one

amber fractal Mar 22, 2026, 7:58 PM

#

That requires you to be creative and see if it matches your energy.

olive sable Mar 22, 2026, 7:59 PM

#

since im selling the 3090 to superbox, i wonder what i should put in it's place hmm

mighty thorn Mar 22, 2026, 7:59 PM

#

olive sable since im selling the 3090 to superbox, i wonder what i should put in it's place ...

Can I steal it

olive sable Mar 22, 2026, 7:59 PM

#

if you can outbid superbox for it, sure

#

superbox is at 500 + shipping

#

ReallyInnocent

leaden crest Mar 22, 2026, 8:00 PM

#

fast pagoda are these creative writing models

creative writing yeah

#

my google searches claim they’re built to be less cringe than actual rp models

olive sable Mar 22, 2026, 8:01 PM

#

olive sable since im selling the 3090 to superbox, i wonder what i should put in it's place ...

ideally something faster and cheaper.
the 3090 is only expensive rn cuz of the 24gb of vram,
3080ti are only 300 bucks

leaden crest Mar 22, 2026, 8:01 PM

#

that you might see on like girlfriend ai sites

mighty thorn Mar 22, 2026, 8:01 PM

#

olive sable ideally something faster and cheaper. the 3090 is only expensive rn cuz of the 2...

Faster AND cheaper

olive sable Mar 22, 2026, 8:01 PM

#

ye

fast pagoda Mar 22, 2026, 8:01 PM

#

amber fractal That requires you to be creative and see if it matches your energy.

it's really easy to see if it matches my energy when the target is my energy

mighty thorn Mar 22, 2026, 8:01 PM

#

olive sable superbox is at 500 + shipping

I can offer you $25

olive sable Mar 22, 2026, 8:01 PM

#

in other words, no 24gb of vram

mighty thorn Mar 22, 2026, 8:01 PM

#

Not more

#

Just total

amber fractal Mar 22, 2026, 8:02 PM

#

Let me see if I have any Virdience lore I am comfortable using as a something to stress test the creative model

fast pagoda Mar 22, 2026, 8:02 PM

#

olive sable ideally something faster and cheaper. the 3090 is only expensive rn cuz of the 2...

mi100

#

oh you said cheaper

olive sable Mar 22, 2026, 8:02 PM

#

no

fast pagoda Mar 22, 2026, 8:02 PM

#

umm

olive sable Mar 22, 2026, 8:02 PM

#

its for gaming

fast pagoda Mar 22, 2026, 8:02 PM

#

oh

#

9070xt

olive sable Mar 22, 2026, 8:02 PM

#

i need only like 16gb of vram, if even that

proven merlin Mar 22, 2026, 8:02 PM

#

fast pagoda lol no big deal i feel i was too harsh in saying you were moaning about it, but ...

Nah youre all good, no apology needed.

fast pagoda Mar 22, 2026, 8:03 PM

#

https://tenor.com/view/bornskywalker-dap-me-up-woody-woody-handshake-woody-toy-story-gif-26021440

Tenor

amber fractal Mar 22, 2026, 8:04 PM

#

@leaden crest Throw this into the creative model, I'll be throwing it into echo

The Extra Dimensional Physics exist within two higher dimensions, currently unnamed; I like to refer to them as 4d and 5d. We will be discussing these as a 3d cross section along with the axis of time. We'll be dealing with this space in 3 different magnitudes, Magic, Time-Travel, and Realms.

Magic, in this case Mana itself and Souls are considered extra dimensional. The Soul gives mana to the vessel, be it human, crystal, or spirit. The activation of magic shifts the soon to be coined world line.
-# The exact nature of mana isn't decided on yet, could be generated from the soul itself or could be harvested from Unspace, which honestly fits for mana.

-# I'm sorry in advance for abridging this.
The basis of time travel is many world lines exist and when you travel through time, you shift what line you exist within. This offsets the world line, changing your position in this higher level space.

Realms being differing planes of existence, each with their own universe inside. They move thanks to offsets either Magic or splitting of worldlines. When they collide, portals are generated and can either merge or they can be rebounded; this depends on the strength of the realms, if they don't have enough strength to hold themselves together they'll be forced to merge; rewriting history and matter from both halves and continue as one realm.

Unspace as mentioned earlier is the space between world lines and realms, nothing is supposed to exist out there; any foreign objects will be removed from existance. However it does exist, and it can be used. If the matter of Unspace is brought out of such it'll instantly dislocate anything it contacts with into Unspace.

olive sable Mar 22, 2026, 8:04 PM

#

its for 4K gaming in newer triple A titles. so thats why i want ideally faster than 3090

amber fractal Mar 22, 2026, 8:04 PM

#

neuroLookUp

leaden crest Mar 22, 2026, 8:05 PM

#

amber fractal <@855840325359370281> Throw this into the creative model, I'll be throwing it in...

I though echo was funyun’s

#

alt identified?

amber fractal Mar 22, 2026, 8:05 PM

#

It is

#

I have access to the echo server

olive sable Mar 22, 2026, 8:05 PM

#

i also have acces to echo lol

leaden crest Mar 22, 2026, 8:06 PM

#

so someone could turn funyun’s sweet child into a femboy at any time

#

neuroD

leaden crest Mar 22, 2026, 8:07 PM

#

amber fractal <@855840325359370281> Throw this into the creative model, I'll be throwing it in...

wait you want me to prompt it or add it to the lora

olive sable Mar 22, 2026, 8:08 PM

#

leaden crest so someone could turn funyun’s sweet child into a femboy at any time

amber fractal Mar 22, 2026, 8:08 PM

#

leaden crest wait you want me to prompt it or add it to the lora

Prompt it to see how it'd extend it

fast pagoda Mar 22, 2026, 8:08 PM

#

Dentge

leaden crest Mar 22, 2026, 8:08 PM

#

how tf did I get in there neuroCatUuh neurOMEGALUL

olive sable Mar 22, 2026, 8:08 PM

#

i just atted your user id

#

<855840325359370281>

amber fractal Mar 22, 2026, 8:09 PM

#

little known trick

leaden crest Mar 22, 2026, 8:09 PM

#

amber fractal Prompt it to see how it'd extend it

alr bet I will when I try it later

#

is it supposed to do something

opaque sigil Mar 22, 2026, 8:09 PM

#

olive sable <855840325359370281>

fun fact, you can put a \ in front of a ping to get the raw string

leaden crest Mar 22, 2026, 8:10 PM

#

olive sable

wait is echo responding to the personality tests in my discord bio

fast pagoda Mar 22, 2026, 8:10 PM

#

leaden crest wait is echo responding to the personality tests in my discord bio

he got multiple messages at once and many things slid off his smooth brain

nocturne olive Mar 22, 2026, 8:10 PM

#

opaque sigil fun fact, you can put a \ in front of a ping to get the raw string

Still pings though, so be careful

olive sable Mar 22, 2026, 8:10 PM

#

since when do blower style 3090s exist? neurOMEGALUL
"turbo edition" the only thing going turbo will be the noise coming from your pc

leaden crest Mar 22, 2026, 8:10 PM

#

recommended neuroCatUuh echo watches streameroid labs

nocturne olive Mar 22, 2026, 8:11 PM

#

olive sable since when do blower style 3090s exist? <:neurOMEGALUL:1097297318119743638> "tu...

Honestly? That would go kinda great in a system like mine

leaden crest Mar 22, 2026, 8:11 PM

#

his inferencing power is diabolical rn

#

2 for 2 on my attributes

olive sable Mar 22, 2026, 8:11 PM

#

nocturne olive Honestly? That would go kinda great in a system like mine

its 800 bucks so outside your budget

nocturne olive Mar 22, 2026, 8:12 PM

#

Aw

#

Oh well

#

I guess at least since I know it exists I can think about tracking one down later

#

Blowing all the hot air out the back of the case would work amazingly to stop putting hot air into the 4070Ti

olive sable Mar 22, 2026, 8:13 PM

#

cheapest rx 9070xt i could find is 575 euro hmm
so cheaper than normal 3090s, but more than id be getting from superbox

#

i dont think i even need to look at the price sof 5090, 5080 or 4090s

opaque sigil Mar 22, 2026, 8:14 PM

#

nocturne olive Still pings though, so be careful

oh you're right it does, good to know NeurOhISee

split steppe Mar 22, 2026, 8:14 PM

#

i'll save up and probably get a framework desktop board instead of wasting time with gpus

nocturne olive Mar 22, 2026, 8:15 PM

#

opaque sigil oh you're right it does, good to know <:NeurOhISee:1136209034966483084>

Yeah I think someone pinged mods like that in some server

olive sable Mar 22, 2026, 8:15 PM

#

i like framework and all that, but i just don't think the framework desktop is a good deal for my purposes

split steppe Mar 22, 2026, 8:15 PM

#

if you're training then it's for sure not the best, but i don't think you can beat the price for inference

#

the only closest contender would be macs

olive sable Mar 22, 2026, 8:16 PM

#

its for 4K gaming

#

so i need gpu

split steppe Mar 22, 2026, 8:16 PM

#

gaming yuh

#

for me i meant inference

olive sable Mar 22, 2026, 8:16 PM

#

i think the "best deal" i ahve for interference, is not spending any money at all and using my current pc

amber fractal Mar 22, 2026, 8:16 PM

#

Are you sure it isn't a for a projector enub

olive sable Mar 22, 2026, 8:16 PM

#

yes im sure

faint sandal Mar 22, 2026, 8:16 PM

#

hungry

olive sable Mar 22, 2026, 8:16 PM

#

i can use my desktop for that

split steppe Mar 22, 2026, 8:17 PM

#

i use a rx7800xt for gaming, it's been pretty good

olive sable Mar 22, 2026, 8:17 PM

#

im not attaching a whole gaming pc to the volumetric display, im pre-rendering that shit

proven merlin Mar 22, 2026, 8:18 PM

#

amber fractal little known trick

It gets worse with the guild id :misschievous:

olive sable Mar 22, 2026, 8:18 PM

#

this volumetric display project is already 1.5K deep, i dont want it to go up to 2.5K

proven merlin Mar 22, 2026, 8:18 PM

#

nocturne olive Still pings though, so be careful

Since when? Afaik it used to only ping visually, not via notification bubble and sound-

olive sable Mar 22, 2026, 8:19 PM

#

@proven merlin

#

test

nocturne olive Mar 22, 2026, 8:19 PM

#

proven merlin Since when? Afaik it used to only ping visually, not via notification bubble and...

Someone pinged mods like that in some server

proven merlin Mar 22, 2026, 8:19 PM

#

olive sable \<@598481177338445834>

Yea no ping, just a highlight

#

Also tf my mobile app cooked

#

Didnt parse it for whatever reason.. i miss v185

proven merlin Mar 22, 2026, 8:20 PM

#

nocturne olive Someone pinged mods like that in some server

Yea with guild id u do at everyone.. its stupid

split steppe Mar 22, 2026, 8:20 PM

#

that's neat

proven merlin Mar 22, 2026, 8:21 PM

#

Yes, it is, if they still dont actually ping

amber fractal Mar 22, 2026, 8:21 PM

#

leaden crest is it supposed to do something

A small a bit of yapping

proven merlin Mar 22, 2026, 8:21 PM

#

But apparantly they changed it according to some here?

fast pagoda Mar 22, 2026, 8:22 PM

#

in isolation he's just like "ok"

proven merlin Mar 22, 2026, 8:23 PM

#

Hm whats the consensus on Q5 vs Q4 btw? Is the Hardware nativeness worth the loss?

split steppe Mar 22, 2026, 8:23 PM

#

someone has probably measured that for you

#

really depends on the model and quantizer strat i think

opaque sigil Mar 22, 2026, 8:24 PM

#

proven merlin But apparantly they changed it according to some here?

just checked again to make sure, it does in fact ping neuroSadge

#

the user ping that is

proven merlin Mar 22, 2026, 8:24 PM

#

I mean yes, q4 is like ~3-5% loss while Q5 seems to be >~1% according to many graphs, but is the speed up on turing and beyond in yalls Opinions worth it

proven merlin Mar 22, 2026, 8:25 PM

#

opaque sigil just checked again to make sure, it does in fact ping <:neuroSadge:1059236278345...

https://en.wikipedia.org/wiki/Enshittification

Enshittification

Enshittification, also known as platform decay, is a process in which two-sided online products and services decline in quality over time. Initially, vendors create high-quality offerings to attract users, then they degrade those offerings to better serve business customers, and finally degrade their services to both users and business customers...

split steppe Mar 22, 2026, 8:28 PM

#

proven merlin I mean yes, q4 is like ~3-5% loss while Q5 seems to be >~1% according to many gr...

i lean towards squeezing maximum quality and sanity out of the lowest speed i can tolerate, but i use it mainly for research tasks, so i am fine if it's fairly slow. currently using a Q8 at 8tps

fast pagoda Mar 22, 2026, 8:29 PM

#

amber fractal A small a bit of yapping

had to encourage him and then the client didnt like the length of his response or something and it never actually sent but this is what he wrote in the logs:

High Concept
World Ruleset: The universe follows fundamental laws that sentient beings can bend through power or knowledge. Magic is not a violation of these laws but a force that reshapes reality alongside them.

Cosmology: A multidimensional structure where 3D space and time are interwoven with higher planes. These dimensions serve as the foundation for magic and supernatural phenomena across various realms.

Sentience and Consciousness: Consciousness is not restricted to biology; it can emerge from magical entities, spirits, or inanimate objects. The mind-body connection is fluid, allowing for astral projection, possession, and collective consciousness.

Magic System
Source of Mana: Mana is an ethereal energy flowing from higher dimensions. It is a dynamic force that responds to the intentions, emotions, and collective will of sentient beings.

Types of Magic: * Elemental: Control of fire, water, earth, and air.

Temporal & Spatial: Manipulation of time and warping of space.

Necromancy: Interaction with death and spirits.

Enchantment: Imbuing objects with magical properties.

Magical Creatures: A diverse population ranging from benevolent angels and fairies to malevolent demons. These beings often act as guardians or follow specific primal instincts and cosmic codes.

Worldbuilding Details
Cosmology Layers: The multiverse consists of interconnected planes accessible via portals or rifts. Each layer features unique physical laws and magical properties.

Realms and Worlds: A vast array of environments ranging from paradises to hellscapes. These worlds are shaped by varying levels of magic and technology, leading to complex inter-realm trade and conflict.

Factions and Organizations: Various groups compete for power, knowledge, or ideological goals. Their agendas range from preserving cosmic balance to tyrannical expansion.

Rules of Interaction
Magic vs. Technology: Both coexist and can create powerful synergies, though they operate on different principles. Technology is predictable and quantifiable, while magic is volatile and harder to control.

Physical Laws: While generally consistent, laws like gravity and time can vary locally or be warped by magical influence in specific dimensions.

Sentient Beings: Individuals possess a wide spectrum of magical, technical, or physical gifts. Growth is limited only by willpower, though the abuse of power typically results in "dire consequences."```

split steppe Mar 22, 2026, 8:29 PM

#

qwen does the car wash correctly, i wonder if it got that trained in 🤔

fast pagoda Mar 22, 2026, 8:29 PM

#

he really said hop on voice i gotta hear more

split steppe Mar 22, 2026, 8:30 PM

#

i'm fairly annoyed at qwen's reasoning process though, but i don't have the resources or knowledge to retrain it

fast pagoda Mar 22, 2026, 8:31 PM

#

#

k

#

i \dont know how to classify this response lmfao

#

saved on try 2

split steppe Mar 22, 2026, 8:34 PM

#

proven merlin I mean yes, q4 is like ~3-5% loss while Q5 seems to be >~1% according to many gr...

also, the trend is nonlinear, there will be a wall in either memory bandwidth, capacity or compute depending on your hardware, and the quality exponentially degrades the smaller your quant, so just test them all and see what gives you best of both, at a speed you can tolerate for the tasks you're using it for

olive sable Mar 22, 2026, 8:34 PM

#

cheapest 4080 super is 750, 4080 is 700
so compared to the rx 9070 xt for 575 these jsut arent worth it.
and the 4070ti is worse than the rx 9070 xt while also being more expensive.
so 9070 xt might be the way to go

fast pagoda Mar 22, 2026, 8:34 PM

#

#

his rudeness is saving him

split steppe Mar 22, 2026, 8:35 PM

#

the duality of ai

fast pagoda Mar 22, 2026, 8:36 PM

#

hadnt considered this

#

aw he failed

#

#

car washes are too expensive for him

split steppe Mar 22, 2026, 8:37 PM

#

what kind of carwash is 6k

fast pagoda Mar 22, 2026, 8:37 PM

#

average california car wash

nocturne olive Mar 22, 2026, 8:37 PM

#

olive sable cheapest 4080 super is 750, 4080 is 700 so compared to the rx 9070 xt for 575 th...

4080 == 4080 super by the way, but the super had a 200 lower MSRP

split steppe Mar 22, 2026, 8:37 PM

#

he washing a mclaren or something

fast pagoda Mar 22, 2026, 8:38 PM

#

facts and logic

young plover Mar 22, 2026, 8:38 PM

#

split steppe what kind of carwash is 6k

one where you accidentally leave the window open

fast pagoda Mar 22, 2026, 8:38 PM

#

young plover one where you accidentally leave the window open

that's probably the one where the water comes out and splashes on your toesies

amber fractal Mar 22, 2026, 8:38 PM

#

fast pagoda he really said hop on voice i gotta hear more

Considering it is one of my projects, checks out

split steppe Mar 22, 2026, 8:38 PM

#

fast pagoda facts and logic

truth

rigid snow Mar 22, 2026, 8:39 PM

#

fast pagoda facts and logic

how is ts better than frontier llms like 2 months ago

fast pagoda Mar 22, 2026, 8:39 PM

#

he's looped around by being too stupid

#

trips and falls into the right answer

split steppe Mar 22, 2026, 8:40 PM

#

you gotta encourage him whenn he does that

#

reinforcement learning duh

fast pagoda Mar 22, 2026, 8:40 PM

#

good responses go in the db

#

and then are added

rigid snow Mar 22, 2026, 8:40 PM

#

waow rlhf

fast pagoda Mar 22, 2026, 8:40 PM

#

amazing technology

#

the smaller versions tend to just go "??" if given anything too out there

#

split steppe Mar 22, 2026, 8:42 PM

#

speak slower, 5head

#

am trying to force mine to think a specific way thru a system prompt

fast pagoda Mar 22, 2026, 8:44 PM

#

i generally dont give him one unless i undertrained it

#

that's not usually the issue lmfao

#

no eating till it's done

sick owl Mar 22, 2026, 8:50 PM

#

Windows finally forced an update on me neuroDespair

fast pagoda Mar 22, 2026, 8:50 PM

#

did this update disable internet to all their malware by accident

sick owl Mar 22, 2026, 8:51 PM

#

I have a DNS sink so I wouldn't know

fast pagoda Mar 22, 2026, 8:51 PM

#

i have a dnr near my sink

split steppe Mar 22, 2026, 8:51 PM

#

i fucked it again

fast pagoda Mar 22, 2026, 8:51 PM

#

saves time on logistics

sick owl Mar 22, 2026, 8:52 PM

#

split steppe i fucked it again

Haven't had that issue with the 27B