#programming | Neuro-sama Headquarters | Page 121

sick owl Aug 5, 2025, 7:05 PM

#

Good point

opaque wharf Aug 5, 2025, 7:05 PM

#

Would be hard with material in the way. But I think that's just another BSDF

rigid snow Aug 5, 2025, 7:05 PM

#

near useless tech tbh

opaque wharf Aug 5, 2025, 7:06 PM

#

rigid snow near useless tech tbh

Well, for simulation it is the best you can approach it

rigid snow Aug 5, 2025, 7:06 PM

#

maybe not audio necessarily then but rather emf simulations or something?

opaque wharf Aug 5, 2025, 7:07 PM

#

Hmm, I feel like EMF simulation will require a different kind of ASIC. But who knows. Spectral rendering is quite new

rigid snow Aug 5, 2025, 7:07 PM

#

because i've never heard of practical path traced audio applications

opaque wharf Aug 5, 2025, 7:07 PM

#

Same ICANT

lilac flame Aug 5, 2025, 7:09 PM

#

more immersive audio in video games ReallyInnocent

rigid snow Aug 5, 2025, 7:09 PM

#

that's not what i meant by practical and that requires realtime on consumer hardware

opaque sigil Aug 5, 2025, 7:10 PM

#

just one more asic bro

rigid snow Aug 5, 2025, 7:10 PM

#

lilac flame more immersive audio in video games <:ReallyInnocent:1229173141088112640>

also steam audio exists and is foss and best in class

#

and cpu only

opaque wharf Aug 5, 2025, 7:11 PM

#

opaque sigil just one more asic bro

I'd rather company that trains AI have ASIC to do it than using GPU neuroPogHD

rough bloom Aug 5, 2025, 7:11 PM

#

looks very cool but is definitely not for consumers, at least for this first iteration of it (and likely not for at least the next decade)
it wouldn't need 400 GbE QSFP if it wasn't meant for larger deployments in datacenters

opaque sigil Aug 5, 2025, 7:12 PM

#

tbf a h100 is already basically an asic

lilac flame Aug 5, 2025, 7:12 PM

#

rigid snow also steam audio exists and is foss and best in class

oh thats cool i havent heard of that actually

opaque wharf Aug 5, 2025, 7:13 PM

#

Petition to change H100 classification from a GPU to whatever dedicated ML processor is neuroPogHD

rough bloom Aug 5, 2025, 7:13 PM

#

GTPU

opaque wharf Aug 5, 2025, 7:14 PM

#

Can it perform the usual graphic operation tho? Like rendering pixel on a screen?

rough bloom Aug 5, 2025, 7:14 PM

#

N neurOMEGALUL

lilac flame Aug 5, 2025, 7:14 PM

#

doesnt gpu origiinally mean an asic but for graphics

opaque wharf Aug 5, 2025, 7:14 PM

#

It stands for Graphic Processing Unit yes

lilac flame Aug 5, 2025, 7:15 PM

#

now we have gpus without the gpu

olive sable Aug 5, 2025, 7:15 PM

#

ehhh

#

kinda

opaque wharf Aug 5, 2025, 7:15 PM

#

More like TPU

#

Tensor Processing Unit

olive sable Aug 5, 2025, 7:16 PM

#

the whole part of the gpu that made it a necesity to add it to a sepereate device are the parallel cores, and we kept those

#

they just removed the display stuff

rough bloom Aug 5, 2025, 7:16 PM

#

rough bloom N<:neurOMEGALUL:1097297318119743638>

the H100 doesn't support any of the graphics APIs and doesn't have display outputs, so no graphics unless you use CUDA
it does have a few ROPs for some reason but they shouldn't be used for anything

opaque wharf Aug 5, 2025, 7:16 PM

#

There's also the term NPU (Neural Processing Unit)

opaque sigil Aug 5, 2025, 7:16 PM

#

there's a lot of specific texture and shader related hardware FOCUS

lilac flame Aug 5, 2025, 7:16 PM

#

yeah i meant more the term “graphics processing unit” in particular

opaque sigil Aug 5, 2025, 7:17 PM

#

opaque wharf Tensor Processing Unit

wouldn't surprise me if google patented this

opaque wharf Aug 5, 2025, 7:17 PM

#

rough bloom the H100 doesn't support any of the graphics APIs and doesn't have display outpu...

One of you guys from #programming would use that poor few ROPs to run AAA games I swear

olive sable Aug 5, 2025, 7:17 PM

#

you can procces graphics without showing them

#

its just kinda weird to do it like that unless you're rendering

rough bloom Aug 5, 2025, 7:17 PM

#

olive sable you can procces graphics without showing them

not the H100
it literally does not support it

olive sable Aug 5, 2025, 7:17 PM

#

no?

rough bloom Aug 5, 2025, 7:18 PM

#

no

olive sable Aug 5, 2025, 7:18 PM

#

thats a shame

opaque sigil Aug 5, 2025, 7:18 PM

#

there's no support for any graphics apis

warped narwhal Aug 5, 2025, 7:18 PM

#

aren't rops just the part that does rasterisation? if so you could emulate it in software no problem.

olive sable Aug 5, 2025, 7:18 PM

#

i feel like removing graphics api support doesnt even save that much money, its just to make people not use it for gaming???

opaque sigil Aug 5, 2025, 7:19 PM

#

it saves space you can use to put more tensor cores

olive sable Aug 5, 2025, 7:19 PM

#

ah

#

ok ok

rough bloom Aug 5, 2025, 7:19 PM

#

warped narwhal aren't rops just the part that does rasterisation? if so you could emulate it in...

yeah, you'd basically be writing a software renderer in CUDA LULE
I don't think you can use the actual ROPs without a graphics API but I may be wrong on that

opaque wharf Aug 5, 2025, 7:19 PM

#

olive sable i feel like removing graphics api support doesnt even save that much money, its ...

The silicone literally doesn't have the needed stuff to do it

olive sable Aug 5, 2025, 7:19 PM

#

well yes, but you could always add it

opaque wharf Aug 5, 2025, 7:19 PM

#

catdespair

opaque sigil Aug 5, 2025, 7:19 PM

#

there are some nvidia cards where they disabled the graphics api in the driver but the hardware is there but iirc the h100 for example literally does not have the hardware for it

warped narwhal Aug 5, 2025, 7:20 PM

#

graphics apis have minimum specs for capabilities, so if we can space by not adding all the fixed function stuff and rendering output, then it is either more space for cores, or you can have a smaller die and a higher yield per silicon cookie

olive sable Aug 5, 2025, 7:20 PM

#

opaque wharf <:catdespair:1087521982817509426>

its not that expensive if you're doing the lithography of the rest of the chip anyways

opaque wharf Aug 5, 2025, 7:20 PM

#

Silicone space is premium. Like VERY premium

olive sable Aug 5, 2025, 7:20 PM

#

but ye you dont need it

warped narwhal Aug 5, 2025, 7:21 PM

#

if you could add dx12 to your gpu, but then you can only fit 10 onto a wafer instead of 15, then is it really worth it? esp. when 99% of your customers will not use the api?

olive sable Aug 5, 2025, 7:21 PM

#

h100's are designed for clusters anyways so you dont need 100 display outs you wont be using

warped narwhal Aug 5, 2025, 7:21 PM

#

you basically lose 5 $10k sales for an unused feature

cosmic sphinx Aug 5, 2025, 7:35 PM

#

all matches for today have ended
pretty much all expected results

rigid snow Aug 5, 2025, 7:40 PM

#

opus 4 not winning a single game is surprising, did they forget to turn on reasoning or what

cosmic sphinx Aug 5, 2025, 7:42 PM

#

rigid snow opus 4 not winning a single game is surprising, did they forget to turn on reaso...

no, they did
claude models just suck at chess ibr

#

better at coding in some way

opaque sigil Aug 5, 2025, 7:42 PM

#

what bothers me the most

cosmic sphinx Aug 5, 2025, 7:42 PM

#

but I honestly dont see even a point in using claude in 2025 with their prices

opaque sigil Aug 5, 2025, 7:42 PM

#

why first to 4

cosmic sphinx Aug 5, 2025, 7:43 PM

#

opaque sigil why first to 4

its bo4

rigid snow Aug 5, 2025, 7:43 PM

#

bo7

cosmic sphinx Aug 5, 2025, 7:43 PM

#

or was supposed to be

opaque sigil Aug 5, 2025, 7:44 PM

#

bo4 makes no sense neuroCry

rigid snow Aug 5, 2025, 7:44 PM

#

first to 4 is bo7, why are they saying bo4

cosmic sphinx Aug 5, 2025, 7:45 PM

#

I think they dont understand how best-of and first-to work :LULE:

#

but the games went first to 4, not 2
wtf fr

rigid snow Aug 5, 2025, 7:48 PM

#

vibe-organized tourney

#

this format makes no sense

desert wave Aug 5, 2025, 7:56 PM

#

bo3 but sometimes bo5

hoary lion Aug 5, 2025, 8:04 PM

#

rough bloom GTPU

tpu neuroHypers

#

TRC program is also giving off tpuv6

#

V6!!

#

just one generation away from their frontier

#

and probably what served gemini for a quite long time

#

neuroPogHD

#

poggers

olive sable Aug 5, 2025, 8:11 PM

#

aight i have split the 1 megafile into 3 smaller ones

#

ill need to reformat some stuff still tho

#

rn i define the needed persistent variables in the main.cpp class and i pass them as a pointer to the other file, but if a variable is mainly used in that other file wouldnt it make more sense to define it there?

#

ofcourse that would mean it'd be global but still

opaque wharf Aug 5, 2025, 8:21 PM

#

What are you trying to achieve exactly?

olive sable Aug 5, 2025, 8:21 PM

#

triangle

#

NeuroClueless

opaque wharf Aug 5, 2025, 8:22 PM

#

I mean, what variable do you need and where?

olive sable Aug 5, 2025, 8:25 PM

#

for example, i create VkDebugUtilsMessengerEXT debugMessenger in the main.cpp class, but i only use it in the debug manager file

opaque sigil Aug 5, 2025, 8:26 PM

#

why tf is the version for the extra/haskell-data-default at version 0.7.1.1-356
what are they doing that they need so many revisions for the pkgbuild neuroCry

#

looks like most of the haskell packages are at a couple dozen or 100+ hmm

olive sable Aug 5, 2025, 8:27 PM

#

thats the day of the year the build was released Minamhm

#

on the 356th day of the year they released 0.7.1.1

#

but ye i have no clue

tender river Aug 5, 2025, 8:28 PM

#

opaque sigil why tf is the version for the `extra/haskell-data-default` at version `0.7.1.1-3...

mm ghc or transitive dependency updates probably

opaque sigil Aug 5, 2025, 8:29 PM

#

i guess that would make sense hmm

opaque sigil Aug 5, 2025, 8:29 PM

#

olive sable for example, i create `VkDebugUtilsMessengerEXT debugMessenger` in the main.cpp ...

you could factor them out into individual modules that hold the relevant variables and then use that

tender river Aug 5, 2025, 8:30 PM

#

tender river mm ghc or transitive dependency updates probably

since a version bump is basically a manual nonenforced indicator "the build output changed please rebuild/redownload"

dense marsh Aug 5, 2025, 8:30 PM

#

mornibg peeps

olive sable Aug 5, 2025, 8:31 PM

#

opaque sigil you could factor them out into individual modules that hold the relevant variabl...

like a struct or something?

opaque sigil Aug 5, 2025, 8:32 PM

#

that'd work yeah

stark needle Aug 5, 2025, 8:32 PM

#

Was it already said that the new gpt models dropped

#

They suck

#

Trained to benchmaxx

olive sable Aug 5, 2025, 8:33 PM

#

bwaa

#

is deepseek up to anything cool recently?

hoary lion Aug 5, 2025, 8:33 PM

#

neuroBwaa

olive sable Aug 5, 2025, 8:33 PM

#

bwaadow

hoary lion Aug 5, 2025, 8:33 PM

#

olive sable is deepseek up to anything cool recently?

no r2 means no cool stuff yet evilBwaa

olive sable Aug 5, 2025, 8:33 PM

#

AquaCry

#

ai dev has certainly halted

hoary lion Aug 5, 2025, 8:35 PM

#

deepseek is not the only team tho

#

but still

#

stark needle Aug 5, 2025, 8:36 PM

#

Chat this model is so bad I can't

#

I have it running on a h200 with the lilac training mixture

olive sable Aug 5, 2025, 8:36 PM

#

google shut the fuck up. ive said no to this 6 months ago

stark needle Aug 5, 2025, 8:36 PM

#

And loss wise

#

Gemma 3 4b outperforms by a mile

#

In mean token accuracy

#

It's so bad at vtuber which means

#

It was trained only on stem

#

Aka to answer questions no one asked

glass flower Aug 5, 2025, 8:38 PM

#

stark needle It was trained only on stem

LULE i mean.. they did say they primarily trained it on coding

#

tho i have no idea how you are suppose to run the 20b version.. its so slow on my 4080 and qwen3 30b outperforms it for me

stark needle Aug 5, 2025, 8:38 PM

#

It's slow cause

#

It needs custom megablocks kernel

hoary lion Aug 5, 2025, 8:39 PM

#

custom kernels ugh

#

we all hate them

glass flower Aug 5, 2025, 8:39 PM

#

it also runs like 60% on the cpu for me.

hoary lion Aug 5, 2025, 8:39 PM

#

stark needle Gemma 3 4b outperforms by a mile

vs 20b?

stark needle Aug 5, 2025, 8:40 PM

#

hoary lion vs 20b?

Ye

#

The gpt 20b is especially bad

hoary lion Aug 5, 2025, 8:40 PM

#

i haven't touched any of them yet

nocturne olive Aug 5, 2025, 8:41 PM

#

I guess I can determine those things to be pointless to even experiment with

glass flower Aug 5, 2025, 8:42 PM

#

YEP tho the 120b seems to atleast be decent

#

from the few people that i heard talk about it

#

stick with qwen3 for now. YES

stark needle Aug 5, 2025, 8:43 PM

#

Qwen 3 megabased

glass flower Aug 5, 2025, 8:43 PM

#

pepetears i wish there was a qwen3-coder:8b version

stark needle Aug 5, 2025, 8:43 PM

#

Maybe i can pull off a good finetune for gpt oss

rigid snow Aug 5, 2025, 8:43 PM

#

stark needle The gpt 20b is especially bad

#

this is the stuff i expect a 1b model do

tender river Aug 5, 2025, 8:44 PM

#

openai is a small indie company please understand

glass flower Aug 5, 2025, 8:44 PM

#

that has probably something to do with their "safe" training

stark needle Aug 5, 2025, 8:44 PM

#

tender river openai is a small indie company please understand

true

stark needle Aug 5, 2025, 8:45 PM

#

glass flower that has probably something to do with their "safe" training

They overfit the model on their rules actually

#

There were some posts showing

#

"based on my internal rules"

#

Type shit

#

Where it would list them verbatim within the reasoning traces

#

lmfao

opaque wharf Aug 5, 2025, 8:45 PM

#

stark needle Where it would list them verbatim within the reasoning traces

Small indie company indeed

glass flower Aug 5, 2025, 8:45 PM

#

scrajj honestly i don't get it... gpt hasn't been top in a long time and now they are just shooting themself in the foot

stark needle Aug 5, 2025, 8:46 PM

#

Cause they need to release gpt5

hoary lion Aug 5, 2025, 8:46 PM

#

glass flower that has probably something to do with their "safe" training

this probably

glass flower Aug 5, 2025, 8:46 PM

#

coding: claude or gemini 2.5 pro.
everything else... one of the chinese models LULE

stark needle Aug 5, 2025, 8:46 PM

#

Sam altman circlejerk on twitter trying to maximize reaction

#

It's so bad

opaque wharf Aug 5, 2025, 8:46 PM

#

stark needle Cause they need to release gpt5

"Look, our latest model performs better than the previous by a wide margin!"

stark needle Aug 5, 2025, 8:46 PM

#

Yea

#

They first "accidentally leaked" on hf

#

That's so fake

hoary lion Aug 5, 2025, 8:47 PM

#

that hype tbh sucked

rigid snow Aug 5, 2025, 8:47 PM

#

yeah created like 30 repos, very accidenta;

stark needle Aug 5, 2025, 8:47 PM

#

Lmao

glass flower Aug 5, 2025, 8:47 PM

#

tink btw are these 2 the horizon models? probably not right

#

like the anon models that are available on openrouter

stark needle Aug 5, 2025, 8:48 PM

#

Hopefully not

#

That's likely gpt 4.99999

glass flower Aug 5, 2025, 8:48 PM

#

NeurOhISee no it doesn't seem like it. the horizon models are still up

glass flower Aug 5, 2025, 8:48 PM

#

stark needle That's likely gpt 4.99999

tink idk.. i feel like maybe its a new deepseek?

#

KEKW from what i saw its pretty good at coding. so it can't be a gpt model

hoary lion Aug 5, 2025, 8:49 PM

#

I would forever not forgive altman for not releasing gpt 5 today

rigid snow Aug 5, 2025, 8:49 PM

#

hoary lion Aug 5, 2025, 8:50 PM

#

?? lol

rigid snow Aug 5, 2025, 8:50 PM

#

people have now started abusing the app name header feature on openrouter to spread anthropic propaganda

stark needle Aug 5, 2025, 8:50 PM

#

Wanna see how funny this is

rigid snow Aug 5, 2025, 8:50 PM

#

insane

hoary lion Aug 5, 2025, 8:50 PM

#

i mean it;s true tho

dusky jackal Aug 5, 2025, 8:50 PM

#

rigid snow

LMFAO BASED SCHIZO RANT

I was kind of excited for the open weight models, but this is even better than I thought! neurOMEGALUL

rigid snow Aug 5, 2025, 8:51 PM

#

wait they link to a domain

stark needle Aug 5, 2025, 8:51 PM

#

Chinese Model vs gpt oss

rigid snow Aug 5, 2025, 8:51 PM

#

rigid snow wait they link to a domain

that redirects to some random person's github

#

lol

stark needle Aug 5, 2025, 8:51 PM

#

2.4 loss vs <2 loss

dusky jackal Aug 5, 2025, 8:51 PM

#

stark needle It's so bad at vtuber which means

Oh, nvm then.

stark needle Aug 5, 2025, 8:52 PM

#

dusky jackal Oh, nvm then.

Even for finetuning it sucks

rigid snow Aug 5, 2025, 8:52 PM

#

cause moe

stark needle Aug 5, 2025, 8:52 PM

#

I hope it fixes itself magically

rigid snow Aug 5, 2025, 8:52 PM

#

right

stark needle Aug 5, 2025, 8:52 PM

#

rigid snow cause moe

SystemL 21b is also a moe

#

It's 20b vs 21b

opaque wharf Aug 5, 2025, 8:52 PM

#

Moe moe kyun

stark needle Aug 5, 2025, 8:52 PM

#

Both 3b active params

rigid snow Aug 5, 2025, 8:53 PM

#

stark needle SystemL 21b is also a moe

i mean regading finetuning

opaque wharf Aug 5, 2025, 8:53 PM

#

opaque wharf Moe moe kyun

Surprised no one named a model this yet

stark needle Aug 5, 2025, 8:53 PM

#

rigid snow i mean regading finetuning

No the chinese moe trains just fine

#

Fyi

#

The stuff in the hf repo

hoary lion Aug 5, 2025, 8:53 PM

#

shad

stark needle Aug 5, 2025, 8:54 PM

#

Is for some stuff 1:1 copied from t5

hoary lion Aug 5, 2025, 8:54 PM

#

i think the activation is too small

stark needle Aug 5, 2025, 8:54 PM

#

Lmfao

hoary lion Aug 5, 2025, 8:54 PM

#

3.6B A

#

too smol

#

what did we exepct

stark needle Aug 5, 2025, 8:54 PM

#

No qwen 30b is also 3b

#

Active

#

It's just they copied from fucking t5 for some reason

dusky jackal Aug 5, 2025, 8:55 PM

#

stark needle Even for finetuning it sucks

Kind of off the rails, but personally I don’t really believe AI VTubers are really seperate from their base model unless they’re fine-tuned. A prompted mistral 7b is just a mistral 7b to me.

stark needle Aug 5, 2025, 8:55 PM

#

dusky jackal Kind of off the rails, but personally I don’t really believe AI VTubers are real...

Yea that's why i have a training corpus for lilac

dusky jackal Aug 5, 2025, 8:55 PM

#

stark needle Yea that's why i have a training corpus for lilac

Based.

stark needle Aug 5, 2025, 8:56 PM

#

Why tf is mean token accuracy stuck at 55%

#

Actually

hoary lion Aug 5, 2025, 8:56 PM

#

lmao

stark needle Aug 5, 2025, 8:56 PM

#

I'll let it be for the night

#

And see

#

How much money I'll waste

#

The trash architecture was expected tho

#

They aint google

#

Google actually drops huge arch improvements

#

E.g. gemma 3n is based

hoary lion Aug 5, 2025, 8:58 PM

#

pretty sure

#

they debuff them before release

#

cause "GPT 5"

dusky jackal Aug 5, 2025, 8:58 PM

#

stark needle Even for finetuning it sucks

I’m guessing it’s too strict, yeah?

hoary lion Aug 5, 2025, 8:58 PM

#

suckers

stark needle Aug 5, 2025, 8:59 PM

#

dusky jackal I’m guessing it’s too strict, yeah?

No it was just trained on math and code so it's having rough time absorbing the vtuber data

dusky jackal Aug 5, 2025, 8:59 PM

#

stark needle No it was just trained on math and code so it's having rough time absorbing the ...

Ahh got it.

#

That’s a shame.

stark needle Aug 5, 2025, 8:59 PM

#

Yea

#

Tbh

#

I'll just release this finetune oss

#

Lmao

#

Yall can do whatever with this then

hoary lion Aug 5, 2025, 9:00 PM

#

sauce for uhh

#

lilac

#

neuroHypers

#

i actually don't know who is lilac still

dusky jackal Aug 5, 2025, 9:01 PM

#

stark needle Yall can do whatever with this then

Guessing Lilac won’t be using that then neurOMEGALUL

stark needle Aug 5, 2025, 9:01 PM

#

hoary lion i actually don't know who is lilac still

lilac

olive sable Aug 5, 2025, 9:02 PM

#

enub

hoary lion Aug 5, 2025, 9:02 PM

#

nub

#

fumo always have that stupid gaze i like

stark needle Aug 5, 2025, 9:03 PM

#

I love this plushie so much

#

If i had the money I'd buy 20

hoary lion Aug 5, 2025, 9:04 PM

#

fumo is life ahh

stark needle Aug 5, 2025, 9:04 PM

#

3600 bucks tho

#

😭

#

For 20

olive sable Aug 5, 2025, 9:04 PM

#

thats crazy

#

i can make most parts of a plush myself, except for the stitching on the eyes

stark needle Aug 5, 2025, 9:05 PM

#

For that u need an embroidery machine

hoary lion Aug 5, 2025, 9:05 PM

#

a what machine??

#

that sounds ominous for no reason

olive sable Aug 5, 2025, 9:06 PM

#

i have an embroidery machine at home, its just form 1920

stark needle Aug 5, 2025, 9:06 PM

#

opaque wharf Aug 5, 2025, 9:06 PM

#

stark needle

Do it the olden ways

#

BY HAND

olive sable Aug 5, 2025, 9:07 PM

#

olive sable i have an embroidery machine at home, its just form 1920

its permanently attatched to a table and you need to manually spin it with your feet KEKW

#

its not feasable to make anything useful with it

opaque wharf Aug 5, 2025, 9:08 PM

#

olive sable its not feasable to make anything useful with it

It is feasible. Just need a lot of practice like most things enub

olive sable Aug 5, 2025, 9:08 PM

#

its one of these. probably not the exactr same model but the same brand

hoary lion Aug 5, 2025, 9:08 PM

#

this is ancient wow

opaque wharf Aug 5, 2025, 9:09 PM

#

Ahhh, singer. My grandma has one of those.

#

And still very dexterous too when using the machine

olive sable Aug 5, 2025, 9:09 PM

#

ive only made an actual plush once, and it was a pain in the ass

opaque wharf Aug 5, 2025, 9:09 PM

#

To sew stuff

olive sable Aug 5, 2025, 9:10 PM

#

manual stitching and i stole the stuffing from an old pillow

#

it was pretty shabby but it was free

rigid snow Aug 5, 2025, 9:11 PM

#

#sewing

opaque wharf Aug 5, 2025, 9:11 PM

#

I forgot how easy it is to get sewing and other textile product here since it is one of our country largest export

rigid snow Aug 5, 2025, 9:12 PM

#

rigid snow #sewing

i'm so glad the #screeps arc is gone at least temporarily, no interesting topics were brought up

hoary lion Aug 5, 2025, 9:13 PM

#

was it necessary to mention that forgotten word

#

now screepers would arise

rigid snow Aug 5, 2025, 9:13 PM

#

i'd bonk them back into their holes they'll crawl out of

tender river Aug 5, 2025, 9:14 PM

#

shiro slander neuroSadge

olive sable Aug 5, 2025, 9:14 PM

#

rigid snow i'm so glad the #screeps arc is gone at least temporarily, no interesting topics...

#

so anyways back to sewing

glass flower Aug 5, 2025, 9:14 PM

#

rigid snow i'm so glad the #screeps arc is gone at least temporarily, no interesting topics...

soo.... any good wall building algorithms you guys developed for your screeps? i currently just letting mine do their thing and upgrade my controller

rigid snow Aug 5, 2025, 9:15 PM

#

you buy bricks and lay them down

#

good technique

olive sable Aug 5, 2025, 9:15 PM

#

i just place the walls manually

hoary lion Aug 5, 2025, 9:15 PM

#

why tf perplexity brought this up

#

no I HATE GPT

stark needle Aug 5, 2025, 9:16 PM

#

olive sable its one of these. probably not the exactr same model but the same brand

Holy shit also have one of those 1:1 at my dads house

glass flower Aug 5, 2025, 9:16 PM

#

hoary lion why tf perplexity brought this up

WHATT gpt-5

olive sable Aug 5, 2025, 9:16 PM

#

stark needle Holy shit also have one of those 1:1 at my dads house

neuroNOWAYING

#

in the middle ages flanders used to be the best textile makers and the richest county in Europe.

rigid snow Aug 5, 2025, 9:16 PM

#

hoary lion why tf perplexity brought this up

openai engineer in this chat neuroNOWAYING

olive sable Aug 5, 2025, 9:16 PM

#

hoary lion Aug 5, 2025, 9:17 PM

#

neuroNOWAYING

hoary lion Aug 5, 2025, 9:17 PM

#

olive sable

you broke the nowaying chain

glass flower Aug 5, 2025, 9:17 PM

#

mods he is leaking gpt-5

stark needle Aug 5, 2025, 9:17 PM

#

HES LEAKING AGI

#

What

#

No way

hoary lion Aug 5, 2025, 9:17 PM

#

fun fact is that gpt 5 actually got leaked on perplexity

#

neuroDeadge

#

pretty sure someone got fired

glass flower Aug 5, 2025, 9:17 PM

#

OMEGADANCEBUTFAST

stark needle Aug 5, 2025, 9:17 PM

#

Perplexity got the hidden url

rigid snow Aug 5, 2025, 9:18 PM

#

does perplexity ignore robots.txt or what

#

if so fuck them

stark needle Aug 5, 2025, 9:18 PM

#

Yes

#

Cloudflare blocks them now

#

Lmao

rigid snow Aug 5, 2025, 9:18 PM

#

good

opaque wharf Aug 5, 2025, 9:18 PM

#

rigid snow does perplexity ignore robots.txt or what

Almost everyone nowadays

olive sable Aug 5, 2025, 9:18 PM

#

~~gpt5~~ gpt4 + fusion card + gpt4

#

NODDERS

rigid snow Aug 5, 2025, 9:18 PM

#

opaque wharf Almost everyone nowadays

who's everyone

stark needle Aug 5, 2025, 9:18 PM

#

rigid snow good

They were rotating ips and headers and everything

glass flower Aug 5, 2025, 9:19 PM

#

deadass

rigid snow Aug 5, 2025, 9:19 PM

#

stark needle They were rotating ips and headers and everything

i can understand that

opaque wharf Aug 5, 2025, 9:19 PM

#

rigid snow who's everyone

Any AI company

rigid snow Aug 5, 2025, 9:19 PM

#

yes but perplexity is a search engine

stark needle Aug 5, 2025, 9:19 PM

#

Even outside their officially documented ip ranges

opaque wharf Aug 5, 2025, 9:20 PM

#

rigid snow yes but perplexity is a search engine

With AI

glass flower Aug 5, 2025, 9:20 PM

#

opaque wharf With AI

so google?

stark needle Aug 5, 2025, 9:20 PM

#

Perplexity is chatgpt wrapper

rigid snow Aug 5, 2025, 9:21 PM

#

opaque wharf With AI

i don't think ai companies roll their own web search while ignoring robots.txt

opaque wharf Aug 5, 2025, 9:21 PM

#

glass flower so google?

Yes, but I don't know if they use the same crawler for search index and their AI stuff

stark needle Aug 5, 2025, 9:21 PM

#

They dont

#

They have a extended bot

#

For ai

rigid snow Aug 5, 2025, 9:21 PM

#

rigid snow i don't think ai companies roll their own web search while ignoring robots.txt

scraping for data and for indexing are a bit different

opaque wharf Aug 5, 2025, 9:22 PM

#

Yes, and the AI division is what ignoring robots.txt

stark needle Aug 5, 2025, 9:22 PM

#

Me when ai companies say "any public data is open to take"

#

BOOM

rigid snow Aug 5, 2025, 9:22 PM

#

that doesn't make sense

rigid snow Aug 5, 2025, 9:23 PM

#

rigid snow that doesn't make sense

because how would gpt-5 leak then? 3 mentions of it in pretraining wouldn't suddenly make the model recall it

#

that means that their search ignores robots.txt

stark needle Aug 5, 2025, 9:25 PM

#

It does

#

It explicitly does

#

https://blog.cloudflare.com/perplexity-is-using-stealth-undeclared-crawlers-to-evade-website-no-crawl-directives/

The Cloudflare Blog

Perplexity is using stealth, undeclared crawlers to evade website n...

Perplexity is repeatedly modifying their user agent and changing IPs and ASNs to hide their crawling activity, in direct conflict with explicit no-crawl preferences expressed by websites.

#

Lmfao

#

Cloudflare blog

rigid snow Aug 5, 2025, 9:26 PM

#

absolutely insane

stark needle Aug 5, 2025, 9:26 PM

#

sometimes failing to even fetch — robots.txt files

#

sussycat

opaque wharf Aug 5, 2025, 9:27 PM

#

Now the question is, do they use the same crawler for their AI and their search engine? Because google as you said, uses a different bot

glass flower Aug 5, 2025, 9:28 PM

#

isn't perplexity just their AI?

opaque wharf Aug 5, 2025, 9:28 PM

#

Hence my statement, AI companies ignoring robot.txt

stark needle Aug 5, 2025, 9:29 PM

#

Perplexity is chatgpt wrapper

#

https://huggingface.co/openai/gpt-oss-20b/discussions/14

openai/gpt-oss-20b · This model is unbelievably ignorant.

opaque wharf Aug 5, 2025, 9:41 PM

#

In short, they want open source to remain the near exclusive domain of autistic coding nerds.

#

evilWheeze

hoary lion Aug 5, 2025, 9:41 PM

#

its simple

#

they dont want 4o competitor

#

so openai made it clueless on real world info

#

glueless

cosmic sphinx Aug 5, 2025, 9:46 PM

#

m8 find me a goddamn phone that can run a 20B model

olive sable Aug 5, 2025, 9:47 PM

#

sam altman's phone, similar to a gaming laptop, uses 5W on batrtery and 1500W when plugged in

#

Minamhm

cosmic sphinx Aug 5, 2025, 9:48 PM

#

yh bro has 3 built in H100's with a threadripper CPU in his phone

#

custom package

olive sable Aug 5, 2025, 9:48 PM

#

bro has sub-zero cooling, in kelvin

sick owl Aug 5, 2025, 9:49 PM

#

GPT oss 20B with 128k tokens of context on my GPU

#

Holy shit

sick owl Aug 5, 2025, 9:49 PM

#

cosmic sphinx m8 find me a goddamn phone that can run a **20B** model

The new Pixels are getting 16 gigs of ram IIRC

hoary lion Aug 5, 2025, 9:50 PM

#

where is the remaining 4gb sir

#

quant?

tender river Aug 5, 2025, 9:50 PM

#

you too can run gpt-oss on your phone with a ChatGPT™ Plus® plan! buy today at https://chatgpt.com/plus (not sponsored)

ChatGPT

A conversational AI system that listens, learns, and challenges

sick owl Aug 5, 2025, 9:50 PM

#

hoary lion where is the remaining 4gb sir

Its 20B parameters not 20GB lmao

cosmic sphinx Aug 5, 2025, 9:50 PM

#

sick owl GPT oss 20B with 128k tokens of context on my GPU

i aint runnin this on my pc 🥀

#

tried downloading the fucking qwen-image yesterday and BSOD'd my PC by accident

sick owl Aug 5, 2025, 9:50 PM

#

cosmic sphinx i aint runnin this on my pc 🥀

Its MoE, you can run it EZ with some offloading

cosmic sphinx Aug 5, 2025, 9:51 PM

#

sick owl Its MoE, you can run it EZ with some offloading

on a 4080?

sick owl Aug 5, 2025, 9:51 PM

#

cosmic sphinx on a 4080?

Easily

#

You have 16 gigs to play with on a 4080

cosmic sphinx Aug 5, 2025, 9:51 PM

#

ima try getting lm studio back in and cook smth ig

sick owl Aug 5, 2025, 9:51 PM

#

Hell you won't have to offload

#

Just drop context down to like 90k instead of 128

#

You've only gotta save 1GB of VRAM so your OS can do its thing

#

Hell drop down to 64k context and you can run it while doing other shit with zero issues

glass flower Aug 5, 2025, 9:54 PM

#

huhExplode i can barely run the gpt 20b with 8k context

#

how are you running it with 128k

sick owl Aug 5, 2025, 9:55 PM

#

glass flower <a:huhExplode:1367159841486929980> i can barely run the gpt 20b with 8k context

You're using ollama aren't you

glass flower Aug 5, 2025, 9:55 PM

#

mybad

sick owl Aug 5, 2025, 9:55 PM

#

Lmao, yeah ollamas memory management is fucked

#

I'm using llama.cpp with flash attention

fast pagoda Aug 5, 2025, 9:55 PM

#

be me
distrohopping
"man i wish there were a way to just kinda store my exact configuration across installs of my system, that way even if i haev to wipe it for some reason it's just configured exactly the same way"
realize that's what nixos is for
it's over

glass flower Aug 5, 2025, 9:56 PM

#

i haven't figured out a good alternative that allows me to quickly use models.... annytfSigh i really should get to it and get rid of ollama. i hate where its heading with their GUI that silently installs when you update

#

i should make my own thin-ollama client LULE

#

i just need something to spin up models and then shut them down after some time

fast pagoda Aug 5, 2025, 9:57 PM

#

vLLM

glass flower Aug 5, 2025, 9:57 PM

#

but doesn't that always run the model?

#

i thought its more for server setup

fast pagoda Aug 5, 2025, 9:59 PM

#

huh it serves it like ollama does, i didnt know it did anything differently, you can just do like vllm serve `Qwen/Qwen2.5-1.5B and itstarts serving that on an endpoint

#

are you saying like you want it to idle and unload the model if idle?

glass flower Aug 5, 2025, 9:59 PM

#

YES

#

since i only need the model to load when i use it with vscode

#

and then it should go away if its not in use

fast pagoda Aug 5, 2025, 10:00 PM

#

good question

#

seems like it would be something it would support

opaque wharf Aug 5, 2025, 10:01 PM

#

fast pagoda >be me >distrohopping >"man i wish there were a way to just kinda store my exact...

For mentioning the possibility of using NixOS alone, you have summoned the NixOS council here

fast pagoda Aug 5, 2025, 10:01 PM

#

lmao

#

i have nixOS on my ventoy

#

havent had the courage yet

opaque wharf Aug 5, 2025, 10:01 PM

#

glass flower since i only need the model to load when i use it with vscode

LM Studio

cosmic sphinx Aug 5, 2025, 10:01 PM

#

@sick owl where do I switch my models download directory I forgor

#

o found it

fast pagoda Aug 5, 2025, 10:02 PM

#

lmstudio uses llama.cpp

#

just like ollama

opaque wharf Aug 5, 2025, 10:02 PM

#

fast pagoda havent had the courage yet

Good luck with that lol. I am content with my Arch setup

fast pagoda Aug 5, 2025, 10:02 PM

#

im on cachyOS right now

#

for my main pc

#

i just keep distrohopping on my laptop

glass flower Aug 5, 2025, 10:03 PM

#

fast pagoda lmstudio uses llama.cpp

scrajj but does it do memory management better or worse?

opaque wharf Aug 5, 2025, 10:03 PM

#

glass flower <a:scrajj:1167103365801578577> but does it do memory management better or worse?

You have a GUI at least to control a lot of the parameter

cosmic sphinx Aug 5, 2025, 10:03 PM

#

only 12 gb of files wauw

#

this is gonna be ez

fast pagoda Aug 5, 2025, 10:04 PM

#

my ventoy drive rn lol

glass flower Aug 5, 2025, 10:04 PM

#

opaque wharf You have a GUI at least to control a lot of the parameter

LUL a gui is exactly the reason i want to move away from ollama. so thats not a positive

fast pagoda Aug 5, 2025, 10:04 PM

#

opaque wharf You have a GUI at least to control a lot of the parameter

lm studio's UI is the reason i am moving away from it

#

it has some stuff but it's frustratingly limited

cosmic sphinx Aug 5, 2025, 10:04 PM

#

fast pagoda lm studio's UI is the reason i am moving away from it

better than running into a risk of bsod'ing my own pc bc I cant handle memory shortages catSUS

opaque wharf Aug 5, 2025, 10:04 PM

#

I see lol. I thought you would be interested in the auto unload model

fast pagoda Aug 5, 2025, 10:05 PM

#

yeah lm studio do be having that i forgot

#

that's what i used to use for echo but i havent set him up on this pc yet

glass flower Aug 5, 2025, 10:05 PM

#

opaque wharf I see lol. I thought you would be interested in the auto unload model

surely... its not diffcult to get that functionallity with vllm

opaque wharf Aug 5, 2025, 10:06 PM

#

And also this config for inference

fast pagoda Aug 5, 2025, 10:06 PM

#

https://github.com/mostlygeek/llama-swap

apparently you can use this with any backend including vllm

GitHub

GitHub - mostlygeek/llama-swap: Model swapping for llama.cpp (or an...

Model swapping for llama.cpp (or any local OpenAPI compatible server) - mostlygeek/llama-swap

#

and it supports auto offload

#

Automatic unloading of models after timeout by setting a ttl in features

opaque wharf Aug 5, 2025, 10:07 PM

#

glass flower surely... its not diffcult to get that functionallity with vllm

I honestly have no idea lol. I use LLM the same way you do, that is only loading it when needed

#

That's why I don't need a daemon running all the time

#

Just open another app as needed

fast pagoda Aug 5, 2025, 10:08 PM

#

was reading this thread and found a few that support it but it's mostly thru frontends

#

https://www.reddit.com/r/LocalLLaMA/comments/1k63qy6/any_llm_backends_that_autounload_models_like/

From the LocalLLaMA community on Reddit

Explore this post and more from the LocalLLaMA community

glass flower Aug 5, 2025, 10:08 PM

#

hmm vllm has /sleep and /wake_up endpoints.. so you can probably just have a slim python script that is proxy

fast pagoda Aug 5, 2025, 10:08 PM

#

speaking of ai slopshit i found that stable diffusion is like 2x faster on my 3080 on this cachyOS install

#

than it was on windows

#

so that's cool

#

check this out

#

https://github.com/BoredBrownBear/text-generation-webui-model_ducking

GitHub

GitHub - BoredBrownBear/text-generation-webui-model_ducking: An ext...

An extension for oobabooga/text-generation-webui that automatically unloads and reloads your model. - BoredBrownBear/text-generation-webui-model_ducking

#

that's the whole point of this thing

#

but it's for oogabooga text gen webui and sillytavern

cosmic sphinx Aug 5, 2025, 10:10 PM

#

ok chat this is first time im trying a MoE local model

#

what do you increase number of experts for

fast pagoda Aug 5, 2025, 10:11 PM

#

dont

#

just dont change number of experts usually

#

it can cause it to be weird and/or shit

#

it's fine tuned with the built in number of experts

cosmic sphinx Aug 5, 2025, 10:12 PM

#

neuroOhISay

fast pagoda Aug 5, 2025, 10:12 PM

#

more experts you would think is better automatically but there is usually a ssweet spot

#

the router itself is the problem with that

#

i mean theoretically more experts = better

#

but unless you're on base model only and planning on fine tuning further with your chosen number of experts it'll probably get unstable quickly

olive sable Aug 5, 2025, 10:15 PM

#

the more i use c++, the more i question if i suck at programming or if c++ sucks as a language

#

and then i relaize the anser is both

fast pagoda Aug 5, 2025, 10:15 PM

#

i was gonan say

#

it's both

#

lol

olive sable Aug 5, 2025, 10:15 PM

#

it was never a valid question

fast pagoda Aug 5, 2025, 10:15 PM

#

hmmm claude has opus 4.1 now i didnt even notice that happened

#

pre-empt gpt5 i guess

#

LULE agi

olive sable Aug 5, 2025, 10:16 PM

#

but ye anyways i have now made the seperate files not contain functions, but a class with functions and variables cuz it jsut works better that way idk

#

chay would not be proud cuz of the OOP

fast pagoda Aug 5, 2025, 10:17 PM

#

if it's not haskell i don't think chay is proud

#

ever

cosmic sphinx Aug 5, 2025, 10:17 PM

#

ok im runnin in a lil bit of trouble here
I think lm studio prioritizes taking up my normal RAM over VRAM for some rsn

#

like it fills up close to 100% way too fast

olive sable Aug 5, 2025, 10:17 PM

#

i started using capitals for the first letter of a classname tho, that is huge

fast pagoda Aug 5, 2025, 10:17 PM

#

cosmic sphinx ok im runnin in a lil bit of trouble here I think lm studio prioritizes taking u...

choose the gpu offload layers

#

appropriately

#

it'll default to like

#

8

#

when there's way more than that

#

you would configure that when you're initializing the model

cosmic sphinx Aug 5, 2025, 10:18 PM

#

its literally set to 20/24

fast pagoda Aug 5, 2025, 10:18 PM

#

oh

#

try 24/24 LUL

opaque wharf Aug 5, 2025, 10:19 PM

#

The brute force way lmao

cosmic sphinx Aug 5, 2025, 10:19 PM

#

gonna lower the context

fast pagoda Aug 5, 2025, 10:19 PM

#

it should put whatever doesnt fit in system ram anyways

#

using anything more than like 8k context locally balloons memory usage pretty fast yeah

cosmic sphinx Aug 5, 2025, 10:20 PM

#

???????

#

ok fine system, you win

olive sable Aug 5, 2025, 10:20 PM

#

i cant read that, is that ram?

cosmic sphinx Aug 5, 2025, 10:20 PM

#

yes

olive sable Aug 5, 2025, 10:20 PM

#

ah

fast pagoda Aug 5, 2025, 10:20 PM

#

your problem is that says память and not memory

cosmic sphinx Aug 5, 2025, 10:20 PM

#

4k length it is LULW

olive sable Aug 5, 2025, 10:21 PM

#

i may not have a brain gentleman but i have an idea

fast pagoda Aug 5, 2025, 10:21 PM

#

an idea and a dream

olive sable Aug 5, 2025, 10:21 PM

#

remove your ram and then it cant use your ram

fast pagoda Aug 5, 2025, 10:21 PM

#

thonk

#

the perfect plan

cosmic sphinx Aug 5, 2025, 10:22 PM

#

am scared

#

it says 15 gb is taken but FUCKING WHERE

#

im fairly certain i toggled off the reserve memory checkbox

#

am really not clocking where is that 15 gb coming out from

fast pagoda Aug 5, 2025, 10:25 PM

#

waht size model

cosmic sphinx Aug 5, 2025, 10:25 PM

#

the 20b one

fast pagoda Aug 5, 2025, 10:25 PM

#

what is your VRAM

cosmic sphinx Aug 5, 2025, 10:25 PM

#

16 gb

fast pagoda Aug 5, 2025, 10:26 PM

#

what quant of model

#

i would expect you to fit like a 20b in q4

cosmic sphinx Aug 5, 2025, 10:26 PM

#

?? im sure oai didnt release any quants yet

#

fast pagoda Aug 5, 2025, 10:26 PM

#

oh yeah

#

so

#

it's gonna need like

#

40gb

cosmic sphinx Aug 5, 2025, 10:27 PM

#

nah no way fr fr

#

@sick owl did u scam me

fast pagoda Aug 5, 2025, 10:27 PM

#

i dunno i'd epxect it to somehow use that much

#

for a 20b

sick owl Aug 5, 2025, 10:28 PM

#

cosmic sphinx <@1113590687636656208> did u scam me

?

#

unsloth/gpt-oss-20b-GGUF

#

ggml-org/gpt-oss-20b-GGUF

#

The model is MXFP4 by default

cosmic sphinx Aug 5, 2025, 10:28 PM

#

ok so i did download the wrong model POG

fast pagoda Aug 5, 2025, 10:29 PM

#

yeah a 20b if it's full precision

#

bf16

#

needs 2 bytes/param

#

which is like 40-45 gb of vram

cosmic sphinx Aug 5, 2025, 10:29 PM

#

but hey it works

sick owl Aug 5, 2025, 10:29 PM

#

fast pagoda yeah a 20b if it's full precision

Its 20B

fast pagoda Aug 5, 2025, 10:29 PM

#

err

#

i meant 20b

cosmic sphinx Aug 5, 2025, 10:29 PM

#

sort of, but im scared for my RAM so im gonna delete this shit

sick owl Aug 5, 2025, 10:29 PM

#

Oh gotcha

cosmic sphinx Aug 5, 2025, 10:29 PM

#

sick owl unsloth/gpt-oss-20b-GGUF

is it released already tho

sick owl Aug 5, 2025, 10:29 PM

#

cosmic sphinx is it released already tho

Yes

#

I'm literally running it neurOMEGALUL

cosmic sphinx Aug 5, 2025, 10:30 PM

#

thonk

fast pagoda Aug 5, 2025, 10:31 PM

#

make sure ur using flash attention and kv offloading stuff

sick owl Aug 5, 2025, 10:31 PM

#

cosmic sphinx <:thonk:602047082064510986>

Repo stats probably haven't been updated yet

#

But Unsloth are my go-to quant providers

fast pagoda Aug 5, 2025, 10:31 PM

#

unsloth are based

sick owl Aug 5, 2025, 10:31 PM

#

And if you go to the ggml-org one they're literally the guys who make llama.cpp neurOMEGALUL

cosmic sphinx Aug 5, 2025, 10:31 PM

#

f16 is the only available so ima go for this one ig

fast pagoda Aug 5, 2025, 10:31 PM

#

just a couple real smart mfs mathing it up

#

f16 is thicc

#

that's gonna need 40gb

sick owl Aug 5, 2025, 10:31 PM

#

fast pagoda that's gonna need 40gb

"F16" here is the mixed precision format the model was trained in

#

Its actually mostly 4 bit

fast pagoda Aug 5, 2025, 10:32 PM

#

oh

sick owl Aug 5, 2025, 10:32 PM

#

Comes in at around 14GB

fast pagoda Aug 5, 2025, 10:32 PM

#

i was thinking it was a full f16 weight model

#

i havent looked into it at all yet

sick owl Aug 5, 2025, 10:32 PM

#

Nah, the embedding and output layers are FP16 but most of the model was trained in FP4

fast pagoda Aug 5, 2025, 10:32 PM

#

interesting

#

guess they determined that was enough

sick owl Aug 5, 2025, 10:32 PM

#

Very, its the first release like this I've seen

#

They merged in support for the MXFP4 format in llama.cpp too

fast pagoda Aug 5, 2025, 10:33 PM

#

makes sense with how q4 is like the default i keep running into

#

everyone keeps measuring what their model will run on with fineprint of **in q4

cosmic sphinx Aug 5, 2025, 10:33 PM

#

if that one was 12 t/s unoptimized I wonder how fast this one gonna run

fast pagoda Aug 5, 2025, 10:33 PM

#

12 t/s inc

sick owl Aug 5, 2025, 10:34 PM

#

cosmic sphinx if that one was 12 t/s unoptimized I wonder how fast this one gonna run

Very

cosmic sphinx Aug 5, 2025, 10:34 PM

#

download on this 1 is holy fucking slow tho

sick owl Aug 5, 2025, 10:34 PM

#

sick owl Very

Its ridiculously fast

fast pagoda Aug 5, 2025, 10:34 PM

#

wpmder wjat this lm studio linux download is inb4 flatpak or appimage

#

god dammit it's .AppImage

sick owl Aug 5, 2025, 10:34 PM

#

Classic lmstudio

fast pagoda Aug 5, 2025, 10:35 PM

#

time to download a sketchy AUR version

sick owl Aug 5, 2025, 10:35 PM

#

Very much a closed source program so you get stuff like that but its good enough

cosmic sphinx Aug 5, 2025, 10:35 PM

#

3 MB/s evilDeadge

sick owl Aug 5, 2025, 10:35 PM

#

I still prefer to stick with llama.cpp though

fast pagoda Aug 5, 2025, 10:35 PM

#

i just like using it for very fast tests

#

cuz i havent set up any llm shit on this install of cachy

sick owl Aug 5, 2025, 10:35 PM

#

cosmic sphinx 3 MB/s <:evilDeadge:1150735483396161586>

You are downloading in llama.cpp itself right?

#

Because that's a lot faster, they throttle browser downloads on huggingface iirc

cosmic sphinx Aug 5, 2025, 10:35 PM

#

im downloading in the fucking lm studio

sick owl Aug 5, 2025, 10:35 PM

#

o7

#

In that case you just have a slow connection to their servers

fast pagoda Aug 5, 2025, 10:36 PM

#

no viruses here

#

surely not

cosmic sphinx Aug 5, 2025, 10:36 PM

#

the og model got yeeted in fast but i guess hf is fucked up

sick owl Aug 5, 2025, 10:36 PM

#

minibotSus

fast pagoda Aug 5, 2025, 10:36 PM

#

actually they look normal thankfully

sick owl Aug 5, 2025, 10:36 PM

#

Optimal sampler params here btw

cosmic sphinx Aug 5, 2025, 10:37 PM

#

Mistral GPT pog

sick owl Aug 5, 2025, 10:37 PM

#

https://huggingface.co/unsloth/gpt-oss-20b-GGUF/blob/main/params

params · unsloth/gpt-oss-20b-GGUF at main

fast pagoda Aug 5, 2025, 10:38 PM

#

#

lol this is already the default

#

that gets shown

cosmic sphinx Aug 5, 2025, 10:38 PM

#

Native MXFP4 quantization: The models are trained with native MXFP4 precision for the MoE layer, making gpt-oss-120b run on a single H100 GPU and the gpt-oss-20b model run within 16GB of memory

within 16GB of memory

#

https://tenor.com/KoqH.gif

Tenor

#

only if you use like 2k context

fast pagoda Aug 5, 2025, 10:39 PM

#

gotta be your connection

cosmic sphinx Aug 5, 2025, 10:40 PM

#

fast pagoda gotta be your connection

thats not the

#

unsloth

#

neuro7

#

u fell down the same slope I did

fast pagoda Aug 5, 2025, 10:40 PM

#

it was recommended by lm studio surely it'll be fine LULE

sick owl Aug 5, 2025, 10:41 PM

#

BF16 ono

fast pagoda Aug 5, 2025, 10:41 PM

#

#

now i know it's your connection for sure

#

cuz i get 70MB/s while downloading both

#

for each

#

engine revving sound

#

i need to get a 10g ethernet card

cosmic sphinx Aug 5, 2025, 10:42 PM

#

fast pagoda now i know it's your connection for sure

Erm

sick owl Aug 5, 2025, 10:44 PM

#

Weird

fast pagoda Aug 5, 2025, 10:44 PM

#

need new ethernet

sick owl Aug 5, 2025, 10:44 PM

#

You got a VPN on?

#

Could be that

cosmic sphinx Aug 5, 2025, 10:45 PM

#

sick owl You got a VPN on?

not really a vpn, just a bypass script

fast pagoda Aug 5, 2025, 10:45 PM

#

lmstudio should be connecting to hf direct anyways

cosmic sphinx Aug 5, 2025, 10:45 PM

#

restarted app & download, seems sorta better

fast pagoda Aug 5, 2025, 10:45 PM

#

it must be something with hf where you are maybe

cosmic sphinx Aug 5, 2025, 10:45 PM

#

12 mb/s

#

i'll download it rn and toy around with tmrw

#

1:46 AM over here

fast pagoda Aug 5, 2025, 10:47 PM

#

lmstudio is unusable

#

wayland icon

#

instead of the app icon

#

it's over

#

did you maybe have this toggled @cosmic sphinx

#

i forgot about this setting

olive sable Aug 5, 2025, 10:48 PM

#

you know you fucked up when you change a file, and then you get 20 errors from completely different files EvilDIESOFCRINGE

fast pagoda Aug 5, 2025, 10:48 PM

#

nah that just means your 20 other files are wrong

#

not the one that just changed

olive sable Aug 5, 2025, 10:48 PM

#

surely

#

NeuroClueless

fast pagoda Aug 5, 2025, 10:48 PM

#

big_brain

olive sable Aug 5, 2025, 10:49 PM

#

fast pagoda Aug 5, 2025, 10:49 PM

#

exactly

#

i feel like i read some anecdote recently about someone who was having inexplicable errors in his code for something and it turned out to be a cpu bug

olive sable Aug 5, 2025, 10:49 PM

#

neurOMEGALUL

sick owl Aug 5, 2025, 10:51 PM

#

fast pagoda i feel like i read some anecdote recently about someone who was having inexplica...

That would actually drive me insane in the truest sense

fast pagoda Aug 5, 2025, 10:51 PM

#

now i find the reason i didnt want to dive in to running this crap on linux

#

exciting times ahead

olive sable Aug 5, 2025, 10:51 PM

#

i can imagine intel saying "do no under any circumstances do these 25 instructions after each other or your cpu will literally kill itself"
with the shit they've been doing lately

fast pagoda Aug 5, 2025, 10:51 PM

#

i think this just means it's running out of VRAM

fast pagoda Aug 5, 2025, 10:52 PM

#

olive sable i can imagine intel saying "do no under any circumstances do these 25 instructio...

they just need a longer instruction pipeline

#

bigger branch predictor

#

how aboout 120 instead of 40 y'all

#

just predict a year ahead

olive sable Aug 5, 2025, 10:52 PM

#

oh i forgot they've been doing those "core ultra" cpu's

#

to me i just thought 14900KS is the latest and greatest they have

sick owl Aug 5, 2025, 10:53 PM

#

After locking the model in memory at 128k tokens

#

This is while watching twitch mind

fast pagoda Aug 5, 2025, 10:54 PM

#

olive sable to me i just thought 14900KS is the latest and greatest they have

yeah they changed the naming again for the lulz so now it's once again impossible to tell what is what unless you already kjnow

glass flower Aug 5, 2025, 10:54 PM

#

im downloading the unsloth version annytfSittu lets see if ollama can run it

fast pagoda Aug 5, 2025, 10:54 PM

#

y'all i was so close today too

olive sable Aug 5, 2025, 10:54 PM

#

apparently 290K or whatever the fuck its called has a built in npu

sick owl Aug 5, 2025, 10:55 PM

#

olive sable Aug 5, 2025, 10:55 PM

#

i feel like thats anti-marketing, nobody wants that

sick owl Aug 5, 2025, 10:55 PM

#

glass flower im downloading the unsloth version <:annytfSittu:1247083219753369610> lets see i...

Be aware ollama blows up the context cache size

nocturne olive Aug 5, 2025, 10:55 PM

#

olive sable apparently 290K or whatever the fuck its called has a built in npu

Not like it helps anyone if you can't get good memory bandwidth

fast pagoda Aug 5, 2025, 10:55 PM

#

should've just done it

nocturne olive Aug 5, 2025, 10:55 PM

#

In LLM inference bandwidth is always the limiting factor

opaque sigil Aug 5, 2025, 10:55 PM

#

until it isn't FOCUS

fast pagoda Aug 5, 2025, 10:55 PM

#

they wanted over 3k still

#

so i said fk that

nocturne olive Aug 5, 2025, 10:56 PM

#

opaque sigil until it isn't <:FOCUS:1168267148523737239>

And when is it not? On any modern GPU they run out of bandwidth before compute

olive sable Aug 5, 2025, 10:56 PM

#

its kinda funny how all these big companies are now boasting features that activly make me not want to get their product

nocturne olive Aug 5, 2025, 10:56 PM

#

olive sable its kinda funny how all these big companies are now boasting features that activ...

AI slop?

fast pagoda Aug 5, 2025, 10:56 PM

#

was tempted on the 24gb intel part though

#

that i should've done

#

ddint know this model was out at the time tho

#

that makes sense why claude put out 4.1 opus today suddenly

olive sable Aug 5, 2025, 10:57 PM

#

nocturne olive AI slop?

kinda. im fine with ai in ms paint to remove backgrounds and shit, but dont use my data please

#

no recall

#

absolutly not

fast pagoda Aug 5, 2025, 10:57 PM

#

hey that's why i moved to linux

nocturne olive Aug 5, 2025, 10:57 PM

#

Linux is peak

fast pagoda Aug 5, 2025, 10:57 PM

#

i really dont feel like getting thought policed randomly cuz i opened the wrong shit when it took a screenshot

nocturne olive Aug 5, 2025, 10:57 PM

#

If I want AI features, I can just host my own model on my own hardware and keep my own data as my own

cosmic sphinx Aug 5, 2025, 10:57 PM

#

fast pagoda did you maybe have this toggled <@126638787462692865>

no
the og model is just too large and unoptimized
thats why we have based unsloth

olive sable Aug 5, 2025, 10:57 PM

#

im moving to linux soon enough, but i do need windows for my college program so ill be dualbooting

fast pagoda Aug 5, 2025, 10:57 PM

#

run windows in a virtual machine lule

nocturne olive Aug 5, 2025, 10:58 PM

#

If I ever need Windows again I'm gonna just VM it

olive sable Aug 5, 2025, 10:58 PM

#

nah

glass flower Aug 5, 2025, 10:58 PM

#

ollama can't run the gguf version of it

olive sable Aug 5, 2025, 10:58 PM

#

im doing a game-dev program, i dont want the performance to suck as

fast pagoda Aug 5, 2025, 10:58 PM

#

for a non-graphical application idk why vm wouldn't be more than fine

#

be the one to make it work in proton

#

//wine

#

//bottles

nocturne olive Aug 5, 2025, 10:58 PM

#

olive sable im doing a game-dev program, i dont want the performance to suck as

Then just run on WINE or using the Linux versions of software

fast pagoda Aug 5, 2025, 10:59 PM

#

why run windows when i could have a system clock THIS verbose

sick owl Aug 5, 2025, 10:59 PM

#

Okay seems like the unsloth quant is fucked

#

GGML org it is

fast pagoda Aug 5, 2025, 10:59 PM

#

tis a bit drunk

cosmic sphinx Aug 5, 2025, 11:00 PM

#

sick owl Okay seems like the unsloth quant is fucked

why thebfucj jm downloadkng this thing

olive sable Aug 5, 2025, 11:00 PM

#

nocturne olive Then just run on WINE or using the Linux versions of software

i will need Visual Studio Enterprise, Microsoft Office, Adobe Creative Cloud, Visual studio community, Unity, Figma and Blender

opaque sigil Aug 5, 2025, 11:00 PM

#

my condolences

fast pagoda Aug 5, 2025, 11:00 PM

#

visual studio is the killer there probably

olive sable Aug 5, 2025, 11:00 PM

#

and a drawing tablet for some reason too

nocturne olive Aug 5, 2025, 11:00 PM

#

Adobe and Office are the two problems I see

fast pagoda Aug 5, 2025, 11:00 PM

#

those too

steel lily Aug 5, 2025, 11:00 PM

#

olive sable i will need `Visual Studio Enterprise, Microsoft Office, Adobe Creative Cloud, V...

I wouldn't wish VS upon my worst enemy 🫡

fast pagoda Aug 5, 2025, 11:00 PM

#

idk i never expect visual studio fat to work properly on linux

cosmic sphinx Aug 5, 2025, 11:00 PM

#

sick owl GGML org it is

tell us how that runs, i may consider testing tmrw

sick owl Aug 5, 2025, 11:01 PM

#

cosmic sphinx tell us how that runs, i may consider testing tmrw

Will do neuro7

opaque sigil Aug 5, 2025, 11:01 PM

#

fast pagoda idk i never expect visual studio fat to work properly on linux

it's not like it has a reason to exist on linux with how tightly integrated it is with windows development tbf

fast pagoda Aug 5, 2025, 11:01 PM

#

whhere's the lobotomized and incoherent mradermacher q2 quant

#

perhaps merged with a random qwen model for no reason too

nocturne olive Aug 5, 2025, 11:02 PM

#

technically with both GPUs combined I could load the 120B model at Q2

olive sable Aug 5, 2025, 11:03 PM

#

olive sable and a drawing tablet for some reason too

the lenovo tablet i got a year ago already has a pen digitiser thing built in along with a magnet wireless charger for it on the back, i just need the pen for it still cuz it wasnt included with the tablet itslef for some reason

fast pagoda Aug 5, 2025, 11:03 PM

#

opaque sigil it's not like it has a reason to exist on linux with how tightly integrated it i...

oh yeah definitely it's just one of the things that i expect to fit the "never, absolutely not, definitely won't be a smooth experience" type thing when trying to use it on *nix

nocturne olive Aug 5, 2025, 11:03 PM

#

nocturne olive *technically* with both GPUs combined I could load the 120B model at Q2

But not like I need it, I have no use for an LLM on my local system

opaque sigil Aug 5, 2025, 11:04 PM

#

as expensive as it was, i absolutely love my surface pen neuroPogHD

olive sable Aug 5, 2025, 11:04 PM

#

the tablet itself was only 240 bucks of aliexpress, im also getting the pen on there

fast pagoda Aug 5, 2025, 11:05 PM

#

there are some mega cheap drawing tablets on amazon

#

i wonder how shit they are

olive sable Aug 5, 2025, 11:05 PM

#

if i were to get them from a local company the tablet would be 700 and the pen 140

sick owl Aug 5, 2025, 11:05 PM

#

Okay, even with the fucked up unsloth quant GPT oss outperformed Qwen 3 32B

fast pagoda Aug 5, 2025, 11:05 PM

#

kinda thought about getting one the other day

sick owl Aug 5, 2025, 11:05 PM

#

sick owl Okay, even with the fucked up unsloth quant GPT oss outperformed Qwen 3 32B

This is the real deal

opaque sigil Aug 5, 2025, 11:05 PM

#

also i sure love how there are like a dozen different standards for digital pens neuroCry

#

can we not

olive sable Aug 5, 2025, 11:06 PM

#

fast pagoda there are some mega cheap drawing tablets on amazon

imo you're better of with aliexpress than amazon in terms of dropshipping

fast pagoda Aug 5, 2025, 11:06 PM

#

yeah i use amazon to find the item

#

and then buy it on ali

#

lol

olive sable Aug 5, 2025, 11:06 PM

#

neurOMEGALUL

fast pagoda Aug 5, 2025, 11:06 PM

#

might as well get it from the same place

#

but amazon easier to sift through

fast pagoda Aug 5, 2025, 11:07 PM

#

opaque sigil also i sure love how there are like a dozen different standards for digital pens...

relevant xkcd

cosmic sphinx Aug 5, 2025, 11:07 PM

#

sick owl Okay, even with the fucked up unsloth quant GPT oss outperformed Qwen 3 32B

honestly
how exactly is it fucked up

fast pagoda Aug 5, 2025, 11:08 PM

#

gibberish

sick owl Aug 5, 2025, 11:08 PM

#

cosmic sphinx honestly how exactly is it fucked up

Tokenisation bugs

olive sable Aug 5, 2025, 11:08 PM

#

opaque sigil also i sure love how there are like a dozen different standards for digital pens...

yep, even lenovo in their tablet ecosystem has 5+ different pens that don't work together. And Lenovo also likes naming things in the Chinese market differently than the western market.
the xiaoxin pad pro 2023 i got is called the tab P12 here, and its 3x the price

fast pagoda Aug 5, 2025, 11:08 PM

#

i wonder how the unsloth version ended up larger

#

guess just from saving it in flat f16

#

assuming that's what was done

#

the mxfp4 weights wouldnt be real f16 weights but would still be slightly larger in disk i assume

cosmic sphinx Aug 5, 2025, 11:09 PM

#

unsloth barely was tested so far it seems

#

so we were running in blind

fast pagoda Aug 5, 2025, 11:10 PM

#

that's par for the course

#

model releases

#

everyone releases some form of it

#

fix later

cosmic sphinx Aug 5, 2025, 11:11 PM

#

I mean, knowing its THE OAI model, I thought more ppl would run it thru by now

fast pagoda Aug 5, 2025, 11:11 PM

#

it's been like

#

6 hours

#

hasnt it

#

lol

sick owl Aug 5, 2025, 11:12 PM

#

Unsloth also had the sampler parameters wrong ICANT

cosmic sphinx Aug 5, 2025, 11:12 PM

#

unsloth version released 40 minutes ago despair

cosmic sphinx Aug 5, 2025, 11:13 PM

#

sick owl Unsloth also had the sampler parameters wrong <:ICANT:1093292528066904195>

temp 1.0 sillycat

sick owl Aug 5, 2025, 11:13 PM

#

cosmic sphinx temp 1.0 <:sillycat:1085011552530346126>

Its a reasoning model, they get weirdge with this stuff

fast pagoda Aug 5, 2025, 11:13 PM

#

sick owl Unsloth also had the sampler parameters wrong <:ICANT:1093292528066904195>

actually classic

#

i feel like this particular param thing does happen like

#

every time

#

with unsloth lol

#

love them but they love shipping the wrong inference params

rigid snow Aug 5, 2025, 11:18 PM

#

ok is the 20b good at whatever nerd slop they trained it on

#

math coding whatever

fast pagoda Aug 5, 2025, 11:18 PM

#

probably p good for programming but idk why you'd do a local model for brogramming

cosmic sphinx Aug 5, 2025, 11:18 PM

#

fast pagoda probably p good for programming but idk why you'd do a local model for brogrammi...

minimal chances of data leaks to bro corpos

fast pagoda Aug 5, 2025, 11:19 PM

#

blud is working on top secret code

cosmic sphinx Aug 5, 2025, 11:19 PM

#

if I worked in the defense industry

fast pagoda Aug 5, 2025, 11:19 PM

#

i figured out long ago that anthropic has no use for my shitty code

rigid snow Aug 5, 2025, 11:19 PM

#

fast pagoda blud is working on top secret code

don’t give it an email tool

fast pagoda Aug 5, 2025, 11:19 PM

#

well yeah defense industry would be a little different if u need a clearnace

cosmic sphinx Aug 5, 2025, 11:20 PM

#

but even then it'd be hard to work from my own pc

sick owl Aug 5, 2025, 11:20 PM

#

fast pagoda probably p good for programming but idk why you'd do a local model for brogrammi...

It can do agentic stuff like browser tasks too which is neat

cosmic sphinx Aug 5, 2025, 11:20 PM

#

honestly yeah, making it work as a local agent

rigid snow Aug 5, 2025, 11:20 PM

#

rigid snow don’t give it an email tool

i’ve seen schizo gh issues filed by claude

fast pagoda Aug 5, 2025, 11:20 PM

#

every time i've used something to agentic anything i'm sat there like wow i could've done this myself already

#

i guess i don't have that much to delegate away

cosmic sphinx Aug 5, 2025, 11:21 PM

#

opus 4.1 is such a meme upgrade evilWheeze

fast pagoda Aug 5, 2025, 11:21 PM

#

2% better on swe bench == agi

cosmic sphinx Aug 5, 2025, 11:21 PM

#

fast pagoda every time i've used something to agentic anything i'm sat there like wow i coul...

consider it a sneak peek to the future possibilities

fast pagoda Aug 5, 2025, 11:21 PM

#

yeah i mean i know it's poggers that it's possible i just don't necessarily have anything for an agent to do atm

#

other than research

#

i got gpt pro on a whim since i've been testing the $2 billion dollar subscriptions one by one for amonth

#

and i'm struggling to find things to make use of it

sick owl Aug 5, 2025, 11:22 PM

#

sick owl Unsloth also had the sampler parameters wrong <:ICANT:1093292528066904195>

Okay I'm getting mixed signals, they recommend unsloths samplers on some guides at their website

#

And then the ones I just screenshot on their github repo

fast pagoda Aug 5, 2025, 11:23 PM

#

in the same article?

#

i figure the article would be out of date

#

i'd trust what's on the repo

cosmic sphinx Aug 5, 2025, 11:23 PM

#

yeah no1 upgrades articles fast enough

fast pagoda Aug 5, 2025, 11:24 PM

#

im malding that i tried the $200 gemini subscription right before they put out deep think mode because i got it thinkin deep think was imminent

#

but i was 1 month ahead

#

might've done the same thing with gpt5 tbh

#

because i see this out

#

but when gpt5

#

although sama was posting gpt5 screenshots on xitter the other day

cosmic sphinx Aug 5, 2025, 11:25 PM

#

fast pagoda and i'm struggling to find things to make use of it

I love how sam just gave a sneak peek in that video, where bros whole desktop was cluttered with txt files

'can u move all these to trash pls thx'

Blud press Ctrl+A and Del

fast pagoda Aug 5, 2025, 11:25 PM

#

ngl ive had claude code do that same task

#

"help, too many files, fix plz"

#

i guess i do have agentic tasks i just give them all to claude code

#

and dont consider them that

#

because they dont need to use a visual thing

#

they just do it cli

cosmic sphinx Aug 5, 2025, 11:27 PM

#

on the other hand, if like 70% of the 100 files on desktop were empty, but the other 30% had important content, then if you could trust the agent to look which had the content and save them instead of deleting, that'd be cooler of a demo

#

altho even that is sorta ez when u just sort by file size

rigid snow Aug 5, 2025, 11:27 PM

#

cosmic sphinx I love how sam just gave a sneak peek in that video, where bros whole desktop wa...

reminds me of the promptboard thing i made vedalCry *presses cmd-space* “textedit pls” *waits 10 seconds for it to type “textedit”*

fast pagoda Aug 5, 2025, 11:27 PM

#

i did that w/ claude code with a bunch of versions of something i'd written but kept saving in different directories with useless differentiators for a name

#

so i had it open each one and figure out which was furthest ahead

#

and then order them in order of like

#

up-to-date-ness

cosmic sphinx Aug 5, 2025, 11:28 PM

#

its nice that for this model u can select the reasoning effort, like for o series models

#

if that acc gives some improvement

fast pagoda Aug 5, 2025, 11:28 PM

#

well

#

max reasoning can actually cause some degradation

#

depending on the task

#

so that's what i imagine you'd use it for

#

low effort for low effort stuff

#

max for hard stuff

#

if the task at hand is too simple sometimes they overthink on max reasoning mode

cosmic sphinx Aug 5, 2025, 11:30 PM

#

"What is 2+2"

Thought for 3 minutes 48 seconds...

fast pagoda Aug 5, 2025, 11:30 PM

#

7

#

cuz it considered 48 different possibilities of what you mean by 2+2

sick owl Aug 5, 2025, 11:30 PM

#

Not experienced the same bugs with the ggml org quant

sick owl Aug 5, 2025, 11:30 PM

#

cosmic sphinx "What is 2+2" > Thought for 3 minutes 48 seconds...

Lemme guess, high reasoning effort neurOMEGALUL

fast pagoda Aug 5, 2025, 11:30 PM

#

and came to the conclusion that you're actually tricking it 3 ways

#

"the user is asking a simple math question, so the answer would be 2+2 = 4

BUT WHAT IF...."

sick owl Aug 5, 2025, 11:31 PM

#

Classic

fast pagoda Aug 5, 2025, 11:31 PM

#

get the answer

#

#

every time

olive sable Aug 5, 2025, 11:33 PM

#

simple math question so 2+2=4
but what if the user lives in a non-Euclidean dimension? ARGendoHmm

opaque wharf Aug 5, 2025, 11:33 PM

#

What am I looking at here now?

fast pagoda Aug 5, 2025, 11:33 PM

#

ai slop schizophrenia

cosmic sphinx Aug 5, 2025, 11:33 PM

#

opaque wharf What am I looking at here now?

discussing openai open source slop

opaque wharf Aug 5, 2025, 11:33 PM

#

Nice

fast pagoda Aug 5, 2025, 11:33 PM

#

that's what OSS stands for

#

in gpt-oss

#

not open source software but open source slop

#

the article on hf is titled Welcome GPT OSS, the new open-source model family from OpenAI!

#

family

#

unless just meaning 20 and 120b

#

maybe more inc

#

at some point

olive sable Aug 5, 2025, 11:37 PM

#

no no no

#

wait wait wait

fast pagoda Aug 5, 2025, 11:37 PM

#

sam's in high reasoning mode

olive sable Aug 5, 2025, 11:37 PM

#

CidNo

#

thats me, thats my pfp

#

interesting

#

didnt know i had that emote

fast pagoda Aug 5, 2025, 11:38 PM

#

ur kirito??

olive sable Aug 5, 2025, 11:38 PM

#

no

#

cid kagenou

fast pagoda Aug 5, 2025, 11:38 PM

#

oh

#

wow he's the same guy

#

that's crazy

#

ive never seen eminence in shadow

olive sable Aug 5, 2025, 11:38 PM

#

you know how these days you have power fantasy isekai slop?

fast pagoda Aug 5, 2025, 11:39 PM

#

are u aware ur oshi is a kirito clone

olive sable Aug 5, 2025, 11:39 PM

#

the eminence in shadow took that concept and ran with it to such a high degree its become one of the best recent power fantasy isekais

#

he's not just a kirito clone, he's the kirito clone

fast pagoda Aug 5, 2025, 11:40 PM

#

isn't that just overlord

olive sable Aug 5, 2025, 11:40 PM

#

well

#

overlord takes itself seriously

#

this one doesnt

fast pagoda Aug 5, 2025, 11:40 PM

#

Suseg does it

olive sable Aug 5, 2025, 11:40 PM

#

mostly

stark needle Aug 5, 2025, 11:41 PM

#

Gpt oss is so bad someone pls kill me the training is going so poorly compared to any other model pls

sick owl Aug 5, 2025, 11:41 PM

#

Hmm

#

Disappointed in oss 20b so far

fast pagoda Aug 5, 2025, 11:42 PM

#

i imagine for fine tuning that something trained in weirdo mxfp4 weights and f16 mixed would be fucked up using finetuning tools not expecting that

sick owl Aug 5, 2025, 11:42 PM

#

Ima give it time to see whether there's just bugs that need smoothing out

fast pagoda Aug 5, 2025, 11:42 PM

#

me every time i try a 20b class model

glass flower Aug 5, 2025, 11:42 PM

#

sick owl Ima give it time to see whether there's just bugs that need smoothing out

kinda funny if there are bugs.. they already delayed it LULE

fast pagoda Aug 5, 2025, 11:43 PM

#

which is why i gave up on using local ai for anything real until i have wayyy thiccer hardware

olive sable Aug 5, 2025, 11:43 PM

#

fast pagoda <:Suseg:1185617273453559928> does it

the eminence in shadow literally has characters named po tato and skel etal
also the mc has alter egos names john smith and mundane mann

glass flower Aug 5, 2025, 11:43 PM

#

annytfSittu qwen3-coder 30b is good for local ai

#

it barely doesn't fit into my gpu vram... but it runs fairly quickly

#

the normal qwen3 models are also good local ai's

opaque wharf Aug 5, 2025, 11:45 PM

#

olive sable the eminence in shadow literally has characters named `po tato` and `skel etal` ...

Pete Saman

#

The guy who delivers pizza shall from now on be called Pete

olive sable Aug 5, 2025, 11:45 PM

#

KEKW

sick owl Aug 5, 2025, 11:46 PM

#

fast pagoda me every time i try a 20b class model

I still consider Mistrals 20b class models stellar

fast pagoda Aug 5, 2025, 11:47 PM

#

olive sable the eminence in shadow literally has characters named `po tato` and `skel etal` ...

ok this may be worth watching nvm

#

i will no longer shit on eminence in shadow

olive sable Aug 5, 2025, 11:47 PM

#

the bad guy is literally named perv ass hat

fast pagoda Aug 5, 2025, 11:47 PM

#

reminds me of one of my favourite manga of all time

#

thinking of what it's called i cannot name it here though rip

olive sable Aug 5, 2025, 11:48 PM

#

neuro7

fast pagoda Aug 5, 2025, 11:50 PM

#

tfw rsync'd 2TB to a new storage drive and then thought let's verify the integrity real quick, that's a good idea

#

and it's a HDD

#

it's been reading for sooooo long

stark needle Aug 5, 2025, 11:51 PM

#

fast pagoda i imagine for fine tuning that something trained in weirdo mxfp4 weights and f16...

It is

#

Bad

#

I have it finetuning on a h200 rn

fast pagoda Aug 5, 2025, 11:51 PM

#

classic

#

how's the graph looking

#

loss

stark needle Aug 5, 2025, 11:52 PM

#

Bad

fast pagoda Aug 5, 2025, 11:52 PM

#

kek

stark needle Aug 5, 2025, 11:52 PM

#

Like a 4b model

#

Higher a bit

fast pagoda Aug 5, 2025, 11:52 PM

#

high then super suspiciously low forever?

#

and then it comes out way overcooked

#

that's been my experience

#

with wrong params anyways

stark needle Aug 5, 2025, 11:53 PM

#

Nah

#

I have robust strategy

#

But

fast pagoda Aug 5, 2025, 11:53 PM

#

not robust enough

stark needle Aug 5, 2025, 11:53 PM

#

Loss is higher like

fast pagoda Aug 5, 2025, 11:53 PM

#

SAD

opaque wharf Aug 5, 2025, 11:53 PM

#

Why is there so many orange here

stark needle Aug 5, 2025, 11:53 PM

#

Orange

fast pagoda Aug 5, 2025, 11:53 PM

#

if not orange

#

why

opaque wharf Aug 5, 2025, 11:53 PM

#

We need more green

#

#programming needs to touch grass more

olive sable Aug 5, 2025, 11:54 PM

#

ye fair

opaque wharf Aug 5, 2025, 11:54 PM

#

You're blue and already visit beach many times

stark needle Aug 5, 2025, 11:54 PM

#

fast pagoda <:SAD:718207731185745922>

2.3 vs 1.4 loss

#

1.4 is qwen 3 30b a3b

#

It's a big diff

#

55 vs 70 accuracy

#

Im trying to see if i can fix somehow

fast pagoda Aug 5, 2025, 11:55 PM

#

2.3 is high

#

also depends kinda on the model

unless the test inferences are also schizo

#

wow pop! OS i thought would be running cosmic by default

#

and it looked like it

#

but it's gnome

stark needle Aug 5, 2025, 11:56 PM

#

fast pagoda also depends kinda on the model unless the test inferences are also schizo

Both are active 3b so

fast pagoda Aug 5, 2025, 11:56 PM

#

and just looks exactly like cosmic

#

well, the other way around

#

they made cosmic look exactly like their gnome flavour

#

wow

#

twitch L

#

does this if launching on zen browser

#

cant figure out that it's just ff

opaque wharf Aug 5, 2025, 11:59 PM

#

Am I missing anything by using vanilla firefox?

fast pagoda Aug 5, 2025, 11:59 PM

#

not really

#

i just use it because i like the vertical tabs

#

and ff can't currently be configured that way without a lot of addon slop it seems

#

because i tried

opaque wharf Aug 6, 2025, 12:00 AM

#

#

???

fast pagoda Aug 6, 2025, 12:00 AM

#

well yes you can have that but it wont let you jam literally everything in the sidebar like zen

opaque wharf Aug 6, 2025, 12:00 AM

#

Bro, wtf are you doing to the poor browser now catdespair

fast pagoda Aug 6, 2025, 12:00 AM

#

i dont even have a top bar

#

cuz i wanted it to be like a portal type thing so there's nothing on the top of the window, it's all on the sidebar which is hidden

opaque wharf Aug 6, 2025, 12:01 AM

#

Ahh, I see

#

More docs should be written in middle English evilWheeze

For yon other intricate joint forms, such as the ball joint, we remain in discourse regarding its potential implementation. Yet, a series motor doth rank last upon the list, owing to the gimbal lock that dost constrain the true movements attainable by a mortal.

fast pagoda Aug 6, 2025, 12:04 AM

#

i wouldn't want king arthur to be unable to comprehend my documentation that's true

cosmic sphinx Aug 6, 2025, 12:07 AM

#

fast pagoda i will no longer shit on eminence in shadow

its the #2 isekai comedy after konosuba for me

fast pagoda Aug 6, 2025, 12:07 AM

#

well mucho loved konosuba

#

speaking of which i need to watch the megumin spinoff

#

see this is why i use zen cuz the entire ui can be blasted onto this left bar

#

#

and then it's like this without that hovered

#

so my space is used for the actual page

#

ignore my too-thicc border

#

i need to fix that

opaque wharf Aug 6, 2025, 12:12 AM

#

I honestly like the minimalist look. But I use a vertical monitor anyway so I have a lot of vertical space neurOMEGALUL

fast pagoda Aug 6, 2025, 12:14 AM

#

so do i

#

on the left side

#

that's actually the main reason

#

because it was a pain in the ass

#

if i'm in mega slouch 3am mode

#

to get the mouse up to the top

#

lool

#

so i was like fuck that i want everything in this sidebar
and ff wouldn't let me

#

and now i'm coping

#

okay..... maybe twitch is mad that my user agent says linux? this is updated ff wtf

#

reeee what is the latest windows ff update version

cosmic sphinx Aug 6, 2025, 12:18 AM

#

sick owl Disappointed in oss 20b so far

funniest thing about this model might be the safety settings that they cooked into the model

fast pagoda Aug 6, 2025, 12:18 AM

#

wait did they really

#

i mean

#

i guess they always were gonna cook safety shit in

#

but like you can adjust it at inference time?

sick owl Aug 6, 2025, 12:18 AM

#

Yeah this release is rough

fast pagoda Aug 6, 2025, 12:18 AM

#

natively

cosmic sphinx Aug 6, 2025, 12:18 AM

#

yeah im reading all about it

fast pagoda Aug 6, 2025, 12:18 AM

#

woof

cosmic sphinx Aug 6, 2025, 12:18 AM

#

its disgusting

faint ferry Aug 6, 2025, 12:18 AM

#

I'm having a little trouble with my AI, which I coded, and it's having trouble using expressions in VTube Studio. Can someone give me a little help

#

Im runing it in python

olive sable Aug 6, 2025, 12:19 AM

#

uhhh

#

im not an ai dev but i feel like we'd need more info about how your system works

faint ferry Aug 6, 2025, 12:20 AM

#

It uses open ai

#

gpt

opaque wharf Aug 6, 2025, 12:20 AM

#

fast pagoda reeee what is the latest windows ff update version

firefox 141.0-1 on arch

#

So that seems the latest version

olive sable Aug 6, 2025, 12:21 AM

#

faint ferry gpt

well ye, but how does the expresion system work?

fast pagoda Aug 6, 2025, 12:22 AM

#

i won't even do it with ff

#

i think it's just mad because it's reporting ff for arch or something

#

but even if i spoof ff

#

it says no

#

and if i use ff

#

spoofing windows ff

#

no

#

i guess they dont want me to ever log in

#

it literally prevents you logging in at all

opaque wharf Aug 6, 2025, 12:23 AM

#

I don't have twitch so can't exactly reproduce it lol

fast pagoda Aug 6, 2025, 12:23 AM

#

what the fk

#

1984

opaque wharf Aug 6, 2025, 12:23 AM

#

I think toast encountered that issue once

#

Or was it tsurai?

fast pagoda Aug 6, 2025, 12:23 AM

#

oh shit suddenly 46 updates on paru

#

time to break system brb

faint sandal Aug 6, 2025, 12:24 AM

#

#programming message

fast pagoda Aug 6, 2025, 12:24 AM

#

maybe it's mad about the html5 player i have installed

#

never had that issue before with it tho