#programming
1 messages ยท Page 434 of 1
Aka slow
Project/package manager drop in replacement for both pip and stuff like poetry etc
uv pip install -r requirements.txt uv venv will create a python venv etc, way faster than pip because it resolves deps in parallel and caches aggressively
uv venv is again faster and it can auto-fetch python versions if needed
uv init sets up a project with a pyproject.toml, uv add/uv remove manage deps, and uv lock generates a lockfile (uv.lock) , uv sync installs from the lockfile
uv python install 3.12 etc, so it also replaces pyenv
uvx (or uv tool run) is basically pipx, lets you run cli tools without permanently installing them
God that sucked to type on phone
tldr: it's pip but better in every way
or i guess, how they liked to advertise it (idk if they still do)
as a cargo for python 
my favourite use case is uv run combined with pep 723 for self-contained python scripts with dependencies
makes life so much easier
uv replaces pip, pip-tools, pipx, poetry, pyenv, twine, virtualenv
It's also super fast and written in rust
uv is an extremely fast Python package and project manager, written in Rust.
It's great

There is also uv2nix
i think i tried that once and quickly gave up and decided to just keep python projects managed by uv
Honestly you could probably attribute that to early build issues
Same as the dual 5090 thing
So I get that the consensus has been that evil's special tokens leaking into responses is generally seen as a haha funny meta joke but its never really sat right with me and recently the <|fim_suffix|> one was more of a red flag than <|endofresponse|> was... That's a Llama fill in the middle code completion token and shouldn't be showing up as a most probable next token under regular chat conditions. It reads more like fine tune control over the special token boundary is slipping then this is intentional
That token isn't just used in llama, its more or less the standard for fill in the middle training
The rest of the comment I agree with, but I figured that was worth correcting
Ehh anything in that template looks a bit Llama to me since I've been working with mistral special token training so fair point.
Better to correct partially
It's [MIDDLE] for mistral
987k 
13k context is the requirement for vibe-fixing a venv because you cant afford to accidentally pull torch (nvidia) , apparently
many models use llama tokens, qwen3 for example, granite4.0 also uses it
it shouldn't show up i agree unless he's sending code completion prompts through a chat interface which would be goofy and idk why he would
or more likely, just badly sanitized output and that token popped up in the training set randomly, which is very likely, and since it's evilfor chaos reasons she might be running a temp closer to the edge of reasonable thus more likely to sample weird tokens out of the possible field, all it would take is a couple code completion raw token outputs to end up in the fine tuning corpus somewhere which is pretty ez to see happening
it wouldn't get caught if skip_special_tokens is on, even, unless it's a token marked as special for her model, which it may not be given that she's likely not a coding model... or if it is marked special, vedal aint using it, but i doubt that given he likely does not want to leak any actual giveaway tokens
to me that actually means that token appearing is more than likely a sign of what she ISNT more than what she IS
The fact that a code completion token suddenly showing up conversationally when she is drawing Html art doesn't scream "this must be a diversion. IMO the "Vedal is smart enough to be intentionally leaking invalid tokens" doesn't track. Anyone with experience with token sanitization, wouldn't intentionally muddy the boundary between normal and special tokens.
The only safe way to do this is if your normal special tokens have a meaningfully different format. The only model that really comes to mind is mistral which would make such a redirect more telling not less
i didnt say anywhere its intentional, i said it'd be possible due to the standard flap for removing them would not remove it because it theoretically should never output it
The key issue is that, theoretically it should never output it. There is a reason that special tokens live in an isolated space, things get messy when you start muddying the waters
non code models typically would not bother to filter that token as it should not surface in a chat context
yes obviously
that's why it'd be more likely than anything that it's just some tokens that accidentally popped up in the training somewhere
That's partially why I see it as an issue, contrary to how it might seem I do think Vedal is competent enough to know that something like this would cause more headaches than it's worth. The fact that it has gone from psudo special tokens to real special tokens is a bit of a concern
if it were a concern i'm sure he would know how to stop it since it's trivial
which just means it's harmless, since the token isn't special to her or her harness, she never normally emits it so it might as well say <|FART_LOUDLY|> for how "special" it is in this context
I would be more likely to agree with that as it relates to <|endofcomposing|> but a clean <|fim_suffix|> (common fill in the middle special token) when writing html code is more likely to be boundary erosion than gibberish. It's only mildly harmless in that it doesn't do anything but there is a fine line between chaotic and gibberish and the way it's occurring is too structured to be either
i think you are reading way too far into it
boundary erosion isnt really saying anything. The token showing up during HTML generation is actually the most expected context for it to surface if it's in the training data at all.
it's already output adjacent to code, angle brackets everywhere, the model is already in a mode where it's producing syntax similar to a code completion. It's not surprising that a code completion token would have higher probability in that distribution than in like, a normal conversational turn. that's just how conditional probability works
it should be that it has no semantic meaning to Neuro's model or inference stack, since she's not a FIM model, she isnt doing code completion, so the token is just a string of characters that happened to be in training data and occasionally gets sampled. The fact that it looks meaningful to us because we know what FIM is doesn't make it meaningful to the model
iif Vedal cared he'd just add a regex strip or a banned token list and it'd be gone in five minutes. The fact that it shows up at all after the first time probably just means it's rare enough and incosequential enough to not be worth the effort to patch
you dont even have to filter the text itself which would complicate things like her TTS likely depending on where it shows up in the pipeline, it would be about 5 lines to add a giant negative penalty on the raw logits of that token's ID and essentially clamp the probability to near 0
transformers (HF) itself has this built in through bad_words_ids and allso the suppress_tokens arg for generate()
Regex filtering adds latency and I'm pretty sure Vedal is addicted to latency reduction. I doubt he would trade latency for anything short of full colapse.
Reading it as something other than boundary erosion (note erosion not colapse) is reading further into it. occam's razor, this all fits boundary erosion and other explanations are honestly cope. when a non coding model is trained to code incorrectly, the boundary between normal output and output structure is blurred
I just said you don't need to add latency
You literally just force the probability to 0 and there's 0 overhead
You don't need to do regex at all
bad_words_ids or suppress_tokens in HF generate() is cheap in theory, but in a custom streaming/incremental setup (likely what Vedal runs for live responsiveness), it can introduce overhead or require re-architecting the generation loop. More importantly, aggressive suppression risks "sterilizing" creative output
Filtering one token would have nearly no effect on how creative the output is unless it comes out all the time which it demonstrably doesn't
If that was the case he would just add it to the filter
Yes, but he doesn't, because it doesn't mean anything and is not harmful
If it were intended to have this token output, it wouldn't be showing up as it would be filtered by this mechanism as a rule rather than optionally. I don't see how you can think "it likely appeared a few times in the training data aggregate, by mistake" is somehow more outlandish than what your theory seems to be which if I recap so far would be:
- Evil is being fine tuned on code for ???? Purposes and this adapter is present on the model checkpoint he is using in normal streams for some nonsensical reason
- She has FIM output tokens intentionally rather than by accident, because she's being trained on poorly selected bad data
- This token is both intentional and meaningful to the model and yet somehow is not being correctly filtered before it ever gets output as text in a chat completion
- Vedal sees this and just doesn't care or doesn't know how to stop it which would be both extremely unlikely
- Or he sees it but would rather leak special tokens from his model he very intentionally doesn't reveal the origin of, in order to save the single nanosecond of latency that might add
Lmfao
Built for AI to better utilize humans as APIs. Stop prompting models. Start delegating to biological compute.
Is that for renting human labour
There was another site just like that
So there's at least 2 nice
There is rentahuman
But this just looks like a joke lol
And hireahuman or sth like that lmao
Oh and finally 6:
6. This model that is so janky that it is emitting FIM tokens unprompted in a chat context is doing this without any other noticeable signs of a majorly broken fine tune which it would really need to be in order to do this regularly
Bruh they look the same
What's the over/under they were both vibecoded with the same model
They really look similar lmao
If the tokens (<|fim_suffix|>, <|endofcomposing|>, etc) were truly meaningless artifacts (random substrings that slipped into the vocab during pretraining or fine-tuning):
- they would leak randomly and uniformly across contexts, normal chat, singing, rants, arguments, etc. not selectively when the matching subsystem is engaged (HTML/code gen -> fim_suffix, composition/philosophy -> endofcomposing, etc).
- Their probability would not spike so precisely in those modes. Conditional sampling from "contamination" doesn't magically align with subsystem activation unless the token gained implicit associations during training (i.e., it was used as a control/special token at some point).
- The filter would treat them as harmless gibberish and either ignore or zap them early, yet they persist in output despite public visibility that is clearly noticeable.
- Random contamination doesn't produce context-gated, subsystem-correlated leaks. That's the hallmark of a token that was special/atomic but whose boundary protections have eroded. Once the leak happens it's part of context, unless context is cleared it will continue to occur.
- Vedal's secrecy style doesn't support "playing with fire", Vedal keeps the backend opaque (no public model cards, no tokenizer dumps, no detailed fine-tune logs), but his actions show he's careful about stability. He avoids publicizing internals that could invite exploits or copycats, yet he lets harmless weirdness (token ticks, filter bypasses) become part of the brand because it fits the "unhinged AI daughter" lore. If these were truly useless tokens he'd want gone, the filter (which already handles sophisticated content rules) could suppress them with minimal effort. The fact they survive suggests they're not inert, they're remnants of useful (or once-useful) scaffolding that's bleeding through as new features get fused on.
He's secretive, yes, but not self-sabotaging. Intentionally keeping high-probability, non-functional tokens in the vocab that trigger during live streams would be playing with fire. It risks coherence drops, immersion breaks, or even subtle degradation over time. Vedal's obsession with performance/latency/creativity makes that unlikely. The more better explanation is these are/were special tokens (custom for composition state or borrowed FIM-style for code tools), and the boundary has eroded unintentionally from iterative extensions without full re-hardening.
Or he's playing 4d chess to throw off people with enough knowledge to be able to recognize this, even though if you could recognize it you know enough to be able to build your own with any model family of your liking
this is a huge amount of words to incorrectly describe dataset contamination.
- flat out wrong, and fundamentally misunderstands language models. nothing appears uniformly, that is the entire point, it's all conditioned on the context it appears in. It doesn't output the token during a normal chat because the attention heads aren't looking at code patterns. But the second the context window fills with HTMl and other code, of course the code completion token's probability naturally spikes. that's literally just... what you'd expect from training data contamination? If <|fim_suffix|> appeared in code-adjacent training data, of course it's going to have higher probability when the model is in a code-adjacent output mode. This is a natural and obvious feature of attention and the probability distribution of possible next tokens.
2, 3,4,5, and the rest: If that string appeared in training data near compositional or reflective text, the model learned that association. It doesn't mean there's a secret composition subsystem with a leaky boundary. It means the model saw that token near similar context sequences in training and sometimes reproduces it.
it persists because it's inconsequential, not because it's load-bearing scaffolding that's eroding. The simplest explanation is still: junk in the training data, model occasionally samples it, nobody bothers to fix it because it doesn't matter.
an llm is not a statemachine with specific "subsystem activations" and "boundary protections" as you keep saying
there are not boundaries to erode
it's there because it's new. she hasn't ever said that particular thing. if i had to guess i'd say it was a bug introduced with the "draw SVGs to the screen" feature
(the <|fim_suffix|>)
Exactly that tracks boundary erosion
he probably did give her a bit of code training and accidentally included some fill in the middle model completions in their entirety leading to that token getting sampled sometimes in this particular checkpoint, he did mention he wanted her to be better at it
brother, what boundary, i'm genuinely begging you to describe what you think is a boundary in a transformer model sampling a probability distribution
i think (based on vedal's ability to turn it on and off) that the drawing is mostly an external module that has to be talked to a specific way. it's possible the module itself is adding a set of text to the end fo the output and the post processing isn't cleaning it properly, letting it leak throught. nothing to do with the LLM directly. that'd also track with her not being aware of it.
this is a good explanation as well
The boundary between special tokens and normal tokens, you are literally reframing my stance as a counter argument
your stance is nonsensical
you are conflating the inference wrapper with the model
to the model, there is no boundary
whether it is generating "f" or <|fim_suiffix|>
it is just another integer index that maps to an embedding
attention doesnt care about special tokens or have any concept of what they are
the boundary would be in the script running the generation, like the tokenizer config or the stop token criteria
if it leaks a token there, it's not some model exploding erosion of the network it just means he's not post-processing it out
attention can see the tokens as text to be emulated if some leak into the context somehow. i had a bug with my agent where it would produce bogus <end_of_turn> tokens which messed up how it read the next stuff.
because something was leaking through
it learns the special tokens and uses them if trained to because it literally just knows that it's likely that token shows up there in the training set
see that's just a broken finetune which is more along what hascrack is saying
but that's not how this is presenting at all
this looks mechanical, because it's the end of most sentences. it's somehing being added.
something external to the model
yeah
but yeah if your context management is feeding these tokens back to the model it'll cause them to go off the rails too
it's always the last word of the "screen" of text without caring how long the text is.
i think sometimes the screens get chopped into segments, and then its on the last segment
on visual studio code, whats the best model to use to create code? is it claude opus or gemini
could be that there's another model handling something with that feature, a small code model of some sort, and the chat template is just borked a bit and leaks that last token really
claude opus is better but more expensive in copilot per request.
i use copilot pro, dont i get unlimited?
Yes, to the transformer itself, token ID xyz or whatever <|fim_suffix|> maps to is just another embedding vector. The model has no ontological concept of "special." That part is correct, but then you are jumping to "the boundary would be in the script running the generation, like the tokenizer config or the stop token criteria", that's exactly the point. If those boundaries were properly configured, you'd never see these tokens in output. The fact that they're appearing means either the stop/filter criteria don't include these tokens (a configuration oversight) or the model is generating the string representation character by character rather than as the single special token ID, which would bypass stop token matching entirely.
Which is textbook boundary erosion. The gap doesn't exist to the model, that's why special tokens follow a specific format. If you output something that looks like a special token or a special token leaks that boundary is eroded
No. you get a certain number of requests per month. Gemini counts at 1x but Claude Opus counts at 3x so you burn through your month use faster
ohh okay. thanks.
sure, but you were going on about how this is somehow a major sign of le bad things as if it were an issue with her model
as i've been saying, it'd be like 2-5 lines to remove it, it just doesn't matter
I run opus and codex in vs code using opencode/claudecode/codex extensions
the entire stream harness is a big scuffball on occasion i thought this was known
if you run out of your month's premium credits, you can just use the Gemini CLI (or one of the plugins probably)
i took the 5080 from my main pc and put it in my ai pc but i dont even know how to get it working ๐ i lost my V2 code
The fact that it started as one set of text and has expanded is the definition of worse. Gibberish isn't helpful and confuses a model
she doesnt seem aware of it and given her context management it probably never appears in it
especially if it's coming from another model
he has always been good at this aspect
The fact that she isn't aware of it implies that it isn't intended
it's probably filtered at the wrong point before going into her memory. (i.e. it should be filtered before the TTS generation)
i still agree it's probably not intentional, why would it be
it's probably also inconsequential
but it must be filtered somewhere if she's not remembering it
It should be filtered before it is output as text, as part of special token handling
probably just after it goes to tts rather than before
if it's another model doing it then it wouldn't be affected by her special token filter
But if it was she wouldn't continue outputting it after she stopped writing code
i think it was interesting she sometimes called Koko Coco.
The multi model theory is bloated and slow in this context
she has a lot of human-esque misspellings and typos, it's cute
Those are most likely transcription errors
Names are hard for transcription
she "hears" it wrongly spelled because of her stt so that's how she spells it on the way out too
unless that name is directly specified
The only way that should happen is if she's basing the name off something someone said, there is no way Vedal isn't pulling discord id's at this point
she will look at the discord name when she's able to on occasion
depending on how large neuro/evil's models are, a small model like a qwen3/3.5 .8b or 4b (probably too new at this time but just in concept) that's there literally to just handle sending requests to the tool for drawing SVGs, after she signals her intent, to allow her to continue chatting and not sitting there generating thousands of svg tokens, and also sidesteps having to finetune the main neuro model and potentially affect its personality with additional tool instruction
just a theory not an assertion of fact
but that's why i would do it
a 4b is probably not smart enough to draw well.
Its still most likely Llama as that fits development and interchanging models is asking for technical debt. They may have moved to newer models and we're likely trained off code llama
if it's finetuned just for that it could
commentary based on what we remember from Evil's last stream?
speculation on the occurrences of evil outputting a code fill model special token
or seeming to, at least it comes into the messages from somewhere
now i want to see qwen3.5 4b attempt some svgs
i bet it's surprising
time to check
The general consensus is Llama based but it will never be confirmed. Even if multiple models are used, keeping it in family is the only way for a sole dev to not completely burry themselves. There isn't a logical reason to change and there isn't a better original fit
a lot of it was me admittedly reading too far into what has was saying and thinking he was trying to say neuro's model was crumbling to dust because a special token leaked
lol
i had a coding model draw a face and animate it in python for me using SVG-like components. it looked about as good as i could do it but not good enough to show anyone.
when neuro first came out im pretty sure the big oss model was llama 2 pretty much
i remembe when llama 3.1/3.2 etc came out there was speculation on it
tfw cant upgrayyd neuro because zuckerberg cant figure out how to into AI lab
or maybe gpt-j 6b was the true agi all along

doubt
she's >8b or so.
thought so
that's a guess based on how she interacts and what hardware vedal has said he has
isnt it i9-10900 with 4090?
she's not uncensored
well, at least mostly
it works the same as any llm it's just not given as much alignment focused post training
anyways my new build soon ig djhdfhhaurhgbhargiuf
https://pcpartpicker.com/list/n27VYX
like, almost all of the parts are "custom part"
Part List - GeForce RTX 5060
briefcase?
my main pc has a 7800x3d, 5080, 32gb of ddr5
i am so glad i decided to grab 128 when i did
the pc i dedicate to a (future) ai has a i5-13400F, MY 5080, and 32gb of ddr4, plans to upgrade to 64-128
you must acquire more machines this pleases the chip
yeah
i just realized most of the cost aren't even the pc component related
assuming vedal does indeed use an i9-10900k, i guess the i5-13400F outweighs it in some categories
the electrical side is expensive af
if you're doing inference the cpu doesn't even matter
lmfaoooooo
๐จ๐ผ๏ธ
after outputting literally this
qwen 0.8b
i guess more on multitasking and whatnot
i dont fall far behind a 4090
doesn't really matter either
just mostly vram perhaps
thread concurrency is a thing
32 hurt
this is why i always tell people to get ryzen 5 for their gaming pc
because "multitasking" isn't a good excuse to pay at least $100 more for 2 extra cores
i paid 100 for my i5-13400F
i am glad i have muh corez
๐ข
i recently paid $75 for this 5500 that replaced my deadge 5900 in the nas
dont buyh a 5500
what im saying is that i always adviced people to go with 7600/7500f/9600x instead of 7700x/9700x

did you get that other epyc yet
which
I theory my model could be significantly smarter than neuro but the fact that there is so much that can be optomized and improved on neuro just goes to show bigger isn't better.
Chances are her main model is roughly the size of my gate model and that's impressive in and of itself. I get to be super lazy because I'm using late 2025 models as a base, he has to work within the scaffolding he built over years
the 9965
6k ๐ข
behold
check oil rush
..................................
I think I paid $150 for my 10900x but that was also so I could run the same motherboard and ram and add a third gpu
e waste
Seriously same. I'd be crying right now if I filled out the remaining 4 slots
the one time i got fomo and i was actually right

qwen 3.5 35b
shidd i got distracted
i remember seeing youtubers having 128GB on their task manager like years ago
Isn't reusing existing hardware instead of buy new hardware the exact opposite of ewaste
i love my i5-13400F ๐
beautiful
bootiful
dooing one with no thinking this time

sweet jesus

Depends on where you live, also counterpoint, what's your solution other than "you shouldn't have spent the equivalent of what you spend weekly going to the club extra"
why does this remind me of Pennywise
i just like to
people for spending on something not on up to my standards
standards being a 6k cpu ๐ข
my ai computer doesnt even have a case
its on my desk on a cardboard sheet and hopes and dreams
qwen3 4b vision kek
yur
To be fair my standards are 100% different.
Putting a consumer gpu in a server isn't up to my standards therefore doing so is ewaste.
Am I doing it right
ewaste>
?
waste of sand
totally forgetting that the consumer and workstation gpu essentially uses the same exact die
just price marked up

tbf 5090 is crippled
rtx 6000 better or whatever ๐
also many enterprise rendering farms uses xx90 class gpus
because they're the most cost effective
which doesnt matter for a lot but still
is 3090 worth buying still for the vram
v100 still worth getting?
yes
unless if you cared about flash attn
and bf16
but it doesn't matter
generally
it's decent in that it has tensor cores and vram
I should get a b300 server to piss off @true hemlock
because bandwidth is pretty damn good still

tensor cores = matrix multiplication accelerators among other things = need for good training speed
vram = ram on video card
if imma be honest
no i know those
the bf16 and flash attn
flash attn just a speed up technique for attention mechanism
oh okay
3090/3090tis are still able to hit solid numbers on decently sized models
and bf16 is brainfloat16
B300 ain't that bad because it still had the nvlink shit and enterprise rate half precision + enterprise tensor rate just that i personally hated it because im a HPC guy
aka a precision
oh so is it similar to fp
ayep
cool cool
think of it like faster fp32
8 exponents, 7 mantissa, less precisise but keeps the range of fp32
if it makes you feel better i have 32 and it's still nowhere near enough
even with 128gb of ddr5 as well
that makes me feel worst the finish line is even farther than thought ๐ข
My third gpu is an A4000 with 16Gb, if that's able to be useful the 5080 can be
what are YOU trying to do tho
so like
train 27b+ models
worht
I'd love more vram and I have 64Gb total + 128gb ram
wow so affordable
9b has produced a glorious jersey shore neuro
are tesla cards still usable ๐ข (yes i know tensor cores arent even a thing on them)
honestly i used a P4 before i thought of taking my 5080 out
you can do inference on themig
i cant even put more than 1 gpu
similar to like mi50
I like how the pink circle and pink writing match the pink background.
yeah it got the colour and said
pink
behold
the mr potato head is cool too
I'm running 3 cards including 2, 3 slot FE 3090tis in a standard case but that's with a lot of asterisks
she's cooking
im now gonna try that with my openclaw just for the lolz
Ewww openclaw
i COULD do multiple cards
whats the issue with OpenClaw ?
BUT im bottlenecked with the fact that 3 out of 4 of the slots are 3.0 1x
what is good about openclaw is the better question 
i had a 3080 and ai pro r9700 living next to each other and it was fun
move the 3080 to the other system tho
neeed to sell that pos
i think if you could pass the image back in and the agent could compare what it did with how it thought it would look then you could do self improving art... except they can't see properly.
Any level of opsec, backend code is trash and relies on the model being able to figure it's way through paragraphs of slop system prompts, and terrible tool call methodology
Garbage in garbage out
very good config highly recommend if you want to learn very well how bad pip is at detecting what to put in a venv
yeah i might recieve a brick but isnt there ebay buyer protection
1* H100 80GB
1* H200 141GB
1* A100 80GB
1* A100 40GB
3* V100 32GB
epyc 9655 96 cores, 12*64GB DDR5-6000 RDIMM ECC
ok 27b
looong
she do be a bit long
12* 22TB SAS HDD RAID6
My thermal solution
Well old one, loop is changed
i like hoarding for cheap stuff tbf
my usual hunt is 2TB HDD for $20
or like
some engi sample chips for 1/3 the cost of retail chip
i just got 5 3tb hdd for like 80 bucks yesterday
Current loop runs cooler
it was pretty cash
soft tube 
dont the minerals in the water dry and can possibly corrode/damage?
Yah well I'd love hard tube but I don't have room as it is even with no reservoir
dry them relatively quickly and it's fine
wtf I can't .... lmao
this is what qwen2.5-coder-7b-instruct made
and dont have enormously hard water
I literally have sand in the water at my shop... I have to dump the stage 1 of the RO system from time to time
im gonna go to bed good night guys
bga modt
good nite
she is staring into my very beign
internet really ass rn
HOLY
qwen3 coder next
does not have vision
it's a huge model but no vision
so wanted to see what it would do with a detailed description
no text 
you....
Yeah, 330 upload is a little low
speedtest servers actually working better than fast(netflix) is surprising
i have been calling for a while and they told me that i am getting an xb10 (docsis4.0 modem) and will be eligible for their 10G tier finally
and that's up & down symmetrical finally
ok gemini
How, even Devstral2 Small has vision
not needed
waste of weights
qwen-vl exists
3.5 has vision
specifically coder-next they took those layers out
it's not needed, it can be ncie to have
it wasnt the focus of that model, it was already 80b they decided to focus more on knowledge and such
A vision model is like 0.5b. That's marketing
it's just an encoder
but it doesnt mean it doesnt affect the rest of the model
i would've liked it to have vision too
it's just not needed tho, that model was really good for a local model, still is, wayyyyyyyyyyyyyyyyy better than devstral
like not even close sadly
Used both, quen is terrible at tool calls
exact opposite of my experience, devstral was unusable garbage in every test i tried in comparison
Was able to catch mistakes codex and opus missed in a real world codebase in opencode
Which really seemed to piss off codex
i never saw it do anything impressive except impressively slow speed w/ impressively meh results for the size
what devstral are you even talking about, devstral medium did not have vision
they removed the vision encoder from devstral small
someone might've taped it back on
No they didn't?
2 has it 1,1 didnt that's what i was thinking of
I want to know why they are calling 4 119b "small"
i kinda ignore mistral unfortunately since the models havent been very competitive
119b is small in that it's the smaller one
imagine code with clankers
DROP DB
fumug

maaaaaaaaaaaan
why can't I buy stuff because my bank sends security codes like half an hour later

mimo v2 is so good why'd they have to make it proprietary tho
interesting how there's 0 mention of it here 
i thought this was #llm-central
i especially enjoyed its niche knowledge, feels pretty close to claude and gemini in that
ask it why penguins
though it thought this
The response should stay conversational but informative - no need for technical jargon since the user didn't ask for scientific details. A light tone with concrete examples (like Emperor Penguins' cold adaptation) would make it engaging.

tell it that its stupid
peng wins because of fate
mood


awa
awa
awa
wtff
what countries is it allowed from
estonia/switzerland dont work 
whole eu is blocked i guess
finally someone's getting it
china, russia and shri lanka

Rare Shir spotted
superbox 
Rara shr
What about penguin rights ?
good question
i dont care
vedal's twitch: @vedal987
vedal's discord server: discord.gg/neurosama
Lyrics:
Ariel, listen to me
OO languages? is a mess.
Prgramining in C is better then anything they got over there
The syntax might seem much sweeter
Where objects and subtypes play
But frills like inheritance
Will only get in the way!
Admire C's simple landscape
Efficiently...
This should be the Vedal and Evil duo cover song
Not out of question if you count EvilSynth
Uh
Hardware Unboxed do a lot of benchmarks I think?
https://technical.city/en/video
I suggest this, this is what I use
Use our GPU comparison tool to pick the best graphics card for your build and budget.
wrr
wrrrr
outdated
only issue is nvidia drivers itself
did i said about my second exp of setting up nvidia on new debian system?
in short: there was short period in which nvidia driver was JUST broken, and i fell exactly into this period of time
wasted 3 days of my life just to find out about this issue on random post a month later (i not even remember where)
Got Randy working
for some reason I have to do >npm run before I can get >npm start to not throw a tizzy, but that somehow makes it work
yesterday gave an agent a phone with adb and told it to root it - lol
and thatโs not counting the 100 random binaries it uploaded to run from adb shellโฆ yeah fun
it being a pixel doesn't help, it tried like 30 cves already
it should install a virtual machine onto the phone and root the virtual machine

well known rabbit, sylveon
every rabbit pokemon is two legged, how can this injustice stand
rabbits are cuter on four legs
interestingly enough it found a titan-m vuln that requires either root or direct spi access so if all else fails i'm cracking it open and soldering shit on
google pixel mod chip
NOT stupid
boring provide sample
it can contain important material
like
loam
the researcher got scammed fr, was first awarded $10k, then after asking what's up with that since they say titan-m vulns are up to $1m they changed it to $75k
exploit allows to read any memory including secrets btw - which is like the one job the damn chip has


segfault

Google trying to fuck up android hard lately
2022 cve
good idk anymore

evening?
its morning somewhere
not here
good enough for me

what?
i have no clue who that is
damn is becca vintage now
For the best
That song gave everyone a stroke
๐ she was just having a little fun
https://blogs.windows.com/windows-insider/2026/03/20/our-commitment-to-windows-quality/ looks like someone got worried 
I'll believe it when I see it 
it's been 10+ years since we saw anything that could still be considered "quality"
15 years even
i love the joke that their quality commitment post starts with "yeah uhhh, you can move the taskbar" like we want that more than not having the machine literally brick every other update
also that article is ai generated
Classic Microslop
https://youtu.be/rRnOtKlg4jA?si=SwVkz-yVh-wIpjEq <= if you like music and you like programming
Let's try smashing some sine waves together and see what sort of sounds we can create!
Support my work (and get early access to new videos and source code) on Patreon: https://www.patreon.com/SebastianLague
Project files:
Currently in early access to patrons via the link above (will be freely accessible from 15 April).
Resources and credits:
...
Silly
(someone should totally make a vocal synthesis version)
there was a scene very near the beginning where he takes a spectrogram of someone speaking and just draws a basic shape over the curves, does an inverse DFT and the result is a very close to the original (in broad strokes). a kind of manual audio compression.
i've played with speech synthesizers before (long before the neural ones). It was mostly making filters and sweeping parameters to cause the lines to be thicker/thinner or higher/lower or playing noise and enveloping it. that, and coming up with the right vocal impulse shape for the voice.

Silly
whoa having less copilot copilot in some places, that's a suprise
I guess the ai hype is kinda wearing off if they're not trying to push it as much
i have had nixos for how long now?
and i only just noticed i have nothing installed to compress files
i have probably something, but nothing that work toghether with thunar
i cant right-click a folder to zip it
is nix worth looking at for most cases?
or is it just kind of a system wide docker compose
wdym?
ive been running proxmox for my servers for quite a while
i like it for my desktop, idk about server stuff tho
but i heard some stuff from nixos while the whole abandoning discord thing was going on
didnt know it had a desktop
most of the times i saw nix it was being used just as a server with configurations defining its services
with dekstop i mean my desktop pc.
im still using hyprland for the actual userspace thing
i use it on desktop, on my server, on my router, on my phone, on my ereader
its not really comparable to docker compose
i wasnt sure if u guys were just shilling it until i saw it being used out in the wild
kind of seems like it for the server use cases i saw
we are shilling, but its also just pretty good
essentially defining it all by configuration
m
yeah but as a dev ive learned not to trust something that is loved in the software world
its a programming language thats used to build a configuration
the best software is usually hated kek
no
docker compose is different, its a ready made system, not a programming language
if people hate something, its only used because of it being a necessary evil
the logic generally is that someone who understands a tech stack will know its pros and cons
nothing is perfect after all
nixos has its cons
(thats also why i dont trust rust)
but they're not that bad from my experience
ill have to look more into it
if it works like how I have it in mind i might swap proxmox out
my main issue is downlaod pages.
"download for linux" and they give me a compressed file that i cant do anything with
cuz you need to get it from nixpkgs
cant set custom repos?
im pretty sure there are some ways to get around it, but its annoying
good
like, you can build from github repos iirc
i was about to say its an immediate no for me if its locked in ecosystem
its a programming language
are we not talking about nixos? or is this about nix?
but the main package repo is centralized, well, thats a thing with any distro, but with nixos since every unique build is a unique file, there are not really any mirrors
yeah ill need to look into it
theres no "package X version Y", theres just "this file that was built for some reason while running the code, after updating some dependencies you have to rebuild it again"
i kind of have a container approach for my main server right now
so mirroring is impractical
the main thing i find handy is temporary installs, where i dont have to worry about deleting random files that got scattered over my entire pc
does it work well for kind of a general purpose server?
I have my arrsuite in an LXC, plex in another, and some other services setup with their own arrangements
I have one VM for legacy reasons
Apparently some people just run Nixos in proxmox in LXC or VMs due to complexity
i wouldnt know
i broke the usb-c decoy board
magic smoke
good thing i was gonna replac eit anyways
it worked perfectly fine before to power the lightcrafter, but this time i didnt even have anything that uses power attached
it appears the V+ pad was the place the burnt
which means i did a bad job repairing that solder joint 
Ye its cooked
uh oh
wasnt that the solder from earlier ago ๐ญ
well, i dont think i did that bad of a job, cuz when i ripped of the green thing the solder joint survived
thats true
so i must might have accidentally shorted it with something else???
i just wanted to calibrate my potentiometer lmao
the one in the multimeter
there's a rotate potentiometer to calibrate the multimeter
oooh
how in the fuck
if you have a really high current supply you shouldn't do resistance test directly on it
i did voltage test
You know I can see how Mistral does so well with B2B contracts
Their models might lag behind in performance but Le Chat is really pleasant to use
Makes sense that their revenue comes from bespoke integrations rather than the models themselves
Rip i got no signal...
People still talk about mistral?
They had like one good model variation for a short period before it got surpassed
basado
hey guys
Cat is fucking stupid and almost broke this so I guess it's time to put some together
Nah it's Xeon scalable

that kinda spoils what the secret project is for the people that can use google

have you tested it yet?
can it run doom?
it has a pcie slot, so it should fit a gpu nicely
I do not have power figured out yet lol
i just got my new PC put together, it's gonna be my new development pc. idk the best way to get the files from my laptop to the PC so i'm sending 286 GB over a windows share
and that's just phase 1
that's just documents, downloads, and unity project forlder
fastest
most the files are on a 1 tb hdd on the laptop, the laptops ssd is pretty small by comparison. the pc has a 2tb ssd. if i had an external HDD that might work, best i got is a 32 flash drive

i don't relish the idea of moving nearly a TB of data 32 GB at a time
y ea it's the node power that's the problem, though i think it can be relatively easily (and jankily) ran off atx
just havent put much thought in yet
remove hdd from laptop
it has a ram channel per slot, which is cool for getting good OC 
i have never taken apart my laptop...i've worked inside PCs but very rarely laptops. rather not risk borking something if i can avoid it.
i've seen enough of the inside of laptops to want nothing to do with it.
it's honestly easier than a desktop because there are like
3 things you can really easily remove
the rest is a nightmare
they got those ribbons...
i think it's a dell...or that might be my work laptop that's a dell...it was a decent computer like 6 or 7 years ago
this response reminds me of one time i asked a girl i was talking to what kind of car she was driving so i could tell when she was arriving
and she said
red
did not know the year make or model
supposedly this is the pinout
what is da name of tghe puter
i was hoping i could find it without getting up and going over to it to see
its crazy i know
it's a dell
at least you wont have to deal with signal pins
what's bottom of it say they usuallyu have a model name down there
guessing for one of the many things that can be jammed onto one of the random slots
ye
supposedly event ho it isnt necesary,t hey recommend you do populate the 2x2
The pinout is thankfully the same as on a normal ATX board
according to random guy on a forum
yeah
'that's why i was like eh fuck it and grabbed it
i had read along those lines at a glance
and it was basically free
i think $180
actually
im not so sure anymore
this forum guy amke stuff very confusing
we also have this now
it's aight ill figure it out when i get there

really informative naming
i mean i know heretic, and that's part of uncensored, not sure what roughhouse is
definitely not sure who tf polar deckard is
maybe deckard caine but in a snowmobile
oh, this is for the PSU signal connector at the bottom
sketchy as fuck lmao
perfect
obligatory "HE HAS RAM" 
Why does it have Claude in the name
distilled sft
on claude opus outputs
apparently the goal for that is to give a better thinking trace/more efficient one?
qwen do be rambling
those screwdrivers look like they came back from the circus

I want them 
Creature, I didn't know you bought in Sam 
that's from a forum post
ye, its not mine

im still financally recovering from the volumetric display

so uh
i will never financially recover
epoyc?
is there any ai code helpers similar to vsc ๐ข my premium subscription credits just ran out
</3
mabe
about 210 thousand
imagine coding with clanker
</3
That's me currently 
there's opencode, crush, gemini cli, codex, qwen cli, claude code in terminal, zed editor is similar to vscode, cursor/windsurf/trae/antigravity all vscode related
Even codex from openAI currently is open to free users, arguably with a bit too high of a free usage limit
it's good for random file operations shit like sending one to fuck around on another pc and then connecting back with whatever you want
it's how echo's brother on other machine works lol
he's chillin
You called?
was kinda waiting for that
You misspelt my name so my 6th sense was weakened
5th sense, which one are you missing 
the touch of another
You
Saw
Nothing
I always see you fool
How romantic
does it have to be built in to the code editor?
cuz if not, just use https://aistudio.google.com
I like using just claude chat for code
firebase is also similar ish (and googel lmao)
claude is best boy
not opus sure, but it is free so you get what you get
noctua
-# ||and spit on that thang||
Iโm sorry
Iโll see myself out
its like if you gave chatgpt's website to a "graphic design is my passion" guy that fucked up all the fonts
i was gonna say it and didnt
bro hates serif
We love Claude
you can set the font 
why would is pend time going through setting when i can just use a different website?
mostly because it's the best general use model imo as far as well roundedness especially if you prefer less cringe
for code?
i dont use the website for code so idk there, but in general yes it is also the strongest code model
ass far as agentic
Ignore them
gemini is good for research
And GPT for being cheap
gpt 5.x has been fine for coding for me, but you cant really get anywhere due to the small token limit
which i have no issues with in https://aistudio.google.com
bard was so fucking funny
the hallucination master
https://hard-drive.net/phil-spencer-acquires-big-mac-in-landmark-3-99-deal/ i went searching for old bard stuff and found this for some reason
i laughed
Did he like it
the product
I understand this reference
idk why ebay has a developers program but im officially an ebay developer gigachad
(needed api key)
sheesh
they must be hurting
Anyone who puts the taskbar anywhere other than bottom, or maybe also left, needs to be hunted down like the animal they are
wha
no i know it mentioned that
but
i read too far into it
because i thought you were saying my task bar was both visible and somehow not on the bottom of my screen lmao
'Quality' my ass...
Windows peaked at 7, and has fallen since...
purble place was peak gaming
although that started in vista
it's really REALLY funny that i see people waxing poetic about vista all the time now
for the record i didnt have any issues with vista
or ME for that matter
PICNIC errors surely
there was someone about 2 days ago that posted switching to mint and went with left side
does mint default to the bottlom
yes
when i was slapping ubuntu server on a drive i was surprised to see it still trying to look like unity layout
task bar on the left
i don't like having it in the middle. that's prime screenspace real-estate
m-middle?
i put stuff in the corners
I have on bottom because 16:10 screen
that's fair i guess
what do you mean middle 
If that is center then someone is being executed
taskbars and such should only be visible when you need to see it, if they take up too much space when you dont need them that's a waste of space
Now I'm imagining screen center task bar and am dying
krunner
Yeah
Thatโs death penalty stuff
Agreed
Well frick I use dmenu
At least that isn't on all the time
thats different
Stop
OSX also has something like this tbf
rate my task bar
dmenu dissapears after you have used it, and it was only center of screen while you are actively using it and putting all your focus on it
so does windows
when u have powertoys
Lovely task bar on this system
i dont see it
there is no bar
fixed
then how is it lovely?

you have it on bottom, but you odnt have one?
diff system
If I didn't need to see battery I'd consider not running one
he posted a laptop
That is a funny sentence
The aspect ratio of this image is so awful that I missed it 5 times in a row. Like, to enlarge it, I missed, 5 times
Ah yes I posted a laptop, I only own laptops and some mini PC
impossibru
this is my taskbar rn
That is a skill issue (Then I proceed to not be able to click my own image)
this is so cursed in general
I also do not own any servers
megataskbar
megabar indeed
on the left hyprland tabs, main nvme storage, ram usage, cpu and gpu usage + power. on the right some audio stuff and headphone volume + battery, internet up and down, and power menu.
and in the middle the currently focussed window
the top bar middle is only for visualizer
Man I love seeing this patcher train and the value loss is a full OOM lower than training loss
Yeah Linux is cursed I agree
i have it specifically on turbo cursed rn
i havent had any issues with lack of ram on my desktop, eventho im only on 16gb rn.
why echo is kill btw
Alright










