#programming
1 messages · Page 481 of 1
Tf is a zigbee
zigbee is awesome
Is it related to zig
no
a bee that goes zig then zag
i replaced all the wifi iot stuff with zigbee
its so much better
im just using a Pi as a bridge that runs Zigbee2MQTT in a docker container
which connects to the home assistant server to the MQTT broker
Seems obscure
no its not
its actually simple
and works so much easier and more reliable compared to tasmota & esphome
another benefit is the batteries in my smart home devices now work for several years
i never had to replace a single battery in years
O its just a cheaper zwave
uh its similar to zwave
oh yeah i forgot i have a pi running pi hole and some other bs i prolly set up years ago and never checked on again
lol
i currently use Blocky to block network level ads and resolve dns
it's go based and should be faster than pihole or adguard home
i was using unbound before but realized i dont even need it lol
idk for me blocky somehow feels faster in name resolution
i never actually measured it tho
I just use bind9 so I have more control and I can do some other dns optimizations
i actually just removed unbound like 2 days ago because i figured i dont actually need a fully recursive dns server
yeah i basically just use it as a dns for my own devices cause i dont wanna affect the rest of my fam with dns blocking
I like to cache stuff and do some other tricks
blocky can also do all that
blocky Documentation
Though i am hesitant to run stuff that isn't completely low level as I REALLY need speed
And as fast as go is it ain't C
idk to me blocky is faster than unbound
i never really used BIND. Only unbound.
:3

Why C when you can use assembly
speakin of servers/services heres what im running now. media server has jellyfin and audiobookshelf.
since i got time do yall have any suggestions on what would be fun to setup?
also if you guys like live tv i recomend checking out tunarr or stuff similair. U can use jellyfin/plex media and basically setup your own iptv station and add custom commercials (sadly rn its only between episodes)
Why use assembly when you can use 1928 adding machine
Time to write my own authoritative DNS server in ASM
It's probably been done in ye olden days
Why use OMV
*arr stack, plex / jellyfin, home-assistant
Seems redundant if ur running a arrsuite
who needs a nas os anyway
all you need is storage, zfs, nfs / smb
Thats why I just use proxmox
maybe iscsi
bro imma be completely real with you. i dont remember. this was years ago. but prolly just cause that was what was recomended and it seemed the easiesty
I dont think u can secure nfs
lol the memory usage btop is telling me cannot be real
Yeah so an external tool
native also works via uid / gid mapping although that is not really secure
i mean yea
-# ai psychosis thread is progressing…
ive thought of arr stack and home assistant, but i dont really need or want any smart home stuff, and i tried setting up an arr stack but it didnt seem worth the effort in my use case( adding a show or anime like once a month)
nfs is much faster but only for local trusted networks usually
Yeah
i trust my network so i use nfs
Its a case of less features for faster local use
More security and less speed for more open uses
pretty much
but i also use it e.g. to stream movies from my plex vm which mounts the nfs share
so speed is a concern
Thats what im doing too
not really
and iSCSI is block storage for vms e.g.
for omv I should just remove it and replace with just a service? and what if i add more hdds in the future and want to do a raid/zfs configuration
just configure your zfs via cli
or use this:
https://github.com/webzfs/webzfs
A modern web-based management interface for ZFS pools, datasets, snapshots, and SMART disk monitoring built with Python FastAPI and HTMX. Runs on Linux, FreeBSD, and NetBSD. - webzfs/webzfs
Which would still be preferred to omv
once i fully migrate my zfs pool into the main server i will test out webzfs
oh cool. is omv that bad? lmao
truenas scale is definetly much better
ohhhh
oh they renamed TrueNAS Scale to TrueNAS Community
Yeah
Chat new use case found for dgx apark I'll just run deepseek v4
Still more free than unraid
for an open source nas that just works with minimal effort truenas scale is probably as good as it gets
Real
Its good to have storage setup and mostly dealt with so u can focus on the suite itself
yea
i currently just mount the zfs raid from the old server which runs TrueNAS core over the network onto the new server running FreeBSD lol
one day im gonna replace the raid controller with a flashed one so i can finally migrate the zfs array into the new server 
mfw i have an entire server running just because im too lazy to replace the RAID controller
i could literally import the zfs pool with one command lmao

yea pretty much

do u guys use any of the arrsuite urselves? is it actually worth setting up, cause i tried with sonarr and i didnt get that far before it didnt seem worth it
i use Sonarr, Radarr, Prowlarr, Sabnzbd, etc
use trashguides for setting it up correctly
ako
how did you get here
bro because you know if u touch it. the "not broken" parts will completely fall apart and u have a day of troubleshooting in your future
aight ill look into it
Sonarr radarr cloudflare-resolver qbittorrent prowlarr etc
sometimes its good to replace stuff that was not broken lmao
Some other microservices for Metadata management
kometa
all of that stuff has packages on freebsd i love it
Tautulli is nice if you use plex
lmao yea
Its kind of just a middleman for giving information
i mostly use it to monitor transcoding and bandwidth
Gives me all that on homepage
i sometimes needed it when sabnzbd did not unpack archives on its on properly lol
you mean Bazarr ?
a lot of stuff from the seven seas is packed and password protected
Not where I sail
huh
mostly everything on usenet is password protected
Yeah I haven't used usenet in like 2 decades
damn
hearing privacy and torrents in the same sentence is rare
I mean VPNs
well yea ok or a anon seedbox

Fire hazard 101 
You know I completely forgot meta owns manus now
You'd think they'd make more of a song and dance about their one useful AI product
gotta love the safety slippers
And it is optional!
Lucas from Resident Evil 7 vibes
Anyone else enjoy the brrrrrrrr noise your gpu makes in ML tasks
Makes your decidedly digital machine feel kind of analog for a bit
VRAM coil whine
it's obnoxiously loud on my 3090
It's 9 am. Claude just told me to go to sleep 
based claude
Engine running again 
Why must modern cars have to use a computer in the first place 
If Ellie wont make the cfrb i will
Yeah Claude is right
Time to get some sleep
German engineering 
Imma kneecap the German engineer 
@opaque sigil come here 
better control of ignition timings and shit
I know. Just hate the fact that they tied it with serialized part
Gemma 4 E2B is surprisingly useful to have on hand
Sure its braindead but its just stable enough to actually have utility as a mini on device assistant for lower power hardware
I like my assistants smarter than braindead
This is more the "intern you send to the photocopier" type of assistant
Idk, for that i prefer the small qwens3.5 (2-4B), you can even get decent results with qwen 3.5 0.8B, but reasoning off and "short answers" syspromt.. gemma 4 is super cool for long context, but its kinda useless (from my experience)
ive been using 31b for its tool use but i might drop down to 4
or just the default latest , which is 8b
Unironically quicker this way
ye its quicker to use the browser

Also I get the added benefit of not providing extra traffic to corpos
wow incredible I can do the same thing as my calculator app, but using more power and sometimes wrong
You can't just leave your calculator app in the background and ask it questions on the fly while doing other stuff
And a calculator app only does one thing, this also does OCR, surprisingly good translation, web search

Yes you can actually
It has native audio input
Its a fairly novel architecture
ok??? it calculates instantly
So does this, its 2 billion parameters 
I don't have to submit a ticket and wait for triage to figure out the temperature
i get my temperatures from figma
i put the question in the document and the colleague edits it later
There is no waiting, that's the point
You make it sound like a process when its literally just asking my pc a question out loud and a pop up gives me the answer
i still don't get the appeal
What's not to like?
calculating temperature has been a solved problem for years
Correct
I've been able to use google assistant for that for the last like, 6 years?#
Also true
and its always been right
Google assistant is also similarly useful
Its just way more limited in its functionality
now I have the option of instead having it sometimes lie
yknow maybe I should try this myself, plot a graph and see what it thinks about temperature
I mean hell feel free, its a tiny download at Q8 or even BF16
Also google assistant lied too, it yoinked its results from the web and regurgitated them without even attempting to interpret context
On account of, y'know, not having context
I mean don't get me wrong this isn't a miracle model, it is braindead
But its decent enough for quick web retrieval, currency, temperature, timezone etc. conversion, ocr, image classification
Its a nice utility to have on a low power device
idk if I would use LLMs for temperature conversions but they are quite handy in lots of similar tasks that are somewhat manual, but pretty tedious to do otherwise
I have it on in the background in this laptop so its quicker than googling
Obviously its not computationally elegant or unique and it does fuck up, its the broad scope of the simple tasks it can handle that makes it cool
The E4B is even better in that regard if you have an NPU or resource constrained dedicated GPU
What kind of graph were you thinking of plotting? I can run it locally and save you the trouble of a download for fun
I do like pushing these dumb models
Don't get me wrong btw, I wouldn't use it for anything other than the simplest tasks 
I mean, feel free
I'd basically be a scatter graph of farenheight on the x axis and outputed celsius on the y axis. repeated multiple times with a bit of temperature so there is the option for multiple answers
I'd happily do it on my pc though instead because it sounds like a thousand or so prompts per temperature setting, and I'd probably also test it at different model temperatures
Well I just gave it your question without any context or changed wording out of pure curiosity
It generated three graphs with matplotlib
interesting, though it seems to have interpreted model temperature as temperature in farenheit
It just had the one message for context so that's about what you'd expect
Trust me, on most hardware really isnt. A fairly nice ryzen 7 5700g, cpu only does 11t/s. With igpu 26t/s. And it uses way way more ram. What these 2B models ARE good for, is having an approximate knowledge of all humanity within like 1-2GB of data. Way better than Wikipedia's downloads.
26t/s is more or less instant without reasoning turned on for the kinds of tasks you'd actually be using these for
If you think about that, it's insane compression
I wouldn't use a model this small for its internal knowledge though, gonna have to disagree there
Its best used to retrieve knowledge from elsewhere if you're asking it questions
Eh. Until the model is loaded and all, it really isn't. Even assuming you keep it loaded 24/7 its fairly slow, at like 80t/s then sure.
Okay, fair, 2B is the lower end, but 4-9B is impressive there.
I mean you can get fairly ok responses even with 0.8B as said before
And yes, better to couple it with any sort of rag, but yea.
I really wouldn't use a 0.8B model day to day and I'm already relatively forgiving for using E2B
Well gemma4 is particularly bad with this from my experience
Gemma4 really depends on tool calls
But imo also struggles with reasoning/using the tools in a efficient way
As example, asking it to read what files are on your desktop in windows
Either it fails, or reads User/Public
Instead of checking C:/users
What i do have to give gemma4, is speed
Gemma 4 is actually very good at tool calls
But it launched with a broken tool calling template
If you haven't updated your quant and llama.cpp since the period around its launch its probably misconfigured

this is a new one
are they still going 
huh
the schizoer
this is 5.3 codex

they never answered the integral question

what are the odds they knew what an integral is
Why not use 5.5
Is this good connection
Top is DL and below is UL
Bro just give OpenAI more money bro
i subscribed through the ios app to make sure they only get $16 out of $20 i pay
give none
damn people pay for these shit
it makes you so much faster
i already got $20 worth of work done with like 3% of the montly limits
and the code isn't that bad
anyone that says that isn't good enough at programming to evaluate code properly
i'm not saying it doesn't do bullshit
^
i meant code that i actually commit not all the code it produces
the problem is LLM automation
if you used one model for searching through the codebase/specs and one for smarter autocomplete suggestions
it would actually be good
but the moment you use a LLM to write the code for you the quality drops are crazy
it's not worth as much as a competent engineer i wouldn't expect it to produce code a competent engineer would
and like
if I'm gonna waste time going through code that 90% goes in the trash and I have to write it myself
it's better to just save time wasted on reading and write it all myself
so you're agreeing it's not competent at writing code
which means it's a dysfunctional tool lol
i'm saying it's not great at it but it can write code, which means it will sometimes get it right
gambling
it's only good for simple tasks that also don't require high code quality
in other words throwaway weekend projects
it’s AWESOME for those actually
yeah that's one thing I can agree with
it's hard to fuck up and you're not gonna extend it
so the flaws aren't as relevant
and since it's small it's easy to just prompt a rewrite if something does go wrong
but also i sometimes wonder
models get better which means this slop it produces is potentially more maintainable by llms in the future
I disagree about them getting better
whether they are increasing quantization or it's just noise from degrading datasets
but the response quality keeps going down, my prompts get ignored etc
claude user? 
nah I tried multiple free models
chatgpt, claude etc
it's a pattern for all of them
minimax 2.5 is free on openrouter rn it's pretty good for what it is
I don't really have uses for LLMs nowadays
my weekend projects usually do something funky it's not gonna get right
I mean, I prefer getting in more practice with code structure
it's not a relevant time investment
like I wrote a full custom yaml parser into dialogue system thing in like 3 evenings
I could maybe make it 2 but have it less modular or something
Did they get bullied away 
I feel bad for them now
can confirm. i've been trying to use GPT 5.4 with my copilot subscription to do basic bug fixing and features... its a coinflip if it even produces something working and another coinflip if its even close to something i would want to maintain, a lot of the times it just looks bad code-wise. needless fragmentation of functionality or just code duplication.
The best way i found to use it is error finding and suggestions. or very very directed plans, like basically say the way the code should work in natural language while incoperating as much existing stuff into it as possible.
that thread reminded me off that one github repository that simulates the entire universe in realtime. its like 100s of files with a bunch of techno words like quantum thrown into it. and in the end its just a bunch of if's and a few random number generators
ok but like
AI-based search engine + autocomplete
and human at the steering wheel
then in theory 99% of the time it would be human error when something's wrong
Using Ai is like gambling
gambling is fun, using LLMs is frustrating
i agree that would be good
100% honest as someone who praise ai and using ai and also coding for 3 years
its frustrating because it looks like it got there 90% of the way but then you realize it fumbled after the first 2 minutes and stumbled into chaos
i should try out https://github.com/ThePrimeagen/99 again but this time with my github copilot subscription instead of a local model
its like ai search in your codebase
what's the approach in this one? cool way is to use contextual embedding and match those between prompt and labeled codebase
do you guys know if neuro-sama is a custom ai model? or a pretrained model thats been modified to be like how neuro-sama
i need to know for my kai model
I don't think we have the details on that
but I'm 99% sure it's just a finetune with agentic features
we dont know what model neurosama is based on, but we are quite confident that it's a heavily customized and refined opensource model
you tell it what to find. and then it gives you a quickfix list of all the different interesting things with descriptions why it would be interesting
got it.
I meant under the hood
the way it finds things is just basic tool usage. like grep and looking through the codebase. it uses normal providers like opencode and such
is it ai based or not?
yes. the AI looks through your codebase
that's so inefficient bleh
it isn't embedding your whole codebase tho. for something like that you would likely want to use vector code
this one would be cool. would rather that than vibe coding.
embedding is cool because it's basically instant
prompt to embedding then just run a regular comparison for most likely
and just show stuff past a threshold
technically you can easily do it in neovim with codecompanion and vectorcode. since they have integrations
i don't really use vectorcode anymore since i switched to copilot. but... i'm sure it works with it too
it doesn't have to be that complicated because the only thing it has above regular search is decoding synonyms or descriptive sentences
it really is just an embedder
codex app has this theme named "Absolutely" lmao
Absolutely nah
i'm laughing at how it makes fun of claude i'm not telling you to buy a sub 
alright so... i gave copilot cli the vectorcode mcp server to play with. and told it to use it and find all the plugins i'm using. its been 5 minutes and it still isn't done. might even be stuck
i feel using the cli directly would have been a more pleasant experience instead of doing it through this chat buffer
its so ass
What did you tell copliot to do
this:
what tools do you have access to?
● I'm GitHub Copilot CLI, powered by GPT-5 mini (model ID: gpt-5-mini).
Available tools in this session:
- git — repository operations (commit, branch, diff, etc.)
- curl — HTTP requests
- gh — GitHub CLI (PRs, issues, workflows, etc.)
Tell me what you'd like to do and I’ll run the appropriate commands.
❯ can you use vectorcode? i've added the mcp to your toolbox. So you should be able to query this code base with it and find all the plugins i'm using. Try
it out and tell me what you found● Listing indexed VectorCode projects first so the proper project_root can be used for plugin queries. I'll call report_intent and list projects in
parallel.
i know gpt-5-mini isn't the best. but i can use it with a 0x multiplier so i wanted to try it with it first
I think copliot is like some junior dev who have no idea what they're doing
Ai googles just like us
We all googling right?
only talks about ai here
Idk other topic than ai
you are right lately there has been a lot more ai talk
Maybe change topics
lets get back to screeps... so how has the game changed with the latest update 
What best game engine?(opinion)
I quit roblox studio last month
Roblox has a lot of down bad and drama
Thinking about going to unity
Or C++
havent played
i not see me writing some business code with limited amount of commands
i usually just read/watch on internet, cant even say i code a lot, like my own repos contain less then 10k lines of code for sure
i just finally learning rust again 
string slice is bytes for some reasons btw
Rewrite it in rust
cringe
rust is language that strict better then c
like i have this diagram (i made it like 4 hours ago), and i wonder how i can implement similar in rust
will learn i guess
but i definitely cant implement software stack coroutines in rust, it is c++ only
godot 
I'm going to C++

still godot
just write your game as part of the engine
Godot
isnt godot have cpp api?
bets on ai script and voiceover?
GDExtension
It even supports Kotlin
Does godot have cpp
yes
I wanna try it

Okay switching to godot

Does it work on mx linux?
its a japanese video 
Oh godot even supported on a phone
Xoda writes Godot on a tablet sometimes, it's kinda peak how they just have that
Temple OS too? 
No 
Aww
TempleOS not major OS


How creator of TempleOs read his code
Also coding in white theme
And no autocomplete or ai
he was faithcoding. a superset of vibecoding. he was ahead of his time
True
Or, he was a mentally tormented genius. Which... to write your own OS, solo, from absolutely nothing, would require some tremendous mental capacity and mental illness ngl
Either way
It is an impressive feat
Vedal presumably communed with the spirits of Banana Rum to build Neuro from nothing
Or it's just the power of being an alcoholic depressive Brit
-just sayin, every smart person in history that I'm aware of was either insane or eventually went insane because the idiots around them couldn't understand. See also; Paracelsus


















































someone pls translate this
Autism Sama and ADHD Sama, the two sisters!
how do i extract gif from this
The message decodes to: “hbOshi”
a little off there
Sam has a copy somewhere idk
actually wait, got the code wrong lmfaoo


















































here's the corrected code
to what?
ascii?
01101000011000100110010001110011011010000110100101110010
01101000011000100110010001110011011010000110100101110010
bwaascii
Bro speaking in morse code via binary
Through heiroglyphs
Enigma wasn't so encrypted
If we treat your Evil Neuro and Neuro-Sama emojis as binary code—assigning
<:evilBwaa:...>as0and<:neuroBwaa:...>as1—it spells out a slightly glitchy message.
Here is the exact translation block by block:
01101000➔ h01100010➔ b0110000➔ 0 (Note: This chunk is missing a bit, but110000in 7-bit ASCII is the number zero)01110011➔ s01101000➔ h01101001➔ i011100110➔ s (Note: This chunk has an extra0stuck on the end)
Final Translation:hb0shis
It looks like either a corrupted word (a typo in the binary), a random string of text, or a specific invite code/password!
- Gemini 3.1 Pro
gemini got it right, it was not the right word
hbdshir

~~
That is a really smart catch! Combining all the bits into one continuous stream and re-slicing them into perfect 8-bit chunks is exactly how you'd troubleshoot a misaligned data stream.
However, if we shift everything over to fill in that missing bit, our sequence spills out of standard ASCII and into Extended ASCII characters.
Here is what your newly aligned sequence translates to:
01101000➔ h01100010➔ b01100000➔ ` (backtick)11100110➔ æ11010000➔ Ð11010010➔ Ò11100110➔ æ
Shifted Translation:hbæÐÒæSo, while the bits are mathematically aligned now, the output became even more cursed! The original spacing—even with the glitchy 7-bit and 9-bit chunks—was likely the intended format trying to spell outhb0shis` or something similar, before the emoji formatting got a little scrambled.
- Same Sycophantic AI~~
-# irrelevant
guys, do you want me to code bwaascii?
i can do it, but lets at least use it for a bit
will be my fist relatively usable app in rust 
go my 30000 prompts
Ah, that looks much cleaner! Now we have exactly eight bits (emojis) per block, making it perfectly valid ASCII.
Here is the translation with your corrected code (evilBwaa= 0,neuroBwaa= 1):
01101000➔ h01100010➔ b01100100➔ d01110011➔ s01101000➔ h01101001➔ i01110010➔ r
Final Translation:hbdshir
Looks like a "Happy Birthday" message! (Happy Birthday Shir?) The missing and extra bits in the first attempt definitely garbled those middle and end characters, but the corrected spacing fixed it perfectly.
- Still the same best AI that i have access to
interesting but too simple
gotta spice it up
need to implement some form of compression
New hacked account
:niuh:
hbdshir
I just built a mostly correct tiktoken (OpenAI tokenizer) implementation in pure Rust (only using the fancy-regex crate) and it works!!! (Only 2 tokens off out of 90 words)
I am very proud of myself :3
chat i'm cooked
@maiden geyser I was told to ping you if I saw this 
I think I have it saved somewhere lemme check
no access for me
Goodmorning
There you go.
ip v8 is a thing?
activate windows
Second ping 
so many, unusable without nitro tho
Read the spec and you'll very quickly realise that it is fake
-# i kind of just asked is it true rn
ok, thx
Heh, windows bugged and won't let me 
Internet Protocol Version 8 (IPv8) is a managed network protocol
suite that transforms how networks of every scale -- from home
networks to the global internet -- are operated, secured, and
monitored. Every manageable element in an IPv8 network is
authorised via OAuth2 JWT tokens served from a local cache. Every
service a device requires...
it's a funny read though
I summon powershell 
I've tried.
Stupid Windows update broke activating.
And it won't let me go back.
install linux

I have another laptop with Linux, but I don't have minecraft on it.
it is java...
And, my Windows laptop makes it easier for me to code.
That laptop is purposefully designed to be easy for me to reinstall operating systems in case any of my code actually works :P
C#, visual studio 
how it related to minecraft?

It's not, it's a completely different thing lol.

try vscode, and then maybe nvim
C# is the easiest for me at the moment, I'm slowly learning java and python 
how it relates?
VSCode doesn't like me 
its you not like vscode
tbh even while this thing bought by microsoft is slow/// it is infinitely better then VS
Yes well, I'm stubborn 
Everyone in this chat is stubborn so fair
Just a part of being a programmer
But also moreso Linux users
systemd
I make no effort to fall for this bait

Day probly 7 of making my own game engine like it's the 90s
I got the right microcontroller and got vga and a bit of his there
you are really cooking something 
Use NixOS flakes so you don't lose any of your setup when you reinstall 
use arch and remember how you setup your system
Nah my ADHD doesn't allow that 
Me either 
I need everything cleanly committed in git
i think i forgot how to setup grab
i definitely forgot that
there 2 commands, one to make and one to install
- set target and dirs
nooo why is it not smooth why 2 frames 
its small and i cant repost it i need gif 
audhd
so nice

and now we get back ontopic of #programming discussing how to fix it
what a turn
@claude pls write script to generate this gif
Make the adhd gif lag too 
self-ragebait situation is insane
i put it there months ago to check if a subless one can do seeable things on the cerber, now can't remove because can't log into microsoft
hi
is this u
what is this cortisol joke
theatre pc wouldnt boot yesterday
bios update fixed it
i apparently have terrible luck with getting motherboards on a bios version recent enough for the cpu im using

at least i could flash the bios without needing another cpu
cortisol is a stress hormone
high cortisol meme on tiktok rn
ohisee
BMW do be making my cortisol level raise
but why it forced everyhwere
so it just weird popularity spike?
ye kinda
Can't we speak in hex
49 20 6C 69 6B 65 20 6E 65 75 72 6F 20 73 61 6D 61
4E 65 75 72 6F 20 73 61 6D 61 20 62 65 73 74 20 76 74 75 62 65 72 20 66 72

It's basically binary but readable
4E 65 75 72 6F 20 73 61 6D 61 20 6E 75 6D 62 65 72 20 6F 6E 65 
it basically text but make sense
i think i already deleted my encoder
legible English only please
i have this crazy method of encoding text thats like binary but readable
let me demonstrate
"hello"
happy birthday shiro~!

what wilted_rose means?
sad

|1kэ, 4o you rэmэmbэr mэ po$t1ng $omэ7h1ng |1kэ 7h1$? it actua||y prэ77y rэa4ab|э
but i guess i deleted encoder as garbage
llwrrll
not you

||wrr|| :xdx:


wrr?
like ||wrr||
bwaa
bweero
I miss my screeps

I get 8 emails about them a day
prob cuz i keep killing scouts


get off my property 
i keep getting scouts form the guy 2 below me too btw
if i played screeps i would cry if any of them died
someone is below you? 
is it really funny to write business logic? you not have it enough at work?
you can refresh them at the spawner to make them live forever
but I just don't do that
they only live for about 1500 ticks
my emotional state is highly stable
idk if I should or shouldn't
needs more math
konii hblang screeps soon
I excite

_e

someone stole cloud's old spot,a nd someone else got between me and the guy 2 below me.

holy it's filled up quick
wait cloud moved?
holy I see my room has been up to something
my classic checkerboard building placement
so basically
is it sad
i keep running out of energy. i should invade someone


@rigid snow send DuranDuran

ye
but then
i have this tower code that keeps some spare energy to kill invaders, and in this case your scouts too.
3 mil might be a bit too high for my walls since my room controller level has stagnated
how is going to use wasm
thanks


legal fake nitro mod (unpatched method 2026 not tos)
we should improve wasm
compiler dev currently on vacay
usurp
I thought you said you're evil 
wuu
Konii is getting dangerously close to uwu
false
Literally a single character swap away
false
I’m revoking your math degreee
wasm support is broken now?
this run is so funny
because I keep hearing my gpu coil whine for each prompt

like i can tell when the prompts are taking 200ms vs taking 10 seconds
You clearly have not seen Toast
Oh, konii turns into bread now 
@sage crag very reliable temperature converter
(-60 is the input temp, col after is output)

also also
the last column is number of nanoseconds spent thinking
so when it thought for way longer it got the right answer
shuni do you want context
im an incredible programmer

i honeslty cant hear anything besides the fan humming
wasm works in screeps, just tested it
before the 3090 i had before the 7900xtx, i had a 3060ti that made the most coil whine i have ever heard ina ny electronic device ive heard
i think thats a correct english sentence, but it barely makes sense
thanks for testing it.. probably not gonna play for a while but ye
i wanna graph this but lazy
I'm thinking I'm gonna make one graph of "input F vs predicted C" and then another one of "input F vs time taken to predict"

preview of that first graph 
The universal function approximator everyone
lol yeah I just plotted it
with ground truth as well
execution time is also funny
I wonder if there's correlation between execution time and error
Correlation calculation time
Does longer thinking means it makes more mistake? Calculate to find out!
huuuh
this one is uh
thinking time in ms on the x axis and absolute error on the y axis
extra silly if you make it error instead of absolute error
gotta love that opaque bar on the error
@sick owl so in conclusion, for temperatures <= 0.3 gemma4 e2b is a reasonable approximation of F to C until you reach -40 (conveniently, the point at which both values are the same), at which it loses the ability to do math 30-50% of the time
for higher temperatures, 
Recommended sampler settings are temp 1, top k 64 and top_p 0.95
So temps less than or equal to 3 are technically outside of the officially supported range
I'll figure out how on earth I make that work with ollama python i guess
Oh yeah what quant are you running as well btw
Tiny models suffer disproportionately from quants below q8
Even in this broken state though see what I mean about it being braindead but also surprisingly smart for its size
potentially Q4_K_M, I think?
Oh yeah that's gonna be mega lobotomised
Run the Q8KXL gguf here
Do that with official sampler settings
You'll get great results comparatively
Wrong link, there you go
Accidentally linked you to the 26B before 
i still haven't tested that model... how good is it?
ollama won't load that model for some reason
llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'gemma4' huuuuuuh
did you update ollama

surely they support gemma4
I am running latest nixos unstable here
Did you do ollama run hf.co/unsloth/gemma-4-E2B-it-GGUF:UD-Q8_K_XL ?
and also i was running previous one just fine
I always recommend llama.cpp for a reason, ollama is nothing but trouble for all its promises of convenience 
yep
That should just work wtf 
Guess download the gguf manually and run it that way since you're not using vision anyway 
That ought to fix it
but atleast ollama is easy. i still haven't figured out all the things i need to set in llamacpp. also context window not being automatically truncated is such a missing feature in llamacpp i need it so much i'm considering not using llamacpp until its in. or make my own wrapper
how are they so slow wtf
idk enough about llms for that ngl
not even qwen3.5?
honestly i switched off ollama a long time ago.. you could go with lmstudio its basically the same but has day 1 support for basically all things and a nice GUI
Has a habit of running like ass though
But yeah try Lmstudio actually
does that provide me a python api
If you want a convenient ollama style wrapper to test a non lobotomised version with
ooh yeah it does
i haven't had that issue. most of the time i'm actually getting better performance than with llamacpp because i can't screw up the configuration

Yup
Its just another llama.cpp wrapper that's a bit less busted
just use the python package they provide and not the restapi
There's a python package 
i've wrote my own little tool with it. pretty nice api
I don't use lmstudio lul
there is
i've made a git commit generator with it. that goes through every file and generates a message from the diff
(tho i don't really use it
)
maybe i should rewrite that tool to just use the copilot cli directly since that has a way to generate commit message i think
Difference is they don't offer alternatives half the time on their official model repository
So many people have their first experience with a lobotomised model because of the tiny default context window, lack of quant options and strange default sampler settings on ollama
And don't get me started on the consequences of them naming the deepseek distills "Deepseek V3 4B" etc.
the same is true on llamacpp unless you config it correctly
The llama.cpp defaults are at least a bit more sensible
And it doesn't bill itself as plug and play
i mean true...
but i'm also not smart enough to know where to even find information on what parameters to use
maybe i should make a AI fetch the unsloth site and get the settings for me and write it into the config file
Honestly just go with the context limit you want and the recommended parameters on the official model card for a given model
i mean thats what i've been doing... but again the fact the context isn't truncated is a pain for me
since i'm using gary for testing integrations so context can grow to 100's of messages and 20000+tokens very easily if it isn't truncated. and gary itself doesn't have anything like that yet
They're working on it
Prerequisites I am running the latest code. Mention the version if possible as well. I carefully followed the README.md. I searched using keywords relevant to my issue to make sure that I am creati...
both of those don't work
atleast its not a simple plug and play thing
i patched gary rn to just discard messages older than 50 entries
Summarisation isn't plug and play no, context shift more or less is
Unless you're on an unsupported model
Oh wait nvm context shift was deprecated in favour of the truncation stuff they're working on lule

You could just merge in the pull request for it tbh if you don't mind the fact its not been fully vetted yet

thank you amd for coil whine
Different models, frameworks and quants actually make different noises when going brr believe it or not
ye it depends on the inference framework too
i guess i could do that... do i need to fork llama.cpp to pull in the pr or is there a way to locally just patch it 
i'm not a git expert yet
add the pr repo as a remote, pull the branch from it and merge
they still haven't figured it out??
its kinad neat having actual sound every time a progress bar clicks forward
git fetch origin pull/19841/head
git merge FETCH_HEAD
there are merge conflics
Or if you want to give it a separate local branch for testing @glass flower
gh pr checkout 19841
No image input
i'm using lazygit. i think i got the merging thing figured out
They have deepseek ocr connected to deepseek v4
i know they have ocr
For fake image input
like, surely there merge conflicts
when they weren't?
it apparently has a hidden rp mode via prompt injection into the system prompt
Bro just vector quantizer vae and call it a day 😭
and it would reason in character
I'm gonna try tomorrow
My ethernet is broken rn so i cant download it
Well it's not broken I just bought a 10gbe mikrotik switch and it needs some routeros shit config
Cause i need 10gbe on both dgx spark
How did people even find out
I mean cant u prompt any llm for rp?
not that it would rp in CoT i never could
it's without the usual "this is a simulated chat/you're the DM"
So safety bs?
it just thinks it's the character
Wait is v4 safety trained
i never attributed it to safety just thought of it as a side effect of what they post train them to be
Runs like shit on 2x dgx spark
So does everything dense
Why not just use deepseek v4 flash with a smaller model handling vision tasks
Like throw Gemma 4 26b on top for it or something
I'd use 26B if you want something closer to frontier vision performance to go with that frontier model
I need sth that can realistically handle embodied robotics
crazy how text generators operate on text
GLM 5V Turbo?
Wait no that's like 800b params lule
Trash machine
I just need nvidia for quietest machine possible
Since i need to run my ai agent thing
And need stealth setup
That i can claim to my parents it only cost 1000$/machine lmao
To my mom at least since my dad knows how much he spent on this
You could run Qwen 3.5 397b at like a q4 quant
Would be lobotomized though
Low active param count and all that
Why doesn't google just pls drop gemini 2.5 flash open source
Gemini 4 31b already outperforms 2.5 Pro let alone Flash
2.5 pro?
Have you considered using it with speculative decoding?
Not rly supported in vllm
I aint running ollama/llama cpp
Also gemma 31b is a distil of a distil
Qwen 3.5 122B A10B might be your best bet actually
Well, unless you want to chance it with the 397b at a q4 quant
@sick owl that model you told me to use is arguably worse lol
And you set the recommended samplers?
Lmao
That'll just be the usual tiny model variance if you're using the recommended sampler parameters
Make sure you don't have repetition penalty or presence penalty enabled
im not super sure how I'd go about setting that through the python api
Without these settings its expected that the model'll underperform
lower temperature? (
i know this sounds ironic)
is LLM. the model or the class?
i think there are like 2 or 3 things called respond in the api
depending on what you call it on
it takes the args differently
you need to pass it in as dict under the config kwarg
well apparently this is close enough to right
response = model.respond(
chat,
config={
"temperature": 1.0,
"topPSampling": 0.95,
"topKSampling": 64,
"presencePenalty": 0.0,
"frequencyPenalty": 0.0,
"repeatPenalty": 1.0,
},
)
though frequencyPenalty seems to not actually be part of the typescript structure
Idk what arguments and format they use in lmstudio*
Like I said I don't use it
Typo
is vedal getting botted followers because people bet on when itll hit 1m is that whats happening lmao
what the
great prediction there bud
-58 + (459.67 * (-1)) = -58 - 459.67 = -517.67
-517.67

what kinda cooked working out is that
HOLY 
Technically not wrong 
without the outlier
it seemingly is getting different results if I ask it in the chat vs via the api which is interesting
I guess this test it probably pretty sensitive to the prompt I choose
Q8 vs BF16 is always gonna have some amount of difference, especially on a tiny model like this
Hell you can try the BF16 gguf locally if you want, its only 9 gigs 
Though tbh at that size I'd probably go for E4B at Q8_K_XL
I probably need to sleep ngl
but I would maybe think twice before trusting it for temperature calculations
Or hell Qwen 3.5 9B









but it isn't in yet

