#programming | Neuro-sama Headquarters | Page 22

gritty dust Jun 1, 2025, 4:30 PM

#

bro that happens to me everytime I write something for science lol

#

at least teachers here can't use AI detectors for marking since they do false positives

olive sable Jun 1, 2025, 4:32 PM

#

here they can

#

and they also use ai to make their test questions now

#

ive seent hem do it, they told me they do it

tender river Jun 1, 2025, 4:33 PM

#

a person i know went to uni before chatgpt but dropped out and then enrolled again and said its now a completely different experience with much less material comprehension neuroDespair

rough bloom Jun 1, 2025, 4:34 PM

#

using LLMs to help make questions seems fine to me
AI detectors though neuromegadance

dense cosmos Jun 1, 2025, 4:34 PM

#

My clg experience was basically going SCHIZO

#

My brain cells are returning to me now

real sierra Jun 1, 2025, 4:40 PM

#

grab_R maximum_suffer grab_L

#

found you

noble zodiac Jun 1, 2025, 4:41 PM

#

I'm gonna lose my marbles

real sierra Jun 1, 2025, 4:41 PM

#

the mystery of the broken ALU is fixed

#

time to re-run the right shift code

#

it worked

#

so all the problems ive been experiencing

#

were because of this stupid bad trace

olive sable Jun 1, 2025, 4:43 PM

#

well, at least it works now

#

neuroHypers

real sierra Jun 1, 2025, 4:44 PM

#

im amazed it didnt cause more problems tbh

#

given its position tho

#

you would have to do an operation that enables the swap flag, and have differing values in bits 10 and 11 of the D register

grave fractal Jun 1, 2025, 4:45 PM

#

real sierra <:grab_R:630861902792228866> <:maximum_suffer:679667474689425439> <:grab_L:63086...

ALERT finally

real sierra Jun 1, 2025, 4:45 PM

#

to even have a chance of noticing it

#

well thats exciting

#

now i have right shift

#

i guess i implement that division code now

grave fractal Jun 1, 2025, 4:46 PM

#

real sierra i guess i implement that division code now

good luck evilCheer

real sierra Jun 1, 2025, 4:47 PM

#

is there a reason for returning 32767 in particular if the divisor is 0?

grave fractal Jun 1, 2025, 4:48 PM

#

real sierra is there a reason for returning `32767` in particular if the divisor is 0?

he just returning maximum value of signed short

#

as an error signal

real sierra Jun 1, 2025, 4:49 PM

#

NeurOhISee

#

surrounded by so many experts i feel like an idiot

#

neurOMEGALUL

grave fractal Jun 1, 2025, 4:50 PM

#

real sierra surrounded by so many experts i feel like an idiot

nah you better than me neuroHeart

#

i'm just a computer engineering student trying my best to help neurOMEGALUL , while you guys are just vibin'

real sierra Jun 1, 2025, 4:52 PM

#

theres an annoying quirk with this architecture

#

for loading literals from code, data instructions are used

#

these are indicated by the highest bit being 0

#

which means you only get to load 15 bits

#

Copege

#

if i ever want max short as a literal, i have to left shift 7FFF and add 1

#

agony

haughty briar Jun 1, 2025, 4:54 PM

#

chat chat chat, look

olive sable Jun 1, 2025, 4:54 PM

#

my brother in christ

real sierra Jun 1, 2025, 4:54 PM

#

tbh for some form of rudimentary error handling

haughty briar Jun 1, 2025, 4:54 PM

#

:3

olive sable Jun 1, 2025, 4:54 PM

#

wtf is that?

real sierra Jun 1, 2025, 4:54 PM

#

ive been considering just making labels to jump to

real sierra Jun 1, 2025, 4:54 PM

#

haughty briar chat chat chat, look

wait what the hell

haughty briar Jun 1, 2025, 4:54 PM

#

olive sable wtf is that?

:3

real sierra Jun 1, 2025, 4:54 PM

#

i expect real neuro in 6 months

#

good luck

olive sable Jun 1, 2025, 4:55 PM

#

bro is collecting huiman body parts to make robots

haughty briar Jun 1, 2025, 4:55 PM

#

calm down its the aniamtronics they usedd for harry potter

olive sable Jun 1, 2025, 4:55 PM

#

so when is five night at neuros opening?

haughty briar Jun 1, 2025, 4:55 PM

#

but i am making somewhat of a similar thing though

haughty briar Jun 1, 2025, 4:56 PM

#

olive sable so when is five night at neuros opening?

lowk

#

gime time

real sierra Jun 1, 2025, 4:57 PM

#

i was just gonna halt if a divide-by-zero happens

#

with a jump to a label that looks something like

@ Exception_DivideByZero
goto Exception_DivideByZero;

#

not much of a "halt" but given i can follow the code while it runs, it's enough for me to see

rigid snow Jun 1, 2025, 4:59 PM

#

neuro pog high definition

#

neuroPogHD

olive sable Jun 1, 2025, 5:01 PM

#

neuroPogHD

noble zodiac Jun 1, 2025, 5:03 PM

#

Im gonna commit triple fault

real sierra Jun 1, 2025, 5:12 PM

#

wait what are you cooking with this condition

olive sable Jun 1, 2025, 5:13 PM

#

3% in 14 hours.
i dont trust that 150 hours left for a second.
at this rate it will take 460 hours

tender river Jun 1, 2025, 5:13 PM

#

real sierra wait what are you cooking with this condition

hi t neuroPogHD

real sierra Jun 1, 2025, 5:13 PM

#

surely the last condition is just t < 0

stray dragon Jun 1, 2025, 5:13 PM

#

hi chayleaf

real sierra Jun 1, 2025, 5:13 PM

#

by like

#

...math

hasty jungle Jun 1, 2025, 5:15 PM

#

nocturne olive Jun 1, 2025, 5:16 PM

#

olive sable 3% in 14 hours. i dont trust that 150 hours left for a second. at this rate it w...

How big was the model again?

#

My PC has a GPU with more compute than yours and it still took 3 days on just 150M on a fairly small dataset

olive sable Jun 1, 2025, 5:17 PM

#

"model_dim": 1536
"depth": 32
"heads": 16,

nocturne olive Jun 1, 2025, 5:20 PM

#

I don't know how to convert that to params

olive sable Jun 1, 2025, 5:21 PM

#

me neither

real sierra Jun 1, 2025, 5:21 PM

#

@sage crag

# math_div(a, b) returns the quotient a/b.
function math_div 2;
    push_arg 1;
    pop_d;
    if_else_d JEQ math_div_bad_divisor math_div_good_divisor
    @ math_div_bad_divisor
        goto Exception_DivideByZero;
    @ math_div_good_divisor
    push_arg 0;
    push_arg 0;
    pop_d;
    if_else_d JLT math_div_neg_a math_div_pos_a;
    @ math_div_neg_a
        neg;
    @ math_div_pos_a
    push_arg 1;
    push_arg 1;
    pop_d;
    if_else_d JLT math_div_neg_b math_div_pos_b;
    @ math_div_neg_b
        neg;
    @ math_div_pos_b
    pop_d
    push_d;
    push_d;
    push_value 1;
    call math_div_recursive;
    push_retval;
    push_arg 0;
    push_arg 1;
    pop_d;
    if_else_d JLT math_div_sign_flip math_div_sign_flip_end;
    @ math_div_sign_flip
        neg;
    @ math_div_sign_flip_end
    pop_d;
    if_else_d JLT math_div_q_flip math_div_q_flip_end;
    @ math_div_q_flip
        neg;
    @ math_div_q_flip_end
    return;
        
# internal backing call for math_div(a, b)
# math_div_recursive(ua, ub, t, d)
function math_div_recursive 4;
    push_arg 0;
    push_arg 2;
    push_arg 2;
    add;
    sub;
    pop_d;
    if_else_d JGT math_div_recursive_test math_div_recursive_pos;
    @ math_div_recursive_test
        push_arg 0;
        push_arg 2;
        sub;
        pop_d;
        if_else_d JGT math_div_recursive_break math_div_recursive_neg
        @ math_div_recursive_break
            push_value 0;
            return;
        @ math_div_recursive_neg
        push_arg 3;
        push_arg 0;
        sub;
        push_arg 1;
        push_arg 1;
        push_value 1;
        call math_div_recursive;
        push_retval;
        push_arg 3;
        add;
        return;
    @ math_div_recursive_pos
    push_arg 0;
    push_arg 1;
    push_arg 2;
    push_arg 2;
    add;
    push_arg 3;
    push_arg 3;
    add;
    call math_div_recursive;
    push_retval;
    return;

#

(sorry for wall of text)

olive sable Jun 1, 2025, 5:22 PM

#

np

real sierra Jun 1, 2025, 5:22 PM

#

ported your code tho

#

wonder if it actually works

#

well i hit start

#

its doing something for sure

#

not sure what

#

failure Sadge

#

i probably made an oops somewhere

inner pike Jun 1, 2025, 5:24 PM

#

real sierra its doing something for sure

the hidden bitcoin miner:

real sierra Jun 1, 2025, 5:25 PM

#

trying to do 15 / 3 and the result im getting is uhhhh

#

0xE004

#

Concerned

rough bloom Jun 1, 2025, 5:26 PM

#

olive sable "model_dim": 1536 "depth": 32 "heads": 16,

should be something like ~1-2B I think depending on the other hyperparameters like vocab size, embedding size, head size, and ffn size

olive sable Jun 1, 2025, 5:27 PM

#

"vocab_size": 50257
idk embed size and head size

nocturne olive Jun 1, 2025, 5:27 PM

#

That's gonna be a lot of compute depending on how big the dataset is and how many epochs it's doing

olive sable Jun 1, 2025, 5:27 PM

#

10 epochs

nocturne olive Jun 1, 2025, 5:28 PM

#

Have fun with your way overfitted model with too much compute spent on it

#

With LLM pretraining usually you want to do at most 1 epoch I believe

hoary lion Jun 1, 2025, 5:28 PM

#

Yes

#

We go less than 1

#

Bc it is too big

#

Don't pretrain more than 1

rough bloom Jun 1, 2025, 5:29 PM

#

nodd never repeat any data, it only causes overfitting
usually there's way more data available than you have compute to actually use

olive sable Jun 1, 2025, 5:30 PM

#

evilShrug @prime ridge

tender river Jun 1, 2025, 5:30 PM

#

real sierra <@851857878876028928> ```ps # math_div(a, b) returns the quotient a/b. function...

math_div(a, b) {
    if args[1] == 0 {
        throw;
    }
    a = args[0];
    if args[0] < 0 {
        a = -a;
    }
    b = args[1];
    if args[1] < 0 {
        b = -b;
    }
    c = b;
    d = 1;
    ret = math_div_recursive(a, b, c, d);
    a = args[0];
    if args[1] < 0 {
        a = -a;
    }
    if a < 0 {
        ret = -ret;
    }
    return ret;
}

math_div_recursive(a, b, c, d) {
    if sub(args[0], add(args[2], args[2])) > 0 {
        if args[0] - args[2] > 0 {
            return 0;
        }
        return math_div_recursive(args[3] - args[0], args[1], args[1], a);
    }
    return math_div_recursive(args[0], args[1], args[2] + args[2], args[3] + args[3]);
}

nocturne olive Jun 1, 2025, 5:30 PM

#

Whoever is telling you to train that model clearly doesn't quite know how training a model should be done

hoary lion Jun 1, 2025, 5:30 PM

#

I am not so sure why python libraries are trying to escape pythonic way

tender river Jun 1, 2025, 5:30 PM

#

ported back to pseudocode (my translation might have errors though neuroPogHD )

hoary lion Jun 1, 2025, 5:30 PM

#

Totally not frustrated by flax glueless

real sierra Jun 1, 2025, 5:31 PM

#

PepeKneel chayleaf

hoary lion Jun 1, 2025, 5:31 PM

#

I can't even use print() easily 😭

rough bloom Jun 1, 2025, 5:31 PM

#

hoary lion Totally not frustrated by flax <:glueless:1282337396230328425>

you should try Equinox glueless

hoary lion Jun 1, 2025, 5:31 PM

#

See tbh

#

Everything is because of jax actually

#

JIT neuroDeadge

rough bloom Jun 1, 2025, 5:32 PM

#

yeah

#

a JIT is necessary though, both because Python is kinda slow and because that way Tensor operations can be fused and launched more efficiently

hoary lion Jun 1, 2025, 5:34 PM

#

I think there is this fundamental tradeoff between easily debuggable code and performant code

rough bloom Jun 1, 2025, 5:36 PM

#

I don't think that has to exist, jax JIT-compiled code and normal eagerly evaluated code look pretty much the same (other than the static shape restriction, but that doesn't affect debugging, it just causes pain getting stuff to work at all)

#

current ML frameworks are just too shit

real sierra Jun 1, 2025, 5:36 PM

#

oh i misread konii's code

#

need to add d

#

maybe it works this time

#

PauseSama

#

catdespair or not...

rough bloom Jun 1, 2025, 5:39 PM

#

neuroGanbare

real sierra Jun 1, 2025, 5:40 PM

#

looks like it got 13 as an answer

hoary lion Jun 1, 2025, 5:40 PM

#

Or maybe it's just python

real sierra Jun 1, 2025, 5:40 PM

#

to 15/3

hoary lion Jun 1, 2025, 5:40 PM

#

Python might be a bad language for ml

real sierra Jun 1, 2025, 5:41 PM

#

it gets 15/15 right

#

its ok, i believe in your code konii neuroHyperBounce

#

i probably just ported it wrong

#

why is it wrong tho Dentge

#

it looks right

hoary lion Jun 1, 2025, 5:44 PM

#

Maybe I should start a mini educative stuff

#

Like tensor manipulation 101

real sierra Jun 1, 2025, 5:45 PM

#

Caught found another bug

#

monkaOMEGA its working

#

https://gyazo.com/ed0383db7a20b3ceb4c51826393b774f.mp4

▶ Play video

#

its actually very impressive

#

thank you for the help

#

do you wanna help me devise square root hmm

#

i was thinking about it earlier and i feel like theres a shortcut somewhere

#

because

#

log2 is easy* with binary
-# *precision not guaranteed

#

so i think you can just

#

log2(x^(1/2))
= (1/2)log2(x)

#

= shr(msb(x))

#

so like

#

i think if you just do 2^that

#

you get something resembling a square root?

#

but im not sure on the accuracy

#

log2 might be too imprecise

#

in fact log2 is only one of 16 possible values isnt it

#

bin search time salute

#

YES

#

ALU supports add, sub, inc, dec, and, or, xor, not

#

just added code for n-bit shifts to left/right

#

left, by far

#

do you want to review it neurOMEGALUL

#

if not then it's the only algorithm, which makes it optimal

#

neuroEZ

#

salute anyone else using this chat

#

# math_shl(x) returns x left-shifted by one position.
function math_shl 1;
    push_arg 0;
    push_arg 0;
    add;
    return;
    
# math_shln(x, n) returns x left-shifted by n positions.
function math_shln 2;
    push_arg 1;
    pop_d;
    if_else_d JEQ math_shln_done math_shln_go;
    @ math_shln_done
        push_arg 0;
        return;
    @ math_shln_go
    push_arg 0;
    push_value 1;
    push_arg 1;
    sub;
    call math_shln;
    push_retval;
    push_retval;
    add;
    return;

# math_shr(x) returns x right-shifted by one position.
function math_shr 1;
    push_arg 0;
    push_value 1;
    push_value 2;
    call math_shr_recursive;
    push_retval;
    return;
    
# math_shrn(x, n) returns x right-shifted by n positions.
function math_shrn 2;
    push_arg 0;
    push_value 1;
    push_value 1;
    push_arg 1;
    call math_shln;
    push_retval;
    call math_shr_recursive;
    push_retval;
    return;

# internal backing call for math_shr(x)
# math_shr(v, mask_out, mask_in) performs a right shift by copying bits using masks.
function math_shr_recursive 3;
    push_arg 2;
    pop_d;
    if_else_d JGE math_shr_recursive_if math_shr_recursive_else
    @ math_shr_recursive_if
        push_arg 0;
        push_arg 1;
        push_arg 1;
        add;
        push_arg 2;
        push_arg 2;
        add;
        call math_shr_recursive;
        push_retval;
        goto math_shr_recursive_end;
    @ math_shr_recursive_else
        push_value 0;
        goto math_shr_recursive_end;
    @ math_shr_recursive_end
    push_arg 0;
    push_arg 2;
    and;
    pop_d;
    if_else_d JNE math_shr_recursive_shift math_shr_recursive_ret
    @ math_shr_recursive_shift
        push_arg 1;
        or;
    @ math_shr_recursive_ret
    return;

#

yeah thats what i have mhm

#

NOWAYING

#

it cant be that easy

#

surely

#

ok let me port this

#

what's the i stand for

#

ICANT i see

#

like, loading a literal into memory somewhere?

#

literals can be put into A register in one instruction
it takes 7 instructions to push a literal onto the stack

#

salute the stack is all software too unfortunately

#

still porting

#

POGGIES

#

wtf it works

#

your code was so nice and concise

#

i feel like my port doesnt do it justice

#

# math_sqrt(x) returns the square root of x.
function math_sqrt 1;
    push_arg 0;
    push_value 1;
    push_value 0;
    call math_sqrt_recursive;
    push_retval;
    return;
    
# internal backing call for math_sqrt(x)
# math_sqrt_recursive(n, odd, count)
function math_sqrt_recursive 3;
    push_arg 0;
    push_arg 1;
    sub;
    pop_d;
    if_else_d JGT math_sqrt_recursive_break math_sqrt_recursive_go;
    @ math_sqrt_recursive_break
        push_arg 2;
        return;
    @ math_sqrt_recursive_go
    push_arg 1;
    push_arg 0;
    sub;
    push_arg 1;
    pop_d;
    LD INC_D;
    LD INC_D;
    push_d;
    push_arg 2;
    pop_d;
    LD INC_D;
    push_d;
    call math_sqrt_recursive;
    push_retval;
    return;

#

even looking at this, i cant figure out how its working

#

oh

#

i see it now

#

clever

#

# math_shr(x) returns x right-shifted by one position.
function math_shr 1;
    push_arg 0;
    push_value 1;
    push_value 2;
    call math_shr_recursive;
    push_retval;
    return;
    
# math_shrn(x, n) returns x right-shifted by n positions.
function math_shrn 2;
    push_arg 0;
    push_value 1;
    push_value 1;
    push_arg 1;
    call math_shln;
    push_retval;
    call math_shr_recursive;
    push_retval;
    return;

# internal backing call for math_shr(x)
# math_shr(v, mask_out, mask_in) performs a right shift by copying bits using masks.
function math_shr_recursive 3;
    push_arg 2;
    pop_d;
    if_else_d JGE math_shr_recursive_if math_shr_recursive_else
    @ math_shr_recursive_if
        push_arg 0;
        push_arg 1;
        push_arg 1;
        add;
        push_arg 2;
        push_arg 2;
        add;
        call math_shr_recursive;
        push_retval;
        goto math_shr_recursive_end;
    @ math_shr_recursive_else
        push_value 0;
        goto math_shr_recursive_end;
    @ math_shr_recursive_end
    push_arg 0;
    push_arg 2;
    and;
    pop_d;
    if_else_d JNE math_shr_recursive_shift math_shr_recursive_ret
    @ math_shr_recursive_shift
        push_arg 1;
        or;
    @ math_shr_recursive_ret
    return;

#

it just goes thru the high bits of the input one bit at a time

#

and copies it to the output

#

crazy how bad shr is via software when its literally just a few wires in hardware

#

division is definitely the largest function here

#

its worse than sqrt somehow

#

im amazed any of these work honestly

#

i was kinda expecting to be working on math functions for ages

#

so i didnt plan what i wanted to do with them

#

doom glueless

trim valve Jun 1, 2025, 6:28 PM

#

glueless

prime ridge Jun 1, 2025, 6:28 PM

#

olive sable 3% in 14 hours. i dont trust that 150 hours left for a second. at this rate it w...

That training doesn't have enough data for a model that size

real sierra Jun 1, 2025, 6:29 PM

#

surely doom fits in my 16 bits of addressable memory

prime ridge Jun 1, 2025, 6:29 PM

#

I'm still zipping all the new data

#

idk why it's taking so long

olive sable Jun 1, 2025, 6:29 PM

#

neuro7

#

i was just testing on my side

#

looks like batch size 16 is fine

#

ran for 15 hours without going over 24GB vram

real sierra Jun 1, 2025, 6:30 PM

#

on this instruction set?

#

susge

old totem Jun 1, 2025, 6:31 PM

#

chat how do you feel

real sierra Jun 1, 2025, 6:32 PM

#

old totem chat how do you feel

how do YOU feel

olive sable Jun 1, 2025, 6:32 PM

#

prime ridge That training doesn't have enough data for a model that size

i stopped it at step 59125
btw, what about the epochs? they said 10 is too much and it should be 1 or less

old totem Jun 1, 2025, 6:32 PM

#

real sierra on this instruction set?

hello shiro neuroAYAYA

#

long time no see

maiden geyser Jun 1, 2025, 6:32 PM

#

old totem chat how do you feel

prime ridge Jun 1, 2025, 6:32 PM

#

olive sable i stopped it at step 59125 btw, what about the epochs? they said 10 is too much ...

should be 1

old totem Jun 1, 2025, 6:32 PM

#

real sierra how do YOU feel

brain blast

prime ridge Jun 1, 2025, 6:32 PM

#

don't train it rn

olive sable Jun 1, 2025, 6:32 PM

#

how do i feel about what?

prime ridge Jun 1, 2025, 6:32 PM

#

ur wasting gpu time

old totem Jun 1, 2025, 6:32 PM

#

maiden geyser

type shit fr

real sierra Jun 1, 2025, 6:32 PM

#

old totem hello shiro <:neuroAYAYA:1067437988671410256>

neuroWaveA always nice to see you :>

prime ridge Jun 1, 2025, 6:32 PM

#

that's the old code

olive sable Jun 1, 2025, 6:32 PM

#

ye i stopped it already

#

you've changed the code again?

prime ridge Jun 1, 2025, 6:33 PM

#

The data is still zipping. It's so slow

#

it's been zipping for almost like 5 days

#

it's zipping like 5 files a second

old totem Jun 1, 2025, 6:34 PM

#

maiden geyser

i might fall in ths rabbit hole i should NOT get used to

nocturne olive Jun 1, 2025, 6:34 PM

#

prime ridge it's been zipping for almost like 5 days

Concatenate the data files and zip it in like an hour instead
You'll save way more time than you'd lose by stopping it now

prime ridge Jun 1, 2025, 6:34 PM

#

If sam was on linux i'd just do a tar

#

woulda taken like a single day max

#

but .zip is so horrible

nocturne olive Jun 1, 2025, 6:35 PM

#

prime ridge but .zip is so horrible

Then use 7z or something?

prime ridge Jun 1, 2025, 6:35 PM

#

I don't have 7z

#

also IO is super slow because it's on an external usb drive

nocturne olive Jun 1, 2025, 6:35 PM

#

Also the 7zip tool can basically handle almost all packed formats

real sierra Jun 1, 2025, 6:35 PM

#

maiden geyser

mf chief() {
  mf ts be cap rn
  ts fr vibin fr yikes rn
  sussin (ts finna cap) {
    ts deadass ongod rn
  }
  yeet rn
}

nocturne olive Jun 1, 2025, 6:36 PM

#

nocturne olive Also the 7zip tool can basically handle almost all packed formats

Just throw it into a tar if you want, Sam can probably easily unpack it

prime ridge Jun 1, 2025, 6:36 PM

#

alr bet

nocturne olive Jun 1, 2025, 6:36 PM

#

But also just don't make your dataset like 30 billion small files
Concatenate them

olive sable Jun 1, 2025, 6:37 PM

#

7 zip can extract tar no?

prime ridge Jun 1, 2025, 6:37 PM

#

bit late for tha tlol

nocturne olive Jun 1, 2025, 6:37 PM

#

olive sable 7 zip can extract tar no?

It can extract basically anything

tender river Jun 1, 2025, 6:37 PM

#

real sierra ```ps # math_sqrt(x) returns the square root of x. function math_sqrt 1; pus...

ideally you'd implement it like this

function sqrt 1;
push_value 0
push_value 1
push_arg 0
push_value 1
sub
pop_d
if jge @loop @exit
@loop
  push_value 2
  add
  ; ??? somehow increment second last value on stack by 1
  ???????
  push_d
  ; ?????? i need to push the second last value on stack to stack again how do i do it
  ??????
  jmp @loop
@exit
pop_d
return

real sierra Jun 1, 2025, 6:37 PM

#

but i thought i was the best shr cryign

nocturne olive Jun 1, 2025, 6:37 PM

#

prime ridge bit late for tha tlol

Why would it be? You can just Python script to read all the contents into a single file no?

prime ridge Jun 1, 2025, 6:37 PM

#

I already have like 100 million files in a single directory

olive sable Jun 1, 2025, 6:37 PM

#

catdespair

#

100 million files is def gonna take a couple hours to extracts, maybe even a full day

#

could you remind me how much storage i need?

real sierra Jun 1, 2025, 6:38 PM

#

tender river ideally you'd implement it like this ``` function sqrt 1; push_value 0 push_valu...

POGGIES new sqrt dropped

trim valve Jun 1, 2025, 6:38 PM

#

prime ridge I already have like 100 million files in a single directory

I have made this mistake before and may or may not have output a single 500MB json file

prime ridge Jun 1, 2025, 6:38 PM

#

I mean I cancled it

prime ridge Jun 1, 2025, 6:38 PM

#

olive sable could you remind me how much storage i need?

120gb

#

most of that is file metadata

olive sable Jun 1, 2025, 6:38 PM

#

oh, thats way less than i was expecting

tender river Jun 1, 2025, 6:38 PM

#

real sierra <:POGGIES:1096887021311639593> new sqrt dropped

its old

prime ridge Jun 1, 2025, 6:38 PM

#

like 80% of it is metadata lmfao

real sierra Jun 1, 2025, 6:38 PM

#

; ?????? i need to push the second last value on stack to stack again how do i do it

#

https://tenor.com/view/thats-the-neat-part-you-dont-invincible-gif-27194608

Tenor

tender river Jun 1, 2025, 6:39 PM

#

real sierra https://tenor.com/view/thats-the-neat-part-you-dont-invincible-gif-27194608

can a function return multiple values on stack or would that mess up the return code

real sierra Jun 1, 2025, 6:39 PM

#

tender river can a function return multiple values on stack or would that mess up the return ...

the way it's written now, only single returns

prime ridge Jun 1, 2025, 6:40 PM

#

atp I might just have to retokenize 😭

#

ughhhhh

trim valve Jun 1, 2025, 6:40 PM

#

tbf 100m files shouldn't be that bad to process into smaller files

tender river Jun 1, 2025, 6:40 PM

#

real sierra the way it's written now, only single returns

neuro7 makes sense you use recursion everywhere its the only way to have more than a couple variables

real sierra Jun 1, 2025, 6:40 PM

#

YES

#

recursion my beloved

trim valve Jun 1, 2025, 6:41 PM

#

I mean the number 100 million and python should rarely be in the same sentence

real sierra Jun 1, 2025, 6:41 PM

#

Tomfoolery stack frames? you mean variables?

trim valve Jun 1, 2025, 6:41 PM

#

but still should be dooable

prime ridge Jun 1, 2025, 6:41 PM

#

not even python is the issue

#

the IO is insanely slow

#

it's on an archive external drive

#

15 mb/s writes

nocturne olive Jun 1, 2025, 6:41 PM

#

prime ridge I already have like 100 million files in a single directory

Easy:

File("out.txt").bufferedWriter().use { writer ->
  for (file in (File("path/to/files").listFiles())!!) {
    writer.write(file.readText())
  }
}

trim valve Jun 1, 2025, 6:41 PM

#

prime ridge 15 mb/s writes

catdespair

tender river Jun 1, 2025, 6:41 PM

#

real sierra recursion my beloved

you wont need it if you implement arbitrary stack offsets neuroPogHD or multiple return at least

real sierra Jun 1, 2025, 6:41 PM

#

is there a quicker way to do squaring than multiplication

trim valve Jun 1, 2025, 6:42 PM

#

is this like trying hard to max out i/o speeds

prime ridge Jun 1, 2025, 6:42 PM

#

The kernel is having a heart attack and the drive is on it's death bed

real sierra Jun 1, 2025, 6:42 PM

#

tender river you wont need it if you implement arbitrary stack offsets <:neuroPogHD:105777879...

i think malloc is probably in my future somewhere

#

then you can return a pointer to a struct

trim valve Jun 1, 2025, 6:42 PM

#

neuroSob

tender river Jun 1, 2025, 6:42 PM

#

malloc is useless

real sierra Jun 1, 2025, 6:42 PM

#

i mean

nocturne olive Jun 1, 2025, 6:43 PM

#

nocturne olive Easy: ```kt File("out.txt").bufferedWriter().use { writer -> for (file in (Fil...

This is all the code required to read the contents of all files in a directory and write them all into a new file

real sierra Jun 1, 2025, 6:43 PM

#

im more just looking for the notion of abstracting away ram management

#

then my code can just say "yeah i need this much memory for my crap"

tender river Jun 1, 2025, 6:43 PM

#

its called stack

prime ridge Jun 1, 2025, 6:43 PM

#

nocturne olive Easy: ```kt File("out.txt").bufferedWriter().use { writer -> for (file in (Fil...

Yeah but how would the code deliminate new files?

real sierra Jun 1, 2025, 6:43 PM

#

perish the stack

prime ridge Jun 1, 2025, 6:43 PM

#

it's not text or anything it's a .npz file of integers

real sierra Jun 1, 2025, 6:43 PM

#

chayleaf please i want malloc

trim valve Jun 1, 2025, 6:43 PM

#

hm

real sierra Jun 1, 2025, 6:43 PM

#

it will give you multiple return

#

albeit indirectly

nocturne olive Jun 1, 2025, 6:44 PM

#

prime ridge Yeah but how would the code deliminate new files?

You can just add a prefix/postfix to the write command

tender river Jun 1, 2025, 6:44 PM

#

❌ no malloc allowed

#

malloc bad

real sierra Jun 1, 2025, 6:44 PM

#

NOOO

#

ReallyMad

trim valve Jun 1, 2025, 6:44 PM

#

honestly I mildly want to suggest sqlite but I don't remember if that can efficiently handle binary data

real sierra Jun 1, 2025, 6:44 PM

#

@sage crag write malloc

tender river Jun 1, 2025, 6:44 PM

#

i wrote a garbage collector recently neuroPogHD

real sierra Jun 1, 2025, 6:44 PM

#

ill just call part of ram the heap

prime ridge Jun 1, 2025, 6:44 PM

#

also can't be read text since it's binary

real sierra Jun 1, 2025, 6:44 PM

#

its fine

nocturne olive Jun 1, 2025, 6:45 PM

#

prime ridge also can't be read text since it's binary

Then read binary and add delimiting bytes where you need them

tender river Jun 1, 2025, 6:45 PM

#

real sierra ill just call part of ram the heap

📎 gc.hb

real sierra Jun 1, 2025, 6:45 PM

#

other than the stack, heap, and a tiny address range for my own stuff, nobody else should be touching ram

#

so it should be fine

trim valve Jun 1, 2025, 6:45 PM

#

otherwise diy a binary archive format that's like ```
uint(total length),uint(metadata length),bytearray<metadata>,bytearray(data)

real sierra Jun 1, 2025, 6:45 PM

#

tender river

monkaOMEGA

prime ridge Jun 1, 2025, 6:45 PM

#

nocturne olive Then read binary and add delimiting bytes where you need them

nah but like the code needs to load these files independantly anyways

nocturne olive Jun 1, 2025, 6:45 PM

#

prime ridge nah but like the code needs to load these files independantly anyways

Why? It seems like your approach completely and utterly sucks

real sierra Jun 1, 2025, 6:45 PM

#

i cant do garbage collection

prime ridge Jun 1, 2025, 6:45 PM

#

can't just load 120gb into vram 😭

trim valve Jun 1, 2025, 6:45 PM

#

??

prime ridge Jun 1, 2025, 6:46 PM

#

nocturne olive Why? It seems like your approach completely and utterly sucks

it'll work but it's slow asf

real sierra Jun 1, 2025, 6:46 PM

#

my performance will never survive

#

its malloc or nothing

tender river Jun 1, 2025, 6:46 PM

#

real sierra its malloc or nothing

malloc is just a worse version of garbage collection

nocturne olive Jun 1, 2025, 6:46 PM

#

prime ridge can't just load 120gb into vram 😭

Most of that is probably filesystem overhead instead of actual data if most files are under 4KB

prime ridge Jun 1, 2025, 6:46 PM

#

atp shipping the drive internationally seems like the move

trim valve Jun 1, 2025, 6:46 PM

#

apollo can I just double check my understanding of the situation rq

prime ridge Jun 1, 2025, 6:46 PM

#

nocturne olive Most of that is probably filesystem overhead instead of actual data if most file...

it's 16 billion tokens

trim valve Jun 1, 2025, 6:47 PM

#

you have:

100m tiny files
stored on a slow drive
you want:
some easy way to transfer all of these files to sam

tender river Jun 1, 2025, 6:47 PM

#

tender river malloc is just a worse version of garbage collection

its like GC except you dont know what the user code will do so you have to do all sorts of tricks to properly allocate and deallocate memory

nocturne olive Jun 1, 2025, 6:47 PM

#

prime ridge it's 16 billion tokens

And is each token a single Int32?

prime ridge Jun 1, 2025, 6:47 PM

#

yeah

#

so a lotttt of overhead

#

I think it's actually more like 200 or 300 on the drive

#

it takes a long time to even check

tender river Jun 1, 2025, 6:48 PM

#

real sierra its fine

just have two stacks, one for functions and one for values, that should solve any problems with not being able to use more than one variable, you wont even need an args pointer anymore

trim valve Jun 1, 2025, 6:48 PM

#

evilShrug if you want me to try throwing to throw together a terrible rust program with far too much multithreading I'd be happy to give it a shot if you can give me an output format

prime ridge Jun 1, 2025, 6:48 PM

#

I cancelled the check at like 120 gb

nocturne olive Jun 1, 2025, 6:49 PM

#

prime ridge yeah

Then you'd have an exceptionally easy time concatenating them all into a single big binary file while keeping separation, just do

collection of chunk size 4-byte Int32s
repeat for every file```

prime ridge Jun 1, 2025, 6:49 PM

#

trim valve you have: - 100m tiny files - stored on a slow drive you want: - some easy way t...

yeah

trim valve Jun 1, 2025, 6:49 PM

#

yeah then unless you just give sam a disk image of the drive you're gonna have to cope with it taking a while to do loads of small reads

#

is it a flash drive or spinning metal

prime ridge Jun 1, 2025, 6:49 PM

#

loading 100m files will still take a while but I guess I gotta just bite the bullet atp

olive sable Jun 1, 2025, 6:50 PM

#

on my side everything should be fine, nvme ssd and stuff.
even if there is not enough space there i also have 2 empty sata ssd's

prime ridge Jun 1, 2025, 6:50 PM

#

trim valve is it a flash drive or spinning metal

it's an external usb drive

prime ridge Jun 1, 2025, 6:50 PM

#

olive sable on my side everything should be fine, nvme ssd and stuff. even if there is not e...

nah ur golden dw

#

it isnt even that much data tbh

#

im just a bit stupid

#

mostly lazy actually

nocturne olive Jun 1, 2025, 6:50 PM

#

Making your dataset 100 million small binary files rather than one big raw text file and including the tokenizer in your trainer code seems stupid

real sierra Jun 1, 2025, 6:50 PM

#

tender river just have two stacks, one for functions and one for values, that should solve an...

HOLY two stacks

prime ridge Jun 1, 2025, 6:50 PM

#

nocturne olive Making your dataset 100 million small binary files rather than one big raw text ...

I mean it was a lot easier to implement at the time

trim valve Jun 1, 2025, 6:50 PM

#

nocturne olive Making your dataset 100 million small binary files rather than one big raw text ...

everyone does stupid things, you have to learn somehow lol

prime ridge Jun 1, 2025, 6:51 PM

#

not a lot easier

#

but slightly

nocturne olive Jun 1, 2025, 6:51 PM

#

prime ridge I mean it was a lot easier to implement at the time

Just send the raw data and tokenize it at the target

tender river Jun 1, 2025, 6:51 PM

#

real sierra <:HOLY:1114131463395344495> two stacks

you can have one grow from the top of the addressable memory and one from the bottom

prime ridge Jun 1, 2025, 6:51 PM

#

I will I will

real sierra Jun 1, 2025, 6:51 PM

#

heap PRAYING heap PRAYING heap PRAYING heap PRAYING

trim valve Jun 1, 2025, 6:51 PM

#

trim valve <:evilShrug:1131357858101993603> if you want me to try throwing to throw togethe...

this offer still stands

prime ridge Jun 1, 2025, 6:51 PM

#

I know the int32 values are 0-50000 inclusive

real sierra Jun 1, 2025, 6:51 PM

#

CJcomputer

trim valve Jun 1, 2025, 6:52 PM

#

oh damn you could use u16s then

prime ridge Jun 1, 2025, 6:52 PM

#

so ig I just use 4294967296 as the delim?

real sierra Jun 1, 2025, 6:52 PM

#

pov: stacks fighting over 0x0FFF

trim valve Jun 1, 2025, 6:52 PM

#

half the filesize glueless

olive sable Jun 1, 2025, 6:52 PM

#

my wifi is bwaadow

prime ridge Jun 1, 2025, 6:52 PM

#

I could indeed

nocturne olive Jun 1, 2025, 6:52 PM

#

prime ridge so ig I just use 4294967296 as the delim?

Yeah, that'd work if you know the maximum allocated range of the actual data

prime ridge Jun 1, 2025, 6:52 PM

#

Can same use gzip?

hoary lion Jun 1, 2025, 6:52 PM

#

Bwaa

trim valve Jun 1, 2025, 6:52 PM

#

probably

olive sable Jun 1, 2025, 6:52 PM

#

probably

prime ridge Jun 1, 2025, 6:52 PM

#

sam

trim valve Jun 1, 2025, 6:53 PM

#

7zip handles most stuff

prime ridge Jun 1, 2025, 6:53 PM

#

alr cool

#

fuck it gimme 10 min

nocturne olive Jun 1, 2025, 6:53 PM

#

trim valve 7zip handles most stuff

Yeah, it's a great software

olive sable Jun 1, 2025, 6:53 PM

#

bwaa

prime ridge Jun 1, 2025, 6:53 PM

#

oh it's only 8 million files

#

we chillin

#

I forgot I changed ctx

tender river Jun 1, 2025, 6:53 PM

#

real sierra pov: stacks fighting over 0x0FFF

if you have two stacks you can implement forth neuroPogHD
https://www.forth.com/wp-content/uploads/2018/11/thinking-forth-color.pdf

real sierra Jun 1, 2025, 6:54 PM

#

oh god

olive sable Jun 1, 2025, 6:54 PM

#

prime ridge oh it's only 8 million files

rip 87.5% of the dataset neuro7

real sierra Jun 1, 2025, 6:54 PM

#

instant headache upon seeing the typeface

tender river Jun 1, 2025, 6:54 PM

#

real sierra instant headache upon seeing the typeface

you havent seen https://www.forth.com/wp-content/uploads/2018/01/Starting-FORTH.pdf

prime ridge Jun 1, 2025, 6:55 PM

#

olive sable rip 87.5% of the dataset <:neuro7:1129322212265050204>

no no no it's fine. It's still 16 billion tokens

#

that's enough to train comfortably

olive sable Jun 1, 2025, 6:55 PM

#

ah ok

trim valve Jun 1, 2025, 6:55 PM

#

trim valve I have made this mistake before and may or may not have output a single 500MB js...

the slowest part of this was saving the json somehow, not actually opening the files

prime ridge Jun 1, 2025, 6:55 PM

#

just a larger ctx

real sierra Jun 1, 2025, 6:55 PM

#

tender river you havent seen https://www.forth.com/wp-content/uploads/2018/01/Starting-FORTH....

CatDespair what the hell

#

im not reading all that

prime ridge Jun 1, 2025, 6:55 PM

#

trim valve the slowest part of this was saving the json somehow, not actually opening the f...

oh hellll no

real sierra Jun 1, 2025, 6:55 PM

#

malloc()

prime ridge Jun 1, 2025, 6:55 PM

#

that's not even bad

real sierra Jun 1, 2025, 6:55 PM

#

free()

#

realloc()

#

perfect

prime ridge Jun 1, 2025, 6:55 PM

#

one of my json files was 50gb

trim valve Jun 1, 2025, 6:55 PM

#

smh

prime ridge Jun 1, 2025, 6:55 PM

#

90% was wasted metadata

rare bramble Jun 1, 2025, 6:56 PM

#

how does a json file get that big ICANT

prime ridge Jun 1, 2025, 6:56 PM

#

it was from the dataset so not even my fault

hoary lion Jun 1, 2025, 6:56 PM

#

Io time

olive sable Jun 1, 2025, 6:56 PM

#

as long as it doesnt have to train for more than a month im good.
im going on vacation again on the 28th and i think it would be nice if it finished by then

tender river Jun 1, 2025, 6:56 PM

#

real sierra perfect

surely a memory allocator is like a couple hundred lines of assembly code glueless

trim valve Jun 1, 2025, 6:56 PM

#

rare bramble how does a json file get that big <:ICANT:1093292528066904195>

people using json because its easy to use

prime ridge Jun 1, 2025, 6:56 PM

#

olive sable as long as it doesnt have to train for more than a month im good. im going on va...

realistically a max of 2 weeks

olive sable Jun 1, 2025, 6:56 PM

#

NODDERS

trim valve Jun 1, 2025, 6:56 PM

#

without thinking about the problems it brings

nocturne olive Jun 1, 2025, 6:56 PM

#

prime ridge just a larger ctx

Welp, gotta recalculate the batch size again

tender river Jun 1, 2025, 6:56 PM

#

tender river surely a memory allocator is like a couple hundred lines of assembly code <:glue...

and not a couple tens of thousands of C code glueless

prime ridge Jun 1, 2025, 6:56 PM

#

if it takes longer just save the checkpoint and i'll finish it

real sierra Jun 1, 2025, 6:56 PM

#

tender river surely a memory allocator is like a couple hundred lines of assembly code <:glue...

the rom and ram are separate 16-bit addressable spaces, so no worries there

real sierra Jun 1, 2025, 6:57 PM

#

tender river and not a couple tens of thousands of C code <:glueless:1282337396230328425>

Aware ok maybe minor concern

prime ridge Jun 1, 2025, 6:57 PM

#

nocturne olive Welp, gotta recalculate the batch size again

I had to anyway cuz I implemented latent attention and flash attention

real sierra Jun 1, 2025, 6:57 PM

#

PagBounce

#

MALLOC

prime ridge Jun 1, 2025, 6:57 PM

#

oh yeah btw @olive sable u need to compile flash attention v2

olive sable Jun 1, 2025, 6:57 PM

#

okay

rare bramble Jun 1, 2025, 6:57 PM

#

trim valve people using json because its easy to use

neuroMad people should learn to use compressed data formats

tender river Jun 1, 2025, 6:57 PM

#

steal memory from the host PC and attach it to the virtual one neuroPogHD

prime ridge Jun 1, 2025, 6:57 PM

#

I'll find how to do that after I start fixing the ds

trim valve Jun 1, 2025, 6:57 PM

#

rare bramble <:neuroMad:1087519743533121586> people should learn to use compressed data forma...

nah just build faster pcs with more memory

#

that solves all issues glueless

real sierra Jun 1, 2025, 6:58 PM

#

Sadgi

#

even rocks would have syscalls

tender river Jun 1, 2025, 6:58 PM

#

real sierra <:Sadgi:1173918619260944394>

anyway forth is literally the simplest high level language and it doesnt require an allocator (trust me allocators are hard)

nocturne olive Jun 1, 2025, 6:59 PM

#

prime ridge I had to anyway cuz I implemented latent attention and flash attention

This whole LLM pretraining venture seems quite pointless
There's like 30 thousand LLMs you can just prompt to get whatever you want with much higher quality than any home-made LLM can, LLMs take way longer to train than even NeuroSynth, and LLMs are honestly way less interesting than NeuroSynth

prime ridge Jun 1, 2025, 6:59 PM

#

nocturne olive This whole LLM pretraining venture seems quite pointless There's like 30 thousan...

Not pointless

tender river Jun 1, 2025, 6:59 PM

#

you dont even have an allocator neurOMEGALUL just an arena

olive sable Jun 1, 2025, 6:59 PM

#

If your machine has less than 96GB of RAM and lots of CPU cores, ninja might run too many parallel compilation jobs that could exhaust the amount of RAM. To limit the number of parallel compilation jobs, you can set the environment variable MAX_JOBS:
hey thats me, i have 16 cores and less than 96GB of ram

nocturne olive Jun 1, 2025, 6:59 PM

#

prime ridge Not pointless

Well, what's the point then?

prime ridge Jun 1, 2025, 6:59 PM

#

Nobody wants a cringy LoRA tuned llm

tender river Jun 1, 2025, 6:59 PM

#

well it is an allocator

#

but it doesnt have free

rare bramble Jun 1, 2025, 7:00 PM

#

tender river anyway forth is literally the simplest high level language and it doesnt require...

you dont need an allocator, just write to random memory addesses and assume that no one else is using them at the time neuroEZ

real sierra Jun 1, 2025, 7:00 PM

#

tender river anyway forth is literally the simplest high level language and it doesnt require...

it looks really weird

nocturne olive Jun 1, 2025, 7:00 PM

#

prime ridge Nobody wants a cringy LoRA tuned llm

What's wrong with those?

real sierra Jun 1, 2025, 7:00 PM

#

neuroCross

prime ridge Jun 1, 2025, 7:00 PM

#

olive sable `If your machine has less than 96GB of RAM and lots of CPU cores, ninja might ru...

just set the workers to a normal amount

prime ridge Jun 1, 2025, 7:00 PM

#

nocturne olive What's wrong with those?

Extremely inefficient

trim valve Jun 1, 2025, 7:00 PM

#

olive sable `If your machine has less than 96GB of RAM and lots of CPU cores, ninja might ru...

glueless surely you're not going to use more than 6GB of ram per job

prime ridge Jun 1, 2025, 7:01 PM

#

requires instruction tuning and more parameters

#

You aren't going to see a 0.5b model outperform larger models without pretraining

nocturne olive Jun 1, 2025, 7:01 PM

#

prime ridge Extremely inefficient

Whar
It's wayyyy more efficient than home-pretraining a tiny model

olive sable Jun 1, 2025, 7:01 PM

#

tbhi haver no clue how to compile flash atttention V2.
cant i just pip install?

prime ridge Jun 1, 2025, 7:01 PM

#

nocturne olive Whar It's *wayyyy* more efficient than home-pretraining a tiny model

not per parameter

prime ridge Jun 1, 2025, 7:01 PM

#

olive sable tbhi haver no clue how to compile flash atttention V2. cant i just pip install?

Yes

#

pip will compile it for you

#

lemme find the package because I think it's specific

hoary lion Jun 1, 2025, 7:02 PM

#

I think he is talking about the vram

#

@nocturne olive

nocturne olive Jun 1, 2025, 7:02 PM

#

prime ridge You aren't going to see a 0.5b model outperform larger models without pretrainin...

Yeah, of course not
But you can just grab an off-the-shelf pretrained model and finetune that

prime ridge Jun 1, 2025, 7:02 PM

#

git clone https://github.com/Dao-AILab/flash-attention.git
cd flash-attention
pip install .

prime ridge Jun 1, 2025, 7:02 PM

#

nocturne olive Yeah, of course not But you can just grab an off-the-shelf pretrained model and ...

have fun running that locally 🤡

#

if you want a good local model. It has to be custom

nocturne olive Jun 1, 2025, 7:03 PM

#

prime ridge have fun running that locally 🤡

Llama 1B
Llama 8B
Llama 13B

prime ridge Jun 1, 2025, 7:03 PM

#

they suck

#

I already tried it

#

especially 1b

nocturne olive Jun 1, 2025, 7:03 PM

#

A home-made model is gonna suck even more

prime ridge Jun 1, 2025, 7:03 PM

#

Nope

olive sable Jun 1, 2025, 7:04 PM

#

NeuroDown

hoary lion Jun 1, 2025, 7:04 PM

#

@prime ridge there is a repo to convert any GQA to MLA
Use qwen 3B ish model and finetune it after converting it to MLA

rare bramble Jun 1, 2025, 7:04 PM

#

neuroLookUp

hoary lion Jun 1, 2025, 7:04 PM

#

Classic

prime ridge Jun 1, 2025, 7:04 PM

#

u need pytorch

hoary lion Jun 1, 2025, 7:04 PM

#

We hate torch dependency

olive sable Jun 1, 2025, 7:04 PM

#

i though i had pytorch

hard shale Jun 1, 2025, 7:05 PM

#

olive sable i though i had pytorch

maybe in another py-env

prime ridge Jun 1, 2025, 7:05 PM

#

yeah

nocturne olive Jun 1, 2025, 7:05 PM

#

And how would you know that your tiny model trained on amateur collected data can beat a model made by a big company with loads of data scientists cleaning up the dataset?

prime ridge Jun 1, 2025, 7:05 PM

#

hoary lion <@463500020058947605> there is a repo to convert any GQA to MLA Use qwen 3B ish ...

sooooo many parameters are completely wasted

#

it doesn't need to know how to solve differential equations to chat like a human

hoary lion Jun 1, 2025, 7:06 PM

#

What

prime ridge Jun 1, 2025, 7:06 PM

#

literally burning compute

olive sable Jun 1, 2025, 7:06 PM

#

cuda 11.8 for 3090?

trim valve Jun 1, 2025, 7:06 PM

#

prime ridge it doesn't need to know how to solve differential equations to chat like a human

but I want my ai girlfriend to do my homework whilst rping !!!

nocturne olive Jun 1, 2025, 7:06 PM

#

olive sable cuda 11.8 for 3090?

CUDA 12 is recommended

#

CUDA 12.1 is my go-to usually

olive sable Jun 1, 2025, 7:06 PM

#

12.6 or 12.8?

prime ridge Jun 1, 2025, 7:06 PM

#

yeah should be cuda 12

hoary lion Jun 1, 2025, 7:07 PM

#

Yeah ofc it will lost some but it is better cram

nocturne olive Jun 1, 2025, 7:07 PM

#

olive sable 12.6 or 12.8?

Doesn't matter, it's backwards compatible anyway I think

hoary lion Jun 1, 2025, 7:07 PM

#

Much efficient than starting from scratch imo

nocturne olive Jun 1, 2025, 7:07 PM

#

nocturne olive Doesn't matter, it's backwards compatible anyway I think

Though check what CUDA Torch is available for

hoary lion Jun 1, 2025, 7:08 PM

#

12.6 is stable

olive sable Jun 1, 2025, 7:08 PM

#

these are the options

prime ridge Jun 1, 2025, 7:08 PM

#

hoary lion Much efficient than starting from scratch imo

Duh. But once it's trained it will have exceptionally low memory usage while having the performance of models orders of magnitude better

hoary lion Jun 1, 2025, 7:08 PM

#

Okay lol good luck then

prime ridge Jun 1, 2025, 7:08 PM

#

Don't select linux

olive sable Jun 1, 2025, 7:08 PM

#

ye

#

fuck it, 12.6 it is

nocturne olive Jun 1, 2025, 7:09 PM

#

prime ridge Duh. But once it's trained it will have exceptionally low memory usage while hav...

"while having the performance of models orders of magnitude better"

hoary lion Jun 1, 2025, 7:09 PM

#

Tbh neither do I know the architecture, so can't really confirm "it would be magnitude better" in both perf and compute/mem

prime ridge Jun 1, 2025, 7:10 PM

#

Idk how being trained on wikipedia is supposed to improve the accuracy of a model meant to sound human

#

that's just not true

#

garbage in garbage out

olive sable Jun 1, 2025, 7:10 PM

#

i dont mind training it anyways, lets j8ust do it

prime ridge Jun 1, 2025, 7:10 PM

#

exactly

nocturne olive Jun 1, 2025, 7:10 PM

#

nocturne olive > "while having the performance of models orders of magnitude better" <:Clueless...

Sure, it may require less compute and memory, but it'll overall not be better if the data is not significantly well curated

hoary lion Jun 1, 2025, 7:11 PM

#

~~in your dreams~~ oof
Can you learn english with discord messages, thats my question here

prime ridge Jun 1, 2025, 7:11 PM

#

It will learn to imitate the data

#

that's how ML works

nocturne olive Jun 1, 2025, 7:11 PM

#

olive sable i dont mind training it anyways, lets j8ust do it

If just I had a 3090 like you so I could just train NeuroSynth models whenever I feel like without it taking 30 hours

hoary lion Jun 1, 2025, 7:11 PM

#

I think wiki is just for priming basic knowledge, being coherent

nocturne olive Jun 1, 2025, 7:12 PM

#

Here's my prediction:
the model will output English words, but without any sense behind them, and be unable to properly keep a conversation

hoary lion Jun 1, 2025, 7:12 PM

#

Imagine a super brainrotted llm spamming non existent emoji

#

neuroBwaa

trim valve Jun 1, 2025, 7:13 PM

#

hoary lion Imagine a super brainrotted llm spamming non existent emoji

me :3

hoary lion Jun 1, 2025, 7:13 PM

#

trim valve me :3

Us core

nocturne olive Jun 1, 2025, 7:13 PM

#

nocturne olive Here's my prediction: the model will output English words, but without any sense...

-# If I'm right, that'd be very silly

prime ridge Jun 1, 2025, 7:13 PM

#

who cares. I'd be hillarious anyways

#

plus I doubt it

#

a lot of the data comes from "smart people" servers too

nocturne olive Jun 1, 2025, 7:14 PM

#

Is your data in any way grammatically consistent?

prime ridge Jun 1, 2025, 7:14 PM

#

prolly 50%

prime ridge Jun 1, 2025, 7:14 PM

#

nocturne olive Is your data in any way grammatically consistent?

somewhat

#

it isnt an english major. It will sound natural

hoary lion Jun 1, 2025, 7:14 PM

#

Nahh it's done for
50% neurOMEGALUL

prime ridge Jun 1, 2025, 7:14 PM

#

it's like 70% brainrot and 30% coherent

nocturne olive Jun 1, 2025, 7:14 PM

#

By the way, if you're training off of Discord messages, you're breaking the Discord TOS and you should consider not doing that so you don't have the same happen to you as happened to Shapes

prime ridge Jun 1, 2025, 7:14 PM

#

worst case

prime ridge Jun 1, 2025, 7:15 PM

#

nocturne olive By the way, if you're training off of Discord messages, you're breaking the Disc...

All I see are numbers 0-50,000

#

doesn't seem like discord to me

nocturne olive Jun 1, 2025, 7:15 PM

#

What matters is what the numbers are derived from

prime ridge Jun 1, 2025, 7:15 PM

#

probably math

#

glueless

nocturne olive Jun 1, 2025, 7:16 PM

#

If it's numbers generated by putting Discord messages trough math formulas, you're in trouble

real sierra Jun 1, 2025, 7:16 PM

#

hmm

#

doom seems hard to fit in this computer

prime ridge Jun 1, 2025, 7:16 PM

#

no ofc not

real sierra Jun 1, 2025, 7:16 PM

#

but bad apple could be possible...

trim valve Jun 1, 2025, 7:17 PM

#

glueless apollo you just have a very lucky random number generator right

nocturne olive Jun 1, 2025, 7:17 PM

#

prime ridge no ofc not

Surely
-# Well, have fun with the Discord lawyers

hoary lion Jun 1, 2025, 7:17 PM

#

Return back to slack glueless

prime ridge Jun 1, 2025, 7:17 PM

#

just rng bud

olive sable Jun 1, 2025, 7:17 PM

#

Ah yes, it is but it isnt the same

hard shale Jun 1, 2025, 7:18 PM

#

trim valve <:glueless:1282337396230328425> apollo you just have a very lucky random number ...

~~pseudo~~ rngs are very useful

prime ridge Jun 1, 2025, 7:18 PM

#

https://tenor.com/view/ahoklollmao-noooo-nooo-noo-no-gif-17139315586294057944

Tenor

trim valve Jun 1, 2025, 7:18 PM

#

rngs power my bogosort

#

still need to optimise that bad boy

hoary lion Jun 1, 2025, 7:18 PM

#

I hate manually managed rngs

prime ridge Jun 1, 2025, 7:18 PM

#

Bad apple is a gift from the gods im convinced

trim valve Jun 1, 2025, 7:18 PM

#

poor thing obliterates my cpu

hoary lion Jun 1, 2025, 7:18 PM

#

hoary lion I hate manually managed rngs

Jax you again

trim valve Jun 1, 2025, 7:18 PM

#

glueless and still runs awfully for some reason

#

you would think

#

but alas

rough bloom Jun 1, 2025, 7:19 PM

#

hoary lion Jax you again

jax.random.split neuroSCHIZO

prime ridge Jun 1, 2025, 7:19 PM

#

no need for llm. Just stream the rng

#

it's already accurate

trim valve Jun 1, 2025, 7:19 PM

#

I think I tried that and it was the same speed

#

evilShrug

#

it was fast enough

hoary lion Jun 1, 2025, 7:20 PM

#

Love when Shuni randomly spawns as i mention jax/flax

#

Who would be spawning if i mention tf

trim valve Jun 1, 2025, 7:21 PM

#

glueless

#

lemme check the speed rq

prime ridge Jun 1, 2025, 7:22 PM

#

DAMN it's fast

trim valve Jun 1, 2025, 7:22 PM

#

it is

olive sable Jun 1, 2025, 7:22 PM

#

my brother in christ, pytorch ***is *** installed

prime ridge Jun 1, 2025, 7:22 PM

#

90 min to encode the entire dataset of psudo random numbers

#

weird

real sierra Jun 1, 2025, 7:22 PM

#

i heard bogosort

prime ridge Jun 1, 2025, 7:23 PM

#

olive sable my brother in christ, pytorch ***is *** installed

worst case it's not required but makes it like 3-4x faster

hexed grove Jun 1, 2025, 7:23 PM

#

yall i need some advice

real sierra Jun 1, 2025, 7:23 PM

#

are there any other important math functions im missing

prime ridge Jun 1, 2025, 7:23 PM

#

check pins

real sierra Jun 1, 2025, 7:23 PM

#

pow might be nice

#

any bright ideas on how to do pow efficiently

#

log_n CatDespair

#

surely just

burnt aurora Jun 1, 2025, 7:24 PM

#

is adding arduino libraries supposed to take 5min and not be done

real sierra Jun 1, 2025, 7:24 PM

#

log2(x) / log2(n)

#

ReallyGunShoot precision

trim valve Jun 1, 2025, 7:24 PM

#

prime ridge Jun 1, 2025, 7:24 PM

#

my build times can be up to 10 min for arduino sometimes

trim valve Jun 1, 2025, 7:24 PM

#

my beloved

prime ridge Jun 1, 2025, 7:24 PM

#

depends on the code

trim valve Jun 1, 2025, 7:24 PM

#

I love how it slowly but surely drops from 9bn/s to 8bn/s as my cpu cooks itself

prime ridge Jun 1, 2025, 7:25 PM

#

@olive sable it's tokenizing and should be finished in 90-120 min. I gotta finish my paper just ping me if u need anything

hexed grove Jun 1, 2025, 7:25 PM

#

i need to integrate live2d into my c code, live2d doesnt have C bindings what do i doooo

olive sable Jun 1, 2025, 7:25 PM

#

okp

trim valve Jun 1, 2025, 7:25 PM

#

uh wait how hard is a 16 item list again

nocturne olive Jun 1, 2025, 7:25 PM

#

hexed grove i need to integrate live2d into my c code, live2d doesnt have C bindings what do...

Make C bindings

stark needle Jun 1, 2025, 7:26 PM

#

prime ridge Idk how being trained on wikipedia is supposed to improve the accuracy of a mode...

it improves generalization

trim valve Jun 1, 2025, 7:26 PM

#

ah ok that is nonideal actually

hexed grove Jun 1, 2025, 7:26 PM

#

nocturne olive Make C bindings

how hard should that be (i do not know c++ )

trim valve Jun 1, 2025, 7:27 PM

#

wait does someone here remember how to do basic stats

#

I forgot all of it glueless

real sierra Jun 1, 2025, 7:27 PM

#

nope

nocturne olive Jun 1, 2025, 7:27 PM

#

hexed grove how hard should that be (i do not know c++ )

I don't know, depends on the complexity of things you're making bindings for

trim valve Jun 1, 2025, 7:28 PM

#

if I have a 1 / 16! chance of getting my list in the correct order, and I'm doing 8 billion attempts per second what's the average time to have a 90% chance of the list being sorted

real sierra Jun 1, 2025, 7:28 PM

#

i have mul

#

but its just repeated addition

#

i dont think i can get much better

trim valve Jun 1, 2025, 7:29 PM

#

trim valve if I have a 1 / 16! chance of getting my list in the correct order, and I'm doin...

glueless I swear I used to know how to do this

stark needle Jun 1, 2025, 7:29 PM

#

nocturne olive This whole LLM pretraining venture seems quite pointless There's like 30 thousan...

this is so real

trim valve Jun 1, 2025, 7:29 PM

#

I would say you can use the standard normal distribution or something but evilShrug

nocturne olive Jun 1, 2025, 7:29 PM

#

stark needle this is so real

Silly

prime ridge Jun 1, 2025, 7:29 PM

#

Nobody likes yet another derivation of GPT. Enough said

#

that's super boring

#

or Llama

olive sable Jun 1, 2025, 7:30 PM

#

no module named torch bwaaaaa

nocturne olive Jun 1, 2025, 7:30 PM

#

LLMs in general are pretty boring

trim valve Jun 1, 2025, 7:30 PM

#

sam how are you installing torch?

nocturne olive Jun 1, 2025, 7:30 PM

#

nocturne olive LLMs in general are pretty boring

Now vocal synthesis, that's where it's at

prime ridge Jun 1, 2025, 7:30 PM

#

olive sable no module named torch bwaaaaa

it's fine. You don't need it. It would just make it a bit faster

trim valve Jun 1, 2025, 7:30 PM

#

nocturne olive LLMs in general are pretty boring

I personally prefer eating rocks

olive sable Jun 1, 2025, 7:30 PM

#

trim valve sam how are you installing torch?

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu126

trim valve Jun 1, 2025, 7:30 PM

#

hm

#

try pip freeze I'm curious

prime ridge Jun 1, 2025, 7:30 PM

#

are you in a env btw

olive sable Jun 1, 2025, 7:30 PM

#

nope

hoary lion Jun 1, 2025, 7:31 PM

#

nocturne olive Now vocal synthesis, that's where it's at

See we have a man of culture

olive sable Jun 1, 2025, 7:31 PM

#

i dont really use envs all that often

trim valve Jun 1, 2025, 7:31 PM

#

glueless do I leave my desktop attempting to bogosort 16 numbers or do I let it rest

real sierra Jun 1, 2025, 7:31 PM

#

i dont suppose those bit hacks require shr or shl

#

monkaLaugh

nocturne olive Jun 1, 2025, 7:31 PM

#

hoary lion See we have a man of culture

This is the peak of silliness

trim valve Jun 1, 2025, 7:32 PM

#

honestly every time I see ai voice stuff it reminds me I need to train voice irl

#

get back on the grind

stark needle Jun 1, 2025, 7:32 PM

#

prime ridge or Llama

uh isn't that what ur doing basically

prime ridge Jun 1, 2025, 7:32 PM

#

not at all?

noble zodiac Jun 1, 2025, 7:33 PM

#

wish I could sing but my vocal coords are fricked up

trim valve Jun 1, 2025, 7:33 PM

#

😔

#

with effort they can get a bit better

stark needle Jun 1, 2025, 7:33 PM

#

prime ridge not at all?

how is it different? From my understanding it's also a transformer

prime ridge Jun 1, 2025, 7:33 PM

#

transformer != llama

stark needle Jun 1, 2025, 7:33 PM

#

llama is a transformer

prime ridge Jun 1, 2025, 7:33 PM

#

Ok surely this is bait 🪤

olive sable Jun 1, 2025, 7:33 PM

#

SCHIZO i have no clue what is happening nor what im doing

#

imma watch some anime

noble zodiac Jun 1, 2025, 7:34 PM

#

trim valve with effort they can get a bit better

a bit aint enough to stop ears from bleeding. Ah well, it do be what it do be

tender river Jun 1, 2025, 7:34 PM

#

real sierra it looks really weird

forth is a lovely language neuroPogHD
you should learn it

: SQRT
    0 SWAP 1
    BEGIN
        2DUP >= WHILE
        DUP 2 +
        ROT ROT -
        ROT 1 +
        ROT ROT
        SWAP
    REPEAT
    DROP DROP ;

25 SQRT CR .

prime ridge Jun 1, 2025, 7:34 PM

#

@olive sable yeah ur good we can just wait for the encoding to finish

stark needle Jun 1, 2025, 7:34 PM

#

prime ridge Ok surely this is bait 🪤

llama is literally the name pf The Architecture, which at the end doesn't matter that much

trim valve Jun 1, 2025, 7:34 PM

#

noble zodiac a bit aint enough to stop ears from bleeding. Ah well, it do be what it do be

i mean

#

evilShrug

#

you gotta start somewhere

noble zodiac Jun 1, 2025, 7:35 PM

#

I rather play instruments than fight with biology

tender river Jun 1, 2025, 7:35 PM

#

tender river forth is a lovely language <:neuroPogHD:1057778797056901130> you should learn i...

otherwise, you will forever be remembered by me as the java programmer ReallyInnocent

real sierra Jun 1, 2025, 7:35 PM

#

tender river forth is a lovely language <:neuroPogHD:1057778797056901130> you should learn i...

stacrobatics

prime ridge Jun 1, 2025, 7:35 PM

#

stark needle llama is literally the name pf The Architecture, which at the end doesn't matter...

llama is an LLM developed by Meta not an architecture

trim valve Jun 1, 2025, 7:35 PM

#

glueless why not both

hoary lion Jun 1, 2025, 7:35 PM

#

It is???

noble zodiac Jun 1, 2025, 7:35 PM

#

there are only so many hours in the day

hoary lion Jun 1, 2025, 7:35 PM

#

Have you not heard of llamafy

stark needle Jun 1, 2025, 7:35 PM

#

prime ridge llama is an LLM developed by Meta not an architecture

ur referring to the llama weights

trim valve Jun 1, 2025, 7:35 PM

#

yeah fair

nocturne olive Jun 1, 2025, 7:36 PM

#

noble zodiac I rather play instruments than fight with biology

Meanwhile me, I'm quite good at making vocals, but suck at instrumentation

noble zodiac Jun 1, 2025, 7:36 PM

#

that has nothing to do with singing

prime ridge Jun 1, 2025, 7:36 PM

#

stark needle ur referring to the llama weights

yeah. LLaMA != Llama

trim valve Jun 1, 2025, 7:37 PM

#

I can sing really well in my head glueless

nocturne olive Jun 1, 2025, 7:37 PM

#

noble zodiac that has nothing to do with singing

Technically it does
I know how to make the computer sing pretty decently

trim valve Jun 1, 2025, 7:37 PM

#

god bless the discord emote autocomplete

trim valve Jun 1, 2025, 7:37 PM

#

nocturne olive Technically it does I know how to make the computer sing pretty decently

ok but

#

like ur throat isn't a computer

signal trout Jun 1, 2025, 7:37 PM

#

trim valve I can sing really well in my head <:glueless:1282337396230328425>

real

trim valve Jun 1, 2025, 7:37 PM

#

you have to put in effort

prime ridge Jun 1, 2025, 7:37 PM

#

either way it's not just a fine tuned model

trim valve Jun 1, 2025, 7:37 PM

#

instead of letting your rock think extra hard

signal trout Jun 1, 2025, 7:37 PM

#

trim valve you have to put in effort

both require effort, and both are valuable skills neuroPray

nocturne olive Jun 1, 2025, 7:38 PM

#

trim valve instead of letting your rock think extra hard

I like when my rock gets to think real hard about singing

trim valve Jun 1, 2025, 7:38 PM

#

yeah sure but not comparable skills

signal trout Jun 1, 2025, 7:38 PM

#

I couldn't imagine doing what superbox does; very impressive stuff

trim valve Jun 1, 2025, 7:38 PM

#

good

nocturne olive Jun 1, 2025, 7:39 PM

#

signal trout I couldn't imagine doing what superbox does; very impressive stuff

We're only just leading Neuro singing voice cloning and exploring completely untapped potential in the field

olive sable Jun 1, 2025, 7:39 PM

#

https://tenor.com/view/llama-animal-gif-27259607

Tenor

#

llama

stark needle Jun 1, 2025, 7:39 PM

#

prime ridge either way it's not just a fine tuned model

i'm open to agreeing but I'm still not quite understanding the exact point of not pretraining and instead working directly with chat data and what makes this exactly better than just fine-tuning

#

Is there any documentation that ya got?

signal trout Jun 1, 2025, 7:39 PM

#

nocturne olive We're only just leading Neuro singing voice cloning and exploring completely unt...

"only" looking pretty impressive neuroPray

noble zodiac Jun 1, 2025, 7:39 PM

#

trim valve yeah sure but not comparable skills

agreed, completly different things

real sierra Jun 1, 2025, 7:40 PM

#

there must be some better way to do powers tho

#

surely

hoary lion Jun 1, 2025, 7:40 PM

#

help

nocturne olive Jun 1, 2025, 7:40 PM

#

signal trout "only" looking pretty impressive <:neuroPray:1108327370206744576>

Silly
Either way, NeuroSynth is gonna be good
Did you see BETA-3?

hoary lion Jun 1, 2025, 7:40 PM

#

my pc has less than 8 gb of space

#

its dying

trim valve Jun 1, 2025, 7:40 PM

#

😭

signal trout Jun 1, 2025, 7:40 PM

#

nocturne olive Silly Either way, NeuroSynth is gonna be good Did you see BETA-3?

I don't think so

tender river Jun 1, 2025, 7:40 PM

#

real sierra there must be some better way to do powers tho

binpow is the fastest you can get with software

trim valve Jun 1, 2025, 7:40 PM

#

so glad I overspecced my storage

nocturne olive Jun 1, 2025, 7:40 PM

#

hoary lion my pc has less than 8 gb of space

Delete any big games you have installed
And if you have Unity you can free up like 50GB by getting rid of it

hoary lion Jun 1, 2025, 7:41 PM

#

I already did 😭

#

today is the day my pc is going to be stacked

nocturne olive Jun 1, 2025, 7:41 PM

#

signal trout I don't think so

Here's some NS-B-3 highly tuned action

prime ridge Jun 1, 2025, 7:41 PM

#

stark needle Is there any documentation that ya got?

A model with a single intended purpose that has been pretrained for that purpose will outperform a model trained on irrelevant data of the same size. It's the classic example of "garbage in, garbage out".

#

Training data is the #1 most important factor of model performance

#

Just look at how Llama 4 seemed good on llmarena despite being bad elsewhere since it was made for that task

#

literally the #1 principle in ML

tender river Jun 1, 2025, 7:42 PM

#

real sierra there must be some better way to do powers tho

something like this surey i didnt make a mistake

int pow(int a, int b) {
    if (b == 0) return 1;
    int ret = pow(a, b / 2);
    ret *= ret;
    if (b % 2 == 1) ret *= a;
    return ret;
}

maiden geyser Jun 1, 2025, 7:42 PM

#

what's next, zaporozhian cossack division?

real sierra Jun 1, 2025, 7:43 PM

#

shr1 recurses 15 times salute not sure the mul savings are worth

tender river Jun 1, 2025, 7:43 PM

#

we wrote the same algo so it must be correct neuroPogHD

prime ridge Jun 1, 2025, 7:43 PM

#

neuro7

real sierra Jun 1, 2025, 7:44 PM

#

this is strangely very relevant to the code im writing rn

stark needle Jun 1, 2025, 7:44 PM

#

prime ridge A model with a single intended purpose that has been pretrained for that purpose...

but it's not directly irrelevant data, when the way grammar works, etc is similar. You may be thinking of the weaknesses of instruction tuned models, those sure yea they got problems but on base models you don't necessarily have that problem

signal trout Jun 1, 2025, 7:45 PM

#

nocturne olive Here's some NS-B-3 highly tuned action

NeurOhISee seems promising

hoary lion Jun 1, 2025, 7:45 PM

#

wait

#

who TF installed games at my c drive

trim valve Jun 1, 2025, 7:45 PM

#

me

#

duh

hexed grove Jun 1, 2025, 7:45 PM

#

hoary lion who TF installed games at my c drive

i sshed in

#

whoops

nocturne olive Jun 1, 2025, 7:46 PM

#

signal trout <:NeurOhISee:1136209034966483084> seems promising

Still running on synthetic data by the way, NeuroSynth-1.0 will be using oragnic data

hoary lion Jun 1, 2025, 7:46 PM

#

😠 istg i definitely set it to other directory than c on my steam

prime ridge Jun 1, 2025, 7:46 PM

#

stark needle but it's not directly irrelevant data, when the way grammar works, etc is simila...

I am GRPO tuning the base model afterwards. Also, just to be so serious, grammar does not matter almost at all in my model...

signal trout Jun 1, 2025, 7:46 PM

#

nocturne olive Still running on synthetic data by the way, NeuroSynth-1.0 will be using oragnic...

what do you mean by "synthetic" data?

nocturne olive Jun 1, 2025, 7:46 PM

#

signal trout what do you mean by "synthetic" data?

Data made with Neuro RVC and existing compatible datasets

hoary lion Jun 1, 2025, 7:46 PM

#

migrating my limbus company to D or E/F

#

byebye

real sierra Jun 1, 2025, 7:47 PM

#

this one i do know

#

think you're missing some parentheses tho

tender river Jun 1, 2025, 7:47 PM

#

tender river something like this surey i didnt make a mistake ```c int pow(int a, int b) { ...

POW :
    DUP 0 =
    IF DROP 1 ELSE
        2DUP
        2 /
        POW
        DUP *
        ROT
        2 % 1 = IF * ELSE DROP THEN
    THEN ;

#

untested

stark needle Jun 1, 2025, 7:48 PM

#

prime ridge I am GRPO tuning the base model afterwards. Also, just to be so serious, grammar...

But grpo assumes the model output is good and you got a high quality autorater on which the model doesn't hack the rewards

real sierra Jun 1, 2025, 7:48 PM

#

i would still never write it without parentheses, it scares me

tender river Jun 1, 2025, 7:48 PM

#

tender river ```forth POW : DUP 0 = IF DROP 1 ELSE 2DUP 2 / P...

also binpow is trivial to translate to an iterative approach too

prime ridge Jun 1, 2025, 7:49 PM

#

stark needle But grpo assumes the model output is good and you got a high quality autorater o...

Why are you so insistant that the model will be bad? I literally already trained it at 30m parameters and it was "eh". Considering the fact that it was only 30m parameters and at least could form sentences is extremely telling don't you think? At a higher scale it will be significantly better

hoary lion Jun 1, 2025, 7:49 PM

#

bc

#

grpo is for like

#

super big big model yk

#

like frontier

#

unmatchable

prime ridge Jun 1, 2025, 7:49 PM

#

GRPO is just RL. It don't need to be big. I'm not using it for test time compute

hoary lion Jun 1, 2025, 7:49 PM

#

and you are objectively wrong with using GRPO on heavily non-factual stuffs

#

it does not work

#

how are you going to measure the perf? ELO or something? then we back to RLHF

prime ridge Jun 1, 2025, 7:50 PM

#

GRPO is for tool usage. Specifically, actions taken to avoid being detected as an AI

#

im not using it as a replacement for fine tuning

#

specific words might indicate doubt of human interaction

nocturne olive Jun 1, 2025, 7:51 PM

#

Tool use on such a tiny model? That's gonna go horribly

prime ridge Jun 1, 2025, 7:51 PM

#

im not 100% set on GRPO btw. Just an experiment

#

holy fuck

#

let a man experiment

#

I am curious

hoary lion Jun 1, 2025, 7:51 PM

#

...

#

neuroDeadge

prime ridge Jun 1, 2025, 7:51 PM

#

I know the base model will be good but idk about fine tuning

stark needle Jun 1, 2025, 7:52 PM

#

prime ridge Why are you so insistant that the model will be bad? I literally already trained...

10M model overfit on tinystories has shown it to have barely cohesive sentences even on extremely simplified and perfect data. The model was also clueless with anything remotely outside it's exact distribution

prime ridge Jun 1, 2025, 7:52 PM

#

where is 10m coming from

hoary lion Jun 1, 2025, 7:52 PM

#

tinystories

#

its the model name

#

ig

trim valve Jun 1, 2025, 7:52 PM

#

is it really an ai discussion in #programming if people don't try to tell you 1000 reasons why you're wrong and a terrible person

prime ridge Jun 1, 2025, 7:52 PM

#

ofc 30m was overfitted

prime ridge Jun 1, 2025, 7:53 PM

#

trim valve is it really an ai discussion in <#1071784467036913664> if people don't try to t...

on god

#

like damn bro. let a man live

hoary lion Jun 1, 2025, 7:53 PM

#

trim valve is it really an ai discussion in <#1071784467036913664> if people don't try to t...

NeuroClueless we just don't want gpu to melt for things that won't work

#

give me the 3090!! ill change the world trust

trim valve Jun 1, 2025, 7:53 PM

#

NeuroClueless why do you care

nocturne olive Jun 1, 2025, 7:53 PM

#

hoary lion give me the 3090!! ill change the world trust

Nah, give me 3090 and I'll make the best Neuro vocal synthesizer ever

hoary lion Jun 1, 2025, 7:53 PM

#

ooh wait

#

we have karaoke

#

ima listen

prime ridge Jun 1, 2025, 7:54 PM

#

Reguardless im still going to learn how to actually make an LLM instead of just yoinking a pretrained model

hoary lion Jun 1, 2025, 7:54 PM

#

fk im lagging

stark needle Jun 1, 2025, 7:55 PM

#

fair enough but only on conversational data is objectively a poor idea that even the Lamda google model was still like 60% web and only 40% conversational which is alr even an insane ratio

olive sable Jun 1, 2025, 7:57 PM

#

hoary lion give me the 3090!! ill change the world trust

$800

#

take it or leave it

#

xdx

olive sable Jun 1, 2025, 7:58 PM

#

nocturne olive Nah, give me 3090 and I'll make the best Neuro vocal synthesizer ever

$750

trim valve Jun 1, 2025, 7:58 PM

#

how about $1

olive sable Jun 1, 2025, 7:58 PM

#

xdx

stark needle Jun 1, 2025, 7:58 PM

#

for 800$ u bet i take it😭😭

nocturne olive Jun 1, 2025, 7:58 PM

#

olive sable $750

I don't have that much money (if I did I would definitely get that)

olive sable Jun 1, 2025, 7:58 PM

#

trim valve how about $1

for $1 i could give you a singular core

hoary lion Jun 1, 2025, 7:58 PM

#

its a steal btw

trim valve Jun 1, 2025, 7:58 PM

#

can I take one of the capacitors from the board

#

glueless

#

DISCORD

stark needle Jun 1, 2025, 7:59 PM

#

???

trim valve Jun 1, 2025, 7:59 PM

#

PLEASE

olive sable Jun 1, 2025, 7:59 PM

#

nocturne olive I don't have that much money (if I did I would definitely get that)

damn, are 3090's that exxpensive now?

#

i paid 675 for this

stark needle Jun 1, 2025, 7:59 PM

#

olive sable damn, are 3090's that exxpensive now?

1.1k used here

olive sable Jun 1, 2025, 7:59 PM

#

welpsagiri

stark needle Jun 1, 2025, 7:59 PM

#

It's a fucking scam

nocturne olive Jun 1, 2025, 7:59 PM

#

olive sable i paid 675 for this

Cheapest I've seen in Finland is 650€, but more usually they're in the 700€ range

trim valve Jun 1, 2025, 7:59 PM

#

powercolour please can you release the 9070xt reaper I'm getting desperate

olive sable Jun 1, 2025, 7:59 PM

#

stark needle 1.1k used here

my entire pc used was 1.3K

rigid snow Jun 1, 2025, 8:00 PM

#

nocturne olive I don't have that much money (if I did I would definitely get that)

i don't understand why one wouldn't rent gpu compute when it isn't that expensive

hoary lion Jun 1, 2025, 8:00 PM

#

brother what

#

3100 canadian dollars

stark needle Jun 1, 2025, 8:00 PM

#

trim valve powercolour please can you release the 9070xt reaper I'm getting desperate

1999$ after scalper pricing

hoary lion Jun 1, 2025, 8:00 PM

#

2000+ USD

nocturne olive Jun 1, 2025, 8:00 PM

#

rigid snow i don't understand why one wouldn't rent gpu compute when it isn't that expensiv...

I need it consistently, for me it'd almost certainly be cheaper overall to just own the hardware

stark needle Jun 1, 2025, 8:00 PM

#

Or something 😭😭

trim valve Jun 1, 2025, 8:01 PM

#

¯_(ツ)_/¯

nocturne olive Jun 1, 2025, 8:01 PM

#

nocturne olive I need it consistently, for me it'd almost certainly be cheaper overall to just ...

-# Also I hate cloud stuff

stark needle Jun 1, 2025, 8:01 PM

#

Apparently

trim valve Jun 1, 2025, 8:01 PM

#

doubt it'd be that much

#

but also I have zero idea how monopoly dollars translate to gbp

hoary lion Jun 1, 2025, 8:01 PM

#

the lowest bid in EBAY is 840 USD rn

stark needle Jun 1, 2025, 8:01 PM

#

China got access to western 5090s somehow so now they are extremely stacking themselves with all 5090s on the market

hoary lion Jun 1, 2025, 8:01 PM

#

like what 😭 😭

trim valve Jun 1, 2025, 8:01 PM

#

hoary lion the lowest bid in **EBAY** is 840 USD rn

is this for a 3090 or the card I was talking about

#

glueless

hoary lion Jun 1, 2025, 8:02 PM

#

3090

#

what card?

olive sable Jun 1, 2025, 8:02 PM

#

#

They really arent that expensive here

hoary lion Jun 1, 2025, 8:02 PM

#

nahh

#

send me there rn

trim valve Jun 1, 2025, 8:02 PM

#

neuroPray ok nvm me then I'm just lost in this mess of a chat

rigid snow Jun 1, 2025, 8:03 PM

#

nocturne olive I need it consistently, for me it'd almost certainly be cheaper overall to just ...

how many hours of training per day do you realistically need, even if you iterate super fast

stark needle Jun 1, 2025, 8:03 PM

#

trim valve is this for a 3090 or the card I was talking about

Cheapest

trim valve Jun 1, 2025, 8:03 PM

#

#

but pre orders are not exactly how I want to live my life

nocturne olive Jun 1, 2025, 8:04 PM

#

rigid snow how many hours of training per day do you realistically need, even if you iterat...

Well, I trained NeuroSynth-BETA-3 for 30 hours straight, and I'll have to soon do it again for a special append for BETA-3.1

stark needle Jun 1, 2025, 8:04 PM

#

I wonder if there's somehow an underground dark web network for china illegal gpu trafficking😭😭

trim valve Jun 1, 2025, 8:04 PM

#

also neither of those are the card I was talking about

#

I specifically need the powercolor reaper

#

nothing else will fit in my case

stark needle Jun 1, 2025, 8:04 PM

#

Cause how come everywhere GPUs are out of stock

olive sable Jun 1, 2025, 8:04 PM

#

olive sable

i dont mind buying and selling 3090's to yall. i just aint paying for shipping

hoary lion Jun 1, 2025, 8:04 PM

#

stark needle I wonder if there's somehow an underground dark web network for china illegal gp...

i am definitely joining it if I had a chance

trim valve Jun 1, 2025, 8:05 PM

#

olive sable i dont mind buying and selling 3090's to yall. i just aint paying for shipping

glueless also like surely there are fees involved for shipping it across borders

nocturne olive Jun 1, 2025, 8:05 PM

#

olive sable

599€?? Holy cheap

stark needle Jun 1, 2025, 8:05 PM

#

hoary lion i am definitely joining it if I had a chance

SAME😭😭

olive sable Jun 1, 2025, 8:05 PM

#

trim valve <:glueless:1282337396230328425> also like surely there are fees involved for shi...

sometimes ye, like 21% import tax

nocturne olive Jun 1, 2025, 8:05 PM

#

olive sable i dont mind buying and selling 3090's to yall. i just aint paying for shipping

I wonder how much the shipping from wherever you are to Finland would be

stark needle Jun 1, 2025, 8:05 PM

#

3192999$ shipping

#

prob 40-50$

trim valve Jun 1, 2025, 8:05 PM

#

sounds like a good deal

hoary lion Jun 1, 2025, 8:06 PM

#

wait I actually did kept a note of

trim valve Jun 1, 2025, 8:06 PM

#

does that shipping include throwing the package down a flight of stairs

hoary lion Jun 1, 2025, 8:06 PM

#

chinese 4090 48GB version

stark needle Jun 1, 2025, 8:06 PM

#

import tax but idk how inport tax works between EU countries

opaque sigil Jun 1, 2025, 8:06 PM

#

There's none

stark needle Jun 1, 2025, 8:06 PM

#

ok that's based

opaque sigil Jun 1, 2025, 8:06 PM

#

Unless that changed

#

Used to apply to the UK too but not anymore since they left the EEA ReallyMad

olive sable Jun 1, 2025, 8:07 PM

#

nocturne olive I wonder how much the shipping from wherever you are to Finland would be

this is without a possible import tax tho

opaque sigil Jun 1, 2025, 8:07 PM

#

I miss ordering custom cables from the uk

nocturne olive Jun 1, 2025, 8:07 PM

#

stark needle + import tax but idk how inport tax works between EU countries

If this dumb AI summary is correct, it's not a thing
But I don't know if I can trust it

stark needle Jun 1, 2025, 8:07 PM

#

my dad had to import his DGX GB10 order via italy cause it's not even sold in Switzerland 😭😭

trim valve Jun 1, 2025, 8:07 PM

#

glueless order some to me and I can take them to the netherlands then ship them

olive sable Jun 1, 2025, 8:07 PM

#

nocturne olive If this dumb AI summary is correct, it's not a thing But I don't know if I can t...

then it should be fine

trim valve Jun 1, 2025, 8:08 PM

#

glueless surely that is a valid way to avoid fees

hoary lion Jun 1, 2025, 8:08 PM

#

trim valve <:glueless:1282337396230328425> order some to me and I can take them to the neth...

totally won't snatch in between glueless

olive sable Jun 1, 2025, 8:08 PM

#

send me 650 and il buy it xdx

#

trust

hoary lion Jun 1, 2025, 8:08 PM

#

shadow

#

how much is 5090 rn

#

in your country

stark needle Jun 1, 2025, 8:08 PM

#

olive sable send me 650 and il buy it <:xdx:1303673369585123420>

Bitcoin address ahh reseller

nocturne olive Jun 1, 2025, 8:09 PM

#

olive sable this is without a possible import tax tho

Hm, maybe
I'll consider it when I have more money if you can find me a good 3-slot one for which it's possible to find a pair later
I really, really need a 3090 for NeuroSynth training

olive sable Jun 1, 2025, 8:09 PM

#

i will if you want me to, im just in debt rn so i need the money beforehand

stark needle Jun 1, 2025, 8:10 PM

#

hoary lion how much is 5090 rn

2.5-2.8k CHF (3000$-3400$) but out of stock everywhere

#

Apparently companies do the conversion of usd price -> chf price 1:1 catdespair

nocturne olive Jun 1, 2025, 8:10 PM

#

nocturne olive Hm, maybe I'll consider it when I have more money if you can find me a good 3-sl...

Just gotta figure out where I'm gonna get all that money
-# ~~definitely not NeuroSynth commissions, nobody wants those anyway~~

olive sable Jun 1, 2025, 8:10 PM

#

3.4K here asswel

#

for 5090

hoary lion Jun 1, 2025, 8:10 PM

#

i have no fucking clue how variant can 5090 be

#

is this canadian dollars or nah i dont even know

#

probably US since it's bestbuy

stark needle Jun 1, 2025, 8:11 PM

#

7381748382$ gpu😭😭

nocturne olive Jun 1, 2025, 8:11 PM

#

nocturne olive Just gotta figure out where I'm gonna get all that money -# ~~definitely not Neu...

Why does money have to be so hard?

olive sable Jun 1, 2025, 8:11 PM

#

nocturne olive Just gotta figure out where I'm gonna get all that money -# ~~definitely not Neu...

imagine aftger game-jam 3.
i get a bill:

usage of neurosynth:
$800 or 1 RTX 3090 24GB

trim valve Jun 1, 2025, 8:11 PM

#

glueless

nocturne olive Jun 1, 2025, 8:12 PM

#

olive sable imagine aftger game-jam 3. i get a bill: ``` usage of neurosynth: $800 or 1 RTX ...

That's so silly

olive sable Jun 1, 2025, 8:12 PM

#

neuroAwareA

#

dont actually send me a bill btw

nocturne olive Jun 1, 2025, 8:12 PM

#

But no, I've always operated on a "pay if you feel like it" model

#

Though I may sometimes provide stuff for free too easily

trim valve Jun 1, 2025, 8:12 PM

#

sam can I mail you an invoice for an rtx ada 6000 please

#

I want one

stark needle Jun 1, 2025, 8:13 PM

#

meanwhile @viral oasis (hi op) with 1 4090 and 2 3090s😭😭

trim valve Jun 1, 2025, 8:13 PM

#

NeuroClueless

opaque sigil Jun 1, 2025, 8:13 PM

#

olive sable dont actually send me a bill btw

that invoice for a gh200 is on its way

stark needle Jun 1, 2025, 8:14 PM

#

my dad also has like 5 jetson orin nx on backorder😭😭 each of em is like 600$ and worse perf than rtx 2060

olive sable Jun 1, 2025, 8:14 PM

#

trim valve I want one

none for sale here so bwaa

olive sable Jun 1, 2025, 8:14 PM

#

stark needle meanwhile <@251757164761186305> (hi op) with 1 4090 and 2 3090s😭😭

hi operator lol

#

didnt know she was in this server

trim valve Jun 1, 2025, 8:15 PM

#

olive sable none for sale here so bwaa

smh

stark needle Jun 1, 2025, 8:15 PM

#

olive sable didnt know she was in this server

she's everywhere

#

even in your walls

olive sable Jun 1, 2025, 8:15 PM

#

damn

#

kinda weird

#

dont do that

stark needle Jun 1, 2025, 8:15 PM

#

how did i get 4 bwaas in 1 second

olive sable Jun 1, 2025, 8:15 PM

#

elvyn

#

the monent 1 bwaa was tpyed a 2nd imediatly appeared

#

and im just terminaly online

trim valve Jun 1, 2025, 8:17 PM

#

I am stalking this chat because its more fun than revision

olive sable Jun 1, 2025, 8:17 PM

#

i dont remember what i was doing tbh

hoary lion Jun 1, 2025, 8:17 PM

#

i think i found all the 5090 that shadow can't find

olive sable Jun 1, 2025, 8:18 PM

#

oh ye, i have to read a book and make a collage about it by tuesday

#

fuck

trim valve Jun 1, 2025, 8:18 PM

#

i have to revise for an exam on wednesday :3

#

(I am not looking forward to it)

opaque sigil Jun 1, 2025, 8:18 PM

#

what's the exam about neuroHypers

olive sable Jun 1, 2025, 8:18 PM

#

undertstandable

trim valve Jun 1, 2025, 8:18 PM

#

marketing

opaque sigil Jun 1, 2025, 8:18 PM

#

ew

olive sable Jun 1, 2025, 8:18 PM

#

marketing?

trim valve Jun 1, 2025, 8:18 PM

#

yeah exactly

olive sable Jun 1, 2025, 8:18 PM

#

what?

trim valve Jun 1, 2025, 8:19 PM

#

i hate the subject so damn much

olive sable Jun 1, 2025, 8:19 PM

#

suply and demand type of shit?

hoary lion Jun 1, 2025, 8:19 PM

#

what the fuck lmao
chinese stores definitely have more gpus, even those that US did not allowed them

trim valve Jun 1, 2025, 8:19 PM

#

no way more brainrotted than that

real sierra Jun 1, 2025, 8:19 PM

#

@sage crag neuroDinkDonk wakey wakey i need some convenience functions for using a display

trim valve Jun 1, 2025, 8:19 PM

#

its basically just "people buy stuff"

hoary lion Jun 1, 2025, 8:19 PM

#

see that dgx and h100 🤦‍♂️

real sierra Jun 1, 2025, 8:19 PM

#

i attempted to write some but they arent working Sadgi

#

theres a display that treats a region of ram as a video buffer

olive sable Jun 1, 2025, 8:20 PM

#

trim valve its basically just "people buy stuff"

yep we do, what about it?

real sierra Jun 1, 2025, 8:20 PM

#

i can write data into it

#

but i want a nice way to just pick an (x, y) coordinate

trim valve Jun 1, 2025, 8:20 PM

#

idk something about how to make it so people buy your stuff instead of other people's

#

but like I really don't like the subject nor the lecturers

#

glueless if my math is right I only need to get 5% on this exam to pass though

olive sable Jun 1, 2025, 8:21 PM

#

if i make it cheaper, and people know its cheaper, and the quality doesnt suck, then people will buy it

real sierra Jun 1, 2025, 8:21 PM

#

current config is 64x64 pixels, monochrome, each 16-bit word of memory controls 16 pixels, in order from left to right, top to bottom

olive sable Jun 1, 2025, 8:21 PM

#

ez

trim valve Jun 1, 2025, 8:21 PM

#

olive sable if i make it cheaper, and people know its cheaper, and the quality doesnt suck, ...

well

#

you see

olive sable Jun 1, 2025, 8:21 PM

#

see

#

evilStare

#

im see

trim valve Jun 1, 2025, 8:21 PM

#

that is one of the many things you are told

olive sable Jun 1, 2025, 8:22 PM

#

uhuh

trim valve Jun 1, 2025, 8:22 PM

#

but the subject basically boils down to "idk wing it"

#

with the exam basically just being a history lesson

olive sable Jun 1, 2025, 8:22 PM

#

https://tenor.com/view/skeleton-reaction-information-my-reaction-to-that-information-my-honest-reaction-to-that-information-gif-613275461876515673

Tenor

real sierra Jun 1, 2025, 8:24 PM

#

for starters shouldnt that be (y * screen_width) + x

trim valve Jun 1, 2025, 8:24 PM

#

like I'm expected to write an essay answer to one of these questions in like 30 minutes

real sierra Jun 1, 2025, 8:24 PM

#

unless your y axis is horizontal

#

but secondly, the problem is that one word in memory is 16 pixels

#

so if i want to turn an individual pixel on

tender river Jun 1, 2025, 8:26 PM

#

shiro i'm so sorry

#

you have to use right shift

real sierra Jun 1, 2025, 8:26 PM

#

nuhuh

#

you can avoid right shift

#

by taking 16 - (x % 16)

#

and doing left shift

#

still i dont know why what i have isnt working

#

susge

stark needle Jun 1, 2025, 8:27 PM

#

Idk how common it is but ive been told by my dad that in russia the pc stores are extremely premium looking in comparison to the west (at least the ones he visited)

real sierra Jun 1, 2025, 8:28 PM

#

# fill_pixel(x, y)
function fill_pixel 2;
    push_arg 1;
    push_value 64;
    call math_mult;
    push_retval;
    push_arg 0;
    add;
    call place_pixel;
    push_value 0;
    return;

function place_pixel 1;
    push_arg 0;
    push_value 8;
    call math_div;
    push_retval;
    push_value DISPLAY_BASE;
    add;
    push_value 0b0000000000000001;
    push_arg 0;
    push_value 15;
    and;
    push_value 16;
    sub;
    call math_shln;
    push_retval;
    pop_memory;
    push_value 0;
    return;

tender river Jun 1, 2025, 8:28 PM

#

shiro can you teach me erlang i dont know it

real sierra Jun 1, 2025, 8:28 PM

#

i also dont know erlang

tender river Jun 1, 2025, 8:28 PM

#

you can learn it neuroPogHD

real sierra Jun 1, 2025, 8:28 PM

#

neuroSisyphus

trim valve Jun 1, 2025, 8:28 PM

#

can we combine our fragments of erlang knowledge into a full understanding glueless

real sierra Jun 1, 2025, 8:29 PM

#

i know nothing about erlang

#

literally 0

tender river Jun 1, 2025, 8:29 PM

#

i know that pleroma is written in erlang (may or may not be true)

trim valve Jun 1, 2025, 8:29 PM

#

0 is more than a number less than zero

#

so we're not losing anything

tender river Jun 1, 2025, 8:29 PM

#

and i know that its based on message passing with in-process mutable state but no shared memory between processes

rigid snow Jun 1, 2025, 8:30 PM

#

stark needle Idk how common it is but ive been told by my dad that in russia the pc stores ar...

lmao what, did he visit boutique prebuilt stores or what, i can't call any of what i visited "premium"

tender river Jun 1, 2025, 8:31 PM

#

idk what premium means in this context

rigid snow Jun 1, 2025, 8:32 PM

#

like not even remotely

#

just regular ass stores

tender river Jun 1, 2025, 8:32 PM

#

i mean it will be cleaner than a supermarket

stark needle Jun 1, 2025, 8:35 PM

#

rigid snow lmao what, did he visit boutique prebuilt stores or what, i can't call any of wh...

idk

#

He was in st petersburg

tender river Jun 1, 2025, 8:35 PM

#

its better to work in whole words instead of individual pixels anyway

#

maybe have a function that bitands or bitors a mask to a certain word

#

that way you can do maths in your head instead of making your computer do it neuroPogHD

#

thats the optimal way

#

please understand, you are replaceable, your computer is one of a kind

#

it deserves care and adoration

#

and precomputed maths

rigid snow Jun 1, 2025, 8:38 PM

#

stark needle He was in st petersburg

i'm in moscow so if anywhere is premium it's here

real sierra Jun 1, 2025, 8:39 PM

#

tender river and precomputed maths

this compiler agrees with you

#

the reason my instruction tokens are so garbage is because +, -, *, /, and any other arithmetic symbol is evaluated at compile time by the precompiler

#

so i cant use them as tokens in my instructions :(

tender river Jun 1, 2025, 8:40 PM

#

did you want to make an apl or something

#

why would you need them in your identifiers

real sierra Jun 1, 2025, 8:40 PM

#

no i just wanted to write "D+A" instead of "ADD_DA"

#

looks nicer

stark needle Jun 1, 2025, 8:40 PM

#

@hoary lion ur not gonna believe it

tender river Jun 1, 2025, 8:41 PM

#

real sierra no i just wanted to write "D+A" instead of "ADD_DA"

makes sense, well you can force whitespace between identifiers to recover that

stark needle Jun 1, 2025, 8:41 PM

#

i'm pretraining a llm with qlora 😭 😭 😭

#

AND ITS WORKING

real sierra Jun 1, 2025, 8:43 PM

#

will try to adapt this since my attempt isnt working Sadgi

tender river Jun 1, 2025, 8:44 PM

#

i wonder if that means my stack breaker broke

#

let me see

#

the holy trinity

#

it broke as i expected

#

its fine i can fix it

#

this is totally a productive use of my time and very important for using the language

#

wait i think it crashes even without running the shellcode

real sierra Jun 1, 2025, 8:53 PM

#

are you sure this is right

#

uuh

tender river Jun 1, 2025, 8:53 PM

#

can you confirm

main := fn(): uint {
    start := "abcdefgh".ptr
    len := 8
    ptr: ^u8 = @syscall(0x9, 0, 4096, 7, 0x22, ~0, 0)
    i := 0
    loop if i == len break else {
        (ptr + i).* = (start + i).*
        i += 1
    }
    return 0
}

real sierra Jun 1, 2025, 8:53 PM

#

these lines are a bit dubious

tender river Jun 1, 2025, 8:55 PM

#

oh its because the old version of lily checked out the old commit

faint sandal Jun 1, 2025, 8:56 PM

#

thanks ida for the descriptive error